TOP ACTIVATIONS
MAX = 8.699

mom said . " R az ors are not toys ,
â M az zy was so excited about
it . A lic ia smiled while she looked
realised he wasn â t scared to go to
â M az zy could barely contain her
the swing . She was breath less with excitement as she
like he wasn â t successful . <|endoftext|> Once
â M az zy â s
brave and wasn â t scared at all !
ball ?" A lic ia nodded and said ,
although she wasn â t sure what to do
knew she wasn â t a threat and it
But Emma wasn â t very excited about the
it might make you p ucker your lips ."
. He wasn â t scared anymore ; he
But they weren â t scared anymore . Now
left and it wasn 't long before the little girl grew
because he didn â t know who she was
! Why don â t you make something special
three and didn â t have any money either

MIN ACTIVATIONS
MIN = -10.770

the tree to its once former beauty . He was happy
did something special . On Wed nes days she liked to
There was a gr umpy ranc her named Tom . Tom
to work cleaning the re ind eer . With her help
He slowly walked towards the Gor illa , and it made
regretted getting close to the Gor illa and not listening to
a big box of ch ocol ates to share with the
wanted to learn more about veterin arians . She decided to
to be restored to its former glory ."
a large selection of ch ocol ates . Ell a picked
dirty stable filled with re ind eer . Emma gasped in
started eating the delicious ch ocol ates . She was so
a big box of ch ocol ates . It was so
a big box of ch ocol ates ! <|endoftext|> Once upon
all go riding together on Tues days . That made the
The big , empty house belonged to a family of four
his parents about the mysterious Gor illa . His parents were
wide , green field that belonged to a special family .
her a box of ch ocol ates . Lily opened the
a normal shoe . It belonged to a little boy .

INTERVAL 3.832 - 8.699
CONTAINS 2.319%

I 'm glad I was able to help and make things
The scarf was more than a m ement o ; it
Please , Ben , please ," she said , pleading .
" Look , mom ," she said . " I
He is very helpful ," Ben says . L

INTERVAL -1.036 - 3.832
CONTAINS 72.664%

the crew and Joe was one happy pirate ! <|endoftext|> Once
the village meeting . When Jack arrived , he saw that
box . They all loved it and played with the toys
the chamber and unw ra pped the blanket . He grabbed
all celebrated the completion of the house ! Jack

INTERVAL -5.903 - -1.036
CONTAINS 24.234%

to perform tricks for his friends . Tim had a toy
to everyone he knew . His mom , dad , and
to the flower shop and found the most perfect bunch of
noise and whim pered . She was very miserable .
The frightened squirrel ran and ran , trying to escape

INTERVAL -10.770 - -5.903
CONTAINS 0.783%

her toys . One day her mom gave her a toy
her . When she opened her eyes , she spotted a
woods behind his house . One day , he found a
little girl named Lily . One day , her mom gifted
, I â m
Token mom
Token ID1995
Feature activation+1.58e+00

Pos logit contributions
 habitats+0.118
esty+0.113
 habitat+0.111
asured+0.107
ind+0.107
Neg logit contributions
 big-0.058
 it-0.056
 the-0.055
!-0.055
 bite-0.054
Token said
Token ID531
Feature activation+8.96e-01

Pos logit contributions
 big+0.017
 it+0.015
 bite+0.015
 long+0.014
!+0.014
Neg logit contributions
esty-0.073
 habitats-0.072
asured-0.072
ried-0.072
years-0.070
Token.
Token ID13
Feature activation+6.34e-01

Pos logit contributions
 big+0.011
 bite+0.010
 jugg+0.010
 Emily+0.009
 it+0.009
Neg logit contributions
esty-0.039
ried-0.039
 habitats-0.039
asured-0.038
ities-0.038
Token "
Token ID366
Feature activation-2.04e+00

Pos logit contributions
 bite+0.008
!+0.008
 big+0.008
 yell+0.007
 it+0.007
Neg logit contributions
 habitats-0.028
esty-0.028
 habitat-0.027
ried-0.026
asured-0.026
TokenR
Token ID49
Feature activation-7.59e-01

Pos logit contributions
 habitats+0.086
ishes+0.083
 habitat+0.082
ried+0.081
esty+0.079
Neg logit contributions
 it-0.039
 the-0.038
 big-0.038
 -0.034
 bite-0.033
Tokenaz
Token ID1031
Feature activation+8.70e+00

Pos logit contributions
 habitats+0.022
ried+0.021
years+0.020
esty+0.020
uries+0.020
Neg logit contributions
!-0.024
,-0.023
 it-0.021
 big-0.021
 bubble-0.020
Tokenors
Token ID669
Feature activation-1.62e-01

Pos logit contributions
 it+0.281
!+0.280
 the+0.279
 big+0.270
 bite+0.267
Neg logit contributions
 habitats-0.241
esty-0.227
 habitat-0.222
asured-0.213
ried-0.210
Token are
Token ID389
Feature activation+5.40e-01

Pos logit contributions
 habitats+0.007
esty+0.007
 habitat+0.007
asured+0.007
ind+0.007
Neg logit contributions
 the-0.003
 it-0.003
!-0.003
 big-0.003
 bite-0.003
Token not
Token ID407
Feature activation+2.59e+00

Pos logit contributions
 the+0.008
 big+0.008
 it+0.008
!+0.007
 bite+0.006
Neg logit contributions
 habitats-0.027
 habitat-0.026
esty-0.026
amb-0.025
ried-0.025
Token toys
Token ID14958
Feature activation-1.41e-02

Pos logit contributions
 big+0.046
 it+0.045
 bite+0.044
!+0.044
 the+0.044
Neg logit contributions
 habitats-0.107
esty-0.104
 habitat-0.100
asured-0.099
ind-0.099
Token,
Token ID11
Feature activation+2.08e+00

Pos logit contributions
 habitats+0.001
esty+0.001
 habitat+0.001
ind+0.001
years+0.001
Neg logit contributions
!-0.000
 it-0.000
 big-0.000
 the-0.000
,-0.000
Tokenâ
Token ID22940
Feature activation-2.11e+00

Pos logit contributions
 habitats+0.029
ind+0.027
esty+0.027
asured+0.027
 habitat+0.027
Neg logit contributions
 -0.012
 it-0.012
 the-0.011
!-0.011
 Emily-0.011
Token
Token ID26391
Feature activation+2.03e+00

Pos logit contributions
 habitats+0.074
esty+0.070
 habitat+0.068
ind+0.066
asured+0.066
Neg logit contributions
 big-0.052
!-0.052
 it-0.052
 the-0.052
 bite-0.051
Token
Token ID198
Feature activation+5.61e-01

Pos logit contributions
 bite+0.038
 big+0.038
 the+0.036
 it+0.035
 Emily+0.035
Neg logit contributions
 habitats-0.082
esty-0.079
 habitat-0.077
ind-0.074
ried-0.074
Token
Token ID198
Feature activation+4.15e-01

Pos logit contributions
 bite+0.006
 big+0.006
!+0.006
 Emily+0.006
 noise+0.005
Neg logit contributions
 habitats-0.026
esty-0.025
 habitat-0.025
ind-0.024
asured-0.024
TokenM
Token ID44
Feature activation+1.38e-01

Pos logit contributions
 big+0.008
 it+0.008
 the+0.008
!+0.007
 bite+0.007
Neg logit contributions
 habitats-0.017
esty-0.016
 habitat-0.016
asured-0.016
ried-0.016
Tokenaz
Token ID1031
Feature activation+8.26e+00

Pos logit contributions
!+0.004
 big+0.003
,+0.003
 it+0.003
 loud+0.003
Neg logit contributions
esty-0.005
 habitats-0.005
ishes-0.004
uries-0.004
ried-0.004
Tokenzy
Token ID7357
Feature activation+1.45e+00

Pos logit contributions
 big+0.246
 it+0.240
 bite+0.240
 the+0.238
!+0.237
Neg logit contributions
 habitats-0.248
esty-0.234
 habitat-0.224
ind-0.221
asured-0.220
Token was
Token ID373
Feature activation+6.01e-01

Pos logit contributions
!+0.017
 the+0.016
 big+0.016
 it+0.016
 bite+0.015
Neg logit contributions
 habitats-0.071
esty-0.069
 habitat-0.067
ind-0.066
ities-0.066
Token so
Token ID523
Feature activation+1.17e+00

Pos logit contributions
 big+0.008
 it+0.008
 the+0.007
!+0.007
 bite+0.007
Neg logit contributions
 habitats-0.028
esty-0.027
 habitat-0.026
ind-0.026
asured-0.026
Token excited
Token ID6568
Feature activation-2.46e-01

Pos logit contributions
 big+0.022
 the+0.021
 it+0.021
!+0.020
 +0.019
Neg logit contributions
 habitats-0.052
esty-0.050
 habitat-0.048
ities-0.047
ind-0.047
Token about
Token ID546
Feature activation-3.05e-02

Pos logit contributions
 habitats+0.012
 habitat+0.011
esty+0.011
asured+0.011
ind+0.011
Neg logit contributions
 bite-0.003
 big-0.002
 video-0.002
 yell-0.002
 swirl-0.002
Token it
Token ID340
Feature activation+1.65e+00

Pos logit contributions
 big+0.006
 it+0.006
 the+0.005
!+0.005
 bite+0.004
Neg logit contributions
 habitats-0.113
esty-0.111
 habitat-0.108
ind-0.107
asured-0.107
Token.
Token ID13
Feature activation-6.40e-01

Pos logit contributions
 big+0.013
 it+0.013
!+0.013
 the+0.012
 bite+0.012
Neg logit contributions
 habitats-0.086
esty-0.083
 habitat-0.081
ind-0.080
asured-0.080
Token
Token ID198
Feature activation+7.72e-01

Pos logit contributions
 habitats+0.029
esty+0.028
 habitat+0.027
ried+0.026
ities+0.026
Neg logit contributions
 bite-0.009
 big-0.009
!-0.009
 it-0.008
 the-0.008
Token
Token ID198
Feature activation+5.39e-01

Pos logit contributions
 bite+0.011
 swirl+0.010
 noise+0.010
!+0.010
 yell+0.009
Neg logit contributions
 habitats-0.034
esty-0.032
 habitat-0.031
uzz-0.031
years-0.031
TokenA
Token ID32
Feature activation+1.42e+00

Pos logit contributions
 bite+0.013
!+0.013
 it+0.013
 yell+0.013
 the+0.012
Neg logit contributions
 habitats-0.019
esty-0.018
 habitat-0.017
years-0.016
ind-0.016
Tokenlic
Token ID677
Feature activation+8.16e+00

Pos logit contributions
 bite+0.022
 Jack+0.022
 Amy+0.021
 it+0.021
 happen+0.020
Neg logit contributions
 habitats-0.055
esty-0.052
years-0.049
 habitat-0.047
 ancestors-0.046
Tokenia
Token ID544
Feature activation+2.60e+00

Pos logit contributions
 bite+0.248
 yell+0.243
 it+0.241
 +0.236
 the+0.233
Neg logit contributions
 habitats-0.240
esty-0.218
 habitat-0.208
ried-0.203
edly-0.200
Token smiled
Token ID13541
Feature activation+1.28e-01

Pos logit contributions
 big+0.033
!+0.033
 it+0.032
 the+0.032
 bite+0.032
Neg logit contributions
 habitats-0.123
esty-0.121
 habitat-0.116
ities-0.115
asured-0.115
Token while
Token ID981
Feature activation+7.28e-01

Pos logit contributions
!+0.001
 big+0.001
 the+0.001
 it+0.001
,+0.001
Neg logit contributions
 habitats-0.007
esty-0.006
 habitat-0.006
ried-0.006
ind-0.006
Token she
Token ID673
Feature activation+2.18e-02

Pos logit contributions
 bite+0.009
 big+0.009
!+0.009
 it+0.009
 yell+0.008
Neg logit contributions
 habitats-0.033
esty-0.032
 habitat-0.031
years-0.031
ried-0.031
Token looked
Token ID3114
Feature activation+2.37e+00

Pos logit contributions
 big+0.000
 it+0.000
!+0.000
 the+0.000
 bite+0.000
Neg logit contributions
 habitats-0.001
esty-0.001
 habitat-0.001
ind-0.001
asured-0.001
Token realised
Token ID19169
Feature activation-3.70e-02

Pos logit contributions
 the+0.007
 it+0.007
 big+0.007
!+0.006
 bite+0.006
Neg logit contributions
 habitats-0.019
esty-0.018
 habitat-0.017
ities-0.017
ind-0.017
Token he
Token ID339
Feature activation-9.76e-01

Pos logit contributions
 habitats+0.002
esty+0.002
 habitat+0.002
ind+0.002
ried+0.002
Neg logit contributions
 it-0.000
 big-0.000
 the-0.000
!-0.000
 bite-0.000
Token wasn
Token ID2492
Feature activation-1.57e-01

Pos logit contributions
 habitats+0.048
esty+0.046
 habitat+0.046
ind+0.045
ities+0.044
Neg logit contributions
 it-0.011
 big-0.011
!-0.011
 the-0.011
 bite-0.010
Tokenâ
Token ID22940
Feature activation-1.07e+00

Pos logit contributions
ind+0.007
years+0.007
onel+0.007
ace+0.006
ried+0.006
Neg logit contributions
 Emily-0.002
 noise-0.002
 bite-0.002
 video-0.002
 Lisa-0.002
Token
Token ID26391
Feature activation+2.92e+00

Pos logit contributions
 habitats+0.038
 habitat+0.035
esty+0.035
ishes+0.034
ind+0.034
Neg logit contributions
!-0.026
 bite-0.026
 the-0.026
 it-0.025
 big-0.025
Token
Token ID8151
Feature activation+8.15e+00

Pos logit contributions
 big+0.078
 bite+0.077
 it+0.077
 the+0.076
!+0.075
Neg logit contributions
 habitats-0.094
esty-0.091
 habitat-0.087
asured-0.084
ind-0.084
Tokent
Token ID83
Feature activation+3.58e+00

Pos logit contributions
 noise+0.147
 it+0.146
 bite+0.143
 big+0.142
 swirl+0.140
Neg logit contributions
esty-0.311
 habitats-0.309
ried-0.305
years-0.302
ind-0.301
Token scared
Token ID12008
Feature activation+4.70e+00

Pos logit contributions
 big+0.066
 bite+0.065
 it+0.065
!+0.064
 the+0.062
Neg logit contributions
 habitats-0.143
esty-0.138
 habitat-0.134
asured-0.134
ried-0.132
Token to
Token ID284
Feature activation+1.57e+00

Pos logit contributions
 bite+0.082
 +0.079
 big+0.078
 Emily+0.078
 it+0.077
Neg logit contributions
 habitats-0.185
esty-0.178
 habitat-0.176
ind-0.176
asured-0.174
Token go
Token ID467
Feature activation-1.70e+00

Pos logit contributions
 big+0.026
 it+0.026
 bite+0.025
 the+0.025
!+0.025
Neg logit contributions
 habitats-0.065
esty-0.063
 habitat-0.062
ind-0.060
asured-0.060
Token to
Token ID284
Feature activation+5.37e-01

Pos logit contributions
 habitats+0.074
esty+0.071
 habitat+0.068
ities+0.068
asured+0.067
Neg logit contributions
 big-0.031
!-0.030
 the-0.030
 it-0.030
 bite-0.028
Tokenâ
Token ID22940
Feature activation-2.87e+00

Pos logit contributions
 habitats+0.045
esty+0.043
ind+0.043
ried+0.042
asured+0.042
Neg logit contributions
 -0.022
 it-0.020
 the-0.020
!-0.019
 Emily-0.019
Token
Token ID26391
Feature activation+1.38e+00

Pos logit contributions
 habitats+0.106
esty+0.099
 habitat+0.099
ind+0.096
asured+0.096
Neg logit contributions
!-0.066
 the-0.065
 it-0.064
 big-0.063
 bite-0.063
Token
Token ID198
Feature activation-7.95e-02

Pos logit contributions
 bite+0.030
 big+0.030
 the+0.029
 it+0.029
 noise+0.028
Neg logit contributions
 habitats-0.051
esty-0.049
 habitat-0.048
ind-0.046
ried-0.046
Token
Token ID198
Feature activation-1.93e-01

Pos logit contributions
 habitats+0.004
esty+0.004
 habitat+0.004
ind+0.004
asured+0.004
Neg logit contributions
 bite-0.001
 big-0.001
 it-0.001
!-0.001
 the-0.001
TokenM
Token ID44
Feature activation-8.78e-02

Pos logit contributions
 habitats+0.008
esty+0.008
 habitat+0.008
ishes+0.008
asured+0.008
Neg logit contributions
 big-0.003
 it-0.003
 the-0.003
!-0.003
 -0.003
Tokenaz
Token ID1031
Feature activation+8.12e+00

Pos logit contributions
esty+0.003
 habitats+0.003
uries+0.003
ishes+0.003
 habitat+0.003
Neg logit contributions
!-0.002
 big-0.002
,-0.002
 loud-0.002
 it-0.002
Tokenzy
Token ID7357
Feature activation+1.30e+00

Pos logit contributions
 big+0.230
 it+0.226
 bite+0.224
 the+0.223
!+0.222
Neg logit contributions
 habitats-0.255
esty-0.242
 habitat-0.233
ind-0.227
asured-0.227
Token could
Token ID714
Feature activation+1.45e+00

Pos logit contributions
!+0.016
 the+0.016
 big+0.016
 it+0.016
 bite+0.014
Neg logit contributions
 habitats-0.062
esty-0.060
 habitat-0.058
ind-0.057
asured-0.057
Token barely
Token ID8523
Feature activation-2.46e+00

Pos logit contributions
 big+0.021
 bite+0.021
!+0.021
 it+0.020
 Emily+0.020
Neg logit contributions
 habitats-0.063
esty-0.060
 habitat-0.059
asured-0.058
ried-0.058
Token contain
Token ID3994
Feature activation+2.20e+00

Pos logit contributions
 habitats+0.102
esty+0.098
 habitat+0.095
ind+0.094
asured+0.094
Neg logit contributions
 big-0.045
 it-0.044
 the-0.044
 bite-0.043
!-0.043
Token her
Token ID607
Feature activation-5.03e+00

Pos logit contributions
 the+0.055
 it+0.054
!+0.052
 big+0.052
,+0.048
Neg logit contributions
 habitats-0.083
esty-0.082
ried-0.078
 habitat-0.077
ishes-0.077
Token the
Token ID262
Feature activation+9.48e-01

Pos logit contributions
 Emily+0.000
 big+0.000
 million+0.000
!+0.000
 bite+0.000
Neg logit contributions
 habitats-0.000
esty-0.000
 habitat-0.000
asured-0.000
ried-0.000
Token swing
Token ID9628
Feature activation+1.12e+00

Pos logit contributions
 big+0.017
 it+0.016
 the+0.016
!+0.016
 bite+0.016
Neg logit contributions
 habitats-0.040
esty-0.038
 habitat-0.037
ind-0.037
asured-0.037
Token.
Token ID13
Feature activation-3.55e-01

Pos logit contributions
 big+0.012
 it+0.011
 bite+0.011
 the+0.011
!+0.011
Neg logit contributions
 habitats-0.054
esty-0.053
 habitat-0.052
asured-0.051
ried-0.051
Token She
Token ID1375
Feature activation-1.11e+00

Pos logit contributions
 habitats+0.015
esty+0.015
ind+0.014
asured+0.014
 habitat+0.014
Neg logit contributions
 it-0.006
 the-0.006
 big-0.006
 -0.006
!-0.006
Token was
Token ID373
Feature activation+7.22e-01

Pos logit contributions
 habitats+0.045
esty+0.043
asured+0.042
ried+0.042
 habitat+0.042
Neg logit contributions
 bite-0.018
 noise-0.018
 Emily-0.017
 video-0.017
 big-0.017
Token breath
Token ID8033
Feature activation+8.10e+00

Pos logit contributions
 bite+0.010
 big+0.010
!+0.010
 it+0.010
 Emily+0.009
Neg logit contributions
 habitats-0.031
esty-0.031
asured-0.030
ried-0.030
 habitat-0.030
Tokenless
Token ID1203
Feature activation+1.56e+00

Pos logit contributions
 it+0.122
 Emily+0.120
 Amy+0.119
 bite+0.119
 big+0.115
Neg logit contributions
 habitats-0.333
esty-0.330
ind-0.323
ried-0.320
 habitat-0.318
Token with
Token ID351
Feature activation+1.21e+00

Pos logit contributions
 big+0.017
 bite+0.016
 it+0.016
 the+0.016
!+0.015
Neg logit contributions
 habitats-0.075
esty-0.073
 habitat-0.071
asured-0.070
ried-0.070
Token excitement
Token ID14067
Feature activation+2.05e+00

Pos logit contributions
 big+0.013
 it+0.013
 the+0.012
!+0.012
 bite+0.012
Neg logit contributions
 habitats-0.060
esty-0.058
 habitat-0.056
asured-0.055
ind-0.055
Token as
Token ID355
Feature activation-2.37e-01

Pos logit contributions
 big+0.019
!+0.018
 it+0.018
 the+0.018
 bite+0.016
Neg logit contributions
 habitats-0.104
esty-0.101
 habitat-0.099
asured-0.097
ind-0.097
Token she
Token ID673
Feature activation-8.99e-02

Pos logit contributions
 habitats+0.012
esty+0.012
 habitat+0.012
asured+0.012
ried+0.011
Neg logit contributions
 it-0.002
 the-0.002
 big-0.002
!-0.002
 Emily-0.002
Token like
Token ID588
Feature activation-9.49e-01

Pos logit contributions
 big+0.028
 it+0.028
 the+0.027
!+0.027
 bite+0.027
Neg logit contributions
 habitats-0.106
esty-0.103
 habitat-0.100
ind-0.098
asured-0.098
Token he
Token ID339
Feature activation-1.15e-01

Pos logit contributions
esty+0.050
 habitats+0.050
ried+0.048
ishes+0.048
ind+0.047
Neg logit contributions
 it-0.011
 the-0.011
 big-0.009
!-0.009
,-0.007
Token wasn
Token ID2492
Feature activation+7.42e-02

Pos logit contributions
 habitats+0.006
esty+0.006
 habitat+0.006
ind+0.005
ities+0.005
Neg logit contributions
!-0.001
 it-0.001
 big-0.001
 the-0.001
 bite-0.001
Tokenâ
Token ID22940
Feature activation-1.18e+00

Pos logit contributions
 Emily+0.001
 bite+0.001
 Amy+0.001
 video+0.001
 swirl+0.001
Neg logit contributions
ind-0.003
ried-0.003
onel-0.003
ace-0.003
esty-0.003
Token
Token ID26391
Feature activation+3.10e+00

Pos logit contributions
 habitats+0.044
esty+0.041
 habitat+0.040
ind+0.039
asured+0.039
Neg logit contributions
 big-0.027
!-0.027
 it-0.027
 the-0.027
 bite-0.027
Token
Token ID8151
Feature activation+8.08e+00

Pos logit contributions
 big+0.079
 it+0.078
 bite+0.077
 the+0.077
!+0.076
Neg logit contributions
 habitats-0.105
esty-0.101
 habitat-0.097
ind-0.095
asured-0.095
Tokent
Token ID83
Feature activation+3.95e+00

Pos logit contributions
 noise+0.152
 it+0.150
 bite+0.149
 big+0.146
 swirl+0.146
Neg logit contributions
esty-0.301
 habitats-0.297
ried-0.293
ind-0.292
years-0.287
Token successful
Token ID4388
Feature activation+2.99e+00

Pos logit contributions
 big+0.076
 bite+0.075
 it+0.074
 Emily+0.072
!+0.072
Neg logit contributions
 habitats-0.150
esty-0.147
asured-0.144
 habitat-0.142
ried-0.141
Token.
Token ID13
Feature activation-5.42e-01

Pos logit contributions
!+0.026
 it+0.026
 big+0.026
 the+0.025
 bite+0.023
Neg logit contributions
 habitats-0.153
esty-0.149
 habitat-0.145
ind-0.143
asured-0.142
Token<|endoftext|>
Token ID50256
Feature activation-4.15e+00

Pos logit contributions
 habitats+0.029
esty+0.028
ind+0.027
 habitat+0.027
ried+0.027
Neg logit contributions
 -0.004
 it-0.004
 the-0.004
!-0.004
 big-0.004
TokenOnce
Token ID7454
Feature activation-2.25e+00

Pos logit contributions
 habitats+0.129
esty+0.120
 habitat+0.118
ind+0.114
ried+0.114
Neg logit contributions
 bite-0.109
 yell-0.104
 big-0.104
 noise-0.103
 loud-0.102
Tokenâ
Token ID22940
Feature activation-2.71e+00

Pos logit contributions
 habitats+0.026
esty+0.025
ind+0.025
asured+0.024
ities+0.024
Neg logit contributions
 it-0.007
 -0.007
 the-0.007
!-0.007
 Emily-0.006
Token
Token ID26391
Feature activation+1.49e+00

Pos logit contributions
 habitats+0.091
esty+0.085
 habitat+0.084
ishes+0.081
ities+0.081
Neg logit contributions
!-0.072
 the-0.071
 -0.070
 it-0.069
 big-0.068
Token
Token ID198
Feature activation-8.14e-01

Pos logit contributions
 big+0.018
 bite+0.018
 it+0.016
 the+0.016
 noise+0.015
Neg logit contributions
 habitats-0.070
esty-0.068
 habitat-0.067
ind-0.065
ried-0.065
Token
Token ID198
Feature activation-9.58e-01

Pos logit contributions
 habitats+0.041
esty+0.040
 habitat+0.039
ind+0.039
asured+0.038
Neg logit contributions
 big-0.007
 bite-0.007
 it-0.006
!-0.006
 the-0.006
TokenM
Token ID44
Feature activation-3.88e-01

Pos logit contributions
 habitats+0.042
esty+0.041
ishes+0.040
 habitat+0.040
ities+0.039
Neg logit contributions
 big-0.016
 it-0.016
!-0.016
 the-0.016
 -0.015
Tokenaz
Token ID1031
Feature activation+8.07e+00

Pos logit contributions
esty+0.014
 habitats+0.013
ried+0.013
ishes+0.013
uries+0.013
Neg logit contributions
!-0.010
 big-0.010
,-0.010
 replied-0.009
 the-0.009
Tokenzy
Token ID7357
Feature activation+1.20e+00

Pos logit contributions
 big+0.235
 it+0.231
 bite+0.229
 the+0.229
!+0.228
Neg logit contributions
 habitats-0.247
esty-0.234
 habitat-0.225
asured-0.219
ind-0.219
Tokenâ
Token ID22940
Feature activation-2.59e+00

Pos logit contributions
 big+0.015
!+0.015
 the+0.015
 it+0.015
 bite+0.014
Neg logit contributions
 habitats-0.056
esty-0.055
 habitat-0.053
asured-0.052
ind-0.052
Token
Token ID26391
Feature activation+2.07e+00

Pos logit contributions
 habitats+0.090
esty+0.085
 habitat+0.083
ishes+0.082
ind+0.082
Neg logit contributions
!-0.064
 the-0.063
 -0.062
 bite-0.062
 it-0.061
Token
Token ID8151
Feature activation+5.41e+00

Pos logit contributions
 big+0.050
 bite+0.049
 it+0.048
 the+0.048
 noise+0.047
Neg logit contributions
 habitats-0.072
esty-0.069
 habitat-0.068
ind-0.065
ried-0.065
Tokens
Token ID82
Feature activation-3.81e+00

Pos logit contributions
 big+0.113
 bite+0.112
 it+0.109
 noise+0.108
 the+0.106
Neg logit contributions
 habitats-0.206
esty-0.196
ried-0.190
 habitat-0.190
uzz-0.186
Token brave
Token ID14802
Feature activation+3.43e+00

Pos logit contributions
 big+0.011
 it+0.011
 the+0.011
!+0.011
 bite+0.011
Neg logit contributions
 habitats-0.024
esty-0.023
 habitat-0.022
asured-0.022
ind-0.022
Token and
Token ID290
Feature activation-9.56e-01

Pos logit contributions
 big+0.036
 bite+0.035
 it+0.035
 the+0.034
!+0.033
Neg logit contributions
 habitats-0.166
esty-0.161
 habitat-0.157
asured-0.155
ind-0.154
Token wasn
Token ID2492
Feature activation-3.11e-02

Pos logit contributions
 habitats+0.040
esty+0.039
ities+0.038
 habitat+0.038
ind+0.038
Neg logit contributions
 it-0.019
 the-0.019
 big-0.018
!-0.017
 bite-0.016
Tokenâ
Token ID22940
Feature activation-1.36e+00

Pos logit contributions
ried+0.001
years+0.001
ind+0.001
esty+0.001
onel+0.001
Neg logit contributions
 Emily-0.000
 video-0.000
 noise-0.000
 swirl-0.000
 bite-0.000
Token
Token ID26391
Feature activation+2.89e+00

Pos logit contributions
 habitats+0.053
 habitat+0.049
esty+0.049
ishes+0.049
asured+0.048
Neg logit contributions
!-0.030
 the-0.029
 it-0.028
 bite-0.028
 -0.028
Token
Token ID8151
Feature activation+8.00e+00

Pos logit contributions
 big+0.075
 it+0.074
 bite+0.074
 the+0.074
!+0.073
Neg logit contributions
 habitats-0.096
esty-0.092
 habitat-0.089
ind-0.086
ried-0.086
Tokent
Token ID83
Feature activation+3.41e+00

Pos logit contributions
 it+0.134
 bite+0.133
 big+0.133
 noise+0.130
 the+0.129
Neg logit contributions
 habitats-0.325
esty-0.321
ried-0.312
ind-0.309
years-0.305
Token scared
Token ID12008
Feature activation+4.32e+00

Pos logit contributions
 bite+0.066
 big+0.064
 video+0.063
 Emily+0.062
 +0.061
Neg logit contributions
 habitats-0.127
esty-0.124
ried-0.123
asured-0.122
 habitat-0.120
Token at
Token ID379
Feature activation-1.76e+00

Pos logit contributions
 bite+0.053
 big+0.051
 it+0.049
 video+0.048
 +0.047
Neg logit contributions
 habitats-0.195
esty-0.189
 habitat-0.185
ind-0.185
ried-0.183
Token all
Token ID477
Feature activation-4.81e-01

Pos logit contributions
 habitats+0.084
esty+0.081
 habitat+0.080
asured+0.078
ind+0.078
Neg logit contributions
 big-0.020
 it-0.020
 bite-0.019
!-0.019
 the-0.019
Token!
Token ID0
Feature activation+3.75e-01

Pos logit contributions
 habitats+0.022
esty+0.021
 habitat+0.021
ind+0.021
ried+0.021
Neg logit contributions
 bite-0.005
 big-0.005
 jugg-0.004
 swirl-0.004
 video-0.004
Token ball
Token ID2613
Feature activation-2.88e-01

Pos logit contributions
 it+0.043
 the+0.042
 Emily+0.040
!+0.040
 Amy+0.039
Neg logit contributions
 habitats-0.116
esty-0.115
 habitat-0.110
ind-0.109
uries-0.106
Token?"
Token ID1701
Feature activation+3.18e+00

Pos logit contributions
 habitats+0.014
esty+0.014
 habitat+0.013
ind+0.013
ities+0.013
Neg logit contributions
!-0.003
 big-0.003
 it-0.003
 the-0.003
 bite-0.003
Token
Token ID198
Feature activation+1.15e+00

Pos logit contributions
 bite+0.054
 big+0.045
 video+0.044
 swirl+0.043
 bubble+0.043
Neg logit contributions
 habitats-0.134
esty-0.126
 habitat-0.124
amb-0.123
years-0.120
Token
Token ID198
Feature activation+7.90e-01

Pos logit contributions
 bite+0.021
 noise+0.018
 video+0.017
 swirl+0.017
 yell+0.017
Neg logit contributions
 habitats-0.045
uzz-0.043
esty-0.043
amb-0.042
ind-0.042
TokenA
Token ID32
Feature activation+1.49e+00

Pos logit contributions
 bite+0.016
 it+0.015
 big+0.015
!+0.014
 noise+0.014
Neg logit contributions
 habitats-0.031
esty-0.030
ind-0.028
years-0.028
 habitat-0.028
Tokenlic
Token ID677
Feature activation+7.99e+00

Pos logit contributions
 bite+0.031
't+0.026
 it+0.026
 Emily+0.026
 happen+0.026
Neg logit contributions
 habitats-0.050
esty-0.048
ber-0.047
years-0.047
ind-0.045
Tokenia
Token ID544
Feature activation+2.73e+00

Pos logit contributions
 bite+0.213
 it+0.209
!+0.204
 Emily+0.203
 the+0.201
Neg logit contributions
 habitats-0.256
esty-0.241
 habitat-0.234
ried-0.233
ind-0.231
Token nodded
Token ID14464
Feature activation+2.31e+00

Pos logit contributions
 big+0.039
!+0.039
 the+0.038
 it+0.038
 bite+0.035
Neg logit contributions
 habitats-0.125
esty-0.124
ities-0.119
 habitat-0.118
asured-0.118
Token and
Token ID290
Feature activation-1.09e+00

Pos logit contributions
 big+0.027
 it+0.026
 the+0.026
!+0.025
 bite+0.025
Neg logit contributions
 habitats-0.112
esty-0.109
 habitat-0.106
ind-0.104
asured-0.104
Token said
Token ID531
Feature activation-1.00e+00

Pos logit contributions
 habitats+0.052
esty+0.051
ried+0.049
 habitat+0.049
asured+0.049
Neg logit contributions
 bite-0.011
 big-0.011
!-0.011
 it-0.010
 yell-0.009
Token,
Token ID11
Feature activation+1.32e+00

Pos logit contributions
esty+0.050
 habitats+0.050
ried+0.049
 habitat+0.048
amb+0.048
Neg logit contributions
 bite-0.007
 big-0.006
 yell-0.006
 video-0.005
 noise-0.005
Token although
Token ID3584
Feature activation-1.52e+00

Pos logit contributions
 big+0.018
 bite+0.017
 it+0.016
!+0.016
 the+0.016
Neg logit contributions
 habitats-0.070
esty-0.067
 habitat-0.066
asured-0.065
ried-0.065
Token she
Token ID673
Feature activation-3.25e-01

Pos logit contributions
 habitats+0.086
esty+0.085
 habitat+0.082
ind+0.081
asured+0.081
Neg logit contributions
 it-0.009
 the-0.008
 big-0.007
!-0.007
 Emily-0.004
Token wasn
Token ID2492
Feature activation-1.99e-01

Pos logit contributions
 habitats+0.015
esty+0.015
 habitat+0.015
ind+0.014
ities+0.014
Neg logit contributions
 big-0.004
!-0.004
 it-0.004
 the-0.004
 bite-0.004
Tokenâ
Token ID22940
Feature activation-1.28e+00

Pos logit contributions
ried+0.008
ind+0.008
ace+0.008
years+0.008
onel+0.008
Neg logit contributions
 video-0.003
 noise-0.003
 Emily-0.002
 swirl-0.002
 bubble-0.002
Token
Token ID26391
Feature activation+2.72e+00

Pos logit contributions
 habitats+0.049
ishes+0.045
 habitat+0.045
esty+0.045
ind+0.044
Neg logit contributions
!-0.029
 bite-0.028
 the-0.027
 it-0.027
 -0.026
Token
Token ID8151
Feature activation+7.98e+00

Pos logit contributions
 bite+0.068
 big+0.068
 it+0.067
 the+0.066
!+0.064
Neg logit contributions
 habitats-0.092
esty-0.089
 habitat-0.085
ried-0.082
ind-0.082
Tokent
Token ID83
Feature activation+3.28e+00

Pos logit contributions
 noise+0.147
 it+0.143
 bite+0.143
 swirl+0.140
 big+0.139
Neg logit contributions
esty-0.303
 habitats-0.298
ried-0.297
years-0.292
ind-0.292
Token sure
Token ID1654
Feature activation+1.82e+00

Pos logit contributions
 bite+0.064
 big+0.063
 it+0.062
 +0.061
!+0.061
Neg logit contributions
 habitats-0.123
esty-0.119
asured-0.118
ried-0.117
 habitat-0.116
Token what
Token ID644
Feature activation+2.56e+00

Pos logit contributions
 it+0.038
 the+0.037
!+0.036
 big+0.036
,+0.034
Neg logit contributions
 habitats-0.076
esty-0.075
 habitat-0.072
ind-0.070
asured-0.070
Token to
Token ID284
Feature activation+1.02e+00

Pos logit contributions
 big+0.022
 bite+0.022
!+0.020
 it+0.020
 the+0.020
Neg logit contributions
 habitats-0.128
esty-0.123
 habitat-0.120
ind-0.119
asured-0.119
Token do
Token ID466
Feature activation+4.10e+00

Pos logit contributions
 big+0.017
 +0.016
 Emily+0.016
 million+0.016
 swirl+0.015
Neg logit contributions
 habitats-0.038
 property-0.037
ities-0.036
adow-0.036
esty-0.036
Token knew
Token ID2993
Feature activation-2.81e+00

Pos logit contributions
 habitats+0.060
esty+0.057
 habitat+0.056
ind+0.055
ities+0.055
Neg logit contributions
 the-0.020
 it-0.020
 big-0.020
!-0.019
 bite-0.018
Token she
Token ID673
Feature activation-9.92e-01

Pos logit contributions
 habitats+0.158
esty+0.156
 habitat+0.151
ind+0.148
ishes+0.147
Neg logit contributions
 it-0.027
 the-0.025
!-0.020
 big-0.018
,-0.015
Token wasn
Token ID2492
Feature activation-3.57e-01

Pos logit contributions
 habitats+0.049
 habitat+0.047
esty+0.047
ities+0.045
ind+0.045
Neg logit contributions
 it-0.013
!-0.012
 big-0.012
 the-0.012
 bite-0.011
Tokenâ
Token ID22940
Feature activation-1.42e+00

Pos logit contributions
ind+0.015
ace+0.015
ried+0.015
years+0.015
esty+0.015
Neg logit contributions
 Emily-0.004
 noise-0.004
 video-0.004
 swirl-0.004
 bubble-0.004
Token
Token ID26391
Feature activation+2.41e+00

Pos logit contributions
 habitats+0.053
ishes+0.049
 habitat+0.049
esty+0.047
ind+0.047
Neg logit contributions
!-0.033
 bite-0.033
 the-0.032
 it-0.031
 yell-0.031
Token
Token ID8151
Feature activation+7.86e+00

Pos logit contributions
 big+0.069
 it+0.068
 bite+0.068
 the+0.068
!+0.066
Neg logit contributions
 habitats-0.074
esty-0.071
 habitat-0.068
ind-0.065
ried-0.065
Tokent
Token ID83
Feature activation+3.43e+00

Pos logit contributions
 big+0.128
 it+0.128
 bite+0.127
 noise+0.126
 the+0.124
Neg logit contributions
 habitats-0.319
esty-0.317
ried-0.307
ind-0.305
years-0.302
Token a
Token ID257
Feature activation+5.50e+00

Pos logit contributions
 big+0.062
 it+0.061
!+0.060
 bite+0.059
 the+0.059
Neg logit contributions
 habitats-0.138
esty-0.134
 habitat-0.130
asured-0.129
ried-0.129
Token threat
Token ID2372
Feature activation+1.24e-01

Pos logit contributions
 Emily+0.104
 the+0.100
 Luke+0.100
 +0.100
 it+0.099
Neg logit contributions
esty-0.208
 habitats-0.205
ind-0.197
 habitat-0.197
 property-0.193
Token and
Token ID290
Feature activation-1.94e-01

Pos logit contributions
 it+0.001
!+0.001
 big+0.001
 the+0.001
 bite+0.001
Neg logit contributions
 habitats-0.007
esty-0.006
 habitat-0.006
ind-0.006
asured-0.006
Token it
Token ID340
Feature activation+2.01e+00

Pos logit contributions
 habitats+0.009
esty+0.008
ind+0.008
ities+0.008
 habitat+0.008
Neg logit contributions
 it-0.004
 the-0.004
 big-0.003
!-0.003
 bite-0.003
Token But
Token ID887
Feature activation-5.69e-01

Pos logit contributions
 habitats+0.011
esty+0.011
 habitat+0.010
asured+0.010
ind+0.010
Neg logit contributions
 big-0.005
 it-0.005
 bite-0.005
 the-0.005
!-0.005
Token Emma
Token ID18966
Feature activation+2.22e+00

Pos logit contributions
 habitats+0.026
esty+0.025
 habitat+0.024
ind+0.024
ishes+0.024
Neg logit contributions
 it-0.010
 the-0.010
!-0.009
 big-0.009
,-0.009
Token wasn
Token ID2492
Feature activation+8.47e-02

Pos logit contributions
 it+0.033
!+0.032
 the+0.031
 big+0.031
 bite+0.029
Neg logit contributions
 habitats-0.102
esty-0.098
 habitat-0.096
ind-0.094
asured-0.094
Tokenâ
Token ID22940
Feature activation-1.13e+00

Pos logit contributions
 Emily+0.001
 video+0.001
 bite+0.001
 swirl+0.001
 noise+0.001
Neg logit contributions
oll-0.003
years-0.003
ind-0.003
ace-0.003
asured-0.003
Token
Token ID26391
Feature activation+2.73e+00

Pos logit contributions
 habitats+0.043
esty+0.040
 habitat+0.040
ind+0.039
asured+0.038
Neg logit contributions
!-0.025
 the-0.025
 it-0.025
 big-0.025
 bite-0.025
Token
Token ID8151
Feature activation+7.86e+00

Pos logit contributions
 big+0.072
 it+0.071
!+0.070
 the+0.070
 bite+0.069
Neg logit contributions
 habitats-0.091
esty-0.087
 habitat-0.084
asured-0.082
ind-0.082
Tokent
Token ID83
Feature activation+3.59e+00

Pos logit contributions
 noise+0.156
 bite+0.151
 it+0.150
 swirl+0.147
 bubble+0.147
Neg logit contributions
esty-0.287
 habitats-0.286
ried-0.282
years-0.279
uzz-0.278
Token very
Token ID845
Feature activation+1.63e+00

Pos logit contributions
 bite+0.070
 big+0.070
 it+0.069
 Emily+0.068
!+0.067
Neg logit contributions
 habitats-0.132
esty-0.131
asured-0.128
ried-0.127
 habitat-0.124
Token excited
Token ID6568
Feature activation+7.08e-01

Pos logit contributions
 it+0.034
 big+0.033
 bite+0.033
!+0.033
 the+0.033
Neg logit contributions
 habitats-0.061
esty-0.059
asured-0.057
ried-0.057
 habitat-0.057
Token about
Token ID546
Feature activation+3.79e-01

Pos logit contributions
 big+0.009
 bite+0.009
 +0.008
 yell+0.008
 it+0.008
Neg logit contributions
 habitats-0.032
esty-0.031
 habitat-0.030
asured-0.030
ities-0.030
Token the
Token ID262
Feature activation+1.11e+00

Pos logit contributions
 it+0.004
 big+0.004
 the+0.004
!+0.004
 bite+0.003
Neg logit contributions
 habitats-0.019
esty-0.019
 habitat-0.018
ried-0.018
asured-0.018
Token it
Token ID340
Feature activation+1.47e+00

Pos logit contributions
 it+0.013
 the+0.012
 big+0.012
!+0.010
 bite+0.010
Neg logit contributions
 habitats-0.039
esty-0.038
 habitat-0.037
ried-0.037
ind-0.036
Token might
Token ID1244
Feature activation-6.74e-02

Pos logit contributions
 big+0.023
 it+0.023
 bite+0.022
 the+0.022
!+0.022
Neg logit contributions
 habitats-0.066
esty-0.063
 habitat-0.062
asured-0.060
ind-0.060
Token make
Token ID787
Feature activation+3.78e+00

Pos logit contributions
 habitats+0.003
esty+0.003
 habitat+0.003
asured+0.003
ried+0.003
Neg logit contributions
 Emily-0.001
 Luke-0.001
 -0.001
 Amy-0.001
 it-0.001
Token you
Token ID345
Feature activation-2.00e+00

Pos logit contributions
 big+0.073
 bite+0.072
!+0.072
 it+0.072
 the+0.071
Neg logit contributions
 habitats-0.149
esty-0.143
 habitat-0.139
ind-0.137
asured-0.136
Token p
Token ID279
Feature activation-8.03e-01

Pos logit contributions
 habitats+0.084
esty+0.082
 habitat+0.080
asured+0.079
ried+0.078
Neg logit contributions
 it-0.030
 big-0.030
 Emily-0.030
!-0.030
 -0.029
Tokenucker
Token ID12603
Feature activation+7.84e+00

Pos logit contributions
 habitats+0.023
esty+0.021
 habitat+0.020
ities+0.020
 belongings+0.019
Neg logit contributions
 big-0.026
!-0.025
 bite-0.025
 it-0.025
 the-0.025
Token your
Token ID534
Feature activation-4.21e+00

Pos logit contributions
 bite+0.118
 big+0.118
 Emily+0.113
 +0.111
 it+0.111
Neg logit contributions
 habitats-0.331
esty-0.330
ind-0.315
ishes-0.312
 habitat-0.311
Token lips
Token ID11914
Feature activation+3.27e+00

Pos logit contributions
 habitats+0.147
esty+0.138
 habitat+0.134
ind+0.132
asured+0.132
Neg logit contributions
 big-0.107
 it-0.105
 bite-0.104
 the-0.104
!-0.104
Token."
Token ID526
Feature activation+1.61e+00

Pos logit contributions
 big+0.046
 it+0.045
 bite+0.044
 the+0.044
 Emily+0.043
Neg logit contributions
 habitats-0.144
esty-0.141
 habitat-0.137
asured-0.136
ried-0.135
Token
Token ID198
Feature activation+1.12e+00

Pos logit contributions
 big+0.011
 it+0.010
 the+0.010
!+0.010
 bite+0.010
Neg logit contributions
 habitats-0.085
esty-0.083
 habitat-0.081
asured-0.080
ind-0.080
Token
Token ID198
Feature activation+8.37e-01

Pos logit contributions
 bite+0.013
 big+0.011
!+0.011
 swirl+0.011
 yell+0.011
Neg logit contributions
 habitats-0.053
ind-0.050
esty-0.050
 habitat-0.049
uzz-0.049
Token.
Token ID13
Feature activation-4.26e-01

Pos logit contributions
!+0.017
 it+0.017
 the+0.017
 big+0.015
,+0.014
Neg logit contributions
 habitats-0.072
esty-0.070
 habitat-0.068
ind-0.067
ishes-0.067
Token He
Token ID679
Feature activation-1.12e+00

Pos logit contributions
 habitats+0.020
esty+0.019
 habitat+0.018
ind+0.018
asured+0.018
Neg logit contributions
 big-0.006
 it-0.006
 the-0.006
!-0.006
 bite-0.006
Token wasn
Token ID2492
Feature activation-1.15e-01

Pos logit contributions
 habitats+0.052
esty+0.051
 habitat+0.049
ities+0.049
ind+0.048
Neg logit contributions
 it-0.017
 big-0.017
 the-0.016
!-0.016
 bite-0.015
Tokenâ
Token ID22940
Feature activation-1.32e+00

Pos logit contributions
ried+0.005
onel+0.005
years+0.005
ace+0.005
esty+0.005
Neg logit contributions
 Emily-0.001
 video-0.001
 bite-0.001
 noise-0.001
 Amy-0.001
Token
Token ID26391
Feature activation+2.50e+00

Pos logit contributions
 habitats+0.048
 habitat+0.044
ishes+0.044
esty+0.043
ind+0.043
Neg logit contributions
!-0.032
 the-0.031
 bite-0.031
 it-0.031
 -0.030
Token
Token ID8151
Feature activation+7.82e+00

Pos logit contributions
 big+0.062
 it+0.062
!+0.061
 the+0.061
 bite+0.059
Neg logit contributions
 habitats-0.088
esty-0.084
 habitat-0.081
ind-0.080
asured-0.080
Tokent
Token ID83
Feature activation+3.42e+00

Pos logit contributions
 noise+0.142
 it+0.141
 bite+0.140
 big+0.136
 the+0.135
Neg logit contributions
esty-0.300
 habitats-0.295
ried-0.294
years-0.289
ind-0.289
Token scared
Token ID12008
Feature activation+4.29e+00

Pos logit contributions
 bite+0.067
 video+0.065
 big+0.065
 Emily+0.065
 +0.064
Neg logit contributions
 habitats-0.123
asured-0.122
esty-0.121
ried-0.120
years-0.117
Token anymore
Token ID7471
Feature activation+2.06e+00

Pos logit contributions
 bite+0.063
 big+0.060
 video+0.060
 +0.060
 Emily+0.059
Neg logit contributions
 habitats-0.180
esty-0.173
 habitat-0.172
ind-0.171
asured-0.169
Token;
Token ID26
Feature activation-1.42e+00

Pos logit contributions
 big+0.019
 bite+0.018
 it+0.018
 the+0.017
!+0.016
Neg logit contributions
 habitats-0.103
esty-0.099
 habitat-0.097
asured-0.096
ind-0.095
Token he
Token ID339
Feature activation-4.69e-01

Pos logit contributions
 habitats+0.081
esty+0.080
 habitat+0.077
ind+0.077
ishes+0.076
Neg logit contributions
 it-0.009
 the-0.008
 big-0.006
!-0.006
 bite-0.004
Token But
Token ID887
Feature activation-5.13e-01

Pos logit contributions
 habitats+0.021
esty+0.021
 habitat+0.020
asured+0.019
ind+0.019
Neg logit contributions
 big-0.010
 bite-0.010
 it-0.010
!-0.010
 the-0.010
Token they
Token ID484
Feature activation+1.40e+00

Pos logit contributions
 habitats+0.026
esty+0.025
 habitat+0.024
ind+0.024
ried+0.024
Neg logit contributions
 it-0.007
 the-0.006
 big-0.006
!-0.005
,-0.005
Token weren
Token ID6304
Feature activation-1.86e-01

Pos logit contributions
 it+0.025
 the+0.025
 big+0.025
!+0.024
 bite+0.024
Neg logit contributions
 habitats-0.060
esty-0.058
 habitat-0.057
ind-0.056
asured-0.055
Tokenâ
Token ID22940
Feature activation-1.44e+00

Pos logit contributions
ace+0.007
ind+0.007
onel+0.007
ried+0.007
years+0.007
Neg logit contributions
 Emily-0.003
 bite-0.002
 video-0.002
 Luke-0.002
 jugg-0.002
Token
Token ID26391
Feature activation+2.61e+00

Pos logit contributions
 habitats+0.049
esty+0.045
 habitat+0.045
asured+0.043
ishes+0.043
Neg logit contributions
!-0.038
 the-0.037
 bite-0.037
 it-0.037
 big-0.037
Token
Token ID8151
Feature activation+7.80e+00

Pos logit contributions
 bite+0.061
 big+0.059
 it+0.058
 the+0.057
 yell+0.056
Neg logit contributions
 habitats-0.093
esty-0.090
 habitat-0.087
ried-0.084
 belongings-0.083
Tokent
Token ID83
Feature activation+3.04e+00

Pos logit contributions
 noise+0.157
 it+0.151
 bite+0.150
 bubble+0.149
 swirl+0.147
Neg logit contributions
esty-0.283
 habitats-0.281
ried-0.280
uzz-0.277
ind-0.277
Token scared
Token ID12008
Feature activation+4.07e+00

Pos logit contributions
 bite+0.058
 big+0.056
 +0.054
 Emily+0.054
 video+0.054
Neg logit contributions
 habitats-0.113
asured-0.111
esty-0.110
ried-0.110
ind-0.109
Token anymore
Token ID7471
Feature activation+1.92e+00

Pos logit contributions
 bite+0.067
 big+0.064
 +0.063
 video+0.062
 jugg+0.061
Neg logit contributions
 habitats-0.163
ind-0.157
 habitat-0.155
esty-0.155
asured-0.152
Token.
Token ID13
Feature activation-4.70e-01

Pos logit contributions
 big+0.019
 bite+0.018
 it+0.017
 the+0.017
 Emily+0.016
Neg logit contributions
 habitats-0.094
esty-0.090
 habitat-0.089
asured-0.088
ind-0.087
Token Now
Token ID2735
Feature activation-1.09e-01

Pos logit contributions
 habitats+0.022
esty+0.021
 habitat+0.020
ind+0.020
asured+0.020
Neg logit contributions
 it-0.006
 the-0.006
 big-0.006
 -0.006
!-0.006
Token left
Token ID1364
Feature activation-2.56e+00

Pos logit contributions
 habitats+0.192
adow+0.186
 habitat+0.185
asured+0.183
esty+0.182
Neg logit contributions
 yell-0.056
 bite-0.053
 Emily-0.053
 shout-0.050
 jugg-0.049
Token and
Token ID290
Feature activation-1.30e+00

Pos logit contributions
esty+0.127
 habitats+0.126
asured+0.118
years+0.118
 habitat+0.118
Neg logit contributions
 the-0.034
 it-0.032
 big-0.031
!-0.030
,-0.027
Token it
Token ID340
Feature activation+1.95e+00

Pos logit contributions
esty+0.057
 habitats+0.057
ind+0.054
ities+0.054
amb+0.053
Neg logit contributions
 the-0.024
 it-0.024
 big-0.023
!-0.021
 bite-0.021
Token wasn
Token ID2492
Feature activation+2.21e-01

Pos logit contributions
 big+0.027
 it+0.026
 the+0.026
!+0.026
 bite+0.025
Neg logit contributions
 habitats-0.094
esty-0.091
ind-0.087
ities-0.087
 habitat-0.087
Token't
Token ID470
Feature activation+3.96e+00

Pos logit contributions
 big+0.001
 it+0.001
 the+0.001
!+0.001
 bite+0.001
Neg logit contributions
 habitats-0.012
esty-0.012
 habitat-0.012
asured-0.012
ind-0.012
Token long
Token ID890
Feature activation+7.78e+00

Pos logit contributions
 big+0.069
 bite+0.069
 it+0.068
!+0.066
 the+0.066
Neg logit contributions
 habitats-0.161
esty-0.156
asured-0.152
 habitat-0.151
ried-0.150
Token before
Token ID878
Feature activation+2.91e+00

Pos logit contributions
!+0.115
 big+0.113
 it+0.112
 the+0.111
 bite+0.109
Neg logit contributions
 habitats-0.351
esty-0.347
ind-0.330
 habitat-0.328
asured-0.325
Token the
Token ID262
Feature activation+1.31e+00

Pos logit contributions
 big+0.023
 bite+0.021
 it+0.021
 the+0.020
!+0.020
Neg logit contributions
 habitats-0.150
esty-0.145
 habitat-0.142
ind-0.140
asured-0.140
Token little
Token ID1310
Feature activation+4.17e-01

Pos logit contributions
 big+0.025
 it+0.024
 the+0.024
 bite+0.024
!+0.024
Neg logit contributions
 habitats-0.054
esty-0.053
asured-0.051
 habitat-0.051
ried-0.050
Token girl
Token ID2576
Feature activation+9.23e-01

Pos logit contributions
 big+0.009
 it+0.009
!+0.008
 the+0.008
 bite+0.008
Neg logit contributions
 habitats-0.016
esty-0.015
 habitat-0.015
ind-0.015
asured-0.015
Token grew
Token ID6348
Feature activation-2.27e+00

Pos logit contributions
!+0.012
 it+0.011
 the+0.011
 big+0.011
 bite+0.010
Neg logit contributions
 habitats-0.045
esty-0.044
 habitat-0.042
ind-0.042
asured-0.042
Token because
Token ID780
Feature activation-2.16e+00

Pos logit contributions
 big+0.018
 bite+0.018
 it+0.016
 the+0.016
 Emily+0.016
Neg logit contributions
 habitats-0.084
esty-0.081
 habitat-0.080
asured-0.079
ind-0.078
Token he
Token ID339
Feature activation+7.77e-02

Pos logit contributions
 habitats+0.117
esty+0.116
 habitat+0.112
ried+0.110
ind+0.110
Neg logit contributions
 it-0.015
 the-0.014
 big-0.014
!-0.012
 bite-0.011
Token didn
Token ID1422
Feature activation+2.32e+00

Pos logit contributions
 big+0.001
 it+0.001
 the+0.001
!+0.001
 bite+0.001
Neg logit contributions
 habitats-0.003
esty-0.003
 habitat-0.003
ind-0.003
asured-0.003
Tokenâ
Token ID22940
Feature activation-1.35e+00

Pos logit contributions
 Emily+0.036
 Amy+0.033
 tasted+0.033
 bite+0.032
 Lisa+0.031
Neg logit contributions
ind-0.087
ace-0.085
amb-0.084
uzz-0.083
asured-0.082
Token
Token ID26391
Feature activation+2.61e+00

Pos logit contributions
 habitats+0.053
 habitat+0.049
ishes+0.049
ind+0.048
esty+0.048
Neg logit contributions
!-0.029
 bite-0.028
 the-0.028
 -0.027
 it-0.027
Token
Token ID8151
Feature activation+7.76e+00

Pos logit contributions
 big+0.072
 bite+0.071
 it+0.071
 the+0.071
!+0.070
Neg logit contributions
 habitats-0.083
esty-0.079
 habitat-0.076
asured-0.074
ind-0.073
Tokent
Token ID83
Feature activation+3.10e+00

Pos logit contributions
 bite+0.138
 noise+0.137
 big+0.136
 it+0.134
 the+0.130
Neg logit contributions
 habitats-0.301
esty-0.299
ried-0.293
ind-0.289
years-0.286
Token know
Token ID760
Feature activation+1.25e+00

Pos logit contributions
 big+0.060
 it+0.058
!+0.057
 bite+0.057
 the+0.057
Neg logit contributions
 habitats-0.119
esty-0.116
asured-0.111
 habitat-0.111
ried-0.111
Token who
Token ID508
Feature activation-5.31e-01

Pos logit contributions
 it+0.027
 the+0.027
!+0.026
 big+0.026
,+0.024
Neg logit contributions
 habitats-0.050
esty-0.050
 habitat-0.048
ried-0.047
asured-0.046
Token she
Token ID673
Feature activation-5.57e-01

Pos logit contributions
 habitats+0.026
esty+0.026
 habitat+0.025
asured+0.025
ind+0.025
Neg logit contributions
 it-0.006
 the-0.006
 big-0.006
!-0.006
 bite-0.005
Token was
Token ID373
Feature activation+3.83e-01

Pos logit contributions
 habitats+0.030
esty+0.030
 habitat+0.029
ishes+0.028
ind+0.028
Neg logit contributions
!-0.004
 it-0.003
 big-0.003
 the-0.003
 bite-0.003
Token!
Token ID0
Feature activation+3.46e-01

Pos logit contributions
 it+0.002
!+0.002
 the+0.002
 big+0.001
 bite+0.001
Neg logit contributions
 habitats-0.006
esty-0.005
 habitat-0.005
ind-0.005
asured-0.005
Token Why
Token ID4162
Feature activation+2.51e+00

Pos logit contributions
 +0.007
 it+0.007
 the+0.007
!+0.007
 Emily+0.007
Neg logit contributions
 habitats-0.015
ind-0.014
 habitat-0.013
asured-0.013
esty-0.013
Token don
Token ID836
Feature activation+2.23e+00

Pos logit contributions
 big+0.070
 bite+0.069
 it+0.068
!+0.068
 the+0.067
Neg logit contributions
 habitats-0.077
esty-0.074
 habitat-0.070
ities-0.069
asured-0.069
Tokenâ
Token ID22940
Feature activation-9.31e-01

Pos logit contributions
 bite+0.038
 Amy+0.037
 Emily+0.036
 Lisa+0.035
 tasted+0.032
Neg logit contributions
amb-0.076
ind-0.074
oll-0.074
aying-0.073
beh-0.071
Token
Token ID26391
Feature activation+2.67e+00

Pos logit contributions
 habitats+0.034
esty+0.032
 habitat+0.032
ind+0.031
asured+0.031
Neg logit contributions
 big-0.022
 it-0.022
 the-0.021
!-0.021
 bite-0.021
Token
Token ID8151
Feature activation+7.74e+00

Pos logit contributions
 big+0.052
 it+0.051
 the+0.051
!+0.051
 bite+0.050
Neg logit contributions
 habitats-0.107
esty-0.103
 habitat-0.100
ind-0.099
asured-0.099
Tokent
Token ID83
Feature activation+3.50e+00

Pos logit contributions
 noise+0.141
 big+0.140
 swirl+0.138
 bite+0.136
 it+0.134
Neg logit contributions
 habitats-0.296
esty-0.291
ried-0.285
years-0.277
ind-0.277
Token you
Token ID345
Feature activation-1.77e-01

Pos logit contributions
 big+0.077
 bite+0.075
 it+0.075
 the+0.074
!+0.074
Neg logit contributions
 habitats-0.129
esty-0.125
 habitat-0.120
ind-0.119
asured-0.118
Token make
Token ID787
Feature activation+4.02e+00

Pos logit contributions
 habitats+0.008
esty+0.007
 habitat+0.007
asured+0.007
ind+0.007
Neg logit contributions
 big-0.003
 it-0.003
 the-0.003
!-0.003
 bite-0.003
Token something
Token ID1223
Feature activation+2.60e+00

Pos logit contributions
 bite+0.062
 yell+0.058
 Amy+0.057
 big+0.055
 Emily+0.054
Neg logit contributions
 habitats-0.165
esty-0.164
ind-0.163
ishes-0.160
 habitat-0.159
Token special
Token ID2041
Feature activation-1.34e+00

Pos logit contributions
 +0.044
 bite+0.043
 Emily+0.043
 Amy+0.042
 yell+0.042
Neg logit contributions
 habitats-0.098
esty-0.098
asured-0.097
ind-0.095
ishes-0.095
Token three
Token ID1115
Feature activation+3.65e+00

Pos logit contributions
 habitats+0.022
esty+0.020
 habitat+0.020
ind+0.020
asured+0.019
Neg logit contributions
 big-0.011
 the-0.011
 it-0.010
!-0.010
 bite-0.010
Token and
Token ID290
Feature activation-5.03e-01

Pos logit contributions
 big+0.095
!+0.095
 it+0.094
 the+0.094
 bite+0.091
Neg logit contributions
 habitats-0.124
esty-0.119
 habitat-0.114
ried-0.112
asured-0.111
Token didn
Token ID1422
Feature activation+2.07e+00

Pos logit contributions
 habitats+0.025
esty+0.024
ind+0.023
 habitat+0.023
ishes+0.023
Neg logit contributions
 the-0.008
 it-0.008
 big-0.007
!-0.007
,-0.006
Tokenâ
Token ID22940
Feature activation-1.38e+00

Pos logit contributions
 Emily+0.024
 bite+0.023
 Amy+0.021
 swirl+0.021
 big+0.021
Neg logit contributions
ind-0.086
ace-0.085
esty-0.084
asured-0.083
uries-0.083
Token
Token ID26391
Feature activation+2.70e+00

Pos logit contributions
 habitats+0.054
 habitat+0.050
esty+0.049
ind+0.049
ishes+0.049
Neg logit contributions
!-0.029
 bite-0.029
 the-0.029
 -0.028
 it-0.028
Token
Token ID8151
Feature activation+7.73e+00

Pos logit contributions
 big+0.068
 it+0.067
!+0.067
 the+0.066
 bite+0.065
Neg logit contributions
 habitats-0.093
esty-0.089
 habitat-0.086
ind-0.084
asured-0.084
Tokent
Token ID83
Feature activation+2.92e+00

Pos logit contributions
 big+0.126
 bite+0.125
 it+0.123
 noise+0.120
 the+0.120
Neg logit contributions
 habitats-0.319
esty-0.314
ried-0.304
ind-0.301
 habitat-0.299
Token have
Token ID423
Feature activation+3.16e+00

Pos logit contributions
 big+0.055
 it+0.054
 bite+0.053
 the+0.052
!+0.052
Neg logit contributions
 habitats-0.116
esty-0.113
asured-0.108
 habitat-0.108
ried-0.108
Token any
Token ID597
Feature activation-3.04e-01

Pos logit contributions
 big+0.061
 it+0.059
 bite+0.059
!+0.059
 the+0.058
Neg logit contributions
 habitats-0.127
esty-0.122
 habitat-0.118
asured-0.116
ind-0.115
Token money
Token ID1637
Feature activation-1.35e+00

Pos logit contributions
 habitats+0.011
esty+0.011
 habitat+0.011
ind+0.010
ried+0.010
Neg logit contributions
 big-0.007
!-0.007
 it-0.007
 the-0.007
 bite-0.007
Token either
Token ID2035
Feature activation-2.35e+00

Pos logit contributions
 habitats+0.074
esty+0.072
 habitat+0.070
ind+0.070
asured+0.068
Neg logit contributions
!-0.011
 the-0.010
 it-0.010
 big-0.009
,-0.008
Token the
Token ID262
Feature activation-3.88e-01

Pos logit contributions
 habitats+0.015
esty+0.015
 habitat+0.015
ind+0.015
asured+0.015
Neg logit contributions
 bite-0.002
 big-0.002
!-0.001
 it-0.001
 Emily-0.001
Token tree
Token ID5509
Feature activation-5.97e-01

Pos logit contributions
 habitats+0.019
esty+0.019
ried+0.018
asured+0.018
amb+0.018
Neg logit contributions
 big-0.006
 bite-0.005
 the-0.005
!-0.005
 it-0.005
Token to
Token ID284
Feature activation-3.07e-01

Pos logit contributions
 habitats+0.028
esty+0.027
 habitat+0.027
asured+0.026
ind+0.026
Neg logit contributions
 big-0.007
 it-0.007
 the-0.007
 bite-0.007
!-0.007
Token its
Token ID663
Feature activation-3.79e+00

Pos logit contributions
 habitats+0.015
esty+0.014
 habitat+0.014
asured+0.014
ind+0.014
Neg logit contributions
 big-0.004
 it-0.003
 bite-0.003
!-0.003
 the-0.003
Token once
Token ID1752
Feature activation-1.74e+00

Pos logit contributions
esty+0.134
 habitats+0.131
ried+0.128
ities+0.127
isher+0.127
Neg logit contributions
 big-0.103
 the-0.096
!-0.095
 bite-0.091
 long-0.089
Token former
Token ID1966
Feature activation-9.49e+00

Pos logit contributions
 habitats+0.078
esty+0.075
 habitat+0.073
ind+0.072
asured+0.072
Neg logit contributions
 big-0.026
 it-0.026
 the-0.025
!-0.025
 bite-0.025
Token beauty
Token ID8737
Feature activation-3.93e+00

Pos logit contributions
esty+0.253
ried+0.233
 habitats+0.231
amb+0.229
years+0.228
Neg logit contributions
 big-0.319
!-0.316
 the-0.308
,-0.305
 it-0.296
Token.
Token ID13
Feature activation-9.75e-01

Pos logit contributions
 habitats+0.202
esty+0.196
 habitat+0.191
asured+0.188
ind+0.188
Neg logit contributions
 big-0.032
 it-0.031
 the-0.029
 bite-0.029
!-0.029
Token He
Token ID679
Feature activation-1.59e+00

Pos logit contributions
 habitats+0.044
esty+0.044
ind+0.043
ishes+0.043
ities+0.042
Neg logit contributions
 -0.016
 it-0.015
 the-0.015
!-0.014
 big-0.014
Token was
Token ID373
Feature activation+5.46e-02

Pos logit contributions
 habitats+0.079
esty+0.077
 habitat+0.075
ities+0.075
ind+0.074
Neg logit contributions
 it-0.019
 the-0.019
 big-0.018
!-0.018
 bite-0.016
Token happy
Token ID3772
Feature activation-4.77e-01

Pos logit contributions
 the+0.001
 big+0.001
 it+0.001
!+0.001
 +0.001
Neg logit contributions
 habitats-0.003
esty-0.003
 habitat-0.003
ind-0.002
ities-0.002
Token did
Token ID750
Feature activation+2.97e+00

Pos logit contributions
 habitats+0.085
esty+0.082
 habitat+0.080
ities+0.078
ind+0.077
Neg logit contributions
 it-0.035
 big-0.034
 the-0.034
!-0.033
 bite-0.031
Token something
Token ID1223
Feature activation+2.01e+00

Pos logit contributions
 bite+0.049
 Emily+0.049
 Luke+0.045
 yell+0.045
 Amy+0.044
Neg logit contributions
 habitats-0.123
 habitat-0.115
esty-0.114
asured-0.114
 belongings-0.114
Token special
Token ID2041
Feature activation-2.11e+00

Pos logit contributions
 bite+0.039
 Luke+0.038
 Emily+0.037
 +0.037
 yell+0.036
Neg logit contributions
 habitats-0.073
esty-0.072
 belongings-0.071
ind-0.070
asured-0.069
Token.
Token ID13
Feature activation-1.16e+00

Pos logit contributions
 habitats+0.111
esty+0.108
 habitat+0.106
ind+0.105
ried+0.104
Neg logit contributions
!-0.018
 it-0.018
 the-0.017
 big-0.016
 bite-0.014
Token On
Token ID1550
Feature activation-2.40e+00

Pos logit contributions
 habitats+0.050
esty+0.048
 habitat+0.047
asured+0.046
ind+0.045
Neg logit contributions
 big-0.019
 bite-0.019
 it-0.018
!-0.018
 the-0.018
Token Wed
Token ID3300
Feature activation-9.47e+00

Pos logit contributions
 habitats+0.109
 habitat+0.103
esty+0.102
ind+0.102
asured+0.101
Neg logit contributions
 bite-0.030
 yell-0.027
 big-0.027
!-0.027
 noise-0.026
Tokennes
Token ID2516
Feature activation-3.61e+00

Pos logit contributions
 habitats+0.241
esty+0.229
 habitat+0.220
ishes+0.218
 belongings+0.210
Neg logit contributions
 it-0.320
 big-0.317
 the-0.317
 bite-0.314
!-0.313
Tokendays
Token ID12545
Feature activation-6.17e+00

Pos logit contributions
 habitats+0.084
esty+0.082
ities+0.076
 habitat+0.074
ishes+0.071
Neg logit contributions
!-0.132
 it-0.130
 the-0.130
 big-0.129
 bite-0.129
Token she
Token ID673
Feature activation-3.58e+00

Pos logit contributions
esty+0.350
 habitats+0.349
ind+0.337
ities+0.337
 habitat+0.335
Neg logit contributions
,-0.038
 the-0.037
!-0.032
 it-0.027
 big-0.026
Token liked
Token ID8288
Feature activation-2.73e+00

Pos logit contributions
 habitats+0.158
esty+0.155
ities+0.151
 habitat+0.149
ried+0.147
Neg logit contributions
 the-0.061
 big-0.061
 it-0.060
!-0.057
,-0.055
Token to
Token ID284
Feature activation-3.84e-01

Pos logit contributions
 habitats+0.144
esty+0.141
 habitat+0.137
ried+0.135
asured+0.135
Neg logit contributions
 big-0.020
 it-0.020
 the-0.019
!-0.018
 bite-0.017
TokenThere
Token ID1858
Feature activation+3.32e-01

Pos logit contributions
 habitats+0.174
esty+0.166
 habitat+0.165
uzz+0.159
ried+0.159
Neg logit contributions
 bite-0.066
 yell-0.063
 noise-0.059
 loud-0.059
 break-0.058
Token was
Token ID373
Feature activation-1.63e+00

Pos logit contributions
 bite+0.004
 big+0.004
 yell+0.004
 Emily+0.004
!+0.004
Neg logit contributions
 habitats-0.015
esty-0.015
 habitat-0.015
ind-0.014
asured-0.014
Token a
Token ID257
Feature activation+3.06e+00

Pos logit contributions
 habitats+0.079
 habitat+0.076
asured+0.075
esty+0.075
ind+0.074
Neg logit contributions
 bite-0.016
!-0.014
 yell-0.014
 it-0.014
 big-0.013
Token gr
Token ID1036
Feature activation-2.44e+00

Pos logit contributions
 it+0.049
 bite+0.048
!+0.047
 the+0.046
 Emily+0.045
Neg logit contributions
 habitats-0.127
esty-0.125
ind-0.122
 habitat-0.118
asured-0.117
Tokenumpy
Token ID32152
Feature activation-6.86e-01

Pos logit contributions
 habitats+0.046
esty+0.042
ind+0.040
 habitat+0.039
asured+0.038
Neg logit contributions
 it-0.100
 the-0.098
 bite-0.098
 big-0.098
!-0.096
Token ranc
Token ID32205
Feature activation-9.39e+00

Pos logit contributions
 habitats+0.028
esty+0.027
 habitat+0.026
asured+0.026
ried+0.026
Neg logit contributions
 big-0.013
 it-0.013
 the-0.013
!-0.012
 bite-0.012
Tokenher
Token ID372
Feature activation-2.49e+00

Pos logit contributions
 habitats+0.283
esty+0.268
 habitat+0.258
ind+0.258
asured+0.253
Neg logit contributions
 it-0.273
 big-0.273
 bite-0.271
 the-0.269
!-0.267
Token named
Token ID3706
Feature activation-3.45e+00

Pos logit contributions
 habitats+0.121
esty+0.120
asured+0.116
 habitat+0.116
ities+0.113
Neg logit contributions
 the-0.031
!-0.031
,-0.028
 big-0.028
 it-0.027
Token Tom
Token ID4186
Feature activation+4.14e-01

Pos logit contributions
 habitats+0.127
esty+0.124
asured+0.123
isher+0.121
 habitat+0.119
Neg logit contributions
 the-0.090
 it-0.088
 big-0.086
!-0.083
 -0.081
Token.
Token ID13
Feature activation-1.23e+00

Pos logit contributions
!+0.003
 the+0.003
 it+0.003
 big+0.002
,+0.002
Neg logit contributions
 habitats-0.023
esty-0.022
 habitat-0.022
ried-0.021
ities-0.021
Token Tom
Token ID4186
Feature activation+1.31e+00

Pos logit contributions
 habitats+0.050
esty+0.046
 habitat+0.046
asured+0.044
ities+0.043
Neg logit contributions
 it-0.023
 bite-0.023
 big-0.023
!-0.023
 Emily-0.023
Token to
Token ID284
Feature activation+1.54e+00

Pos logit contributions
 habitats+0.023
esty+0.022
 habitat+0.022
ind+0.021
asured+0.021
Neg logit contributions
 big-0.006
 it-0.006
 the-0.006
!-0.005
 bite-0.005
Token work
Token ID670
Feature activation+1.13e+00

Pos logit contributions
 big+0.022
 bite+0.022
!+0.022
 it+0.021
 Emily+0.021
Neg logit contributions
 habitats-0.066
esty-0.063
 habitat-0.062
asured-0.061
ind-0.061
Token cleaning
Token ID12724
Feature activation-1.91e+00

Pos logit contributions
 bite+0.013
 big+0.012
 Emily+0.011
 yell+0.011
 it+0.011
Neg logit contributions
 habitats-0.051
esty-0.049
asured-0.049
ried-0.048
ind-0.048
Token the
Token ID262
Feature activation-1.28e+00

Pos logit contributions
 habitats+0.089
esty+0.087
ind+0.085
 habitat+0.084
asured+0.084
Neg logit contributions
 bite-0.022
 big-0.021
 it-0.019
!-0.019
 yell-0.019
Token re
Token ID302
Feature activation-7.02e+00

Pos logit contributions
uzz+0.048
 habitats+0.045
years+0.044
beh+0.044
ried+0.044
Neg logit contributions
 big-0.040
 the-0.035
 noise-0.033
 long-0.033
 loud-0.033
Tokenind
Token ID521
Feature activation-9.23e+00

Pos logit contributions
ities+0.103
onel+0.102
esty+0.101
 habitats+0.099
ried+0.098
Neg logit contributions
!-0.285
 it-0.272
s-0.269
 big-0.269
!"-0.263
Tokeneer
Token ID28153
Feature activation-2.69e+00

Pos logit contributions
esty+0.195
onel+0.186
years+0.186
 habitats+0.180
beh+0.177
Neg logit contributions
 the-0.370
 big-0.354
,-0.354
!-0.350
 it-0.344
Token.
Token ID13
Feature activation-2.09e-01

Pos logit contributions
 habitats+0.139
esty+0.132
 habitat+0.129
ind+0.128
years+0.126
Neg logit contributions
!-0.028
 the-0.028
 big-0.028
 it-0.027
 bite-0.024
Token With
Token ID2080
Feature activation+1.16e+00

Pos logit contributions
 habitats+0.009
esty+0.009
ind+0.009
asured+0.009
 habitat+0.009
Neg logit contributions
 -0.004
 the-0.004
 it-0.004
 big-0.003
 Emily-0.003
Token her
Token ID607
Feature activation-3.92e+00

Pos logit contributions
 the+0.011
 big+0.011
 it+0.011
!+0.010
 bite+0.009
Neg logit contributions
 habitats-0.060
esty-0.059
 habitat-0.057
ried-0.056
ind-0.056
Token help
Token ID1037
Feature activation+2.23e+00

Pos logit contributions
 habitats+0.169
esty+0.163
 habitat+0.159
ried+0.157
asured+0.155
Neg logit contributions
 big-0.076
 the-0.072
 it-0.070
!-0.069
 bite-0.068
Token He
Token ID679
Feature activation-1.68e+00

Pos logit contributions
 habitats+0.039
esty+0.039
 habitat+0.037
ried+0.036
amb+0.035
Neg logit contributions
 bite-0.019
 swirl-0.017
 big-0.017
 bubble-0.016
 jugg-0.016
Token slowly
Token ID6364
Feature activation+1.11e+00

Pos logit contributions
 habitats+0.075
esty+0.073
 habitat+0.071
ind+0.070
ities+0.069
Neg logit contributions
 big-0.026
 it-0.026
 the-0.025
!-0.025
 bite-0.024
Token walked
Token ID6807
Feature activation-3.16e+00

Pos logit contributions
 big+0.021
 it+0.021
 the+0.021
!+0.021
 bite+0.021
Neg logit contributions
 habitats-0.045
esty-0.043
 habitat-0.042
ind-0.041
asured-0.041
Token towards
Token ID3371
Feature activation-2.04e+00

Pos logit contributions
 habitats+0.124
esty+0.122
ried+0.117
 habitat+0.116
ind+0.115
Neg logit contributions
 bite-0.060
 big-0.060
 it-0.058
!-0.058
 the-0.056
Token the
Token ID262
Feature activation-3.38e-02

Pos logit contributions
 habitats+0.113
esty+0.109
 habitat+0.108
ind+0.107
ried+0.106
Neg logit contributions
 big-0.006
!-0.005
 bite-0.005
 Emily-0.004
 it-0.003
Token Gor
Token ID19097
Feature activation-9.18e+00

Pos logit contributions
 habitats+0.001
esty+0.001
 habitat+0.001
ind+0.001
asured+0.001
Neg logit contributions
 Emily-0.001
 it-0.001
!-0.001
 the-0.001
 bite-0.001
Tokenilla
Token ID5049
Feature activation-2.18e+00

Pos logit contributions
 habitats+0.173
esty+0.163
 habitat+0.148
asured+0.144
ried+0.141
Neg logit contributions
!-0.371
 big-0.368
 it-0.365
 bite-0.364
 the-0.364
Token,
Token ID11
Feature activation+1.27e+00

Pos logit contributions
 habitats+0.106
esty+0.103
 habitat+0.100
asured+0.099
ind+0.099
Neg logit contributions
 big-0.023
 it-0.023
 the-0.022
 bite-0.022
!-0.022
Token and
Token ID290
Feature activation-9.46e-01

Pos logit contributions
 big+0.021
 it+0.020
 the+0.020
!+0.020
 bite+0.020
Neg logit contributions
 habitats-0.055
esty-0.053
 habitat-0.051
ind-0.051
asured-0.051
Token it
Token ID340
Feature activation+1.48e+00

Pos logit contributions
 habitats+0.043
esty+0.042
 habitat+0.041
asured+0.040
ind+0.040
Neg logit contributions
 big-0.013
 it-0.013
 the-0.013
!-0.013
 bite-0.012
Token made
Token ID925
Feature activation+2.41e+00

Pos logit contributions
 big+0.018
 it+0.018
 bite+0.018
!+0.017
 the+0.017
Neg logit contributions
 habitats-0.069
esty-0.067
 habitat-0.065
ried-0.064
asured-0.064
Token regretted
Token ID44652
Feature activation-2.00e-01

Pos logit contributions
 habitats+0.069
esty+0.067
 habitat+0.065
ities+0.064
ind+0.064
Neg logit contributions
 it-0.024
 big-0.024
 the-0.024
!-0.023
 bite-0.023
Token getting
Token ID1972
Feature activation-1.88e+00

Pos logit contributions
 habitats+0.008
esty+0.008
 habitat+0.007
asured+0.007
ind+0.007
Neg logit contributions
 big-0.004
 bite-0.004
 Emily-0.003
!-0.003
 it-0.003
Token close
Token ID1969
Feature activation+5.43e-01

Pos logit contributions
 habitats+0.088
esty+0.083
 habitat+0.081
ities+0.080
ind+0.079
Neg logit contributions
 the-0.031
 big-0.030
 it-0.030
!-0.027
 bite-0.026
Token to
Token ID284
Feature activation+4.77e-01

Pos logit contributions
 the+0.004
!+0.004
 big+0.004
 it+0.004
,+0.004
Neg logit contributions
esty-0.029
 habitats-0.029
uries-0.028
ities-0.027
 habitat-0.027
Token the
Token ID262
Feature activation+3.58e-01

Pos logit contributions
 the+0.002
 it+0.002
 big+0.002
!+0.002
 bite+0.002
Neg logit contributions
 habitats-0.027
esty-0.026
 habitat-0.025
ried-0.025
asured-0.025
Token Gor
Token ID19097
Feature activation-8.99e+00

Pos logit contributions
 big+0.009
 it+0.009
 the+0.009
 bite+0.009
!+0.009
Neg logit contributions
 habitats-0.013
esty-0.012
 habitat-0.012
asured-0.011
ind-0.011
Tokenilla
Token ID5049
Feature activation-1.85e+00

Pos logit contributions
 habitats+0.165
esty+0.157
 habitat+0.142
asured+0.137
ried+0.136
Neg logit contributions
!-0.370
 big-0.362
 bite-0.360
 the-0.360
 it-0.360
Token and
Token ID290
Feature activation-5.03e-01

Pos logit contributions
 habitats+0.094
esty+0.091
 habitat+0.089
asured+0.088
ind+0.088
Neg logit contributions
 big-0.016
 it-0.016
!-0.015
 the-0.015
 bite-0.015
Token not
Token ID407
Feature activation+1.63e+00

Pos logit contributions
 habitats+0.022
esty+0.021
 habitat+0.021
ind+0.021
asured+0.020
Neg logit contributions
 it-0.008
 the-0.008
 big-0.008
!-0.008
 bite-0.008
Token listening
Token ID8680
Feature activation+1.61e+00

Pos logit contributions
 big+0.032
 the+0.032
 it+0.032
!+0.032
 bite+0.032
Neg logit contributions
 habitats-0.067
esty-0.065
 habitat-0.062
ind-0.062
ishes-0.061
Token to
Token ID284
Feature activation+1.00e+00

Pos logit contributions
 big+0.014
 bite+0.014
 it+0.013
 Emily+0.013
 the+0.013
Neg logit contributions
 habitats-0.080
esty-0.077
 habitat-0.076
asured-0.074
ind-0.074
Token a
Token ID257
Feature activation+2.58e+00

Pos logit contributions
 habitats+0.055
esty+0.053
 habitat+0.051
ind+0.050
asured+0.050
Neg logit contributions
 big-0.034
 the-0.034
 it-0.034
!-0.033
 bite-0.032
Token big
Token ID1263
Feature activation+3.45e+00

Pos logit contributions
!+0.037
 it+0.037
 Emily+0.037
 Amy+0.036
 Luke+0.036
Neg logit contributions
 habitats-0.107
esty-0.103
ind-0.102
 habitat-0.101
 belongings-0.101
Token box
Token ID3091
Feature activation+1.07e+00

Pos logit contributions
 it+0.075
 the+0.073
!+0.072
 +0.072
 Emily+0.072
Neg logit contributions
 habitats-0.124
esty-0.118
 habitat-0.116
ind-0.114
ishes-0.112
Token of
Token ID286
Feature activation-1.12e+00

Pos logit contributions
 it+0.014
 big+0.014
 +0.014
 Emily+0.014
 bite+0.014
Neg logit contributions
 habitats-0.046
asured-0.045
esty-0.045
ried-0.044
 habitat-0.044
Token ch
Token ID442
Feature activation-1.92e+00

Pos logit contributions
 habitats+0.040
esty+0.039
 habitat+0.037
ried+0.037
asured+0.036
Neg logit contributions
 big-0.028
 it-0.027
 the-0.027
!-0.026
 bite-0.026
Tokenocol
Token ID4668
Feature activation-8.92e+00

Pos logit contributions
 habitats+0.050
esty+0.050
uries+0.047
 habitat+0.045
belt+0.044
Neg logit contributions
 big-0.064
!-0.061
 it-0.061
 the-0.060
 bite-0.060
Tokenates
Token ID689
Feature activation-1.82e+00

Pos logit contributions
 habitats+0.165
esty+0.162
 habitat+0.147
uries+0.138
 property+0.134
Neg logit contributions
!-0.346
 bite-0.335
,-0.332
on-0.327
 big-0.327
Token to
Token ID284
Feature activation-4.56e-01

Pos logit contributions
 habitats+0.089
esty+0.087
 habitat+0.085
ind+0.083
asured+0.083
Neg logit contributions
 it-0.020
 big-0.020
!-0.019
 the-0.019
 bite-0.019
Token share
Token ID2648
Feature activation-1.83e-02

Pos logit contributions
 habitats+0.020
esty+0.019
 habitat+0.019
ind+0.018
asured+0.018
Neg logit contributions
 big-0.008
 it-0.007
 the-0.007
 bite-0.007
!-0.007
Token with
Token ID351
Feature activation+1.65e-01

Pos logit contributions
 habitats+0.001
esty+0.001
 habitat+0.001
ind+0.001
asured+0.001
Neg logit contributions
 big-0.000
 bite-0.000
!-0.000
 Emily-0.000
 it-0.000
Token the
Token ID262
Feature activation+1.04e+00

Pos logit contributions
 big+0.002
 bite+0.002
!+0.002
 it+0.002
 the+0.002
Neg logit contributions
 habitats-0.007
esty-0.007
 habitat-0.007
ind-0.007
asured-0.007
Token wanted
Token ID2227
Feature activation-2.50e+00

Pos logit contributions
 habitats+0.023
esty+0.022
 habitat+0.021
ind+0.021
asured+0.021
Neg logit contributions
 big-0.010
 it-0.010
 the-0.009
!-0.009
 bite-0.009
Token to
Token ID284
Feature activation+7.45e-01

Pos logit contributions
 habitats+0.125
esty+0.120
 habitat+0.118
ind+0.116
asured+0.116
Neg logit contributions
!-0.022
 big-0.022
 bite-0.021
 it-0.021
 -0.019
Token learn
Token ID2193
Feature activation-2.73e-02

Pos logit contributions
 Emily+0.013
 it+0.013
!+0.013
 big+0.012
 +0.012
Neg logit contributions
 habitats-0.029
 habitat-0.028
esty-0.027
asured-0.027
ind-0.027
Token more
Token ID517
Feature activation-7.81e-01

Pos logit contributions
 habitats+0.001
esty+0.001
ind+0.001
 habitat+0.001
asured+0.001
Neg logit contributions
 bite-0.000
 yell-0.000
 Emily-0.000
 big-0.000
!-0.000
Token about
Token ID546
Feature activation-5.97e-01

Pos logit contributions
 habitats+0.033
esty+0.032
 habitat+0.031
asured+0.030
ind+0.030
Neg logit contributions
 big-0.014
 it-0.013
 bite-0.013
 the-0.013
!-0.013
Token veterin
Token ID33647
Feature activation-8.91e+00

Pos logit contributions
 habitats+0.029
esty+0.028
 habitat+0.027
ried+0.027
asured+0.027
Neg logit contributions
 big-0.007
 the-0.007
 it-0.007
!-0.007
 bite-0.006
Tokenarians
Token ID13517
Feature activation-1.30e+00

Pos logit contributions
 habitats+0.193
isher+0.185
asured+0.175
uzz+0.175
esty+0.174
Neg logit contributions
!-0.308
't-0.290
 shout-0.288
 yell-0.288
,-0.287
Token.
Token ID13
Feature activation+3.36e-01

Pos logit contributions
 habitats+0.068
esty+0.066
 habitat+0.065
ind+0.065
ried+0.063
Neg logit contributions
 the-0.013
!-0.012
 it-0.012
,-0.012
 big-0.010
Token She
Token ID1375
Feature activation-6.36e-01

Pos logit contributions
 +0.005
 the+0.005
 big+0.004
 Amy+0.004
 it+0.004
Neg logit contributions
asured-0.016
ried-0.016
ind-0.016
isher-0.016
 habitats-0.016
Token decided
Token ID3066
Feature activation-3.85e+00

Pos logit contributions
 habitats+0.029
esty+0.028
 habitat+0.027
ities+0.027
ind+0.027
Neg logit contributions
 big-0.010
 it-0.010
 the-0.010
!-0.009
 bite-0.009
Token to
Token ID284
Feature activation+1.01e-01

Pos logit contributions
 habitats+0.203
esty+0.197
 habitat+0.192
asured+0.190
ind+0.190
Neg logit contributions
 big-0.028
 it-0.028
 the-0.027
!-0.026
 bite-0.025
Token to
Token ID284
Feature activation-1.66e+00

Pos logit contributions
 habitats+0.149
esty+0.145
 habitat+0.141
years+0.140
ind+0.139
Neg logit contributions
 the-0.030
 it-0.030
!-0.030
 big-0.028
,-0.027
Token be
Token ID307
Feature activation-3.79e+00

Pos logit contributions
 habitats+0.070
esty+0.068
 habitat+0.066
asured+0.064
ind+0.064
Neg logit contributions
 big-0.029
 it-0.029
 the-0.028
!-0.028
 bite-0.028
Token restored
Token ID15032
Feature activation-5.59e-01

Pos logit contributions
 habitats+0.173
esty+0.165
amb+0.161
 habitat+0.160
ind+0.156
Neg logit contributions
 big-0.068
 the-0.067
!-0.064
 it-0.063
,-0.058
Token to
Token ID284
Feature activation-1.25e+00

Pos logit contributions
 habitats+0.028
esty+0.027
 habitat+0.026
years+0.026
ried+0.026
Neg logit contributions
!-0.007
 it-0.007
 the-0.007
 big-0.006
,-0.006
Token its
Token ID663
Feature activation-4.23e+00

Pos logit contributions
 habitats+0.059
esty+0.059
 habitat+0.056
ried+0.055
amb+0.055
Neg logit contributions
 the-0.018
 it-0.018
 big-0.017
!-0.016
 bite-0.015
Token former
Token ID1966
Feature activation-8.85e+00

Pos logit contributions
esty+0.133
amb+0.131
years+0.131
isher+0.130
ried+0.130
Neg logit contributions
 big-0.131
 the-0.121
!-0.118
 long-0.117
,-0.116
Token glory
Token ID13476
Feature activation-2.64e+00

Pos logit contributions
esty+0.231
years+0.220
amb+0.219
 habitats+0.218
ried+0.216
Neg logit contributions
 big-0.297
!-0.290
 the-0.283
,-0.282
 it-0.275
Token."
Token ID526
Feature activation+1.40e+00

Pos logit contributions
 habitats+0.133
esty+0.129
 habitat+0.125
years+0.122
ried+0.122
Neg logit contributions
!-0.031
 the-0.031
 it-0.031
 big-0.030
,-0.027
Token
Token ID220
Feature activation+2.57e+00

Pos logit contributions
 bite+0.013
 big+0.012
 loud+0.010
!+0.010
 noise+0.010
Neg logit contributions
 habitats-0.068
esty-0.067
 habitat-0.065
ried-0.064
amb-0.064
Token
Token ID198
Feature activation+1.10e+00

Pos logit contributions
 bite+0.023
 big+0.021
 it+0.019
!+0.019
 the+0.019
Neg logit contributions
 habitats-0.128
esty-0.125
 habitat-0.122
ried-0.120
asured-0.120
Token
Token ID198
Feature activation+7.43e-01

Pos logit contributions
 bite+0.018
 noise+0.016
 yell+0.016
 video+0.016
 loud+0.015
Neg logit contributions
uzz-0.043
 habitats-0.043
esty-0.042
amb-0.042
years-0.042
Token a
Token ID257
Feature activation+2.96e+00

Pos logit contributions
 big+0.018
 it+0.017
 the+0.017
!+0.016
 bite+0.016
Neg logit contributions
 habitats-0.074
esty-0.073
 habitat-0.070
ind-0.070
asured-0.069
Token large
Token ID1588
Feature activation+2.28e+00

Pos logit contributions
 the+0.041
 it+0.040
!+0.040
 bite+0.040
 Emily+0.039
Neg logit contributions
 habitats-0.129
esty-0.126
 habitat-0.122
ind-0.122
 belongings-0.119
Token selection
Token ID6356
Feature activation-3.45e+00

Pos logit contributions
 big+0.045
 it+0.044
 the+0.044
!+0.044
 bite+0.043
Neg logit contributions
 habitats-0.091
esty-0.088
 habitat-0.085
asured-0.083
ind-0.083
Token of
Token ID286
Feature activation-1.69e+00

Pos logit contributions
 habitats+0.173
esty+0.168
 habitat+0.164
ind+0.161
asured+0.161
Neg logit contributions
 big-0.034
 it-0.034
 the-0.033
!-0.033
 bite-0.031
Token ch
Token ID442
Feature activation-2.34e+00

Pos logit contributions
 habitats+0.067
esty+0.065
 habitat+0.063
ried+0.063
asured+0.061
Neg logit contributions
 big-0.037
 it-0.037
 the-0.037
!-0.035
 bite-0.034
Tokenocol
Token ID4668
Feature activation-8.79e+00

Pos logit contributions
 habitats+0.042
esty+0.040
 habitat+0.037
ried+0.035
asured+0.033
Neg logit contributions
 big-0.101
!-0.099
 the-0.099
 it-0.097
,-0.095
Tokenates
Token ID689
Feature activation-1.82e+00

Pos logit contributions
esty+0.169
 habitats+0.166
 habitat+0.151
 privacy+0.139
 property+0.138
Neg logit contributions
!-0.345
on-0.328
 the-0.325
,-0.324
 bite-0.323
Token.
Token ID13
Feature activation-9.54e-01

Pos logit contributions
 habitats+0.099
esty+0.096
 habitat+0.094
asured+0.092
ind+0.092
Neg logit contributions
!-0.014
 it-0.013
 the-0.012
 big-0.012
,-0.010
Token Ell
Token ID7122
Feature activation-1.46e+00

Pos logit contributions
 habitats+0.043
esty+0.041
ind+0.040
asured+0.040
ities+0.040
Neg logit contributions
 -0.017
 it-0.016
 the-0.016
 big-0.015
 Emily-0.015
Tokena
Token ID64
Feature activation+1.95e+00

Pos logit contributions
 habitats+0.052
 habitat+0.048
esty+0.048
ities+0.045
 property+0.045
Neg logit contributions
!-0.037
 the-0.036
 it-0.036
 big-0.036
 -0.035
Token picked
Token ID6497
Feature activation-5.69e-01

Pos logit contributions
 big+0.020
 the+0.020
!+0.020
 it+0.020
,+0.018
Neg logit contributions
 habitats-0.098
esty-0.095
 habitat-0.092
ities-0.092
ind-0.091
Token dirty
Token ID11841
Feature activation+1.41e+00

Pos logit contributions
 bite+0.021
!+0.021
 it+0.021
 the+0.020
 big+0.020
Neg logit contributions
 habitats-0.063
esty-0.060
 habitat-0.058
ind-0.058
asured-0.058
Token stable
Token ID8245
Feature activation-2.94e+00

Pos logit contributions
 big+0.030
 it+0.029
 the+0.029
!+0.029
 bite+0.029
Neg logit contributions
 habitats-0.054
esty-0.052
 habitat-0.050
asured-0.049
ind-0.049
Token filled
Token ID5901
Feature activation-1.17e+00

Pos logit contributions
 habitats+0.129
esty+0.124
 habitat+0.121
ried+0.121
ind+0.120
Neg logit contributions
 big-0.041
 bite-0.040
 it-0.039
 Emily-0.039
 the-0.038
Token with
Token ID351
Feature activation-1.20e+00

Pos logit contributions
 habitats+0.048
ind+0.046
esty+0.046
 habitat+0.045
asured+0.045
Neg logit contributions
 bite-0.019
 yell-0.019
 big-0.018
 -0.018
 shout-0.017
Token re
Token ID302
Feature activation-7.00e+00

Pos logit contributions
esty+0.053
 habitats+0.052
ried+0.050
amb+0.049
asured+0.049
Neg logit contributions
 big-0.022
 the-0.021
 it-0.020
!-0.019
 bite-0.019
Tokenind
Token ID521
Feature activation-8.79e+00

Pos logit contributions
 habitats+0.079
esty+0.078
ities+0.062
 habitat+0.062
ried+0.061
Neg logit contributions
!-0.330
 it-0.315
 big-0.314
 noise-0.304
 loud-0.304
Tokeneer
Token ID28153
Feature activation-2.33e+00

Pos logit contributions
esty+0.164
 habitats+0.152
 habitat+0.140
asured+0.138
uries+0.134
Neg logit contributions
!-0.368
 the-0.365
 big-0.361
,-0.351
 it-0.350
Token.
Token ID13
Feature activation-5.13e-01

Pos logit contributions
 habitats+0.117
esty+0.114
 habitat+0.111
asured+0.109
ried+0.108
Neg logit contributions
!-0.025
 big-0.024
 it-0.023
 the-0.023
 bite-0.021
Token Emma
Token ID18966
Feature activation+2.54e+00

Pos logit contributions
 habitats+0.022
esty+0.022
 habitat+0.021
years+0.020
asured+0.020
Neg logit contributions
 bite-0.008
 big-0.007
!-0.007
 yell-0.007
 it-0.006
Token gasped
Token ID45236
Feature activation+1.23e+00

Pos logit contributions
 big+0.032
 it+0.031
 the+0.030
 bite+0.030
!+0.030
Neg logit contributions
 habitats-0.119
esty-0.116
 habitat-0.112
asured-0.111
ind-0.111
Token in
Token ID287
Feature activation+3.23e+00

Pos logit contributions
 the+0.017
!+0.017
 it+0.017
 big+0.016
,+0.015
Neg logit contributions
 habitats-0.059
esty-0.056
 habitat-0.055
asured-0.054
ried-0.054
Token started
Token ID2067
Feature activation-2.08e+00

Pos logit contributions
 habitats+0.075
esty+0.072
 habitat+0.070
ind+0.069
asured+0.069
Neg logit contributions
 big-0.029
 it-0.028
 the-0.028
!-0.028
 bite-0.027
Token eating
Token ID6600
Feature activation-8.66e-01

Pos logit contributions
 habitats+0.088
esty+0.084
 habitat+0.083
ried+0.083
asured+0.082
Neg logit contributions
 bite-0.031
 big-0.031
!-0.030
 Emily-0.030
 it-0.029
Token the
Token ID262
Feature activation+3.26e-02

Pos logit contributions
 habitats+0.041
esty+0.039
ishes+0.039
 habitat+0.038
ried+0.038
Neg logit contributions
 Emily-0.009
 bite-0.008
 yell-0.008
 big-0.008
 video-0.007
Token delicious
Token ID12625
Feature activation-9.87e-01

Pos logit contributions
 it+0.001
 big+0.001
!+0.001
 the+0.001
 bite+0.001
Neg logit contributions
 habitats-0.001
esty-0.001
 habitat-0.001
ind-0.001
asured-0.001
Token ch
Token ID442
Feature activation-1.82e+00

Pos logit contributions
 habitats+0.037
esty+0.035
 habitat+0.034
ind+0.034
ishes+0.033
Neg logit contributions
 Emily-0.021
 big-0.020
 it-0.020
 the-0.020
!-0.020
Tokenocol
Token ID4668
Feature activation-8.76e+00

Pos logit contributions
 habitats+0.032
esty+0.029
 habitat+0.027
ried+0.025
asured+0.025
Neg logit contributions
!-0.079
 big-0.079
 bite-0.078
 it-0.078
 the-0.077
Tokenates
Token ID689
Feature activation-1.62e+00

Pos logit contributions
 habitats+0.156
esty+0.153
 habitat+0.142
asured+0.132
 belongings+0.126
Neg logit contributions
!-0.355
 bite-0.341
 the-0.338
on-0.330
,-0.329
Token.
Token ID13
Feature activation-7.71e-01

Pos logit contributions
 habitats+0.076
esty+0.074
ried+0.073
 habitat+0.072
asured+0.072
Neg logit contributions
 big-0.017
 bite-0.016
 Emily-0.015
 it-0.015
 the-0.015
Token She
Token ID1375
Feature activation-1.74e+00

Pos logit contributions
 habitats+0.037
esty+0.036
 habitat+0.035
ried+0.034
uries+0.034
Neg logit contributions
 bite-0.008
!-0.008
 big-0.008
 bubble-0.007
 jugg-0.007
Token was
Token ID373
Feature activation+4.00e-01

Pos logit contributions
 habitats+0.078
esty+0.075
asured+0.074
 habitat+0.074
ried+0.073
Neg logit contributions
 big-0.023
 bite-0.022
 it-0.022
!-0.022
 the-0.021
Token so
Token ID523
Feature activation+8.17e-01

Pos logit contributions
 bite+0.004
 big+0.004
 it+0.004
!+0.004
 Emily+0.004
Neg logit contributions
 habitats-0.019
esty-0.018
asured-0.018
 habitat-0.018
ried-0.018
Token a
Token ID257
Feature activation+3.67e+00

Pos logit contributions
 big+0.001
 it+0.001
 bite+0.001
!+0.001
 the+0.001
Neg logit contributions
 habitats-0.003
esty-0.003
 habitat-0.003
asured-0.003
ind-0.003
Token big
Token ID1263
Feature activation+4.46e+00

Pos logit contributions
 Amy+0.058
 Emily+0.056
 Luke+0.055
't+0.055
 the+0.055
Neg logit contributions
 habitats-0.141
esty-0.140
ind-0.139
 belongings-0.134
 property-0.132
Token box
Token ID3091
Feature activation+1.65e+00

Pos logit contributions
 +0.085
 Emily+0.082
't+0.082
 it+0.080
 Amy+0.079
Neg logit contributions
 habitats-0.162
esty-0.157
ind-0.154
 habitat-0.151
 property-0.150
Token of
Token ID286
Feature activation-9.12e-01

Pos logit contributions
 Emily+0.024
 bite+0.024
 big+0.024
 it+0.024
 +0.023
Neg logit contributions
 habitats-0.069
esty-0.066
asured-0.066
ried-0.066
 habitat-0.066
Token ch
Token ID442
Feature activation-1.64e+00

Pos logit contributions
 habitats+0.033
esty+0.032
 habitat+0.030
asured+0.030
ried+0.030
Neg logit contributions
 big-0.022
 it-0.021
 the-0.021
!-0.021
 bite-0.021
Tokenocol
Token ID4668
Feature activation-8.75e+00

Pos logit contributions
 habitats+0.038
esty+0.035
 habitat+0.034
uries+0.032
asured+0.032
Neg logit contributions
 big-0.063
!-0.062
 the-0.061
 it-0.061
 bite-0.059
Tokenates
Token ID689
Feature activation-1.42e+00

Pos logit contributions
esty+0.154
 habitats+0.153
 habitat+0.132
uries+0.123
asured+0.121
Neg logit contributions
!-0.357
 bite-0.340
,-0.337
 the-0.336
 big-0.332
Token.
Token ID13
Feature activation-7.22e-01

Pos logit contributions
 habitats+0.074
esty+0.072
 habitat+0.070
asured+0.069
ind+0.069
Neg logit contributions
!-0.011
 it-0.011
 big-0.011
 the-0.010
 bite-0.010
Token It
Token ID632
Feature activation+7.21e-01

Pos logit contributions
 habitats+0.031
esty+0.030
ind+0.030
asured+0.029
 habitat+0.029
Neg logit contributions
 it-0.012
 the-0.012
 big-0.012
 -0.012
 Emily-0.012
Token was
Token ID373
Feature activation-5.60e-01

Pos logit contributions
 bite+0.007
 big+0.007
 it+0.007
!+0.007
 yell+0.007
Neg logit contributions
 habitats-0.033
esty-0.033
ried-0.032
asured-0.032
 habitat-0.031
Token so
Token ID523
Feature activation+2.65e-01

Pos logit contributions
 habitats+0.029
esty+0.028
 habitat+0.027
ind+0.027
ried+0.027
Neg logit contributions
 big-0.005
 it-0.005
 the-0.005
!-0.005
 bite-0.005
Token a
Token ID257
Feature activation+3.77e+00

Pos logit contributions
 big+0.002
 it+0.002
 the+0.002
!+0.002
 bite+0.002
Neg logit contributions
 habitats-0.014
esty-0.014
 habitat-0.013
ind-0.013
ried-0.013
Token big
Token ID1263
Feature activation+4.66e+00

Pos logit contributions
 Amy+0.060
 it+0.060
 the+0.058
 Emily+0.058
 Luke+0.058
Neg logit contributions
 habitats-0.140
esty-0.140
 belongings-0.137
 habitat-0.135
 property-0.135
Token box
Token ID3091
Feature activation+1.93e+00

Pos logit contributions
 it+0.063
 the+0.061
 Emily+0.060
 big+0.059
 +0.058
Neg logit contributions
 habitats-0.198
esty-0.192
 habitat-0.188
ind-0.184
asured-0.182
Token of
Token ID286
Feature activation-7.24e-01

Pos logit contributions
 big+0.020
 it+0.019
 the+0.018
 bite+0.018
!+0.018
Neg logit contributions
 habitats-0.095
esty-0.092
 habitat-0.090
asured-0.089
ind-0.088
Token ch
Token ID442
Feature activation-1.46e+00

Pos logit contributions
 habitats+0.029
esty+0.029
ried+0.027
 habitat+0.027
asured+0.027
Neg logit contributions
 big-0.015
 the-0.015
 it-0.015
 bite-0.014
!-0.014
Tokenocol
Token ID4668
Feature activation-8.72e+00

Pos logit contributions
 habitats+0.031
esty+0.031
uries+0.028
 habitat+0.027
asured+0.025
Neg logit contributions
!-0.057
 big-0.056
 the-0.055
 it-0.054
 bite-0.054
Tokenates
Token ID689
Feature activation-1.25e+00

Pos logit contributions
 habitats+0.157
esty+0.156
 habitat+0.134
uries+0.131
asured+0.124
Neg logit contributions
!-0.356
 bite-0.335
,-0.333
 the-0.325
 big-0.325
Token!
Token ID0
Feature activation+3.13e-01

Pos logit contributions
 habitats+0.067
esty+0.065
 habitat+0.063
ind+0.062
asured+0.061
Neg logit contributions
!-0.011
 it-0.011
 the-0.010
 big-0.009
 bite-0.008
Token<|endoftext|>
Token ID50256
Feature activation-3.98e+00

Pos logit contributions
 big+0.003
 bite+0.003
 it+0.003
!+0.003
 the+0.003
Neg logit contributions
 habitats-0.016
esty-0.015
 habitat-0.015
asured-0.014
ried-0.014
TokenOnce
Token ID7454
Feature activation-2.42e+00

Pos logit contributions
 habitats+0.144
esty+0.138
 habitat+0.134
ried+0.131
ind+0.129
Neg logit contributions
 bite-0.088
 big-0.086
 yell-0.083
 it-0.082
 noise-0.081
Token upon
Token ID2402
Feature activation-4.68e+00

Pos logit contributions
 habitats+0.055
esty+0.050
ried+0.049
uzz+0.049
asured+0.048
Neg logit contributions
 bite-0.082
 swirl-0.081
 noise-0.080
 yell-0.080
 video-0.080
Token all
Token ID477
Feature activation-1.17e+00

Pos logit contributions
 big+0.033
 it+0.032
 Emily+0.032
!+0.032
 bite+0.032
Neg logit contributions
 habitats-0.086
esty-0.082
 habitat-0.081
asured-0.080
ind-0.079
Token go
Token ID467
Feature activation-2.58e+00

Pos logit contributions
 habitats+0.047
esty+0.045
ried+0.044
 habitat+0.044
asured+0.044
Neg logit contributions
 Emily-0.020
 big-0.020
 bite-0.018
 Amy-0.018
 it-0.018
Token riding
Token ID10311
Feature activation-3.51e+00

Pos logit contributions
 habitats+0.102
esty+0.099
ind+0.098
ried+0.097
 habitat+0.096
Neg logit contributions
 bite-0.046
 Emily-0.044
 Amy-0.044
 big-0.044
 noise-0.042
Token together
Token ID1978
Feature activation-6.34e-01

Pos logit contributions
 habitats+0.165
esty+0.160
 habitat+0.156
ind+0.153
asured+0.153
Neg logit contributions
 big-0.044
 it-0.042
 the-0.041
 bite-0.041
!-0.041
Token on
Token ID319
Feature activation+1.75e+00

Pos logit contributions
 habitats+0.031
esty+0.030
 habitat+0.029
ind+0.029
asured+0.029
Neg logit contributions
 big-0.007
 it-0.006
 bite-0.006
 the-0.006
!-0.006
Token Tues
Token ID48496
Feature activation-8.71e+00

Pos logit contributions
 bite+0.010
 big+0.009
 Emily+0.008
!+0.008
 yell+0.007
Neg logit contributions
 habitats-0.093
esty-0.089
 habitat-0.088
ind-0.087
asured-0.086
Tokendays
Token ID12545
Feature activation-5.48e+00

Pos logit contributions
 habitats+0.217
esty+0.216
ishes+0.200
ities+0.192
 habitat+0.188
Neg logit contributions
!-0.299
 it-0.297
 -0.295
 the-0.290
't-0.284
Token.
Token ID13
Feature activation-5.34e-01

Pos logit contributions
 habitats+0.288
esty+0.281
 habitat+0.274
asured+0.271
ind+0.270
Neg logit contributions
!-0.042
 big-0.040
 the-0.040
 it-0.040
 bite-0.033
Token That
Token ID1320
Feature activation-1.47e+00

Pos logit contributions
 habitats+0.022
esty+0.022
asured+0.021
ind+0.021
ities+0.021
Neg logit contributions
 -0.010
 it-0.010
 the-0.010
 big-0.010
!-0.009
Token made
Token ID925
Feature activation+2.41e+00

Pos logit contributions
 habitats+0.065
esty+0.063
 habitat+0.061
ind+0.060
asured+0.060
Neg logit contributions
 big-0.021
 bite-0.021
 it-0.020
!-0.020
 Emily-0.020
Token the
Token ID262
Feature activation+1.22e+00

Pos logit contributions
 big+0.049
 it+0.048
 the+0.047
!+0.047
 bite+0.047
Neg logit contributions
 habitats-0.095
esty-0.091
 habitat-0.088
asured-0.087
ind-0.087
Token The
Token ID383
Feature activation-4.17e-01

Pos logit contributions
 habitats+0.051
esty+0.049
 habitat+0.048
asured+0.047
ind+0.047
Neg logit contributions
 big-0.023
 it-0.023
!-0.023
 bite-0.023
 the-0.023
Token big
Token ID1263
Feature activation+3.19e+00

Pos logit contributions
esty+0.019
 habitats+0.018
asured+0.018
ried+0.018
 habitat+0.017
Neg logit contributions
 big-0.010
 the-0.008
 noise-0.008
 loud-0.008
 it-0.008
Token,
Token ID11
Feature activation+7.39e-01

Pos logit contributions
 big+0.078
 the+0.076
 it+0.075
!+0.074
 bite+0.073
Neg logit contributions
 habitats-0.119
esty-0.114
 habitat-0.110
asured-0.109
ried-0.108
Token empty
Token ID6565
Feature activation-1.80e+00

Pos logit contributions
 big+0.011
 it+0.011
 the+0.011
!+0.011
 bite+0.011
Neg logit contributions
 habitats-0.033
esty-0.032
 habitat-0.031
asured-0.030
ind-0.030
Token house
Token ID2156
Feature activation-3.37e+00

Pos logit contributions
 habitats+0.064
esty+0.062
 habitat+0.059
asured+0.058
ried+0.058
Neg logit contributions
 big-0.044
 it-0.043
 the-0.043
!-0.042
 bite-0.042
Token belonged
Token ID19611
Feature activation-8.69e+00

Pos logit contributions
 habitats+0.165
esty+0.159
 habitat+0.155
ried+0.153
asured+0.153
Neg logit contributions
 big-0.034
 bite-0.033
 it-0.032
!-0.032
 the-0.031
Token to
Token ID284
Feature activation-3.14e-01

Pos logit contributions
 habitats+0.457
esty+0.443
 habitat+0.431
ind+0.424
asured+0.423
Neg logit contributions
 big-0.070
!-0.065
 it-0.065
 the-0.065
 bite-0.057
Token a
Token ID257
Feature activation+2.74e+00

Pos logit contributions
esty+0.018
 habitats+0.018
ried+0.017
ishes+0.017
 habitat+0.017
Neg logit contributions
 the-0.002
 it-0.001
 big-0.001
!-0.001
 Amy-0.000
Token family
Token ID1641
Feature activation-4.17e+00

Pos logit contributions
 it+0.039
 bite+0.039
!+0.039
 the+0.037
 Emily+0.037
Neg logit contributions
 habitats-0.119
esty-0.115
ind-0.112
 habitat-0.111
asured-0.108
Token of
Token ID286
Feature activation-1.25e-01

Pos logit contributions
 habitats+0.221
esty+0.216
 habitat+0.210
ried+0.208
ities+0.206
Neg logit contributions
 the-0.034
!-0.034
 big-0.034
 it-0.033
,-0.027
Token four
Token ID1440
Feature activation+1.12e+00

Pos logit contributions
esty+0.005
 habitat+0.005
 habitats+0.005
ried+0.005
amb+0.005
Neg logit contributions
 the-0.003
 big-0.003
 it-0.003
 loud-0.003
 cl-0.003
Token his
Token ID465
Feature activation-5.66e+00

Pos logit contributions
 habitats+0.062
esty+0.058
 habitat+0.058
ind+0.057
asured+0.057
Neg logit contributions
 bite-0.017
!-0.017
 big-0.017
 -0.015
 Emily-0.015
Token parents
Token ID3397
Feature activation-1.66e+00

Pos logit contributions
 habitats+0.196
esty+0.188
 habitat+0.181
asured+0.177
ind+0.177
Neg logit contributions
 big-0.144
 it-0.141
 the-0.140
!-0.139
 bite-0.139
Token about
Token ID546
Feature activation+4.29e-01

Pos logit contributions
 habitats+0.063
esty+0.060
 habitat+0.058
ind+0.058
asured+0.058
Neg logit contributions
 bite-0.034
 big-0.033
!-0.032
 it-0.031
 Emily-0.031
Token the
Token ID262
Feature activation+1.21e+00

Pos logit contributions
 big+0.002
 bite+0.002
!+0.002
 Emily+0.002
 yell+0.002
Neg logit contributions
 habitats-0.023
esty-0.022
 habitat-0.022
ind-0.021
asured-0.021
Token mysterious
Token ID11428
Feature activation+2.04e-01

Pos logit contributions
 it+0.026
 big+0.026
!+0.026
 the+0.026
 bite+0.026
Neg logit contributions
 habitats-0.045
esty-0.043
 habitat-0.042
ind-0.041
asured-0.041
Token Gor
Token ID19097
Feature activation-8.68e+00

Pos logit contributions
 it+0.005
 big+0.005
 Emily+0.005
 the+0.005
!+0.005
Neg logit contributions
 habitats-0.007
esty-0.006
 habitat-0.006
ind-0.006
asured-0.006
Tokenilla
Token ID5049
Feature activation-1.22e+00

Pos logit contributions
 habitats+0.179
esty+0.176
 habitat+0.160
ried+0.145
asured+0.143
Neg logit contributions
!-0.341
 bite-0.328
 it-0.326
 big-0.325
 the-0.325
Token.
Token ID13
Feature activation-1.50e-01

Pos logit contributions
 habitats+0.063
esty+0.062
 habitat+0.060
asured+0.058
ind+0.058
Neg logit contributions
 it-0.013
 the-0.013
!-0.012
 big-0.012
,-0.011
Token His
Token ID2399
Feature activation-5.65e+00

Pos logit contributions
 habitats+0.006
esty+0.006
asured+0.006
ind+0.006
ishes+0.006
Neg logit contributions
 the-0.003
 big-0.003
 it-0.003
 -0.003
!-0.003
Token parents
Token ID3397
Feature activation-6.54e-01

Pos logit contributions
esty+0.169
 habitats+0.165
ried+0.163
isher+0.153
ishes+0.152
Neg logit contributions
 big-0.189
 the-0.175
 it-0.173
!-0.167
 loud-0.166
Token were
Token ID547
Feature activation+8.77e-01

Pos logit contributions
 habitats+0.026
esty+0.025
ried+0.025
years+0.025
asured+0.025
Neg logit contributions
 bite-0.011
 big-0.011
 Emily-0.010
 it-0.010
!-0.010
Token wide
Token ID3094
Feature activation+2.18e+00

Pos logit contributions
 it+0.045
 bite+0.044
!+0.044
 the+0.043
 big+0.043
Neg logit contributions
 habitats-0.127
esty-0.124
ind-0.120
 habitat-0.119
asured-0.117
Token,
Token ID11
Feature activation+2.03e-01

Pos logit contributions
 big+0.039
 it+0.038
 the+0.037
!+0.037
 bite+0.037
Neg logit contributions
 habitats-0.092
esty-0.089
 habitat-0.086
ried-0.085
asured-0.085
Token green
Token ID4077
Feature activation+8.98e-01

Pos logit contributions
 big+0.003
 the+0.003
 it+0.003
!+0.003
 bite+0.003
Neg logit contributions
 habitats-0.009
esty-0.009
 habitat-0.009
asured-0.008
ried-0.008
Token field
Token ID2214
Feature activation-1.59e+00

Pos logit contributions
 big+0.017
 the+0.016
!+0.016
 it+0.016
,+0.015
Neg logit contributions
 habitats-0.038
esty-0.037
asured-0.036
ried-0.036
 habitat-0.036
Token that
Token ID326
Feature activation-8.35e-01

Pos logit contributions
 habitats+0.079
esty+0.076
 habitat+0.074
asured+0.073
ind+0.073
Neg logit contributions
 big-0.016
 it-0.016
 the-0.015
!-0.015
 bite-0.015
Token belonged
Token ID19611
Feature activation-8.68e+00

Pos logit contributions
esty+0.035
 habitats+0.035
ities+0.034
ind+0.033
 habitat+0.033
Neg logit contributions
 the-0.020
 it-0.019
 big-0.018
 -0.017
!-0.017
Token to
Token ID284
Feature activation-3.71e-01

Pos logit contributions
 habitats+0.459
esty+0.446
 habitat+0.432
ities+0.427
ind+0.427
Neg logit contributions
 big-0.070
!-0.065
 the-0.065
 it-0.063
 bite-0.052
Token a
Token ID257
Feature activation+2.52e+00

Pos logit contributions
esty+0.021
 habitats+0.021
ried+0.020
ishes+0.020
ind+0.020
Neg logit contributions
 the-0.003
 it-0.002
 big-0.002
!-0.002
 -0.001
Token special
Token ID2041
Feature activation-2.51e+00

Pos logit contributions
 big+0.041
 it+0.041
 the+0.040
!+0.040
 bite+0.040
Neg logit contributions
 habitats-0.108
esty-0.105
 habitat-0.101
ind-0.100
asured-0.100
Token family
Token ID1641
Feature activation-4.53e+00

Pos logit contributions
 habitats+0.095
esty+0.091
 habitat+0.088
asured+0.087
ried+0.087
Neg logit contributions
 big-0.055
 it-0.054
!-0.054
 the-0.054
 bite-0.053
Token.
Token ID13
Feature activation-1.04e+00

Pos logit contributions
 habitats+0.250
esty+0.245
 habitat+0.239
ried+0.237
asured+0.234
Neg logit contributions
!-0.031
 the-0.030
 it-0.028
 big-0.026
,-0.024
Token her
Token ID607
Feature activation-5.61e+00

Pos logit contributions
 habitats+0.069
esty+0.067
 habitat+0.064
ind+0.064
ried+0.064
Neg logit contributions
 big-0.030
!-0.029
 bite-0.029
 it-0.029
 the-0.028
Token a
Token ID257
Feature activation+3.55e+00

Pos logit contributions
 habitats+0.276
esty+0.271
 habitat+0.264
asured+0.263
ishes+0.261
Neg logit contributions
 bite-0.045
 yell-0.043
!-0.043
 it-0.043
 big-0.042
Token box
Token ID3091
Feature activation+1.82e+00

Pos logit contributions
 Amy+0.057
!+0.053
 it+0.051
 Lisa+0.051
't+0.051
Neg logit contributions
esty-0.132
 belongings-0.129
 habitats-0.129
 permission-0.129
ind-0.127
Token of
Token ID286
Feature activation-6.84e-01

Pos logit contributions
 it+0.021
 big+0.021
 the+0.020
 bite+0.020
!+0.020
Neg logit contributions
 habitats-0.085
esty-0.083
 habitat-0.081
asured-0.080
ried-0.080
Token ch
Token ID442
Feature activation-1.35e+00

Pos logit contributions
 habitats+0.026
esty+0.026
 habitat+0.025
ried+0.024
asured+0.024
Neg logit contributions
 big-0.015
 it-0.015
 the-0.015
!-0.014
 bite-0.014
Tokenocol
Token ID4668
Feature activation-8.67e+00

Pos logit contributions
 habitats+0.036
esty+0.034
uries+0.032
 habitat+0.032
ried+0.031
Neg logit contributions
 big-0.045
 it-0.045
!-0.044
 bite-0.044
 the-0.044
Tokenates
Token ID689
Feature activation-1.39e+00

Pos logit contributions
 habitats+0.155
esty+0.147
 habitat+0.137
uries+0.128
 former+0.124
Neg logit contributions
!-0.344
 bite-0.340
 big-0.332
,-0.332
 the-0.326
Token.
Token ID13
Feature activation-5.18e-01

Pos logit contributions
 habitats+0.072
esty+0.070
 habitat+0.068
ind+0.067
asured+0.067
Neg logit contributions
 it-0.012
 big-0.012
!-0.012
 the-0.012
 bite-0.011
Token Lily
Token ID20037
Feature activation+1.92e+00

Pos logit contributions
 habitats+0.022
esty+0.021
 habitat+0.021
ried+0.020
asured+0.020
Neg logit contributions
 bite-0.008
!-0.008
 big-0.008
 it-0.007
 yell-0.007
Token opened
Token ID4721
Feature activation-1.08e+00

Pos logit contributions
 big+0.023
 bite+0.022
 it+0.022
!+0.022
 the+0.021
Neg logit contributions
 habitats-0.089
esty-0.087
 habitat-0.085
asured-0.084
ried-0.083
Token the
Token ID262
Feature activation-2.89e-01

Pos logit contributions
 habitats+0.052
ind+0.051
esty+0.051
ishes+0.051
ried+0.050
Neg logit contributions
 yell-0.008
 joke-0.008
 noise-0.007
 bite-0.007
 shout-0.007
Token a
Token ID257
Feature activation+3.48e+00

Pos logit contributions
 habitats+0.071
esty+0.069
 habitat+0.067
ind+0.066
ried+0.065
Neg logit contributions
 big-0.014
 it-0.013
 the-0.013
!-0.013
 bite-0.013
Token normal
Token ID3487
Feature activation-1.66e+00

Pos logit contributions
 bite+0.046
!+0.046
 it+0.046
 the+0.045
 Emily+0.045
Neg logit contributions
 habitats-0.150
esty-0.149
ind-0.144
 habitat-0.141
asured-0.139
Token shoe
Token ID17292
Feature activation-1.11e+00

Pos logit contributions
 habitats+0.054
esty+0.051
 habitat+0.049
ried+0.048
ind+0.048
Neg logit contributions
 big-0.046
 it-0.046
!-0.045
 the-0.045
 bite-0.045
Token.
Token ID13
Feature activation-1.45e+00

Pos logit contributions
 habitats+0.053
esty+0.052
 habitat+0.051
ind+0.050
asured+0.050
Neg logit contributions
 big-0.011
 bite-0.011
 it-0.010
 the-0.010
 Emily-0.010
Token It
Token ID632
Feature activation-4.31e-01

Pos logit contributions
 habitats+0.063
esty+0.062
 habitat+0.060
ind+0.059
ried+0.059
Neg logit contributions
 it-0.024
 big-0.024
 the-0.023
!-0.023
 -0.022
Token belonged
Token ID19611
Feature activation-8.64e+00

Pos logit contributions
 habitats+0.023
esty+0.022
ities+0.022
 habitat+0.022
ind+0.021
Neg logit contributions
 big-0.005
 the-0.005
 it-0.005
!-0.004
 bite-0.004
Token to
Token ID284
Feature activation-7.04e-01

Pos logit contributions
 habitats+0.500
esty+0.475
ities+0.470
 habitat+0.469
ind+0.460
Neg logit contributions
 big-0.059
 the-0.054
!-0.048
 it-0.048
,-0.038
Token a
Token ID257
Feature activation+2.72e+00

Pos logit contributions
esty+0.043
 habitats+0.042
ried+0.042
ishes+0.041
isher+0.041
Neg logit contributions
 the-0.004
 it-0.003
 big-0.002
 a-0.001
!-0.001
Token little
Token ID1310
Feature activation-7.05e-01

Pos logit contributions
 it+0.040
 bite+0.040
!+0.039
 the+0.039
 big+0.038
Neg logit contributions
 habitats-0.118
esty-0.115
 habitat-0.111
ind-0.111
asured-0.109
Token boy
Token ID2933
Feature activation+5.00e-01

Pos logit contributions
 habitats+0.029
esty+0.028
 habitat+0.027
asured+0.027
ried+0.027
Neg logit contributions
 big-0.014
 the-0.014
 it-0.014
!-0.014
 bite-0.013
Token.
Token ID13
Feature activation-8.96e-01

Pos logit contributions
,+0.010
 the+0.010
!+0.010
 it+0.010
 big+0.008
Neg logit contributions
 habitats-0.023
esty-0.022
 habitat-0.021
ities-0.021
ried-0.021
TokenI
Token ID40
Feature activation+1.23e+00

Pos logit contributions
 habitats+0.117
 habitat+0.111
ishes+0.111
ities+0.109
asured+0.108
Neg logit contributions
 it-0.066
 the-0.064
 -0.062
 big-0.061
!-0.061
Token'm
Token ID1101
Feature activation-7.48e-01

Pos logit contributions
 big+0.025
 it+0.025
!+0.025
 the+0.025
 bite+0.025
Neg logit contributions
 habitats-0.048
esty-0.046
 habitat-0.045
ind-0.044
asured-0.044
Token glad
Token ID9675
Feature activation+6.97e-02

Pos logit contributions
 habitats+0.028
esty+0.027
 habitat+0.026
ind+0.026
asured+0.026
Neg logit contributions
 big-0.017
 it-0.017
 the-0.016
!-0.016
 bite-0.016
Token I
Token ID314
Feature activation+1.69e+00

Pos logit contributions
 big+0.001
 it+0.001
 the+0.001
!+0.001
 bite+0.001
Neg logit contributions
 habitats-0.003
esty-0.003
 habitat-0.003
asured-0.002
ind-0.002
Token was
Token ID373
Feature activation+6.13e-01

Pos logit contributions
!+0.025
 it+0.024
 the+0.024
 big+0.024
 bite+0.023
Neg logit contributions
 habitats-0.080
esty-0.077
 habitat-0.076
ind-0.075
ities-0.074
Token able
Token ID1498
Feature activation+4.27e+00

Pos logit contributions
 bite+0.012
 big+0.012
 it+0.012
 yell+0.012
 Emily+0.012
Neg logit contributions
 habitats-0.022
esty-0.022
asured-0.021
ind-0.021
 habitat-0.021
Token to
Token ID284
Feature activation+8.17e-02

Pos logit contributions
 big+0.040
 bite+0.039
 Emily+0.038
 it+0.037
 yell+0.037
Neg logit contributions
 habitats-0.204
esty-0.197
 habitat-0.195
asured-0.193
ried-0.193
Token help
Token ID1037
Feature activation+1.80e+00

Pos logit contributions
 big+0.001
 it+0.001
 the+0.001
!+0.001
 bite+0.001
Neg logit contributions
 habitats-0.003
esty-0.003
 habitat-0.003
ind-0.003
asured-0.003
Token and
Token ID290
Feature activation-1.65e+00

Pos logit contributions
 bite+0.037
 big+0.036
 +0.035
 Emily+0.034
 yell+0.034
Neg logit contributions
 habitats-0.066
esty-0.064
ind-0.063
 habitat-0.062
asured-0.061
Token make
Token ID787
Feature activation+3.80e+00

Pos logit contributions
 habitats+0.075
esty+0.073
 habitat+0.070
ind+0.070
asured+0.069
Neg logit contributions
 it-0.025
 big-0.025
 the-0.025
 bite-0.024
!-0.023
Token things
Token ID1243
Feature activation-7.21e-01

Pos logit contributions
 bite+0.065
 yell+0.060
 +0.058
 shout+0.056
!+0.056
Neg logit contributions
 habitats-0.149
ind-0.147
esty-0.146
 belongings-0.144
 habitat-0.143
TokenThe
Token ID464
Feature activation+4.37e-01

Pos logit contributions
 big+0.004
 it+0.004
 the+0.004
!+0.004
 +0.004
Neg logit contributions
 habitats-0.009
esty-0.008
 habitat-0.008
ishes-0.008
ities-0.008
Token scarf
Token ID36953
Feature activation+1.73e+00

Pos logit contributions
 big+0.007
 the+0.006
 Emily+0.005
 bite+0.005
 it+0.005
Neg logit contributions
 habitats-0.021
ishes-0.020
esty-0.020
 habitat-0.020
ind-0.020
Token was
Token ID373
Feature activation-4.00e-01

Pos logit contributions
 big+0.021
 it+0.020
 the+0.020
 bite+0.020
!+0.020
Neg logit contributions
 habitats-0.081
esty-0.079
 habitat-0.077
asured-0.076
ried-0.076
Token more
Token ID517
Feature activation+5.10e-01

Pos logit contributions
 habitats+0.019
esty+0.018
 habitat+0.018
asured+0.018
ind+0.018
Neg logit contributions
 big-0.005
 it-0.005
 bite-0.004
!-0.004
 the-0.004
Token than
Token ID621
Feature activation-2.89e+00

Pos logit contributions
 big+0.009
 it+0.009
!+0.009
 the+0.009
 bite+0.009
Neg logit contributions
 habitats-0.021
esty-0.020
 habitat-0.020
ind-0.019
ried-0.019
Token a
Token ID257
Feature activation+4.11e+00

Pos logit contributions
 habitats+0.127
esty+0.124
 habitat+0.119
ind+0.117
asured+0.117
Neg logit contributions
 it-0.050
 the-0.049
 big-0.048
!-0.046
 bite-0.045
Token m
Token ID285
Feature activation-1.91e+00

Pos logit contributions
 big+0.080
 it+0.079
 the+0.077
!+0.077
 bite+0.077
Neg logit contributions
 habitats-0.165
esty-0.159
 habitat-0.154
asured-0.151
ind-0.151
Tokenement
Token ID972
Feature activation+1.10e+00

Pos logit contributions
 habitats+0.059
esty+0.056
 habitat+0.053
ried+0.052
ishes+0.052
Neg logit contributions
!-0.060
 big-0.057
 bite-0.056
 it-0.056
,-0.053
Tokeno
Token ID78
Feature activation+1.81e+00

Pos logit contributions
 big+0.010
 it+0.010
!+0.010
 the+0.010
 bite+0.010
Neg logit contributions
 habitats-0.055
esty-0.054
 habitat-0.052
ind-0.052
asured-0.052
Token;
Token ID26
Feature activation-2.25e+00

Pos logit contributions
!+0.034
 it+0.034
 the+0.032
 bite+0.031
 big+0.030
Neg logit contributions
 habitats-0.080
 habitat-0.076
esty-0.076
ind-0.074
asured-0.074
Token it
Token ID340
Feature activation+1.24e+00

Pos logit contributions
 habitats+0.127
esty+0.124
 habitat+0.121
ried+0.119
ind+0.119
Neg logit contributions
 it-0.014
 the-0.013
 big-0.012
!-0.010
 bite-0.008
Token Please
Token ID4222
Feature activation+7.45e-01

Pos logit contributions
 +0.008
 Amy+0.007
+0.007
 Emily+0.007
 the+0.007
Neg logit contributions
 habitats-0.014
ind-0.014
asured-0.013
isher-0.013
ried-0.013
Token,
Token ID11
Feature activation+2.13e+00

Pos logit contributions
 big+0.008
!+0.008
 it+0.008
 the+0.008
 bite+0.008
Neg logit contributions
 habitats-0.037
esty-0.035
 habitat-0.035
asured-0.034
ind-0.034
Token Ben
Token ID3932
Feature activation+4.22e+00

Pos logit contributions
 the+0.038
 it+0.038
 big+0.037
 bite+0.034
 Emily+0.034
Neg logit contributions
 habitats-0.099
ind-0.094
 habitat-0.094
asured-0.091
years-0.091
Token,
Token ID11
Feature activation+2.56e+00

Pos logit contributions
 big+0.035
 swirl+0.034
 bite+0.034
 Emily+0.033
 yell+0.032
Neg logit contributions
esty-0.205
 habitats-0.202
ried-0.195
 habitat-0.194
ind-0.193
Token please
Token ID3387
Feature activation-2.51e-01

Pos logit contributions
 it+0.050
 the+0.050
 big+0.047
 bite+0.044
!+0.044
Neg logit contributions
 habitats-0.120
 habitat-0.113
ind-0.112
esty-0.111
asured-0.108
Token,"
Token ID553
Feature activation+4.02e+00

Pos logit contributions
 habitats+0.012
esty+0.012
 habitat+0.011
asured+0.011
ind+0.011
Neg logit contributions
 big-0.003
 it-0.003
 the-0.003
 bite-0.003
!-0.003
Token she
Token ID673
Feature activation+2.35e+00

Pos logit contributions
!+0.075
 bite+0.074
 big+0.071
 swirl+0.070
 jugg+0.066
Neg logit contributions
esty-0.147
 habitats-0.146
 habitat-0.137
uzz-0.137
years-0.135
Token said
Token ID531
Feature activation+1.31e+00

Pos logit contributions
!+0.032
 it+0.030
 bite+0.029
 long+0.029
 big+0.028
Neg logit contributions
esty-0.098
ried-0.095
 habitats-0.095
asured-0.092
ishes-0.092
Token,
Token ID11
Feature activation+2.91e+00

Pos logit contributions
 big+0.015
 bite+0.014
 Emily+0.012
 it+0.012
 jugg+0.012
Neg logit contributions
esty-0.059
 habitats-0.059
ried-0.058
ities-0.057
asured-0.057
Token pleading
Token ID30279
Feature activation+1.04e+00

Pos logit contributions
 big+0.042
 it+0.040
 bite+0.040
!+0.040
 the+0.038
Neg logit contributions
 habitats-0.128
esty-0.125
 habitat-0.121
asured-0.120
ried-0.120
Token.
Token ID13
Feature activation+5.32e-01

Pos logit contributions
 big+0.009
 it+0.009
 bite+0.008
 the+0.008
!+0.008
Neg logit contributions
 habitats-0.052
esty-0.051
 habitat-0.049
asured-0.049
ried-0.049
Token
Token ID198
Feature activation+1.01e+00

Pos logit contributions
 bite+0.012
!+0.011
 big+0.011
 it+0.010
 noise+0.010
Neg logit contributions
 habitats-0.053
esty-0.050
ind-0.050
 habitat-0.049
uzz-0.049
Token"
Token ID1
Feature activation-3.31e+00

Pos logit contributions
 big+0.019
 it+0.018
 the+0.018
!+0.018
 bite+0.018
Neg logit contributions
 habitats-0.043
esty-0.041
 habitat-0.040
asured-0.039
ried-0.039
TokenLook
Token ID8567
Feature activation+9.60e-01

Pos logit contributions
 habitats+0.112
esty+0.106
 habitat+0.104
asured+0.102
ried+0.102
Neg logit contributions
 big-0.087
 it-0.086
 the-0.086
 bite-0.084
!-0.084
Token,
Token ID11
Feature activation+2.00e+00

Pos logit contributions
 big+0.005
 it+0.005
 the+0.005
 bite+0.005
!+0.005
Neg logit contributions
 habitats-0.052
esty-0.050
 habitat-0.049
ind-0.048
asured-0.048
Token mom
Token ID1995
Feature activation+1.06e-01

Pos logit contributions
 big+0.029
 it+0.029
 the+0.028
!+0.027
 bite+0.027
Neg logit contributions
 habitats-0.093
esty-0.089
 habitat-0.088
ind-0.086
asured-0.086
Token,"
Token ID553
Feature activation+4.55e+00

Pos logit contributions
 bite+0.001
 big+0.001
 it+0.001
 yell+0.001
 Emily+0.001
Neg logit contributions
 habitats-0.005
esty-0.005
ishes-0.005
ried-0.005
ind-0.005
Token she
Token ID673
Feature activation+1.54e+00

Pos logit contributions
 bite+0.070
!+0.070
 big+0.067
 it+0.061
 swirl+0.059
Neg logit contributions
 habitats-0.188
esty-0.183
 habitat-0.174
years-0.172
ishes-0.172
Token said
Token ID531
Feature activation+7.23e-01

Pos logit contributions
 bite+0.025
!+0.023
 it+0.022
 popping+0.021
't+0.021
Neg logit contributions
esty-0.060
ried-0.058
 habitats-0.057
asured-0.057
ace-0.055
Token.
Token ID13
Feature activation+4.84e-01

Pos logit contributions
 bite+0.009
 big+0.009
 jugg+0.009
 video+0.008
 noise+0.008
Neg logit contributions
ried-0.031
esty-0.031
 habitats-0.030
ities-0.030
asured-0.030
Token "
Token ID366
Feature activation-2.23e+00

Pos logit contributions
 bite+0.008
!+0.007
 big+0.007
 it+0.007
 jugg+0.007
Neg logit contributions
 habitats-0.020
esty-0.020
 habitat-0.019
ried-0.018
ities-0.018
TokenI
Token ID40
Feature activation+1.85e+00

Pos logit contributions
 habitats+0.089
esty+0.084
 habitat+0.084
ishes+0.083
ried+0.083
Neg logit contributions
 it-0.048
 big-0.047
 the-0.047
 -0.045
 bite-0.044
Token He
Token ID679
Feature activation-4.36e-01

Pos logit contributions
 +0.002
 the+0.001
 it+0.001
 Emily+0.001
 Amy+0.001
Neg logit contributions
 habitats-0.005
ind-0.005
ried-0.005
isher-0.005
asured-0.005
Token is
Token ID318
Feature activation+7.64e-01

Pos logit contributions
 habitats+0.019
esty+0.018
 habitat+0.017
ried+0.017
ind+0.017
Neg logit contributions
 big-0.008
 bite-0.008
!-0.008
 it-0.008
 the-0.008
Token very
Token ID845
Feature activation+2.12e-01

Pos logit contributions
 big+0.011
 it+0.010
 the+0.010
!+0.010
 bite+0.010
Neg logit contributions
 habitats-0.035
esty-0.034
 habitat-0.033
ried-0.033
ind-0.032
Token helpful
Token ID7613
Feature activation+1.71e+00

Pos logit contributions
 big+0.004
 the+0.003
 it+0.003
 loud+0.003
!+0.003
Neg logit contributions
 habitats-0.010
esty-0.009
 habitat-0.009
ities-0.009
ried-0.009
Token,"
Token ID553
Feature activation+4.38e+00

Pos logit contributions
 big+0.016
 it+0.015
!+0.015
 the+0.015
 bite+0.014
Neg logit contributions
 habitats-0.086
esty-0.084
 habitat-0.082
ind-0.080
asured-0.080
Token Ben
Token ID3932
Feature activation+4.78e+00

Pos logit contributions
 bite+0.080
!+0.077
 big+0.071
 noise+0.070
 yell+0.070
Neg logit contributions
 habitats-0.170
esty-0.162
years-0.155
ind-0.155
 habitat-0.154
Token says
Token ID1139
Feature activation+1.62e+00

Pos logit contributions
 bite+0.076
 big+0.076
 it+0.075
!+0.071
 Emily+0.070
Neg logit contributions
 habitats-0.192
esty-0.191
amb-0.183
asured-0.183
 habitat-0.181
Token.
Token ID13
Feature activation+4.46e-01

Pos logit contributions
 jugg+0.023
 bite+0.023
 big+0.021
 video+0.021
 Emily+0.021
Neg logit contributions
ried-0.067
esty-0.066
ace-0.064
asured-0.064
 habitats-0.064
Token
Token ID198
Feature activation+1.70e+00

Pos logit contributions
 bite+0.006
 yell+0.005
!+0.005
 jugg+0.005
 noise+0.005
Neg logit contributions
 habitats-0.019
esty-0.019
 habitat-0.018
uzz-0.018
asured-0.018
Token
Token ID198
Feature activation+1.41e+00

Pos logit contributions
 bite+0.022
 yell+0.018
 jugg+0.018
 swirl+0.018
!+0.018
Neg logit contributions
 habitats-0.076
esty-0.074
ind-0.074
uzz-0.072
asured-0.072
TokenL
Token ID43
Feature activation-1.48e-01

Pos logit contributions
 big+0.026
 it+0.025
 the+0.025
!+0.024
 bite+0.024
Neg logit contributions
 habitats-0.060
esty-0.057
 habitat-0.056
asured-0.055
ind-0.055
Token the
Token ID262
Feature activation+8.40e-01

Pos logit contributions
 big+0.002
 it+0.002
 the+0.002
!+0.002
 bite+0.002
Neg logit contributions
 habitats-0.014
esty-0.014
 habitat-0.014
ind-0.013
asured-0.013
Token crew
Token ID5462
Feature activation-2.02e+00

Pos logit contributions
 big+0.020
 the+0.019
 it+0.019
!+0.019
 bite+0.018
Neg logit contributions
 habitats-0.031
esty-0.030
asured-0.029
ried-0.029
ishes-0.029
Token and
Token ID290
Feature activation-4.35e-01

Pos logit contributions
 habitats+0.098
esty+0.095
 habitat+0.092
asured+0.091
ind+0.091
Neg logit contributions
 big-0.022
 it-0.021
 bite-0.021
 the-0.021
!-0.020
Token Joe
Token ID5689
Feature activation+3.05e+00

Pos logit contributions
 habitats+0.019
esty+0.019
 habitat+0.018
asured+0.018
ind+0.018
Neg logit contributions
 big-0.007
 it-0.007
!-0.006
 the-0.006
 bite-0.006
Token was
Token ID373
Feature activation+8.04e-01

Pos logit contributions
 the+0.038
!+0.038
 big+0.037
 it+0.037
 bite+0.034
Neg logit contributions
 habitats-0.147
esty-0.142
 habitat-0.139
ind-0.137
ities-0.136
Token one
Token ID530
Feature activation-7.87e-01

Pos logit contributions
 bite+0.012
 big+0.012
 it+0.012
!+0.012
 Emily+0.012
Neg logit contributions
 habitats-0.034
esty-0.033
asured-0.032
 habitat-0.032
ried-0.032
Token happy
Token ID3772
Feature activation-1.60e-01

Pos logit contributions
 habitats+0.031
esty+0.030
 habitat+0.029
asured+0.029
ind+0.028
Neg logit contributions
 big-0.015
 it-0.015
 bite-0.015
 the-0.015
!-0.015
Token pirate
Token ID25868
Feature activation+4.57e-01

Pos logit contributions
 habitats+0.008
esty+0.008
 habitat+0.008
ind+0.008
asured+0.008
Neg logit contributions
 big-0.002
 it-0.002
!-0.001
 the-0.001
 bite-0.001
Token!
Token ID0
Feature activation+4.94e-01

Pos logit contributions
!+0.007
 it+0.007
 the+0.007
 big+0.007
,+0.006
Neg logit contributions
 habitats-0.021
esty-0.020
 habitat-0.020
ind-0.019
asured-0.019
Token<|endoftext|>
Token ID50256
Feature activation-3.83e+00

Pos logit contributions
 big+0.005
 bite+0.005
 it+0.004
!+0.004
 the+0.004
Neg logit contributions
 habitats-0.025
esty-0.024
 habitat-0.023
ried-0.023
asured-0.023
TokenOnce
Token ID7454
Feature activation-2.25e+00

Pos logit contributions
 habitats+0.146
esty+0.140
 habitat+0.137
ried+0.134
ind+0.132
Neg logit contributions
 bite-0.077
 big-0.074
 yell-0.072
 noise-0.071
 it-0.070
Token the
Token ID262
Feature activation+7.74e-01

Pos logit contributions
 big+0.001
!+0.001
 bite+0.001
 it+0.001
 +0.001
Neg logit contributions
 habitats-0.005
esty-0.005
 habitat-0.005
asured-0.005
ind-0.005
Token village
Token ID7404
Feature activation-1.83e+00

Pos logit contributions
 big+0.015
 it+0.015
 the+0.015
 bite+0.015
!+0.015
Neg logit contributions
 habitats-0.031
esty-0.030
 habitat-0.029
asured-0.029
ried-0.029
Token meeting
Token ID3249
Feature activation-1.87e+00

Pos logit contributions
 habitats+0.092
esty+0.090
 habitat+0.087
ind+0.086
asured+0.086
Neg logit contributions
 big-0.017
 it-0.017
 the-0.017
!-0.016
 bite-0.016
Token.
Token ID13
Feature activation-9.30e-01

Pos logit contributions
 habitats+0.087
esty+0.083
 habitat+0.082
asured+0.080
ind+0.080
Neg logit contributions
 bite-0.023
 it-0.022
 big-0.022
 -0.021
!-0.021
Token When
Token ID1649
Feature activation-2.79e+00

Pos logit contributions
 habitats+0.037
esty+0.036
 habitat+0.035
asured+0.033
ried+0.033
Neg logit contributions
 bite-0.018
 big-0.018
!-0.018
 it-0.018
 the-0.017
Token Jack
Token ID3619
Feature activation+1.70e+00

Pos logit contributions
 habitats+0.113
 habitat+0.107
esty+0.106
 belongings+0.106
ments+0.104
Neg logit contributions
 bite-0.037
 swirl-0.036
 yell-0.034
 sigh-0.031
 big-0.030
Token arrived
Token ID5284
Feature activation-3.17e+00

Pos logit contributions
 big+0.022
 it+0.021
 bite+0.021
 the+0.021
!+0.021
Neg logit contributions
 habitats-0.078
esty-0.076
 habitat-0.074
ried-0.073
asured-0.073
Token,
Token ID11
Feature activation+6.93e-01

Pos logit contributions
 habitats+0.162
esty+0.155
 habitat+0.152
asured+0.149
ities+0.149
Neg logit contributions
 the-0.034
 big-0.034
 it-0.034
!-0.031
,-0.029
Token he
Token ID339
Feature activation-1.60e+00

Pos logit contributions
 bite+0.009
 yell+0.008
 swirl+0.008
!+0.007
 big+0.007
Neg logit contributions
 habitats-0.030
esty-0.029
 belongings-0.029
asured-0.029
 habitat-0.028
Token saw
Token ID2497
Feature activation-4.85e-01

Pos logit contributions
 habitats+0.069
esty+0.067
 habitat+0.065
asured+0.065
ried+0.065
Neg logit contributions
 bite-0.023
 big-0.023
 it-0.022
!-0.022
 the-0.021
Token that
Token ID326
Feature activation+9.69e-01

Pos logit contributions
 habitats+0.021
 habitat+0.020
esty+0.020
years+0.020
 belongings+0.020
Neg logit contributions
 bite-0.007
 yell-0.006
!-0.006
 shout-0.006
 swirl-0.005
Token box
Token ID3091
Feature activation+1.51e+00

Pos logit contributions
 big+0.018
 it+0.018
!+0.018
 the+0.018
 bite+0.017
Neg logit contributions
 habitats-0.038
esty-0.036
 habitat-0.035
asured-0.035
ried-0.035
Token.
Token ID13
Feature activation-2.93e-01

Pos logit contributions
 big+0.014
 it+0.014
 bite+0.013
 the+0.013
!+0.013
Neg logit contributions
 habitats-0.075
esty-0.073
 habitat-0.071
asured-0.070
ried-0.070
Token They
Token ID1119
Feature activation+1.00e+00

Pos logit contributions
 habitats+0.012
esty+0.012
 habitat+0.011
asured+0.011
ind+0.011
Neg logit contributions
 big-0.005
 it-0.005
 the-0.005
!-0.005
 bite-0.005
Token all
Token ID477
Feature activation-2.79e-01

Pos logit contributions
 big+0.015
 it+0.015
 bite+0.015
!+0.015
 the+0.014
Neg logit contributions
 habitats-0.044
esty-0.042
 habitat-0.041
asured-0.040
ried-0.040
Token loved
Token ID6151
Feature activation-2.83e+00

Pos logit contributions
 habitats+0.011
esty+0.010
ried+0.010
asured+0.010
ishes+0.010
Neg logit contributions
 big-0.005
 it-0.005
 bite-0.005
 Emily-0.005
!-0.005
Token it
Token ID340
Feature activation+1.78e+00

Pos logit contributions
 habitats+0.141
esty+0.138
 habitat+0.134
ind+0.132
ried+0.132
Neg logit contributions
 it-0.030
 big-0.029
 the-0.029
!-0.027
 bite-0.026
Token and
Token ID290
Feature activation-4.99e-01

Pos logit contributions
 bite+0.026
 yell+0.025
 shout+0.023
 jugg+0.022
 +0.022
Neg logit contributions
esty-0.073
 habitats-0.073
asured-0.072
ried-0.069
 habitat-0.069
Token played
Token ID2826
Feature activation-1.25e+00

Pos logit contributions
 habitats+0.021
esty+0.020
 habitat+0.019
asured+0.019
ried+0.019
Neg logit contributions
 big-0.009
 bite-0.009
!-0.009
 it-0.008
 the-0.008
Token with
Token ID351
Feature activation-3.50e-03

Pos logit contributions
 habitats+0.048
esty+0.048
 habitat+0.045
ried+0.045
asured+0.045
Neg logit contributions
 bite-0.023
 yell-0.021
 big-0.021
 Emily-0.021
 -0.020
Token the
Token ID262
Feature activation+8.06e-01

Pos logit contributions
 habitats+0.000
esty+0.000
ishes+0.000
ind+0.000
 belongings+0.000
Neg logit contributions
 bite-0.000
 big-0.000
 yell-0.000
!-0.000
 Emily-0.000
Token toys
Token ID14958
Feature activation-2.90e+00

Pos logit contributions
 big+0.014
 it+0.014
 the+0.014
!+0.014
 bite+0.014
Neg logit contributions
 habitats-0.034
esty-0.032
 habitat-0.031
ind-0.031
asured-0.031
Token the
Token ID262
Feature activation+8.50e-01

Pos logit contributions
 bite+0.016
!+0.015
 yell+0.014
 swirl+0.014
 jugg+0.014
Neg logit contributions
 habitats-0.051
asured-0.050
 permission-0.049
ind-0.049
esty-0.048
Token chamber
Token ID11847
Feature activation-1.22e+00

Pos logit contributions
 big+0.020
 the+0.017
 it+0.017
 noise+0.016
 long+0.016
Neg logit contributions
 habitats-0.038
ried-0.036
amb-0.036
ishes-0.036
beh-0.035
Token and
Token ID290
Feature activation-9.85e-01

Pos logit contributions
 habitats+0.057
esty+0.055
 habitat+0.054
asured+0.053
ind+0.053
Neg logit contributions
 big-0.014
 it-0.014
 bite-0.014
 the-0.013
!-0.013
Token unw
Token ID7379
Feature activation+9.34e-01

Pos logit contributions
 habitats+0.045
esty+0.044
 habitat+0.042
ind+0.042
asured+0.041
Neg logit contributions
 it-0.015
 the-0.015
 big-0.015
!-0.014
 bite-0.014
Tokenra
Token ID430
Feature activation+1.26e+00

Pos logit contributions
!+0.035
 big+0.035
 it+0.035
 the+0.034
 bite+0.034
Neg logit contributions
 habitats-0.022
 habitat-0.019
esty-0.019
ities-0.018
 property-0.018
Tokenpped
Token ID1496
Feature activation+1.51e+00

Pos logit contributions
 it+0.042
!+0.041
 big+0.041
 the+0.041
 bite+0.040
Neg logit contributions
 habitats-0.034
esty-0.032
 habitat-0.031
asured-0.030
ind-0.029
Token the
Token ID262
Feature activation+1.12e+00

Pos logit contributions
 big+0.013
 bite+0.013
 Emily+0.013
 Amy+0.011
 yell+0.011
Neg logit contributions
 habitats-0.071
esty-0.069
ried-0.068
ind-0.068
asured-0.068
Token blanket
Token ID18447
Feature activation-1.21e-01

Pos logit contributions
 big+0.030
 the+0.027
 it+0.027
!+0.026
 bite+0.025
Neg logit contributions
 habitats-0.047
 habitat-0.043
asured-0.042
esty-0.042
beh-0.042
Token.
Token ID13
Feature activation-5.90e-01

Pos logit contributions
 habitats+0.005
esty+0.005
ried+0.005
 habitat+0.005
asured+0.005
Neg logit contributions
 bite-0.001
 big-0.001
 Emily-0.001
 yell-0.001
 it-0.001
Token He
Token ID679
Feature activation-1.48e+00

Pos logit contributions
 habitats+0.025
esty+0.025
ind+0.024
ishes+0.024
ried+0.024
Neg logit contributions
 big-0.011
 it-0.010
 the-0.010
!-0.010
 -0.010
Token grabbed
Token ID13176
Feature activation-1.69e-01

Pos logit contributions
 habitats+0.072
esty+0.070
 habitat+0.068
ind+0.067
ities+0.067
Neg logit contributions
 big-0.018
 it-0.018
 the-0.018
!-0.017
 bite-0.017
Token all
Token ID477
Feature activation-1.05e-01

Pos logit contributions
 the+0.030
!+0.029
 big+0.029
 it+0.028
 bite+0.027
Neg logit contributions
 habitats-0.079
esty-0.078
 habitat-0.076
ind-0.074
asured-0.074
Token celebrated
Token ID13943
Feature activation-9.66e-01

Pos logit contributions
 habitats+0.005
esty+0.004
 habitat+0.004
asured+0.004
ind+0.004
Neg logit contributions
 big-0.002
 it-0.002
 bite-0.002
!-0.002
 the-0.002
Token the
Token ID262
Feature activation-3.73e-01

Pos logit contributions
 habitats+0.047
esty+0.045
 habitat+0.044
ind+0.043
asured+0.043
Neg logit contributions
 bite-0.010
 big-0.010
 it-0.009
!-0.009
 yell-0.009
Token completion
Token ID11939
Feature activation-8.33e-01

Pos logit contributions
 habitats+0.015
esty+0.015
 habitat+0.014
asured+0.014
ried+0.014
Neg logit contributions
 big-0.007
 it-0.007
 the-0.007
!-0.007
 bite-0.007
Token of
Token ID286
Feature activation-5.34e-01

Pos logit contributions
 habitats+0.041
esty+0.040
 habitat+0.039
asured+0.038
ind+0.038
Neg logit contributions
 big-0.008
 it-0.008
 the-0.008
!-0.008
 bite-0.008
Token the
Token ID262
Feature activation-8.71e-01

Pos logit contributions
 habitats+0.029
esty+0.027
 habitat+0.027
asured+0.027
ind+0.027
Neg logit contributions
 big-0.003
!-0.003
 bite-0.003
 it-0.003
 the-0.002
Token house
Token ID2156
Feature activation-2.35e+00

Pos logit contributions
 habitats+0.037
esty+0.036
 habitat+0.035
asured+0.034
ind+0.034
Neg logit contributions
 big-0.015
 it-0.015
 the-0.014
!-0.014
 bite-0.014
Token!
Token ID0
Feature activation-1.18e-01

Pos logit contributions
 habitats+0.121
esty+0.118
 habitat+0.115
ind+0.113
asured+0.113
Neg logit contributions
 big-0.020
 it-0.020
!-0.020
 the-0.019
 bite-0.018
Token
Token ID198
Feature activation+2.40e-01

Pos logit contributions
 habitats+0.006
esty+0.006
 habitat+0.006
ind+0.006
asured+0.006
Neg logit contributions
 it-0.001
 the-0.001
 big-0.001
 -0.001
!-0.001
Token
Token ID198
Feature activation+6.28e-02

Pos logit contributions
 bite+0.003
 big+0.003
!+0.003
 it+0.003
 the+0.003
Neg logit contributions
 habitats-0.011
esty-0.010
 habitat-0.010
ind-0.010
asured-0.010
TokenJack
Token ID14295
Feature activation+3.62e+00

Pos logit contributions
 big+0.001
 it+0.001
 the+0.001
!+0.001
 +0.001
Neg logit contributions
 habitats-0.003
esty-0.003
 habitat-0.003
ishes-0.003
ities-0.003
Token to
Token ID284
Feature activation+7.74e-02

Pos logit contributions
 habitats+0.158
esty+0.152
 habitat+0.149
ind+0.147
asured+0.147
Neg logit contributions
 big-0.028
 it-0.027
!-0.027
 bite-0.026
 the-0.026
Token perform
Token ID1620
Feature activation-1.42e+00

Pos logit contributions
!+0.002
 it+0.002
 Emily+0.002
 +0.001
 big+0.001
Neg logit contributions
 habitats-0.003
esty-0.003
 habitat-0.003
asured-0.003
ities-0.003
Token tricks
Token ID15910
Feature activation+8.62e-01

Pos logit contributions
 habitats+0.059
esty+0.057
ind+0.056
asured+0.055
ities+0.055
Neg logit contributions
 bite-0.023
 -0.022
 yell-0.022
 Emily-0.021
 it-0.021
Token for
Token ID329
Feature activation-7.68e-01

Pos logit contributions
 bite+0.013
 big+0.012
 Emily+0.012
 it+0.012
 yell+0.012
Neg logit contributions
 habitats-0.037
esty-0.036
asured-0.035
ities-0.035
ind-0.034
Token his
Token ID465
Feature activation-6.70e+00

Pos logit contributions
 habitats+0.034
esty+0.032
 habitat+0.032
ind+0.032
asured+0.031
Neg logit contributions
 bite-0.011
!-0.011
 big-0.011
 it-0.011
 -0.011
Token friends
Token ID2460
Feature activation-1.76e+00

Pos logit contributions
esty+0.252
 habitats+0.248
ried+0.244
years+0.235
asured+0.235
Neg logit contributions
 big-0.169
 the-0.152
!-0.149
 it-0.148
 long-0.143
Token.
Token ID13
Feature activation-1.13e+00

Pos logit contributions
 habitats+0.096
esty+0.093
 habitat+0.092
asured+0.090
ried+0.089
Neg logit contributions
!-0.011
 big-0.011
 the-0.011
 it-0.011
 bite-0.009
Token Tim
Token ID5045
Feature activation+8.11e-01

Pos logit contributions
 habitats+0.044
esty+0.043
 habitat+0.041
asured+0.040
ind+0.040
Neg logit contributions
 big-0.022
 bite-0.022
 it-0.022
!-0.022
 the-0.021
Token had
Token ID550
Feature activation+1.32e+00

Pos logit contributions
 big+0.016
 it+0.016
 bite+0.015
!+0.015
 the+0.015
Neg logit contributions
 habitats-0.032
esty-0.031
 habitat-0.030
asured-0.030
ind-0.030
Token a
Token ID257
Feature activation+3.05e+00

Pos logit contributions
 bite+0.018
 yell+0.016
 sigh+0.016
!+0.015
 shout+0.014
Neg logit contributions
 habitats-0.060
 habitat-0.056
esty-0.055
years-0.055
asured-0.054
Token toy
Token ID13373
Feature activation-2.82e+00

Pos logit contributions
!+0.047
 it+0.046
 Amy+0.045
 +0.043
 the+0.043
Neg logit contributions
esty-0.123
 habitats-0.123
ind-0.120
 belongings-0.115
 favorites-0.114
Token to
Token ID284
Feature activation-5.35e-01

Pos logit contributions
 habitats+0.015
esty+0.015
 habitat+0.014
asured+0.014
ind+0.014
Neg logit contributions
 big-0.003
 it-0.002
 the-0.002
!-0.002
 bite-0.002
Token everyone
Token ID2506
Feature activation-1.74e+00

Pos logit contributions
 habitats+0.024
esty+0.023
asured+0.022
 habitat+0.022
ried+0.022
Neg logit contributions
 bite-0.007
 big-0.007
!-0.007
 Emily-0.007
 yell-0.006
Token he
Token ID339
Feature activation-1.06e+00

Pos logit contributions
 habitats+0.083
esty+0.081
 habitat+0.079
asured+0.078
ind+0.078
Neg logit contributions
 big-0.019
 bite-0.018
 it-0.018
 the-0.018
!-0.018
Token knew
Token ID2993
Feature activation-2.74e+00

Pos logit contributions
 habitats+0.052
esty+0.052
ities+0.051
 habitat+0.050
isher+0.050
Neg logit contributions
 big-0.014
 the-0.013
!-0.012
 it-0.012
 said-0.010
Token.
Token ID13
Feature activation-6.24e-01

Pos logit contributions
 habitats+0.144
esty+0.141
 habitat+0.137
ind+0.136
years+0.133
Neg logit contributions
 it-0.031
 the-0.029
!-0.029
 big-0.025
,-0.023
Token His
Token ID2399
Feature activation-5.86e+00

Pos logit contributions
 habitats+0.027
esty+0.026
ind+0.026
 habitat+0.025
asured+0.025
Neg logit contributions
 it-0.011
 big-0.010
 the-0.010
 -0.010
!-0.010
Token mom
Token ID1995
Feature activation+5.31e-01

Pos logit contributions
esty+0.247
ried+0.242
 habitats+0.241
uries+0.239
ities+0.234
Neg logit contributions
 big-0.142
 the-0.116
 it-0.115
,-0.113
 loud-0.108
Token,
Token ID11
Feature activation+2.38e+00

Pos logit contributions
 big+0.008
 bite+0.008
 it+0.007
!+0.007
 the+0.007
Neg logit contributions
 habitats-0.023
esty-0.023
asured-0.022
ind-0.022
ried-0.022
Token dad
Token ID9955
Feature activation+6.34e-01

Pos logit contributions
 bite+0.037
 big+0.035
!+0.035
 it+0.034
 yell+0.032
Neg logit contributions
 habitats-0.103
esty-0.099
 habitat-0.096
asured-0.096
ind-0.095
Token,
Token ID11
Feature activation+2.20e+00

Pos logit contributions
 big+0.006
 bite+0.006
 it+0.006
!+0.005
 the+0.005
Neg logit contributions
 habitats-0.031
esty-0.030
asured-0.029
 habitat-0.029
ried-0.029
Token and
Token ID290
Feature activation-3.45e-02

Pos logit contributions
 bite+0.034
 big+0.034
 it+0.033
!+0.033
 the+0.031
Neg logit contributions
 habitats-0.096
esty-0.092
 habitat-0.090
asured-0.089
ried-0.088
Token to
Token ID284
Feature activation+6.26e-02

Pos logit contributions
 habitats+0.105
esty+0.102
 habitat+0.099
ind+0.098
ried+0.098
Neg logit contributions
 big-0.029
 it-0.029
 bite-0.029
!-0.028
 the-0.028
Token the
Token ID262
Feature activation+7.14e-01

Pos logit contributions
!+0.001
 million+0.001
 swirl+0.001
br+0.001
 jugg+0.001
Neg logit contributions
 permission-0.003
 habitats-0.003
 habitat-0.003
asured-0.003
esty-0.003
Token flower
Token ID15061
Feature activation-1.05e+00

Pos logit contributions
 big+0.016
 the+0.014
 it+0.013
!+0.013
 bite+0.013
Neg logit contributions
 habitats-0.030
asured-0.029
amb-0.028
ried-0.028
esty-0.028
Token shop
Token ID6128
Feature activation-2.34e+00

Pos logit contributions
 habitats+0.050
esty+0.048
 habitat+0.046
ind+0.046
asured+0.046
Neg logit contributions
 the-0.014
 big-0.014
 it-0.014
!-0.014
 bite-0.013
Token and
Token ID290
Feature activation-9.13e-01

Pos logit contributions
 habitats+0.120
esty+0.116
 habitat+0.113
ind+0.111
asured+0.111
Neg logit contributions
 it-0.022
 big-0.022
 the-0.022
!-0.021
 bite-0.019
Token found
Token ID1043
Feature activation-1.38e+00

Pos logit contributions
 habitats+0.041
esty+0.040
 habitat+0.038
ind+0.038
asured+0.038
Neg logit contributions
 it-0.015
 the-0.015
 big-0.014
!-0.014
 bite-0.014
Token the
Token ID262
Feature activation+6.67e-01

Pos logit contributions
 habitats+0.070
esty+0.068
 habitat+0.066
asured+0.065
ind+0.065
Neg logit contributions
 big-0.013
 it-0.013
 the-0.012
!-0.012
 bite-0.012
Token most
Token ID749
Feature activation-1.55e+00

Pos logit contributions
 big+0.014
 the+0.013
 it+0.013
 bite+0.013
!+0.013
Neg logit contributions
 habitats-0.027
esty-0.026
asured-0.026
ried-0.025
 habitat-0.025
Token perfect
Token ID2818
Feature activation-1.86e+00

Pos logit contributions
 habitats+0.069
esty+0.066
 habitat+0.065
asured+0.064
ind+0.064
Neg logit contributions
 big-0.023
 it-0.023
 bite-0.023
 the-0.022
!-0.022
Token bunch
Token ID7684
Feature activation-1.18e-01

Pos logit contributions
 habitats+0.075
esty+0.072
 habitat+0.070
ind+0.068
asured+0.068
Neg logit contributions
 big-0.037
 it-0.036
 the-0.036
!-0.036
 bite-0.035
Token of
Token ID286
Feature activation-1.47e+00

Pos logit contributions
 habitats+0.006
esty+0.006
 habitat+0.005
asured+0.005
ind+0.005
Neg logit contributions
 big-0.001
 it-0.001
 the-0.001
!-0.001
 bite-0.001
Token noise
Token ID7838
Feature activation+2.20e+00

Pos logit contributions
 habitats+0.013
esty+0.012
 habitat+0.012
 belongings+0.012
asured+0.012
Neg logit contributions
 bite-0.004
!-0.004
 it-0.004
 Emily-0.004
 big-0.004
Token and
Token ID290
Feature activation-7.54e-01

Pos logit contributions
 bite+0.027
 big+0.027
 +0.027
 it+0.026
 Emily+0.025
Neg logit contributions
 habitats-0.100
esty-0.096
 habitat-0.094
asured-0.093
ind-0.093
Token whim
Token ID29923
Feature activation-1.10e+00

Pos logit contributions
 habitats+0.035
esty+0.033
 habitat+0.033
asured+0.032
ried+0.032
Neg logit contributions
 big-0.010
 it-0.010
 bite-0.010
!-0.010
 the-0.010
Tokenpered
Token ID13653
Feature activation+1.40e+00

Pos logit contributions
 habitats+0.023
esty+0.020
isher+0.019
 belongings+0.018
 habitat+0.018
Neg logit contributions
!-0.044
 big-0.043
,-0.042
 the-0.042
 noise-0.042
Token.
Token ID13
Feature activation-3.65e-01

Pos logit contributions
!+0.021
 the+0.021
 big+0.021
 it+0.021
,+0.019
Neg logit contributions
 habitats-0.064
esty-0.061
asured-0.060
 habitat-0.060
ishes-0.059
Token She
Token ID1375
Feature activation-1.10e+00

Pos logit contributions
 habitats+0.016
esty+0.016
 habitat+0.015
asured+0.015
ind+0.015
Neg logit contributions
 big-0.005
 it-0.005
 bite-0.005
!-0.005
 the-0.005
Token was
Token ID373
Feature activation+7.59e-01

Pos logit contributions
 habitats+0.049
esty+0.048
 habitat+0.046
ind+0.045
asured+0.045
Neg logit contributions
 big-0.016
 it-0.016
 the-0.016
!-0.016
 bite-0.016
Token very
Token ID845
Feature activation-1.13e-02

Pos logit contributions
 big+0.012
 it+0.011
 the+0.011
!+0.011
 bite+0.011
Neg logit contributions
 habitats-0.034
esty-0.033
 habitat-0.032
ind-0.031
asured-0.031
Token miserable
Token ID22444
Feature activation-1.97e-01

Pos logit contributions
 habitats+0.000
esty+0.000
 habitat+0.000
ind+0.000
asured+0.000
Neg logit contributions
 it-0.000
!-0.000
 bite-0.000
 big-0.000
 the-0.000
Token.
Token ID13
Feature activation-4.59e-01

Pos logit contributions
 habitats+0.010
esty+0.009
 habitat+0.009
asured+0.009
ind+0.009
Neg logit contributions
 big-0.002
 it-0.002
 bite-0.002
 the-0.002
!-0.002
Token
Token ID220
Feature activation+1.82e+00

Pos logit contributions
 habitats+0.024
esty+0.023
 habitat+0.023
asured+0.023
ind+0.023
Neg logit contributions
 big-0.003
 it-0.003
 the-0.003
!-0.003
 bite-0.003
Token
Token ID198
Feature activation+2.59e-01

Pos logit contributions
 swirl+0.008
 bite+0.008
 video+0.007
 bubble+0.007
 noise+0.007
Neg logit contributions
 habitats-0.015
uzz-0.015
amb-0.015
esty-0.014
ind-0.014
TokenThe
Token ID464
Feature activation+2.79e-01

Pos logit contributions
 big+0.004
 it+0.004
!+0.004
 bite+0.004
 the+0.004
Neg logit contributions
 habitats-0.012
esty-0.011
 habitat-0.011
ind-0.011
asured-0.011
Token frightened
Token ID24776
Feature activation+1.32e+00

Pos logit contributions
 big+0.006
 the+0.006
 bite+0.006
 noise+0.005
 it+0.005
Neg logit contributions
 habitats-0.012
 habitat-0.011
ried-0.011
esty-0.011
uzz-0.011
Token squirrel
Token ID33039
Feature activation+6.62e-01

Pos logit contributions
 big+0.028
 the+0.028
 it+0.028
!+0.027
 bite+0.026
Neg logit contributions
 habitats-0.052
esty-0.050
 habitat-0.049
asured-0.048
ried-0.047
Token ran
Token ID4966
Feature activation-1.87e+00

Pos logit contributions
 big+0.010
 Emily+0.009
 it+0.009
 bite+0.009
 Luke+0.009
Neg logit contributions
 habitats-0.028
ried-0.028
esty-0.027
 habitat-0.026
asured-0.026
Token and
Token ID290
Feature activation-1.70e+00

Pos logit contributions
 habitats+0.068
esty+0.067
ried+0.065
 habitat+0.064
ind+0.062
Neg logit contributions
 bite-0.039
 big-0.038
 Emily-0.037
 jugg-0.037
 it-0.037
Token ran
Token ID4966
Feature activation-2.02e+00

Pos logit contributions
 habitats+0.069
esty+0.067
ried+0.066
 habitat+0.065
asured+0.063
Neg logit contributions
 big-0.030
!-0.028
 bite-0.028
 it-0.028
 Emily-0.027
Token,
Token ID11
Feature activation+8.70e-01

Pos logit contributions
 habitats+0.074
esty+0.071
ried+0.071
 habitat+0.070
ace+0.066
Neg logit contributions
 jugg-0.041
 video-0.041
 bubble-0.041
 Emily-0.040
 bite-0.040
Token trying
Token ID2111
Feature activation-5.28e-01

Pos logit contributions
 big+0.014
 bite+0.013
!+0.013
 video+0.013
 Emily+0.013
Neg logit contributions
 habitats-0.036
esty-0.035
ried-0.035
 habitat-0.034
asured-0.033
Token to
Token ID284
Feature activation-7.54e-01

Pos logit contributions
 habitats+0.022
 habitat+0.021
esty+0.021
ried+0.021
 permission+0.021
Neg logit contributions
 bubble-0.008
 video-0.008
 Emily-0.008
 jugg-0.008
-0.007
Token escape
Token ID6654
Feature activation-1.38e+00

Pos logit contributions
 habitats+0.029
esty+0.028
ried+0.028
 habitat+0.028
asured+0.028
Neg logit contributions
!-0.013
 Emily-0.013
 -0.013
 it-0.013
 big-0.013
Token her
Token ID607
Feature activation-6.54e+00

Pos logit contributions
 habitats+0.026
ind+0.025
 belongings+0.024
ishes+0.024
asured+0.024
Neg logit contributions
!-0.019
 -0.018
 bite-0.018
 sigh-0.018
 yell-0.017
Token toys
Token ID14958
Feature activation-4.04e+00

Pos logit contributions
 habitats+0.232
esty+0.223
 habitat+0.215
ried+0.210
asured+0.209
Neg logit contributions
 big-0.171
 the-0.164
 it-0.163
!-0.160
 bite-0.159
Token.
Token ID13
Feature activation-9.26e-01

Pos logit contributions
 habitats+0.198
esty+0.192
 habitat+0.187
asured+0.184
ind+0.184
Neg logit contributions
 big-0.046
 the-0.044
 it-0.044
!-0.044
 bite-0.042
Token One
Token ID1881
Feature activation-6.21e+00

Pos logit contributions
 habitats+0.036
esty+0.034
 habitat+0.033
ind+0.032
asured+0.032
Neg logit contributions
 big-0.019
 bite-0.019
!-0.019
 it-0.019
 the-0.018
Token day
Token ID1110
Feature activation-7.79e-01

Pos logit contributions
 habitats+0.246
esty+0.237
 habitat+0.229
asured+0.225
ind+0.225
Neg logit contributions
 big-0.124
 it-0.123
!-0.121
 the-0.121
 bite-0.121
Token her
Token ID607
Feature activation-6.94e+00

Pos logit contributions
 habitats+0.036
 belongings+0.034
esty+0.034
 habitat+0.034
ind+0.034
Neg logit contributions
 jugg-0.007
 bite-0.006
 video-0.006
 bubble-0.006
 big-0.006
Token mom
Token ID1995
Feature activation-1.64e+00

Pos logit contributions
 habitats+0.293
esty+0.284
 habitat+0.271
ried+0.271
asured+0.269
Neg logit contributions
 big-0.143
 the-0.128
 it-0.124
!-0.124
 bite-0.123
Token gave
Token ID2921
Feature activation-1.80e+00

Pos logit contributions
 habitats+0.063
esty+0.061
asured+0.060
ried+0.059
ind+0.059
Neg logit contributions
 bite-0.031
 it-0.030
 big-0.030
!-0.030
 yell-0.029
Token her
Token ID607
Feature activation-5.95e+00

Pos logit contributions
 habitats+0.069
esty+0.068
ind+0.065
ried+0.065
 habitat+0.065
Neg logit contributions
!-0.034
 big-0.033
 bite-0.033
 yell-0.032
 jugg-0.032
Token a
Token ID257
Feature activation+3.29e+00

Pos logit contributions
 habitats+0.301
esty+0.294
 habitat+0.286
asured+0.284
ind+0.281
Neg logit contributions
 it-0.044
 bite-0.044
!-0.044
 big-0.043
 yell-0.040
Token toy
Token ID13373
Feature activation-2.41e+00

Pos logit contributions
!+0.052
 it+0.052
 Luke+0.050
 Amy+0.050
 the+0.048
Neg logit contributions
esty-0.123
 habitats-0.122
 belongings-0.121
ind-0.120
isher-0.118
Token her
Token ID607
Feature activation-6.05e+00

Pos logit contributions
 habitats+0.005
esty+0.005
 habitat+0.005
asured+0.005
ind+0.005
Neg logit contributions
 big-0.001
 it-0.001
 the-0.001
 bite-0.001
!-0.001
Token.
Token ID13
Feature activation-4.07e-01

Pos logit contributions
 habitats+0.226
esty+0.222
ried+0.211
asured+0.207
 habitat+0.206
Neg logit contributions
 big-0.145
!-0.140
 the-0.138
 it-0.136
 bite-0.135
Token When
Token ID1649
Feature activation-2.38e+00

Pos logit contributions
 habitats+0.016
esty+0.016
 habitat+0.015
ind+0.015
asured+0.015
Neg logit contributions
 big-0.008
 it-0.008
 the-0.008
!-0.008
 bite-0.008
Token she
Token ID673
Feature activation-1.42e+00

Pos logit contributions
 habitats+0.081
years+0.080
ried+0.075
 habitat+0.074
 permission+0.074
Neg logit contributions
 video-0.044
 bite-0.042
 jugg-0.042
 yell-0.040
 trick-0.040
Token opened
Token ID4721
Feature activation-9.83e-01

Pos logit contributions
 habitats+0.063
esty+0.061
 habitat+0.060
asured+0.060
ried+0.059
Neg logit contributions
 big-0.019
 it-0.018
!-0.018
 the-0.018
 bite-0.018
Token her
Token ID607
Feature activation-6.72e+00

Pos logit contributions
 habitats+0.041
ind+0.040
esty+0.040
 habitat+0.039
ishes+0.039
Neg logit contributions
 video-0.013
 bite-0.013
 yell-0.013
 joke-0.013
 jugg-0.012
Token eyes
Token ID2951
Feature activation+1.98e+00

Pos logit contributions
 habitats+0.230
esty+0.219
 habitat+0.206
ried+0.206
asured+0.204
Neg logit contributions
 big-0.185
 the-0.177
 it-0.176
!-0.175
 bite-0.175
Token,
Token ID11
Feature activation+8.67e-01

Pos logit contributions
 bite+0.014
 video+0.014
 Emily+0.014
 yell+0.013
 big+0.013
Neg logit contributions
 habitats-0.094
esty-0.092
 habitat-0.091
ried-0.091
ind-0.089
Token she
Token ID673
Feature activation-1.04e+00

Pos logit contributions
 bite+0.010
 big+0.010
!+0.010
 yell+0.009
 video+0.009
Neg logit contributions
 habitats-0.039
esty-0.038
ried-0.037
years-0.037
 habitat-0.037
Token spotted
Token ID13489
Feature activation-1.39e+00

Pos logit contributions
 habitats+0.046
esty+0.045
 habitat+0.044
ind+0.043
ried+0.043
Neg logit contributions
 big-0.014
 bite-0.014
 it-0.014
 the-0.014
!-0.014
Token a
Token ID257
Feature activation+3.71e+00

Pos logit contributions
 habitats+0.071
esty+0.069
 habitat+0.067
ind+0.066
asured+0.066
Neg logit contributions
 big-0.012
 bite-0.011
 it-0.011
!-0.011
 the-0.010
Token woods
Token ID16479
Feature activation-5.00e+00

Pos logit contributions
 habitats+0.050
esty+0.049
ried+0.047
asured+0.046
 habitat+0.046
Neg logit contributions
 big-0.042
 the-0.039
!-0.038
 it-0.038
 bite-0.037
Token behind
Token ID2157
Feature activation+8.12e-01

Pos logit contributions
 habitats+0.227
esty+0.219
 habitat+0.213
asured+0.210
ind+0.209
Neg logit contributions
 big-0.070
 it-0.069
 bite-0.068
 the-0.067
!-0.067
Token his
Token ID465
Feature activation-7.05e+00

Pos logit contributions
 big+0.009
 bite+0.009
 it+0.009
!+0.009
 Emily+0.008
Neg logit contributions
 habitats-0.039
esty-0.037
 habitat-0.037
ind-0.036
asured-0.036
Token house
Token ID2156
Feature activation-3.53e+00

Pos logit contributions
 habitats+0.240
esty+0.231
ried+0.219
 habitat+0.219
asured+0.219
Neg logit contributions
 big-0.192
!-0.185
 the-0.183
 it-0.181
 bite-0.177
Token.
Token ID13
Feature activation-1.11e+00

Pos logit contributions
esty+0.198
 habitats+0.198
ind+0.190
asured+0.188
 habitat+0.188
Neg logit contributions
 the-0.018
!-0.018
 big-0.016
 it-0.015
,-0.014
Token One
Token ID1881
Feature activation-6.00e+00

Pos logit contributions
 habitats+0.041
esty+0.039
 habitat+0.038
asured+0.037
ind+0.037
Neg logit contributions
 bite-0.024
 big-0.024
!-0.023
 it-0.023
 the-0.022
Token day
Token ID1110
Feature activation-3.43e-01

Pos logit contributions
 habitats+0.237
esty+0.228
 habitat+0.221
asured+0.217
ind+0.217
Neg logit contributions
 big-0.120
 it-0.119
!-0.117
 the-0.117
 bite-0.117
Token,
Token ID11
Feature activation+5.98e-02

Pos logit contributions
 habitats+0.015
esty+0.014
amb+0.014
 belongings+0.014
ishes+0.014
Neg logit contributions
 jugg-0.004
 video-0.003
 Emily-0.003
 swirl-0.003
 bubble-0.003
Token he
Token ID339
Feature activation-1.86e+00

Pos logit contributions
 sigh+0.001
 bite+0.001
 Emily+0.001
 video+0.001
 jugg+0.001
Neg logit contributions
 habitats-0.002
esty-0.002
 belongings-0.002
ishes-0.002
ried-0.002
Token found
Token ID1043
Feature activation-1.74e+00

Pos logit contributions
 habitats+0.085
esty+0.082
 habitat+0.079
ities+0.078
ind+0.078
Neg logit contributions
 big-0.029
 the-0.028
 it-0.028
!-0.027
 bite-0.026
Token a
Token ID257
Feature activation+2.80e+00

Pos logit contributions
 habitats+0.085
esty+0.083
 habitat+0.081
asured+0.081
ind+0.080
Neg logit contributions
 bite-0.015
!-0.015
 yell-0.014
 big-0.013
 Emily-0.013
Token little
Token ID1310
Feature activation-7.56e-01

Pos logit contributions
 it+0.050
!+0.050
 bite+0.050
 +0.048
 yell+0.047
Neg logit contributions
esty-0.125
 habitats-0.125
ind-0.124
 habitat-0.115
asured-0.115
Token girl
Token ID2576
Feature activation+3.20e-01

Pos logit contributions
 habitats+0.026
esty+0.026
ind+0.025
 habitat+0.024
ishes+0.024
Neg logit contributions
 it-0.017
 bite-0.017
!-0.017
 big-0.017
 the-0.017
Token named
Token ID3706
Feature activation-2.70e+00

Pos logit contributions
 big+0.008
 it+0.008
 the+0.008
!+0.008
 bite+0.008
Neg logit contributions
 habitats-0.011
esty-0.010
 habitat-0.010
ind-0.010
asured-0.010
Token Lily
Token ID20037
Feature activation+6.32e-01

Pos logit contributions
 habitats+0.128
esty+0.124
 habitat+0.121
ind+0.119
ried+0.119
Neg logit contributions
 big-0.032
 it-0.031
!-0.031
 bite-0.031
 the-0.030
Token.
Token ID13
Feature activation-7.70e-01

Pos logit contributions
 big+0.006
 it+0.006
 bite+0.006
 the+0.006
!+0.005
Neg logit contributions
 habitats-0.031
esty-0.030
 habitat-0.030
ind-0.029
asured-0.029
Token One
Token ID1881
Feature activation-6.05e+00

Pos logit contributions
 habitats+0.031
esty+0.030
 habitat+0.029
ind+0.028
asured+0.028
Neg logit contributions
 big-0.015
 it-0.015
 the-0.015
!-0.015
 bite-0.015
Token day
Token ID1110
Feature activation-6.38e-01

Pos logit contributions
 habitats+0.225
esty+0.217
 habitat+0.209
asured+0.208
ind+0.207
Neg logit contributions
 it-0.127
!-0.125
 big-0.124
 bite-0.123
 the-0.123
Token,
Token ID11
Feature activation+1.07e-01

Pos logit contributions
 habitats+0.028
esty+0.028
 belongings+0.027
years+0.027
uzz+0.027
Neg logit contributions
 jugg-0.006
 swirl-0.006
 noise-0.006
 bite-0.005
 yell-0.005
Token her
Token ID607
Feature activation-5.59e+00

Pos logit contributions
 bite+0.002
!+0.002
 yell+0.002
 jugg+0.002
 swirl+0.002
Neg logit contributions
 habitats-0.004
esty-0.004
years-0.004
 belongings-0.004
ried-0.004
Token mom
Token ID1995
Feature activation-9.52e-01

Pos logit contributions
 habitats+0.252
esty+0.241
 habitat+0.237
ried+0.234
asured+0.232
Neg logit contributions
 big-0.107
 the-0.091
 bite-0.088
!-0.087
 it-0.086
Token gifted
Token ID22527
Feature activation-2.70e-01

Pos logit contributions
 habitats+0.029
esty+0.028
asured+0.028
ishes+0.027
ind+0.027
Neg logit contributions
 bite-0.025
 big-0.024
 it-0.024
 yell-0.024
!-0.024
Token,
Token ID11
Feature activation+1.72e+00

Pos logit contributions
 habitats+0.051
esty+0.050
 habitat+0.049
ried+0.048
ind+0.048
Neg logit contributions
 big-0.005
 bite-0.005
 it-0.005
!-0.005
 the-0.004
Token
Token ID6184
Feature activation-1.61e+00

Pos logit contributions
 it+0.019
 big+0.019
 the+0.018
!+0.018
 bite+0.017
Neg logit contributions
 habitats-0.085
esty-0.082
 habitat-0.080
asured-0.079
ind-0.079
Token
Token ID95
Feature activation-1.98e+00

Pos logit contributions
 habitats+0.051
esty+0.048
 habitat+0.046
asured+0.046
ried+0.045
Neg logit contributions
 it-0.045
 big-0.045
!-0.045
 the-0.044
 bite-0.044
Token
Token ID26391
Feature activation+1.75e+00

Pos logit contributions
 habitats+0.066
esty+0.063
 habitat+0.061
asured+0.059
ind+0.059
Neg logit contributions
 big-0.052
 it-0.052
 the-0.051
!-0.051
 bite-0.051
Token
Token ID129
Feature activation+6.81e-01

Pos logit contributions
 bite+0.042
 big+0.042
 the+0.041
 it+0.041
 Emily+0.040
Neg logit contributions
 habitats-0.061
esty-0.059
 habitat-0.057
ind-0.055
ried-0.054
Token
Token ID241
Feature activation-5.95e+00

Pos logit contributions
 it+0.021
 big+0.021
 the+0.020
!+0.020
 bite+0.020
Neg logit contributions
 habitats-0.020
esty-0.019
 habitat-0.018
ried-0.018
ishes-0.018
TokenI
Token ID40
Feature activation+7.12e-01

Pos logit contributions
 habitats+0.224
esty+0.214
 habitat+0.208
ind+0.203
asured+0.203
Neg logit contributions
 big-0.130
 bite-0.127
 the-0.125
 it-0.125
!-0.124
Tokenâ
Token ID22940
Feature activation-2.48e+00

Pos logit contributions
 big+0.014
 it+0.014
 bite+0.014
!+0.014
 the+0.014
Neg logit contributions
 habitats-0.028
esty-0.027
 habitat-0.026
ind-0.026
ities-0.026
Token
Token ID26391
Feature activation+2.04e+00

Pos logit contributions
 habitats+0.088
esty+0.082
 habitat+0.081
ind+0.079
asured+0.079
Neg logit contributions
!-0.062
 it-0.061
 the-0.061
 bite-0.060
 big-0.060
Token
Token ID8151
Feature activation+6.21e+00

Pos logit contributions
 big+0.045
 bite+0.044
 it+0.044
 the+0.044
!+0.043
Neg logit contributions
 habitats-0.076
esty-0.073
 habitat-0.070
ind-0.068
asured-0.068
Tokenm
Token ID76
Feature activation-1.50e+00

Pos logit contributions
 bite+0.131
 noise+0.131
 it+0.128
 swirl+0.128
 bubble+0.126
Neg logit contributions
 habitats-0.219
esty-0.219
ried-0.215
years-0.212
uzz-0.211