TOP ACTIVATIONS
MAX = 6.780

. The car does not smell . The car is nice
Suddenly , a loud " bang !" filled the air .
cream . Doesn 't that sound nice ?" J
dry and it didn 't smell bad at all . Sus
on because it didn 't smell good . But his mom
o , bird . Does it hurt ?" The
day . Doesn 't that sound special ?".
Mia do not like the smell . They cover their noses
it makes me happy !" Fro ggy replied , " I
frowned . Something didn 't smell right . It was sm
The lizard does not his ss anymore . It is quiet
not move or make a sound . " I
it because it didn 't smell like a real plant .
. The candy did not taste good anymore . It tasted
, Tom ! Isn 't it cool ? Let 's catch
Tim did not like the smell and he felt sick .
shelter . They did not smell the rot that filled the
fire . She did not smell the smoke . She did
oven . They did not smell the food burning . They
my veil ! Isn 't it lovely ?" Anna and Ben

MIN ACTIVATIONS
MIN = -5.936

, they all counted the days until Lily 's birthday .
spent the rest of his days in the magical land ,
fisherman ! I spend my days catching fish . Do you
anymore . He spent his days playing with his mineral and
on , he spent his days watching the birds and feeling
The girl would spend her days with her net , making
. They would spend their days exercising in the forest ,
said , " I like Tues days too ! It 's
bs . She spent her days spinning her webs around the
was happy to spend his days on the farm , learning
S ue spent her days reading the books in the
Ben . Ben spent his days playing in the park near
on , Spike spent his days running around the yard with
and his friend spent their days visiting people who were doing
apart . They spent their days playing together and enjoying the
He crawled out of his cob web and started to stretch
, they started counting the days on the calendar . Lily
together . They spent their days running and hopping around the
enjoyed the rest of their summer day . <|endoftext|> The little
Joe forgot all about his k ite , and started to

INTERVAL 3.601 - 6.780
CONTAINS 0.241%

. They do not make loud noises . They are quiet
day , the TV made a loud noise and stopped working
does not want to make a mess or make Mom angry
when he couldn 't see anything . " Yes
and why it made that noise . They waved to the

INTERVAL 0.422 - 3.601
CONTAINS 36.399%

went out to search for something special . He went out
dragon breathing fire . He brave ly swung his sword and
want to play with me and my doll ?"
safe and sound ! <|endoftext|> Once there was a generous man
Tom . Tom was 3 years old and he was not

INTERVAL -2.757 - 0.422
CONTAINS 61.985%

indoors to tell dad the news . Dad was so proud
They take the red square block and some other blocks and
and rushed back to the park . He used the tool
The rain stopped , and the sun came out again .
at home . He carefully put the photo

INTERVAL -5.936 - -2.757
CONTAINS 1.376%

M andy cl apped her hands and said , " Let
supplies to play with for hours . He thanked the plastic
man . She thanked him again and thanked him for making
running again . After a while , he started
together . <|endoftext|> L ily loved flowers . She liked to
Token.
Token ID13
Feature activation+2.19e+00

Pos logit contributions
ls+0.088
l+0.086
 cor+0.086
 AW+0.078
ah+0.076
Neg logit contributions
 Clean-0.062
 themselves-0.058
 lately-0.058
elt-0.057
 together-0.056
Token The
Token ID383
Feature activation+1.36e+00

Pos logit contributions
ls+0.068
l+0.067
 cor+0.064
 crocod+0.056
osh+0.055
Neg logit contributions
 Clean-0.031
elt-0.028
 lately-0.028
side-0.027
apping-0.026
Token car
Token ID1097
Feature activation+1.90e+00

Pos logit contributions
l+0.027
ls+0.025
 cor+0.023
 crocod+0.021
 dragon+0.019
Neg logit contributions
elt-0.036
 Clean-0.035
apping-0.034
 lately-0.034
 Working-0.034
Token does
Token ID857
Feature activation+4.52e+00

Pos logit contributions
 cor+0.038
ls+0.036
l+0.033
 AW+0.030
 beast+0.030
Neg logit contributions
 then-0.042
 Clean-0.040
 Then-0.038
 together-0.037
 sometimes-0.036
Token not
Token ID407
Feature activation+3.01e+00

Pos logit contributions
ls+0.151
 cor+0.141
l+0.140
 beast+0.130
ah+0.130
Neg logit contributions
 then-0.046
 Clean-0.045
 including-0.041
 themselves-0.039
 cleaning-0.039
Token smell
Token ID8508
Feature activation+6.78e+00

Pos logit contributions
ls+0.085
l+0.078
 cor+0.075
 beast+0.067
 AW+0.067
Neg logit contributions
 Clean-0.050
 including-0.049
 lately-0.048
 birthday-0.046
elt-0.046
Token.
Token ID13
Feature activation+2.35e+00

Pos logit contributions
ls+0.177
 cor+0.164
l+0.161
 AW+0.155
ah+0.153
Neg logit contributions
 Clean-0.115
 lately-0.114
 including-0.108
 themselves-0.106
 then-0.106
Token The
Token ID383
Feature activation+1.22e+00

Pos logit contributions
ls+0.076
l+0.074
 cor+0.072
 crocod+0.062
ah+0.062
Neg logit contributions
 Clean-0.032
elt-0.028
 lately-0.027
apping-0.027
side-0.026
Token car
Token ID1097
Feature activation+1.80e+00

Pos logit contributions
l+0.024
ls+0.023
 cor+0.021
 crocod+0.018
 dragon+0.017
Neg logit contributions
elt-0.032
 Clean-0.031
apping-0.031
 lately-0.031
amed-0.030
Token is
Token ID318
Feature activation+7.55e-01

Pos logit contributions
 cor+0.035
ls+0.033
l+0.032
 AW+0.029
 beast+0.028
Neg logit contributions
 then-0.040
 Clean-0.037
 Then-0.035
 together-0.035
 sometimes-0.034
Token nice
Token ID3621
Feature activation-7.48e-02

Pos logit contributions
ls+0.019
l+0.018
 cor+0.017
 AW+0.014
ah+0.014
Neg logit contributions
 Clean-0.015
 lately-0.014
elt-0.014
amed-0.014
itted-0.014
TokenSuddenly
Token ID38582
Feature activation-1.65e-02

Pos logit contributions
elt+0.015
 lately+0.014
 Clean+0.014
itted+0.014
 organizing+0.013
Neg logit contributions
l-0.017
ls-0.017
 cor-0.016
ia-0.014
ira-0.014
Token,
Token ID11
Feature activation-7.81e-01

Pos logit contributions
 Clean+0.000
 lately+0.000
elt+0.000
wards+0.000
side+0.000
Neg logit contributions
ls-0.000
l-0.000
 cor-0.000
osh-0.000
 AW-0.000
Token a
Token ID257
Feature activation+2.04e+00

Pos logit contributions
 Clean+0.015
 lately+0.015
elt+0.014
wards+0.014
side+0.014
Neg logit contributions
ls-0.019
l-0.019
 cor-0.018
 AW-0.015
ah-0.015
Token loud
Token ID7812
Feature activation+3.29e+00

Pos logit contributions
ls+0.036
l+0.034
 cor+0.032
 AW+0.024
osh+0.024
Neg logit contributions
 Clean-0.055
elt-0.053
 lately-0.052
apping-0.051
side-0.051
Token "
Token ID366
Feature activation-1.96e+00

Pos logit contributions
ls+0.074
l+0.069
 cor+0.066
osh+0.051
 AW+0.051
Neg logit contributions
 Clean-0.077
elt-0.070
 lately-0.069
apping-0.068
oned-0.067
Tokenbang
Token ID36668
Feature activation+6.25e+00

Pos logit contributions
elt+0.037
 Clean+0.036
amed+0.034
oned+0.034
side+0.033
Neg logit contributions
ls-0.055
l-0.050
 cor-0.047
ia-0.042
ira-0.041
Token!"
Token ID2474
Feature activation+1.33e+00

Pos logit contributions
ls+0.152
 cor+0.141
l+0.138
ira+0.117
 crocod+0.115
Neg logit contributions
 Clean-0.128
oned-0.124
apping-0.114
 lately-0.111
 then-0.110
Token filled
Token ID5901
Feature activation+4.72e-01

Pos logit contributions
ls+0.034
l+0.033
 cor+0.031
 crocod+0.026
 beast+0.026
Neg logit contributions
 Clean-0.026
elt-0.024
 lately-0.024
side-0.023
apping-0.023
Token the
Token ID262
Feature activation+1.53e-01

Pos logit contributions
ls+0.011
l+0.011
 cor+0.010
ah+0.009
 AW+0.009
Neg logit contributions
 Clean-0.010
wards-0.009
side-0.009
keeping-0.009
oned-0.009
Token air
Token ID1633
Feature activation+4.67e-01

Pos logit contributions
ls+0.004
l+0.004
 cor+0.004
 crocod+0.003
 beast+0.003
Neg logit contributions
 Clean-0.003
elt-0.003
 lately-0.003
side-0.003
apping-0.003
Token.
Token ID13
Feature activation-4.35e-01

Pos logit contributions
ls+0.013
l+0.013
 cor+0.012
 crocod+0.010
osh+0.010
Neg logit contributions
 Clean-0.008
elt-0.008
 lately-0.007
 Working-0.007
itted-0.007
Token cream
Token ID8566
Feature activation-2.69e-01

Pos logit contributions
 Clean+0.059
elt+0.056
 lately+0.055
 Working+0.054
itted+0.054
Neg logit contributions
ls-0.060
l-0.058
 cor-0.054
 crocod-0.044
ah-0.043
Token.
Token ID13
Feature activation-1.38e+00

Pos logit contributions
 Clean+0.006
elt+0.005
 lately+0.005
itted+0.005
apping+0.005
Neg logit contributions
ls-0.007
l-0.006
 cor-0.006
 crocod-0.005
ah-0.005
Token Doesn
Token ID28048
Feature activation+1.30e+00

Pos logit contributions
amed+0.028
side+0.026
aming+0.025
itted+0.024
 indoors+0.024
Neg logit contributions
l-0.031
 cor-0.030
ls-0.030
 Ro-0.029
osh-0.028
Token't
Token ID470
Feature activation+3.49e+00

Pos logit contributions
ls+0.036
 cor+0.036
l+0.035
 troll+0.030
 beast+0.030
Neg logit contributions
 Clean-0.022
elt-0.020
 lately-0.020
 Working-0.020
side-0.019
Token that
Token ID326
Feature activation+4.32e+00

Pos logit contributions
ls+0.098
l+0.094
 cor+0.091
 AW+0.079
ia+0.079
Neg logit contributions
 Clean-0.060
 lately-0.059
 then-0.055
elt-0.055
 including-0.054
Token sound
Token ID2128
Feature activation+6.19e+00

Pos logit contributions
ls+0.115
 cor+0.109
l+0.104
ira+0.095
 troll+0.092
Neg logit contributions
 Clean-0.079
 then-0.078
 birthday-0.077
 so-0.075
 lately-0.073
Token nice
Token ID3621
Feature activation+1.47e+00

Pos logit contributions
ls+0.171
 cor+0.163
l+0.161
ira+0.139
 slit+0.136
Neg logit contributions
 Clean-0.109
 lately-0.102
 including-0.096
apping-0.094
 then-0.093
Token?"
Token ID1701
Feature activation+2.13e-01

Pos logit contributions
ls+0.032
l+0.032
 cor+0.029
ah+0.024
osh+0.024
Neg logit contributions
 Clean-0.034
elt-0.032
itted-0.031
side-0.031
amed-0.031
Token
Token ID198
Feature activation-2.97e-01

Pos logit contributions
l+0.006
ls+0.006
 cor+0.005
ia+0.004
ira+0.004
Neg logit contributions
 Clean-0.004
elt-0.004
 lately-0.004
amed-0.004
itted-0.003
Token
Token ID198
Feature activation-9.34e-01

Pos logit contributions
 Clean+0.004
elt+0.004
side+0.004
 lately+0.004
apping+0.004
Neg logit contributions
ls-0.009
l-0.009
 cor-0.009
 crocod-0.007
 beast-0.007
TokenJ
Token ID41
Feature activation+2.75e+00

Pos logit contributions
 Clean+0.021
elt+0.021
 lately+0.020
itted+0.019
apping+0.019
Neg logit contributions
l-0.021
ls-0.021
 cor-0.019
ia-0.016
osh-0.015
Token dry
Token ID5894
Feature activation-4.47e-01

Pos logit contributions
ls+0.027
l+0.026
 cor+0.025
 crocod+0.020
osh+0.020
Neg logit contributions
 Clean-0.024
elt-0.023
 lately-0.023
apping-0.022
itted-0.022
Token and
Token ID290
Feature activation+1.04e+00

Pos logit contributions
 Clean+0.009
elt+0.009
 Working+0.009
uggets+0.009
apping+0.009
Neg logit contributions
l-0.011
ls-0.011
 cor-0.009
ches-0.008
 crocod-0.008
Token it
Token ID340
Feature activation+1.75e+00

Pos logit contributions
ls+0.025
l+0.024
 cor+0.023
 crocod+0.019
osh+0.019
Neg logit contributions
 Clean-0.022
elt-0.021
apping-0.020
 lately-0.020
itted-0.020
Token didn
Token ID1422
Feature activation+3.98e+00

Pos logit contributions
ls+0.043
 cor+0.042
l+0.041
 AW+0.035
 beast+0.035
Neg logit contributions
 Clean-0.035
 then-0.032
amed-0.031
side-0.031
 cleaning-0.031
Token't
Token ID470
Feature activation+2.30e+00

Pos logit contributions
ls+0.106
l+0.103
 cor+0.100
 crocod+0.083
 beast+0.082
Neg logit contributions
 Clean-0.073
elt-0.068
 lately-0.068
side-0.066
apping-0.065
Token smell
Token ID8508
Feature activation+6.06e+00

Pos logit contributions
ls+0.047
l+0.046
 cor+0.043
 crocod+0.034
ah+0.034
Neg logit contributions
 Clean-0.056
elt-0.055
 lately-0.053
side-0.053
apping-0.053
Token bad
Token ID2089
Feature activation+9.30e-01

Pos logit contributions
ls+0.146
l+0.140
 cor+0.134
 crocod+0.110
 beast+0.110
Neg logit contributions
 Clean-0.128
 lately-0.121
elt-0.120
side-0.117
apping-0.115
Token at
Token ID379
Feature activation+1.25e+00

Pos logit contributions
ls+0.020
l+0.019
 cor+0.018
 crocod+0.014
ia+0.014
Neg logit contributions
 Clean-0.022
amed-0.021
elt-0.021
side-0.021
apping-0.020
Token all
Token ID477
Feature activation-1.71e-01

Pos logit contributions
ls+0.032
l+0.032
 cor+0.030
 crocod+0.025
ia+0.025
Neg logit contributions
 Clean-0.024
elt-0.023
 lately-0.022
apping-0.022
amed-0.022
Token.
Token ID13
Feature activation-4.49e-01

Pos logit contributions
 Clean+0.004
 lately+0.004
elt+0.003
side+0.003
apping+0.003
Neg logit contributions
ls-0.004
l-0.004
 cor-0.004
 crocod-0.003
osh-0.003
Token Sus
Token ID8932
Feature activation+1.29e+00

Pos logit contributions
apping+0.008
amed+0.008
uggets+0.007
elt+0.007
 Clean+0.007
Neg logit contributions
l-0.012
ls-0.011
 cor-0.010
ira-0.009
ia-0.009
Token on
Token ID319
Feature activation+2.39e+00

Pos logit contributions
ls+0.032
l+0.031
 cor+0.028
ah+0.024
 crocod+0.024
Neg logit contributions
 Clean-0.022
elt-0.021
 Working-0.020
apping-0.020
wards-0.020
Token because
Token ID780
Feature activation+3.73e-01

Pos logit contributions
ls+0.049
l+0.048
 cor+0.045
 crocod+0.036
 beast+0.035
Neg logit contributions
 Clean-0.058
elt-0.057
 lately-0.055
apping-0.055
itted-0.054
Token it
Token ID340
Feature activation+1.92e+00

Pos logit contributions
ls+0.007
l+0.007
 cor+0.006
 crocod+0.005
osh+0.005
Neg logit contributions
 Clean-0.010
elt-0.009
amed-0.009
apping-0.009
itted-0.009
Token didn
Token ID1422
Feature activation+4.56e+00

Pos logit contributions
ls+0.050
 cor+0.048
l+0.048
 AW+0.043
 troll+0.041
Neg logit contributions
 Clean-0.037
elt-0.033
 cleaning-0.033
 then-0.032
 soaking-0.032
Token't
Token ID470
Feature activation+2.18e+00

Pos logit contributions
ls+0.134
 cor+0.131
l+0.128
 troll+0.111
 crocod+0.111
Neg logit contributions
 Clean-0.070
 lately-0.063
side-0.061
 Working-0.061
oned-0.061
Token smell
Token ID8508
Feature activation+6.05e+00

Pos logit contributions
ls+0.055
l+0.052
 cor+0.051
 beast+0.042
 crocod+0.042
Neg logit contributions
 Clean-0.044
 lately-0.042
elt-0.041
itted-0.039
side-0.039
Token good
Token ID922
Feature activation+1.53e-01

Pos logit contributions
ls+0.132
l+0.124
 cor+0.120
ira+0.100
osh+0.098
Neg logit contributions
 Clean-0.140
 lately-0.138
elt-0.129
 including-0.128
side-0.127
Token.
Token ID13
Feature activation+3.77e-01

Pos logit contributions
ls+0.003
l+0.003
 cor+0.003
 crocod+0.002
 happens+0.002
Neg logit contributions
 Clean-0.004
amed-0.004
side-0.004
 Working-0.004
apping-0.004
Token But
Token ID887
Feature activation-7.61e-01

Pos logit contributions
ls+0.011
l+0.011
 cor+0.010
 crocod+0.009
ah+0.009
Neg logit contributions
 Clean-0.006
elt-0.006
 lately-0.006
apping-0.005
amed-0.005
Token his
Token ID465
Feature activation-5.83e-01

Pos logit contributions
amed+0.019
apping+0.019
itted+0.019
elt+0.018
 Clean+0.018
Neg logit contributions
ls-0.015
l-0.015
 cor-0.014
 crocod-0.012
ah-0.011
Token mom
Token ID1995
Feature activation-1.96e+00

Pos logit contributions
 Clean+0.013
elt+0.013
 lately+0.013
side+0.012
apping+0.012
Neg logit contributions
ls-0.013
l-0.013
 cor-0.012
 crocod-0.010
ia-0.010
Tokeno
Token ID78
Feature activation+2.72e+00

Pos logit contributions
ls+0.057
 cor+0.056
l+0.054
 slit+0.049
 AW+0.047
Neg logit contributions
elt-0.030
 Clean-0.029
apping-0.028
itted-0.027
 lately-0.027
Token,
Token ID11
Feature activation+9.41e-01

Pos logit contributions
ls+0.074
 cor+0.070
l+0.070
 slit+0.059
ira+0.058
Neg logit contributions
 Clean-0.048
 lately-0.045
elt-0.044
apping-0.043
side-0.041
Token bird
Token ID6512
Feature activation+1.07e+00

Pos logit contributions
ls+0.026
l+0.024
 cor+0.024
 AW+0.021
osh+0.020
Neg logit contributions
 Clean-0.016
 lately-0.016
 then-0.016
 cleaning-0.015
elt-0.015
Token.
Token ID13
Feature activation-3.34e-01

Pos logit contributions
ls+0.029
 cor+0.028
l+0.027
 AW+0.024
 slit+0.023
Neg logit contributions
 Clean-0.018
 lately-0.017
apping-0.016
ful-0.016
elt-0.016
Token Does
Token ID8314
Feature activation+4.13e+00

Pos logit contributions
 Clean+0.005
side+0.005
amed+0.005
 lately+0.005
elt+0.005
Neg logit contributions
ls-0.009
l-0.009
 cor-0.009
 crocod-0.008
osh-0.008
Token it
Token ID340
Feature activation+5.99e+00

Pos logit contributions
ls+0.106
 cor+0.104
l+0.101
 AW+0.090
ah+0.088
Neg logit contributions
 Clean-0.071
 lately-0.071
 then-0.064
elt-0.064
 cleaning-0.062
Token hurt
Token ID5938
Feature activation+3.15e+00

Pos logit contributions
ls+0.141
 cor+0.140
l+0.126
 troll+0.121
 beast+0.116
Neg logit contributions
elt-0.120
 Clean-0.115
 then-0.113
 including-0.108
 cleaning-0.106
Token?"
Token ID1701
Feature activation+1.21e+00

Pos logit contributions
ls+0.071
l+0.070
 cor+0.065
 crocod+0.053
osh+0.052
Neg logit contributions
 Clean-0.070
elt-0.067
 lately-0.066
side-0.066
itted-0.065
Token
Token ID198
Feature activation+1.31e+00

Pos logit contributions
ls+0.036
l+0.035
 cor+0.034
 crocod+0.029
 beast+0.029
Neg logit contributions
 Clean-0.018
elt-0.017
 lately-0.017
side-0.016
apping-0.016
Token
Token ID198
Feature activation+4.33e-01

Pos logit contributions
ls+0.045
 cor+0.044
l+0.042
 AW+0.040
 beast+0.040
Neg logit contributions
 Clean-0.013
 Then-0.012
elt-0.011
apping-0.011
 then-0.010
TokenThe
Token ID464
Feature activation+1.06e+00

Pos logit contributions
ls+0.012
l+0.012
 cor+0.011
 beast+0.010
 AW+0.010
Neg logit contributions
 Clean-0.007
elt-0.007
side-0.007
 lately-0.007
apping-0.007
Token day
Token ID1110
Feature activation-2.28e+00

Pos logit contributions
elt+0.049
 Clean+0.048
itted+0.046
amed+0.046
 lately+0.045
Neg logit contributions
ls-0.052
l-0.052
 cor-0.051
 crocod-0.042
 beast-0.042
Token.
Token ID13
Feature activation-1.31e+00

Pos logit contributions
elt+0.051
 Clean+0.051
amed+0.050
itted+0.049
oned+0.048
Neg logit contributions
ls-0.048
l-0.047
 cor-0.045
 crocod-0.036
osh-0.036
Token Doesn
Token ID28048
Feature activation+7.48e-01

Pos logit contributions
amed+0.027
side+0.026
itted+0.026
 lately+0.025
elt+0.024
Neg logit contributions
l-0.028
 cor-0.027
ls-0.026
 Ro-0.024
osh-0.024
Token't
Token ID470
Feature activation+3.40e+00

Pos logit contributions
ls+0.021
 cor+0.020
l+0.020
 crocod+0.017
 troll+0.016
Neg logit contributions
 Clean-0.013
elt-0.012
 lately-0.012
 Working-0.012
side-0.011
Token that
Token ID326
Feature activation+4.34e+00

Pos logit contributions
ls+0.091
l+0.089
 cor+0.085
 AW+0.071
 crocod+0.071
Neg logit contributions
 Clean-0.063
 lately-0.059
elt-0.058
side-0.056
apping-0.056
Token sound
Token ID2128
Feature activation+5.99e+00

Pos logit contributions
ls+0.108
l+0.100
 cor+0.100
ira+0.081
 troll+0.081
Neg logit contributions
 Clean-0.092
 lately-0.084
 then-0.083
 birthday-0.083
 including-0.080
Token special
Token ID2041
Feature activation+7.38e-01

Pos logit contributions
ls+0.161
l+0.154
 cor+0.152
 crocod+0.127
 AW+0.125
Neg logit contributions
 Clean-0.110
 lately-0.102
elt-0.099
apping-0.097
 Working-0.096
Token?".
Token ID43634
Feature activation+4.65e-02

Pos logit contributions
ls+0.017
l+0.017
 cor+0.016
 seems+0.013
osh+0.013
Neg logit contributions
 Clean-0.015
elt-0.015
itted-0.015
side-0.015
amed-0.015
Token
Token ID220
Feature activation+1.27e+00

Pos logit contributions
ls+0.001
l+0.001
 cor+0.001
 crocod+0.001
ah+0.001
Neg logit contributions
 Clean-0.001
elt-0.001
 lately-0.001
amed-0.001
side-0.001
Token
Token ID198
Feature activation-7.31e-01

Pos logit contributions
ls+0.041
l+0.040
 cor+0.039
 crocod+0.034
ah+0.034
Neg logit contributions
 Clean-0.017
elt-0.015
apping-0.015
 Working-0.014
itted-0.014
Token
Token ID198
Feature activation-1.34e+00

Pos logit contributions
 Clean+0.009
 Working+0.008
side+0.008
elt+0.008
wards+0.008
Neg logit contributions
ls-0.024
l-0.023
 cor-0.023
 AW-0.020
 crocod-0.020
Token Mia
Token ID32189
Feature activation+6.26e-01

Pos logit contributions
ls+0.114
l+0.108
inkle+0.107
 AW+0.106
 cor+0.104
Neg logit contributions
 Clean-0.054
 then-0.053
 Then-0.049
days-0.048
 cleaning-0.048
Token do
Token ID466
Feature activation+1.99e+00

Pos logit contributions
ls+0.014
l+0.014
 cor+0.013
 crocod+0.011
osh+0.011
Neg logit contributions
 Clean-0.014
elt-0.013
 lately-0.013
side-0.013
apping-0.013
Token not
Token ID407
Feature activation+1.74e+00

Pos logit contributions
ls+0.048
l+0.048
 cor+0.044
ia+0.036
 crocod+0.036
Neg logit contributions
elt-0.041
 Clean-0.041
 lately-0.039
itted-0.039
amed-0.038
Token like
Token ID588
Feature activation+1.60e+00

Pos logit contributions
ls+0.045
l+0.045
 cor+0.042
 crocod+0.036
ah+0.035
Neg logit contributions
 Clean-0.033
elt-0.032
side-0.031
 lately-0.031
amed-0.030
Token the
Token ID262
Feature activation+2.88e+00

Pos logit contributions
ls+0.040
l+0.039
 cor+0.037
 AW+0.033
ah+0.033
Neg logit contributions
 Clean-0.032
 lately-0.030
elt-0.029
 cleaning-0.029
 organizing-0.029
Token smell
Token ID8508
Feature activation+5.98e+00

Pos logit contributions
ls+0.068
l+0.067
 cor+0.063
 crocod+0.052
 beast+0.051
Neg logit contributions
 Clean-0.062
elt-0.059
 lately-0.058
side-0.057
apping-0.056
Token.
Token ID13
Feature activation+1.48e+00

Pos logit contributions
 cor+0.165
ls+0.160
l+0.153
 AW+0.149
 beast+0.145
Neg logit contributions
 Clean-0.086
 lately-0.082
apping-0.080
 then-0.076
 cleaning-0.074
Token They
Token ID1119
Feature activation+4.93e-01

Pos logit contributions
ls+0.052
l+0.051
 cor+0.050
 crocod+0.044
ia+0.044
Neg logit contributions
 Clean-0.014
elt-0.013
 lately-0.013
amed-0.012
side-0.012
Token cover
Token ID3002
Feature activation+9.45e-01

Pos logit contributions
ls+0.012
l+0.011
 cor+0.011
 beast+0.009
 crocod+0.009
Neg logit contributions
 Clean-0.010
 lately-0.009
elt-0.009
 then-0.009
oned-0.009
Token their
Token ID511
Feature activation-6.07e-01

Pos logit contributions
ls+0.025
 cor+0.024
l+0.024
 AW+0.023
 beast+0.022
Neg logit contributions
 themselves-0.016
 Clean-0.016
 then-0.015
 Then-0.015
uggets-0.015
Token noses
Token ID41760
Feature activation+1.15e+00

Pos logit contributions
 Clean+0.011
elt+0.010
 lately+0.010
itted+0.010
side+0.010
Neg logit contributions
ls-0.016
l-0.016
 cor-0.015
ah-0.013
osh-0.013
Token it
Token ID340
Feature activation+1.31e+00

Pos logit contributions
 Clean+0.001
 lately+0.001
elt+0.001
apping+0.001
side+0.001
Neg logit contributions
ls-0.002
l-0.002
 cor-0.002
 AW-0.001
osh-0.001
Token makes
Token ID1838
Feature activation+2.06e+00

Pos logit contributions
l+0.032
ls+0.031
 cor+0.028
ira+0.027
ia+0.026
Neg logit contributions
 Clean-0.025
 then-0.023
 together-0.022
 Always-0.021
 cleaning-0.021
Token me
Token ID502
Feature activation-2.30e-02

Pos logit contributions
ls+0.059
l+0.056
 cor+0.053
ira+0.050
ah+0.050
Neg logit contributions
 Clean-0.034
 then-0.031
 so-0.030
 cleaning-0.030
 themselves-0.030
Token happy
Token ID3772
Feature activation-4.89e-01

Pos logit contributions
 Clean+0.000
apping+0.000
 lately+0.000
elt+0.000
uggets+0.000
Neg logit contributions
ls-0.001
l-0.001
 cor-0.001
 AW-0.001
ira-0.001
Token!"
Token ID2474
Feature activation+4.58e-01

Pos logit contributions
 Clean+0.012
elt+0.011
 lately+0.011
side+0.011
amed+0.011
Neg logit contributions
ls-0.010
l-0.010
 cor-0.009
 crocod-0.007
 beast-0.007
Token Fro
Token ID9734
Feature activation+5.96e+00

Pos logit contributions
ls+0.012
l+0.012
 cor+0.011
 crocod+0.009
ah+0.009
Neg logit contributions
 Clean-0.009
 lately-0.008
side-0.008
amed-0.008
elt-0.008
Tokenggy
Token ID19970
Feature activation+1.70e+00

Pos logit contributions
ls+0.102
l+0.099
 cor+0.098
osh+0.081
 works+0.079
Neg logit contributions
apping-0.163
elt-0.150
 Clean-0.147
amed-0.145
uggets-0.141
Token replied
Token ID8712
Feature activation+8.71e-01

Pos logit contributions
ls+0.042
l+0.039
 cor+0.037
 beast+0.033
 AW+0.031
Neg logit contributions
 Clean-0.036
 then-0.035
elt-0.035
 Then-0.033
amed-0.032
Token,
Token ID11
Feature activation-1.04e+00

Pos logit contributions
ls+0.022
l+0.022
 cor+0.021
 crocod+0.017
ia+0.017
Neg logit contributions
 Clean-0.017
elt-0.016
 lately-0.016
side-0.016
itted-0.015
Token "
Token ID366
Feature activation-2.56e+00

Pos logit contributions
 Clean+0.019
 lately+0.018
elt+0.018
amed+0.017
side+0.017
Neg logit contributions
l-0.027
ls-0.027
 cor-0.026
 crocod-0.022
 slit-0.022
TokenI
Token ID40
Feature activation-2.35e+00

Pos logit contributions
 Clean+0.039
 lately+0.036
elt+0.035
apping+0.035
 Working+0.034
Neg logit contributions
l-0.074
ls-0.073
 cor-0.071
ah-0.062
osh-0.062
Token frowned
Token ID32198
Feature activation-1.06e+00

Pos logit contributions
ls+0.012
l+0.011
 cor+0.011
 AW+0.009
 crocod+0.009
Neg logit contributions
 Clean-0.012
elt-0.011
 lately-0.011
amed-0.010
 then-0.010
Token.
Token ID13
Feature activation-7.63e-01

Pos logit contributions
 Clean+0.018
 Working+0.018
keeping+0.017
side+0.017
oned+0.017
Neg logit contributions
ls-0.028
l-0.028
 cor-0.027
 crocod-0.023
ah-0.023
Token Something
Token ID13742
Feature activation+1.30e+00

Pos logit contributions
 Clean+0.013
 lately+0.012
elt+0.012
side+0.012
amed+0.012
Neg logit contributions
ls-0.021
l-0.021
 cor-0.020
 crocod-0.017
 beast-0.017
Token didn
Token ID1422
Feature activation+3.08e+00

Pos logit contributions
ls+0.028
l+0.027
 cor+0.026
 crocod+0.020
ah+0.020
Neg logit contributions
 Clean-0.031
elt-0.029
 lately-0.029
side-0.028
apping-0.028
Token't
Token ID470
Feature activation+2.20e+00

Pos logit contributions
ls+0.086
l+0.083
 cor+0.083
 crocod+0.070
ah+0.068
Neg logit contributions
 Clean-0.052
 lately-0.047
elt-0.046
 Working-0.046
oned-0.046
Token smell
Token ID8508
Feature activation+5.94e+00

Pos logit contributions
ls+0.055
l+0.052
 cor+0.050
 crocod+0.041
 beast+0.041
Neg logit contributions
 Clean-0.045
 lately-0.043
elt-0.042
apping-0.041
side-0.041
Token right
Token ID826
Feature activation+6.52e-01

Pos logit contributions
ls+0.138
l+0.129
 cor+0.125
ira+0.109
osh+0.107
Neg logit contributions
 Clean-0.128
 lately-0.126
 including-0.119
 cleaning-0.117
elt-0.115
Token.
Token ID13
Feature activation-3.48e-01

Pos logit contributions
ls+0.014
l+0.014
 cor+0.013
 crocod+0.010
ah+0.010
Neg logit contributions
 Clean-0.015
elt-0.015
 lately-0.014
apping-0.014
side-0.014
Token It
Token ID632
Feature activation+1.59e+00

Pos logit contributions
 Clean+0.006
 lately+0.006
elt+0.006
side+0.005
amed+0.005
Neg logit contributions
ls-0.010
l-0.009
 cor-0.009
 crocod-0.008
 beast-0.008
Token was
Token ID373
Feature activation-6.70e-01

Pos logit contributions
ls+0.049
 cor+0.046
l+0.045
 AW+0.043
ira+0.042
Neg logit contributions
 then-0.022
 Clean-0.022
 so-0.019
 sometimes-0.018
 cleaning-0.018
Token sm
Token ID895
Feature activation-1.59e+00

Pos logit contributions
 Clean+0.014
elt+0.013
 lately+0.013
side+0.012
apping+0.012
Neg logit contributions
ls-0.017
l-0.016
 cor-0.015
 AW-0.013
osh-0.013
TokenThe
Token ID464
Feature activation+3.66e-01

Pos logit contributions
ls+0.010
l+0.009
 cor+0.009
 crocod+0.008
 beast+0.008
Neg logit contributions
 Clean-0.005
elt-0.004
apping-0.004
 lately-0.004
amed-0.004
Token lizard
Token ID42406
Feature activation+2.00e+00

Pos logit contributions
ls+0.006
l+0.006
 cor+0.005
 AW+0.005
osh+0.004
Neg logit contributions
 Clean-0.010
 then-0.010
 lately-0.010
apping-0.009
 birthday-0.009
Token does
Token ID857
Feature activation+3.69e+00

Pos logit contributions
l+0.039
 cor+0.037
ls+0.036
 AW+0.035
ah+0.032
Neg logit contributions
 then-0.044
 Clean-0.041
 Then-0.040
 together-0.038
elt-0.037
Token not
Token ID407
Feature activation+2.94e+00

Pos logit contributions
ls+0.117
 cor+0.110
l+0.109
 beast+0.099
ah+0.099
Neg logit contributions
 Clean-0.047
 then-0.044
 lately-0.042
apping-0.042
oned-0.041
Token his
Token ID465
Feature activation+1.07e+00

Pos logit contributions
ls+0.091
 cor+0.085
l+0.083
 AW+0.076
 beast+0.075
Neg logit contributions
 Clean-0.038
 lately-0.037
 including-0.035
 then-0.034
apping-0.034
Tokenss
Token ID824
Feature activation+5.92e+00

Pos logit contributions
ls+0.027
l+0.026
 cor+0.025
 AW+0.022
osh+0.022
Neg logit contributions
 birthday-0.022
 Clean-0.021
 anniversary-0.020
 lately-0.019
apping-0.019
Token anymore
Token ID7471
Feature activation+3.28e-01

Pos logit contributions
 cor+0.135
ls+0.130
 slit+0.121
 crocod+0.118
 swarm+0.115
Neg logit contributions
itted-0.108
elt-0.104
 She-0.102
 Fighting-0.099
ful-0.096
Token.
Token ID13
Feature activation+1.72e+00

Pos logit contributions
ls+0.006
l+0.006
 cor+0.006
ia+0.005
 crocod+0.004
Neg logit contributions
elt-0.008
 Clean-0.008
amed-0.008
side-0.008
itted-0.008
Token It
Token ID632
Feature activation+3.02e+00

Pos logit contributions
ls+0.063
 cor+0.061
l+0.060
ches+0.052
 beast+0.052
Neg logit contributions
 Then-0.017
 Clean-0.016
elt-0.014
 Now-0.013
 Fighting-0.013
Token is
Token ID318
Feature activation+5.89e-01

Pos logit contributions
ls+0.059
l+0.058
 cor+0.056
 AW+0.054
 Entry+0.052
Neg logit contributions
 then-0.065
 Then-0.058
 Clean-0.057
 sometimes-0.056
 Always-0.054
Token quiet
Token ID5897
Feature activation-6.04e-02

Pos logit contributions
ls+0.016
l+0.016
 cor+0.015
 AW+0.013
osh+0.013
Neg logit contributions
 Clean-0.010
 lately-0.010
 then-0.009
amed-0.009
 including-0.009
Token not
Token ID407
Feature activation+2.67e+00

Pos logit contributions
ls+0.129
 cor+0.121
l+0.120
ah+0.114
 beast+0.110
Neg logit contributions
 then-0.034
 Clean-0.030
 including-0.029
 cleaning-0.028
 themselves-0.028
Token move
Token ID1445
Feature activation+3.69e+00

Pos logit contributions
ls+0.083
l+0.075
 cor+0.073
 AW+0.066
 beast+0.066
Neg logit contributions
 Clean-0.038
 including-0.036
 lately-0.035
elt-0.034
 birthday-0.033
Token or
Token ID393
Feature activation+3.03e+00

Pos logit contributions
ls+0.098
 cor+0.097
l+0.093
ira+0.085
 beast+0.083
Neg logit contributions
 Clean-0.061
 lately-0.060
elt-0.055
apping-0.053
 including-0.053
Token make
Token ID787
Feature activation+5.06e+00

Pos logit contributions
ls+0.055
l+0.053
 cor+0.049
ah+0.038
 beast+0.037
Neg logit contributions
 Clean-0.081
elt-0.077
 lately-0.076
side-0.074
apping-0.074
Token a
Token ID257
Feature activation+4.70e+00

Pos logit contributions
ls+0.136
 cor+0.133
l+0.131
ah+0.122
 AW+0.117
Neg logit contributions
 Clean-0.084
 themselves-0.081
 birthday-0.078
 Then-0.076
 cleaning-0.073
Token sound
Token ID2128
Feature activation+5.91e+00

Pos logit contributions
ls+0.098
l+0.093
 cor+0.088
osh+0.078
ches+0.074
Neg logit contributions
 Clean-0.105
 birthday-0.099
elt-0.097
apping-0.095
 cleaning-0.094
Token.
Token ID13
Feature activation+2.20e+00

Pos logit contributions
ls+0.148
l+0.139
 cor+0.138
 AW+0.116
ira+0.114
Neg logit contributions
 Clean-0.118
 lately-0.110
apping-0.108
elt-0.107
 including-0.104
Token
Token ID198
Feature activation+1.74e+00

Pos logit contributions
ls+0.066
l+0.064
 cor+0.062
 crocod+0.053
ah+0.053
Neg logit contributions
 Clean-0.033
elt-0.031
 lately-0.030
side-0.029
apping-0.029
Token
Token ID198
Feature activation+8.25e-01

Pos logit contributions
ls+0.055
 cor+0.053
l+0.053
 AW+0.047
ah+0.047
Neg logit contributions
 Clean-0.023
elt-0.020
 Working-0.019
apping-0.019
 lately-0.018
Token"
Token ID1
Feature activation-4.09e-01

Pos logit contributions
ls+0.022
l+0.022
 cor+0.020
 crocod+0.017
ia+0.017
Neg logit contributions
 Clean-0.015
elt-0.015
 lately-0.015
itted-0.014
side-0.014
TokenI
Token ID40
Feature activation-1.31e+00

Pos logit contributions
elt+0.005
 Clean+0.005
Playing+0.005
amed+0.005
 Fighting+0.005
Neg logit contributions
ls-0.013
l-0.012
 cor-0.012
ches-0.011
ia-0.011
Token it
Token ID340
Feature activation+2.50e+00

Pos logit contributions
ls+0.022
l+0.021
 cor+0.020
ah+0.016
 AW+0.016
Neg logit contributions
 Clean-0.020
 lately-0.019
elt-0.019
side-0.018
itted-0.018
Token because
Token ID780
Feature activation+9.44e-01

Pos logit contributions
ls+0.083
l+0.080
 cor+0.080
 crocod+0.069
osh+0.069
Neg logit contributions
 Clean-0.030
 lately-0.028
elt-0.026
side-0.025
 including-0.024
Token it
Token ID340
Feature activation+2.10e+00

Pos logit contributions
ls+0.022
l+0.022
 cor+0.020
 crocod+0.017
osh+0.016
Neg logit contributions
 Clean-0.020
elt-0.020
apping-0.019
amed-0.019
itted-0.019
Token didn
Token ID1422
Feature activation+4.53e+00

Pos logit contributions
ls+0.055
 cor+0.054
l+0.052
 AW+0.048
haw+0.046
Neg logit contributions
 Clean-0.038
 cleaning-0.034
 lately-0.033
 then-0.033
elt-0.032
Token't
Token ID470
Feature activation+2.13e+00

Pos logit contributions
ls+0.133
 cor+0.131
l+0.127
 crocod+0.112
 troll+0.111
Neg logit contributions
 Clean-0.068
 lately-0.060
oned-0.060
 Working-0.059
side-0.058
Token smell
Token ID8508
Feature activation+5.91e+00

Pos logit contributions
ls+0.052
l+0.049
 cor+0.048
 beast+0.039
 crocod+0.038
Neg logit contributions
 Clean-0.045
 lately-0.043
elt-0.042
itted-0.041
apping-0.040
Token like
Token ID588
Feature activation+1.04e+00

Pos logit contributions
ls+0.125
l+0.118
 cor+0.114
osh+0.095
 beast+0.094
Neg logit contributions
 Clean-0.138
 lately-0.136
elt-0.128
 including-0.127
itted-0.125
Token a
Token ID257
Feature activation+3.27e+00

Pos logit contributions
ls+0.025
l+0.024
 cor+0.023
 crocod+0.019
 beast+0.019
Neg logit contributions
 Clean-0.022
elt-0.021
 lately-0.020
apping-0.020
amed-0.020
Token real
Token ID1103
Feature activation-1.46e-01

Pos logit contributions
ls+0.067
l+0.064
 cor+0.060
osh+0.050
ah+0.048
Neg logit contributions
 Clean-0.079
elt-0.074
 lately-0.074
side-0.073
apping-0.072
Token plant
Token ID4618
Feature activation+2.67e+00

Pos logit contributions
 Clean+0.003
elt+0.003
 lately+0.003
side+0.003
apping+0.003
Neg logit contributions
ls-0.003
l-0.003
 cor-0.003
osh-0.002
 beast-0.002
Token.
Token ID13
Feature activation+4.07e-01

Pos logit contributions
ls+0.062
l+0.060
 cor+0.057
 crocod+0.046
osh+0.046
Neg logit contributions
 Clean-0.058
elt-0.055
 lately-0.055
side-0.053
apping-0.053
Token.
Token ID13
Feature activation+6.83e-01

Pos logit contributions
ls+0.014
l+0.014
 cor+0.013
ia+0.010
 crocod+0.010
Neg logit contributions
 Clean-0.019
elt-0.019
amed-0.019
side-0.019
oned-0.019
Token The
Token ID383
Feature activation+5.01e-01

Pos logit contributions
ls+0.017
l+0.017
 cor+0.016
 crocod+0.014
ia+0.013
Neg logit contributions
 Clean-0.013
elt-0.013
 lately-0.012
amed-0.012
side-0.012
Token candy
Token ID18550
Feature activation+3.60e-01

Pos logit contributions
ls+0.012
l+0.011
 cor+0.010
osh+0.009
 AW+0.009
Neg logit contributions
 Clean-0.011
 lately-0.011
elt-0.011
side-0.010
 cleaning-0.010
Token did
Token ID750
Feature activation+3.74e+00

Pos logit contributions
ls+0.008
l+0.008
 cor+0.008
 AW+0.006
ah+0.006
Neg logit contributions
 Clean-0.008
side-0.007
 lately-0.007
elt-0.007
 then-0.007
Token not
Token ID407
Feature activation+2.66e+00

Pos logit contributions
ls+0.129
 cor+0.124
l+0.121
ah+0.115
 beast+0.112
Neg logit contributions
 then-0.035
 Clean-0.032
 including-0.030
 cleaning-0.029
 lately-0.028
Token taste
Token ID6938
Feature activation+5.91e+00

Pos logit contributions
ls+0.073
l+0.068
 cor+0.065
 AW+0.057
 beast+0.056
Neg logit contributions
 Clean-0.048
 lately-0.047
 including-0.046
 birthday-0.044
elt-0.043
Token good
Token ID922
Feature activation+1.97e-01

Pos logit contributions
ls+0.146
l+0.141
 cor+0.139
 slit+0.131
 beast+0.129
Neg logit contributions
 lately-0.109
 Clean-0.099
 including-0.098
 cleaning-0.096
 soaking-0.094
Token anymore
Token ID7471
Feature activation+5.63e-01

Pos logit contributions
ls+0.004
l+0.004
 cor+0.004
 crocod+0.003
 AW+0.003
Neg logit contributions
 Clean-0.005
amed-0.005
elt-0.004
side-0.004
apping-0.004
Token.
Token ID13
Feature activation+1.35e+00

Pos logit contributions
ls+0.012
l+0.012
 cor+0.011
 crocod+0.009
ia+0.008
Neg logit contributions
 Clean-0.013
elt-0.013
amed-0.013
 lately-0.013
itted-0.013
Token It
Token ID632
Feature activation+2.48e+00

Pos logit contributions
ls+0.039
l+0.038
 cor+0.037
 crocod+0.031
 beast+0.031
Neg logit contributions
 Clean-0.022
elt-0.021
 lately-0.020
side-0.020
apping-0.019
Token tasted
Token ID29187
Feature activation+2.56e+00

Pos logit contributions
ls+0.070
l+0.067
 AW+0.065
 cor+0.063
 Entry+0.060
Neg logit contributions
 then-0.041
 Clean-0.032
 sometimes-0.031
 so-0.029
 Then-0.028
Token,
Token ID11
Feature activation-4.87e-01

Pos logit contributions
 Clean+0.001
 then+0.000
apping+0.000
 birthday+0.000
 together+0.000
Neg logit contributions
l-0.002
 cor-0.002
ls-0.002
 slit-0.002
 beast-0.002
Token Tom
Token ID4186
Feature activation+2.21e+00

Pos logit contributions
 Clean+0.008
 birthday+0.007
 then+0.007
 lately+0.007
elt+0.007
Neg logit contributions
ls-0.014
l-0.014
 cor-0.013
ches-0.012
 AW-0.011
Token!
Token ID0
Feature activation-1.44e+00

Pos logit contributions
ls+0.080
l+0.079
 cor+0.077
 AW+0.070
RO+0.069
Neg logit contributions
elt-0.014
 Clean-0.014
 then-0.013
 together-0.013
apping-0.013
Token Isn
Token ID25110
Feature activation+2.62e-01

Pos logit contributions
 Clean+0.021
elt+0.020
 lately+0.019
apping+0.018
side+0.018
Neg logit contributions
ls-0.044
l-0.043
 cor-0.041
 crocod-0.035
 beast-0.035
Token't
Token ID470
Feature activation+4.10e+00

Pos logit contributions
 cor+0.008
ls+0.008
l+0.008
 troll+0.007
 beast+0.007
Neg logit contributions
 Clean-0.004
 Working-0.003
elt-0.003
Playing-0.003
 then-0.003
Token it
Token ID340
Feature activation+5.90e+00

Pos logit contributions
ls+0.109
l+0.105
 cor+0.100
ches+0.091
 AW+0.090
Neg logit contributions
 Clean-0.073
 including-0.069
 lately-0.069
elt-0.068
 then-0.065
Token cool
Token ID3608
Feature activation+2.59e+00

Pos logit contributions
ls+0.155
 cor+0.155
l+0.150
 AW+0.137
ira+0.137
Neg logit contributions
 birthday-0.113
 so-0.111
 today-0.107
 together-0.103
 sharing-0.101
Token?
Token ID30
Feature activation+1.27e-01

Pos logit contributions
ls+0.055
l+0.054
 cor+0.051
 crocod+0.040
 beast+0.040
Neg logit contributions
 Clean-0.061
elt-0.059
 lately-0.058
side-0.057
apping-0.057
Token Let
Token ID3914
Feature activation+1.82e+00

Pos logit contributions
l+0.003
ls+0.003
 cor+0.003
 crocod+0.003
 Ro+0.003
Neg logit contributions
side-0.003
amed-0.003
aming-0.002
 indoors-0.002
oned-0.002
Token's
Token ID338
Feature activation-1.45e+00

Pos logit contributions
 cor+0.055
ls+0.055
 troll+0.052
l+0.052
 crocod+0.049
Neg logit contributions
 Clean-0.026
 then-0.026
 so-0.023
 outside-0.021
amed-0.020
Token catch
Token ID4929
Feature activation+2.26e+00

Pos logit contributions
elt+0.037
side+0.035
itted+0.035
 Clean+0.035
 lately+0.035
Neg logit contributions
 cor-0.027
ls-0.027
l-0.027
 crocod-0.021
osh-0.021
Token Tim
Token ID5045
Feature activation+1.44e+00

Pos logit contributions
ls+0.027
l+0.027
 cor+0.025
 crocod+0.022
ia+0.021
Neg logit contributions
 Clean-0.023
side-0.023
amed-0.023
 lately-0.023
elt-0.022
Token did
Token ID750
Feature activation+1.96e+00

Pos logit contributions
ls+0.035
l+0.033
 cor+0.032
 AW+0.028
 crocod+0.026
Neg logit contributions
 Clean-0.030
elt-0.028
 then-0.028
amed-0.028
 lately-0.027
Token not
Token ID407
Feature activation+1.51e+00

Pos logit contributions
ls+0.056
l+0.054
 cor+0.053
ah+0.045
 crocod+0.045
Neg logit contributions
 Clean-0.033
 lately-0.030
side-0.029
 Working-0.028
elt-0.028
Token like
Token ID588
Feature activation+1.77e+00

Pos logit contributions
ls+0.043
l+0.042
 cor+0.040
 crocod+0.034
ah+0.034
Neg logit contributions
 Clean-0.026
elt-0.024
 lately-0.023
side-0.023
apping-0.023
Token the
Token ID262
Feature activation+2.20e+00

Pos logit contributions
ls+0.048
l+0.046
 cor+0.043
 AW+0.040
ah+0.039
Neg logit contributions
 Clean-0.032
 lately-0.030
 cleaning-0.030
 organizing-0.029
elt-0.029
Token smell
Token ID8508
Feature activation+5.84e+00

Pos logit contributions
ls+0.053
l+0.043
 cor+0.042
 AW+0.040
ches+0.040
Neg logit contributions
 birthday-0.046
 lately-0.046
 cleaning-0.046
 buying-0.044
 washing-0.043
Token and
Token ID290
Feature activation+3.37e+00

Pos logit contributions
ls+0.176
 cor+0.171
l+0.170
 AW+0.153
ira+0.149
Neg logit contributions
 Clean-0.079
 lately-0.075
 then-0.068
 including-0.067
elt-0.066
Token he
Token ID339
Feature activation+1.80e+00

Pos logit contributions
ls+0.075
l+0.072
 cor+0.066
 AW+0.057
ah+0.056
Neg logit contributions
 Clean-0.076
 lately-0.072
elt-0.072
 then-0.071
 Then-0.068
Token felt
Token ID2936
Feature activation+1.48e+00

Pos logit contributions
ls+0.052
l+0.048
 cor+0.046
 AW+0.043
 crocod+0.040
Neg logit contributions
 then-0.030
 Clean-0.030
elt-0.026
 Then-0.026
 lately-0.026
Token sick
Token ID6639
Feature activation+1.66e+00

Pos logit contributions
ls+0.028
l+0.028
 cor+0.026
 crocod+0.019
 crow+0.018
Neg logit contributions
 Clean-0.039
side-0.038
elt-0.038
apping-0.037
amed-0.037
Token.
Token ID13
Feature activation+1.23e+00

Pos logit contributions
l+0.027
ls+0.027
 cor+0.022
 crocod+0.017
ia+0.017
Neg logit contributions
amed-0.048
uggets-0.048
 Clean-0.047
othes-0.046
 Working-0.046
Token shelter
Token ID11772
Feature activation-8.10e-01

Pos logit contributions
l+0.030
ls+0.030
 cor+0.029
 crocod+0.024
 lands+0.023
Neg logit contributions
apping-0.033
elt-0.032
uggets-0.032
 Clean-0.032
 lately-0.031
Token.
Token ID13
Feature activation+1.28e+00

Pos logit contributions
 Clean+0.019
elt+0.019
 lately+0.018
itted+0.018
amed+0.018
Neg logit contributions
ls-0.017
l-0.017
 cor-0.016
 crocod-0.013
ia-0.012
Token They
Token ID1119
Feature activation+3.06e-01

Pos logit contributions
l+0.035
ls+0.034
 cor+0.033
 crocod+0.030
ia+0.029
Neg logit contributions
amed-0.021
 lately-0.020
side-0.020
oned-0.020
elt-0.020
Token did
Token ID750
Feature activation+2.06e+00

Pos logit contributions
ls+0.009
l+0.008
 cor+0.008
 AW+0.007
 beast+0.007
Neg logit contributions
 Clean-0.005
 then-0.005
 lately-0.005
elt-0.005
 including-0.004
Token not
Token ID407
Feature activation+1.81e+00

Pos logit contributions
ls+0.065
l+0.063
 cor+0.061
ah+0.059
 beast+0.057
Neg logit contributions
 then-0.023
 Clean-0.022
 themselves-0.022
 including-0.022
 cleaning-0.021
Token smell
Token ID8508
Feature activation+5.83e+00

Pos logit contributions
ls+0.052
l+0.049
 cor+0.046
 AW+0.041
osh+0.040
Neg logit contributions
 Clean-0.030
 lately-0.028
elt-0.027
 including-0.026
apping-0.026
Token the
Token ID262
Feature activation+1.35e+00

Pos logit contributions
ls+0.151
l+0.142
 cor+0.137
 AW+0.135
ah+0.135
Neg logit contributions
 lately-0.097
 Clean-0.094
 including-0.094
 themselves-0.087
 then-0.087
Token rot
Token ID5724
Feature activation+1.05e+00

Pos logit contributions
ls+0.030
l+0.029
 cor+0.027
osh+0.023
 AW+0.023
Neg logit contributions
 Clean-0.029
 lately-0.028
elt-0.028
side-0.027
 including-0.026
Token that
Token ID326
Feature activation+2.69e+00

Pos logit contributions
ls+0.026
l+0.025
 cor+0.024
 AW+0.020
 crocod+0.020
Neg logit contributions
 Clean-0.021
elt-0.020
 lately-0.020
side-0.019
apping-0.019
Token filled
Token ID5901
Feature activation+1.83e+00

Pos logit contributions
ls+0.068
l+0.065
 cor+0.062
 AW+0.056
ah+0.054
Neg logit contributions
 Clean-0.053
 lately-0.049
 including-0.047
 then-0.046
 cleaning-0.046
Token the
Token ID262
Feature activation+3.12e-01

Pos logit contributions
ls+0.049
l+0.048
 cor+0.045
 AW+0.042
ah+0.040
Neg logit contributions
 Clean-0.032
 lately-0.030
side-0.030
uggets-0.029
 then-0.029
Token fire
Token ID2046
Feature activation+1.30e+00

Pos logit contributions
ls+0.031
l+0.030
 cor+0.029
osh+0.024
 AW+0.024
Neg logit contributions
 Clean-0.028
 lately-0.026
elt-0.026
side-0.026
 cleaning-0.025
Token.
Token ID13
Feature activation+7.85e-01

Pos logit contributions
ls+0.032
l+0.031
 cor+0.030
 AW+0.026
osh+0.024
Neg logit contributions
 Clean-0.026
elt-0.025
 lately-0.024
side-0.024
apping-0.023
Token She
Token ID1375
Feature activation+9.70e-01

Pos logit contributions
ls+0.025
l+0.025
 cor+0.024
 crocod+0.022
ia+0.021
Neg logit contributions
 Clean-0.009
amed-0.008
side-0.008
 lately-0.008
elt-0.008
Token did
Token ID750
Feature activation+2.58e+00

Pos logit contributions
ls+0.024
l+0.023
 AW+0.021
 cor+0.021
ah+0.019
Neg logit contributions
 then-0.018
 Clean-0.018
elt-0.016
 Then-0.016
amed-0.016
Token not
Token ID407
Feature activation+1.70e+00

Pos logit contributions
ls+0.085
 cor+0.080
l+0.080
ah+0.075
 crocod+0.073
Neg logit contributions
 Clean-0.029
 then-0.028
 lately-0.026
 cleaning-0.026
 including-0.025
Token smell
Token ID8508
Feature activation+5.78e+00

Pos logit contributions
ls+0.050
l+0.046
 cor+0.043
 AW+0.040
 crocod+0.037
Neg logit contributions
 Clean-0.029
 lately-0.025
elt-0.025
days-0.025
 including-0.024
Token the
Token ID262
Feature activation+1.32e+00

Pos logit contributions
ls+0.148
l+0.134
 cor+0.131
 AW+0.129
ah+0.128
Neg logit contributions
 Clean-0.106
 lately-0.104
 cleaning-0.101
 then-0.097
 including-0.097
Token smoke
Token ID7523
Feature activation+4.36e+00

Pos logit contributions
ls+0.031
l+0.028
 cor+0.027
osh+0.023
 AW+0.022
Neg logit contributions
 Clean-0.029
 lately-0.027
elt-0.027
side-0.027
 cleaning-0.026
Token.
Token ID13
Feature activation+1.22e+00

Pos logit contributions
ls+0.111
l+0.106
 cor+0.103
 AW+0.086
ira+0.085
Neg logit contributions
 Clean-0.085
 lately-0.079
elt-0.078
side-0.077
apping-0.076
Token She
Token ID1375
Feature activation+1.05e+00

Pos logit contributions
l+0.040
ls+0.039
 cor+0.037
ia+0.033
ira+0.033
Neg logit contributions
 Clean-0.013
amed-0.013
side-0.013
 lately-0.013
elt-0.012
Token did
Token ID750
Feature activation+2.55e+00

Pos logit contributions
ls+0.027
l+0.026
 AW+0.024
 cor+0.024
ah+0.021
Neg logit contributions
 then-0.019
 Clean-0.017
 Then-0.017
elt-0.016
amed-0.015
Token oven
Token ID14361
Feature activation+1.54e+00

Pos logit contributions
l+0.024
ls+0.023
 cor+0.023
 crocod+0.020
 beast+0.019
Neg logit contributions
 Clean-0.026
elt-0.026
 lately-0.026
apping-0.026
itted-0.026
Token.
Token ID13
Feature activation+1.63e+00

Pos logit contributions
ls+0.037
l+0.036
 cor+0.035
 crocod+0.028
ah+0.028
Neg logit contributions
 Clean-0.032
elt-0.030
 lately-0.030
side-0.029
apping-0.029
Token They
Token ID1119
Feature activation+3.71e-01

Pos logit contributions
ls+0.051
l+0.051
 cor+0.047
 crocod+0.043
ia+0.043
Neg logit contributions
 lately-0.021
amed-0.020
side-0.020
 Clean-0.020
elt-0.019
Token did
Token ID750
Feature activation+2.22e+00

Pos logit contributions
ls+0.010
l+0.009
 cor+0.008
 AW+0.008
ia+0.007
Neg logit contributions
 Clean-0.007
 then-0.007
elt-0.006
 lately-0.006
amed-0.006
Token not
Token ID407
Feature activation+1.93e+00

Pos logit contributions
ls+0.071
l+0.068
 cor+0.066
ah+0.061
osh+0.059
Neg logit contributions
 Clean-0.029
 then-0.026
 cleaning-0.025
 including-0.024
 lately-0.024
Token smell
Token ID8508
Feature activation+5.78e+00

Pos logit contributions
ls+0.048
l+0.046
 cor+0.044
 crocod+0.036
 beast+0.036
Neg logit contributions
 Clean-0.040
elt-0.038
 lately-0.037
apping-0.036
side-0.036
Token the
Token ID262
Feature activation+1.41e+00

Pos logit contributions
ls+0.145
l+0.138
 cor+0.130
ah+0.126
 AW+0.122
Neg logit contributions
 Clean-0.112
 lately-0.104
 including-0.103
 cleaning-0.103
 themselves-0.100
Token food
Token ID2057
Feature activation+1.70e+00

Pos logit contributions
ls+0.030
l+0.029
 cor+0.027
ia+0.022
osh+0.022
Neg logit contributions
 Clean-0.033
elt-0.032
 lately-0.031
side-0.031
amed-0.030
Token burning
Token ID9482
Feature activation+1.96e+00

Pos logit contributions
ls+0.045
l+0.044
 cor+0.042
ah+0.036
 AW+0.036
Neg logit contributions
 Clean-0.031
 lately-0.029
elt-0.028
side-0.028
apping-0.027
Token.
Token ID13
Feature activation+1.87e+00

Pos logit contributions
ls+0.047
l+0.046
 cor+0.044
 crocod+0.036
 beast+0.035
Neg logit contributions
 Clean-0.041
elt-0.039
 lately-0.039
side-0.038
apping-0.038
Token They
Token ID1119
Feature activation+6.10e-01

Pos logit contributions
ls+0.059
l+0.058
 cor+0.054
 crocod+0.049
ia+0.049
Neg logit contributions
 lately-0.024
 Clean-0.023
amed-0.023
side-0.023
elt-0.023
Token my
Token ID616
Feature activation-1.52e+00

Pos logit contributions
 Clean+0.001
elt+0.001
 lately+0.001
side+0.001
apping+0.001
Neg logit contributions
ls-0.001
l-0.001
 cor-0.001
 crocod-0.001
ah-0.001
Token veil
Token ID30615
Feature activation+2.79e-01

Pos logit contributions
 birthday+0.030
 Clean+0.030
 anniversary+0.029
 lately+0.028
 then+0.028
Neg logit contributions
l-0.036
ls-0.034
 cor-0.033
ah-0.031
osh-0.030
Token!
Token ID0
Feature activation-1.06e+00

Pos logit contributions
ls+0.007
l+0.007
 cor+0.007
osh+0.005
 beast+0.005
Neg logit contributions
 Clean-0.005
 lately-0.005
elt-0.005
apping-0.005
side-0.005
Token Isn
Token ID25110
Feature activation+4.10e-01

Pos logit contributions
side+0.019
amed+0.019
 lately+0.018
aming+0.018
 indoors+0.017
Neg logit contributions
ls-0.027
l-0.026
 cor-0.025
osh-0.022
 Ro-0.022
Token't
Token ID470
Feature activation+4.25e+00

Pos logit contributions
 cor+0.013
ls+0.013
l+0.012
 troll+0.012
 crocod+0.011
Neg logit contributions
 Clean-0.005
 Working-0.005
oned-0.005
Playing-0.004
 then-0.004
Token it
Token ID340
Feature activation+5.77e+00

Pos logit contributions
ls+0.104
l+0.101
 cor+0.097
 AW+0.080
ah+0.079
Neg logit contributions
 Clean-0.088
 lately-0.083
elt-0.082
 including-0.079
amed-0.077
Token lovely
Token ID14081
Feature activation+1.77e+00

Pos logit contributions
ls+0.149
 cor+0.148
l+0.141
 beast+0.125
ira+0.123
Neg logit contributions
 birthday-0.112
 Clean-0.108
 including-0.107
 so-0.105
 then-0.102
Token?"
Token ID1701
Feature activation+8.89e-01

Pos logit contributions
ls+0.036
l+0.035
 cor+0.032
 crocod+0.025
ah+0.025
Neg logit contributions
 Clean-0.044
elt-0.042
 lately-0.041
side-0.041
itted-0.041
Token Anna
Token ID11735
Feature activation+1.39e+00

Pos logit contributions
ls+0.023
l+0.023
 cor+0.021
ia+0.018
 crocod+0.018
Neg logit contributions
 Clean-0.016
amed-0.016
elt-0.016
 lately-0.016
side-0.015
Token and
Token ID290
Feature activation+3.10e+00

Pos logit contributions
ls+0.038
l+0.036
 cor+0.034
 AW+0.031
 beast+0.031
Neg logit contributions
 Clean-0.026
 then-0.025
elt-0.025
 Then-0.024
apping-0.023
Token Ben
Token ID3932
Feature activation+1.09e+00

Pos logit contributions
ls+0.082
l+0.080
 cor+0.078
 crocod+0.068
arrow+0.067
Neg logit contributions
 Clean-0.059
 then-0.056
 lately-0.054
 Fighting-0.052
 Then-0.052
Token,
Token ID11
Feature activation-4.75e-01

Pos logit contributions
elt+0.008
amed+0.008
 Clean+0.008
itted+0.008
 lately+0.008
Neg logit contributions
ls-0.010
l-0.009
 cor-0.009
ia-0.007
 crocod-0.007
Token they
Token ID484
Feature activation-1.73e+00

Pos logit contributions
 Clean+0.009
 lately+0.009
elt+0.009
amed+0.009
itted+0.009
Neg logit contributions
ls-0.012
l-0.012
 cor-0.011
 crocod-0.010
osh-0.009
Token all
Token ID477
Feature activation-1.58e+00

Pos logit contributions
 Clean+0.034
elt+0.032
 lately+0.031
side+0.031
amed+0.031
Neg logit contributions
ls-0.044
l-0.043
 cor-0.041
 crocod-0.034
 AW-0.034
Token counted
Token ID14789
Feature activation-3.18e+00

Pos logit contributions
 Clean+0.031
elt+0.029
amed+0.028
 lately+0.028
side+0.028
Neg logit contributions
ls-0.041
l-0.039
 cor-0.038
 beast-0.031
 AW-0.031
Token the
Token ID262
Feature activation-1.69e+00

Pos logit contributions
 Clean+0.049
elt+0.048
side+0.046
 lately+0.046
apping+0.045
Neg logit contributions
ls-0.090
l-0.088
 cor-0.085
 AW-0.078
 beast-0.075
Token days
Token ID1528
Feature activation-5.94e+00

Pos logit contributions
 Clean+0.036
 organizing+0.033
 lately+0.033
side+0.033
 birthday+0.033
Neg logit contributions
ls-0.040
l-0.039
 cor-0.037
 crocod-0.031
 slit-0.031
Token until
Token ID1566
Feature activation-7.53e-01

Pos logit contributions
 Clean+0.093
 lately+0.087
elt+0.086
itted+0.083
uggets+0.082
Neg logit contributions
ls-0.176
l-0.169
 cor-0.162
osh-0.141
 crocod-0.139
Token Lily
Token ID20037
Feature activation+8.51e-01

Pos logit contributions
 Clean+0.016
elt+0.015
 lately+0.015
apping+0.015
itted+0.015
Neg logit contributions
ls-0.018
l-0.018
 cor-0.017
 crocod-0.014
osh-0.013
Token's
Token ID338
Feature activation-6.42e-01

Pos logit contributions
ls+0.023
l+0.022
 cor+0.021
 AW+0.019
 crocod+0.018
Neg logit contributions
 Clean-0.016
elt-0.015
 Then-0.014
 then-0.014
amed-0.014
Token birthday
Token ID10955
Feature activation-1.91e+00

Pos logit contributions
 Clean+0.013
elt+0.012
 lately+0.012
apping+0.012
side+0.012
Neg logit contributions
ls-0.016
l-0.015
 cor-0.014
ah-0.012
 AW-0.012
Token.
Token ID13
Feature activation-1.06e+00

Pos logit contributions
 Clean+0.039
elt+0.035
 birthday+0.034
side+0.034
 lately+0.034
Neg logit contributions
ls-0.049
l-0.048
 cor-0.045
 AW-0.037
 beast-0.037
Token spent
Token ID3377
Feature activation-2.34e+00

Pos logit contributions
ls+0.009
l+0.009
 cor+0.008
 AW+0.007
 crocod+0.007
Neg logit contributions
 Clean-0.007
amed-0.006
elt-0.006
 lately-0.006
side-0.006
Token the
Token ID262
Feature activation-2.27e+00

Pos logit contributions
 Clean+0.056
elt+0.053
 lately+0.053
side+0.052
amed+0.051
Neg logit contributions
ls-0.050
l-0.048
 cor-0.045
 crocod-0.036
 AW-0.035
Token rest
Token ID1334
Feature activation-3.13e+00

Pos logit contributions
 Clean+0.035
side+0.032
amed+0.032
wards+0.031
 lately+0.031
Neg logit contributions
ls-0.069
l-0.064
 cor-0.061
 troll-0.054
 crocod-0.054
Token of
Token ID286
Feature activation-2.64e+00

Pos logit contributions
 Clean+0.066
 Working+0.060
elt+0.060
 Fighting+0.060
First+0.059
Neg logit contributions
ls-0.076
l-0.074
 cor-0.069
osh-0.062
ia-0.060
Token his
Token ID465
Feature activation-2.79e+00

Pos logit contributions
 Clean+0.065
 lately+0.061
wards+0.061
 organizing+0.060
 cleaning+0.060
Neg logit contributions
ls-0.054
l-0.052
 cor-0.047
 AW-0.040
osh-0.039
Token days
Token ID1528
Feature activation-5.79e+00

Pos logit contributions
 Clean+0.045
side+0.040
 birthday+0.040
 lately+0.040
elt+0.039
Neg logit contributions
ls-0.080
l-0.078
 cor-0.074
 AW-0.064
 beast-0.063
Token in
Token ID287
Feature activation-1.09e+00

Pos logit contributions
 Clean+0.095
 Fighting+0.091
 lately+0.085
uggets+0.083
elt+0.083
Neg logit contributions
ls-0.167
l-0.162
 cor-0.157
 crocod-0.132
osh-0.131
Token the
Token ID262
Feature activation+1.39e-01

Pos logit contributions
 Clean+0.030
elt+0.029
 lately+0.028
itted+0.028
apping+0.028
Neg logit contributions
ls-0.019
l-0.018
 cor-0.017
 crocod-0.013
 beast-0.013
Token magical
Token ID10883
Feature activation-1.46e+00

Pos logit contributions
ls+0.004
l+0.003
 cor+0.003
 AW+0.003
osh+0.003
Neg logit contributions
 Clean-0.003
side-0.003
 lately-0.002
elt-0.002
wards-0.002
Token land
Token ID1956
Feature activation-6.22e-01

Pos logit contributions
 Clean+0.031
elt+0.030
 lately+0.029
apping+0.028
amed+0.028
Neg logit contributions
ls-0.035
l-0.034
 cor-0.032
 crocod-0.026
 beast-0.026
Token,
Token ID11
Feature activation-1.15e+00

Pos logit contributions
 Clean+0.010
elt+0.010
 lately+0.010
side+0.009
apping+0.009
Neg logit contributions
ls-0.018
l-0.017
 cor-0.016
 crocod-0.014
 AW-0.014
Token fisherman
Token ID48750
Feature activation-3.39e-02

Pos logit contributions
ls+0.018
l+0.017
 cor+0.014
ches+0.013
ia+0.013
Neg logit contributions
 Clean-0.019
 birthday-0.018
elt-0.018
apping-0.018
 organizing-0.017
Token!
Token ID0
Feature activation-1.12e+00

Pos logit contributions
 Clean+0.001
 lately+0.001
elt+0.001
apping+0.001
 including+0.001
Neg logit contributions
ls-0.001
l-0.001
 cor-0.001
 AW-0.001
ia-0.001
Token I
Token ID314
Feature activation-1.98e+00

Pos logit contributions
side+0.018
 Clean+0.017
amed+0.017
 lately+0.017
itted+0.016
Neg logit contributions
ls-0.031
l-0.031
 cor-0.029
 crocod-0.026
osh-0.026
Token spend
Token ID4341
Feature activation-3.05e+00

Pos logit contributions
 Clean+0.032
elt+0.030
days+0.030
 lately+0.029
itted+0.028
Neg logit contributions
ls-0.057
l-0.054
 cor-0.052
ia-0.046
 beast-0.045
Token my
Token ID616
Feature activation-4.57e+00

Pos logit contributions
 Clean+0.066
elt+0.064
 lately+0.062
apping+0.062
side+0.061
Neg logit contributions
ls-0.071
l-0.069
 cor-0.066
 crocod-0.054
 AW-0.053
Token days
Token ID1528
Feature activation-5.74e+00

Pos logit contributions
 Clean+0.088
elt+0.083
 lately+0.081
itted+0.079
side+0.079
Neg logit contributions
ls-0.118
l-0.115
 cor-0.108
 crocod-0.089
 AW-0.089
Token catching
Token ID16508
Feature activation-4.85e-01

Pos logit contributions
 Clean+0.109
 Fighting+0.107
 Then+0.100
 lately+0.098
elt+0.098
Neg logit contributions
ls-0.148
l-0.139
 cor-0.137
 flies-0.120
 crocod-0.118
Token fish
Token ID5916
Feature activation-4.18e-01

Pos logit contributions
 together+0.009
 including+0.009
 Clean+0.009
 soaking+0.009
 then+0.009
Neg logit contributions
ls-0.010
l-0.010
ira-0.010
ah-0.009
 cor-0.009
Token.
Token ID13
Feature activation-9.48e-01

Pos logit contributions
 Clean+0.008
apping+0.008
elt+0.008
 lately+0.008
uggets+0.008
Neg logit contributions
ls-0.010
l-0.010
 cor-0.009
 AW-0.008
ira-0.008
Token Do
Token ID2141
Feature activation+2.50e-01

Pos logit contributions
side+0.018
amed+0.017
itted+0.016
 indoors+0.016
keeping+0.016
Neg logit contributions
l-0.023
ls-0.023
 cor-0.023
 crocod-0.020
 Ro-0.020
Token you
Token ID345
Feature activation+3.73e-01

Pos logit contributions
ls+0.008
 cor+0.007
l+0.007
 beast+0.006
inkle+0.006
Neg logit contributions
 Clean-0.003
 lately-0.003
 then-0.003
 birthday-0.003
elt-0.003
Token anymore
Token ID7471
Feature activation-4.22e-01

Pos logit contributions
ls+0.036
l+0.035
 cor+0.033
 crocod+0.028
 beast+0.027
Neg logit contributions
amed-0.026
 Clean-0.025
elt-0.024
uggets-0.024
itted-0.024
Token.
Token ID13
Feature activation-7.62e-01

Pos logit contributions
 Clean+0.010
amed+0.010
elt+0.010
side+0.009
itted+0.009
Neg logit contributions
ls-0.009
l-0.009
 cor-0.008
 crocod-0.007
 beast-0.006
Token He
Token ID679
Feature activation-3.33e-01

Pos logit contributions
 Clean+0.010
elt+0.010
 lately+0.009
apping+0.009
side+0.009
Neg logit contributions
ls-0.024
l-0.023
 cor-0.023
 crocod-0.019
 beast-0.019
Token spent
Token ID3377
Feature activation-2.74e+00

Pos logit contributions
 Clean+0.005
 then+0.005
 Always+0.005
elt+0.004
amed+0.004
Neg logit contributions
ls-0.010
l-0.010
 cor-0.009
 AW-0.008
 beast-0.008
Token his
Token ID465
Feature activation-3.32e+00

Pos logit contributions
 Clean+0.066
elt+0.064
 lately+0.062
apping+0.061
side+0.061
Neg logit contributions
ls-0.058
l-0.056
 cor-0.053
 crocod-0.042
 beast-0.041
Token days
Token ID1528
Feature activation-5.70e+00

Pos logit contributions
 Clean+0.052
elt+0.049
 lately+0.048
side+0.046
apping+0.046
Neg logit contributions
ls-0.097
l-0.095
 cor-0.091
 crocod-0.078
 beast-0.077
Token playing
Token ID2712
Feature activation-1.62e+00

Pos logit contributions
 Clean+0.100
 Fighting+0.097
 lately+0.095
othes+0.087
 Then+0.087
Neg logit contributions
ls-0.157
 cor-0.152
l-0.152
 crocod-0.134
 flies-0.130
Token with
Token ID351
Feature activation-1.30e+00

Pos logit contributions
 Clean+0.024
elt+0.024
 lately+0.023
itted+0.022
side+0.022
Neg logit contributions
ls-0.049
l-0.047
 cor-0.046
 crocod-0.040
 slit-0.039
Token his
Token ID465
Feature activation-1.53e+00

Pos logit contributions
 Clean+0.028
 cleaning+0.028
 lately+0.028
 then+0.027
 organizing+0.027
Neg logit contributions
ls-0.028
l-0.026
 AW-0.025
 cor-0.025
ah-0.022
Token mineral
Token ID18352
Feature activation-1.28e+00

Pos logit contributions
 Clean+0.032
elt+0.030
 lately+0.030
 cleaning+0.029
 organizing+0.029
Neg logit contributions
ls-0.036
l-0.035
 cor-0.033
 AW-0.028
osh-0.028
Token and
Token ID290
Feature activation-9.45e-02

Pos logit contributions
 Clean+0.022
elt+0.020
 lately+0.020
amed+0.019
itted+0.019
Neg logit contributions
ls-0.037
l-0.036
 cor-0.034
 crocod-0.029
 beast-0.028
Token on
Token ID319
Feature activation+6.96e-01

Pos logit contributions
keeping+0.042
uggets+0.042
elt+0.040
 anniversary+0.039
side+0.039
Neg logit contributions
ls-0.040
l-0.037
 cor-0.033
 AW-0.031
ah-0.030
Token,
Token ID11
Feature activation-4.03e-01

Pos logit contributions
ls+0.018
l+0.017
 cor+0.016
 crocod+0.013
ia+0.013
Neg logit contributions
elt-0.014
 Clean-0.014
amed-0.014
itted-0.014
uggets-0.013
Token he
Token ID339
Feature activation-3.76e-01

Pos logit contributions
 Clean+0.009
elt+0.009
 lately+0.009
side+0.008
apping+0.008
Neg logit contributions
ls-0.009
l-0.009
 cor-0.008
 beast-0.007
 AW-0.007
Token spent
Token ID3377
Feature activation-2.81e+00

Pos logit contributions
 Clean+0.006
 then+0.005
elt+0.005
 Always+0.005
 Working+0.005
Neg logit contributions
ls-0.011
l-0.011
 cor-0.010
 AW-0.009
 crocod-0.009
Token his
Token ID465
Feature activation-3.11e+00

Pos logit contributions
 Clean+0.067
elt+0.063
 lately+0.062
side+0.062
amed+0.061
Neg logit contributions
ls-0.061
l-0.059
 cor-0.055
 crocod-0.044
 AW-0.043
Token days
Token ID1528
Feature activation-5.62e+00

Pos logit contributions
 Clean+0.056
elt+0.052
side+0.050
 lately+0.050
itted+0.050
Neg logit contributions
ls-0.085
l-0.082
 cor-0.078
 crocod-0.065
 beast-0.064
Token watching
Token ID4964
Feature activation+1.10e+00

Pos logit contributions
 Clean+0.097
 lately+0.095
 Fighting+0.090
uggets+0.084
 Then+0.084
Neg logit contributions
ls-0.155
 cor-0.151
l-0.150
 crocod-0.128
 flies-0.128
Token the
Token ID262
Feature activation+1.57e+00

Pos logit contributions
ls+0.028
l+0.025
 cor+0.024
 AW+0.023
ira+0.023
Neg logit contributions
 Clean-0.020
 then-0.020
 including-0.020
 together-0.019
 lately-0.019
Token birds
Token ID10087
Feature activation-5.43e-01

Pos logit contributions
ls+0.042
 cor+0.038
l+0.037
ah+0.035
ches+0.035
Neg logit contributions
 Clean-0.025
 birthday-0.024
elt-0.024
 then-0.024
 celebrating-0.023
Token and
Token ID290
Feature activation-3.81e-01

Pos logit contributions
 Clean+0.012
elt+0.012
 lately+0.012
side+0.011
apping+0.011
Neg logit contributions
ls-0.012
l-0.012
 cor-0.011
 crocod-0.009
 beast-0.009
Token feeling
Token ID4203
Feature activation-1.78e-01

Pos logit contributions
 Clean+0.006
elt+0.006
 lately+0.006
side+0.006
apping+0.006
Neg logit contributions
ls-0.011
l-0.011
 cor-0.010
 crocod-0.009
 beast-0.009
TokenThe
Token ID464
Feature activation-1.49e-01

Pos logit contributions
elt+0.028
 Clean+0.027
keeping+0.026
 lately+0.025
itted+0.025
Neg logit contributions
l-0.037
ls-0.036
 cor-0.034
ah-0.029
ia-0.029
Token girl
Token ID2576
Feature activation-1.26e+00

Pos logit contributions
 Clean+0.004
elt+0.004
 lately+0.004
itted+0.004
uggets+0.004
Neg logit contributions
ls-0.003
l-0.003
 cor-0.003
 crocod-0.002
ia-0.002
Token would
Token ID561
Feature activation-1.19e+00

Pos logit contributions
 Clean+0.024
elt+0.022
 lately+0.021
amed+0.021
 then+0.021
Neg logit contributions
ls-0.033
l-0.032
 cor-0.030
 AW-0.026
 beast-0.026
Token spend
Token ID4341
Feature activation-2.77e+00

Pos logit contributions
elt+0.028
 Clean+0.027
 lately+0.026
itted+0.026
amed+0.026
Neg logit contributions
l-0.025
 cor-0.024
ls-0.024
ah-0.020
osh-0.020
Token her
Token ID607
Feature activation-4.74e+00

Pos logit contributions
elt+0.069
 Clean+0.069
apping+0.066
 lately+0.065
uggets+0.064
Neg logit contributions
ls-0.056
l-0.055
 cor-0.052
ia-0.040
ah-0.040
Token days
Token ID1528
Feature activation-5.60e+00

Pos logit contributions
 Clean+0.084
elt+0.079
 lately+0.078
side+0.076
apping+0.075
Neg logit contributions
ls-0.130
l-0.126
 cor-0.121
 crocod-0.102
osh-0.101
Token with
Token ID351
Feature activation-1.91e+00

Pos logit contributions
 Clean+0.097
 Fighting+0.092
 lately+0.091
elt+0.084
 Then+0.084
Neg logit contributions
ls-0.158
l-0.151
 cor-0.149
 crocod-0.127
 flies-0.126
Token her
Token ID607
Feature activation-2.53e+00

Pos logit contributions
 Clean+0.033
 lately+0.032
side+0.031
wards+0.031
 cleaning+0.031
Neg logit contributions
ls-0.050
l-0.049
 cor-0.047
 AW-0.044
ah-0.042
Token net
Token ID2010
Feature activation-1.15e+00

Pos logit contributions
 Clean+0.054
elt+0.050
 lately+0.050
side+0.049
apping+0.049
Neg logit contributions
ls-0.059
l-0.059
 cor-0.056
ah-0.046
 AW-0.045
Token,
Token ID11
Feature activation-1.60e+00

Pos logit contributions
 Clean+0.023
elt+0.021
 Working+0.021
 Fighting+0.020
oned+0.020
Neg logit contributions
ls-0.030
l-0.029
 cor-0.027
ia-0.023
ches-0.023
Token making
Token ID1642
Feature activation+4.25e-01

Pos logit contributions
 Clean+0.028
 lately+0.027
elt+0.026
side+0.026
amed+0.026
Neg logit contributions
ls-0.042
l-0.042
 cor-0.042
 crocod-0.035
osh-0.035
Token.
Token ID13
Feature activation-3.77e-01

Pos logit contributions
 Clean+0.008
elt+0.008
uggets+0.007
side+0.007
 lately+0.007
Neg logit contributions
ls-0.009
l-0.009
 cor-0.008
osh-0.007
 crocod-0.007
Token They
Token ID1119
Feature activation-1.84e+00

Pos logit contributions
 lately+0.005
elt+0.005
 Clean+0.005
uggets+0.005
side+0.005
Neg logit contributions
ls-0.011
l-0.011
 cor-0.011
 crocod-0.009
osh-0.009
Token would
Token ID561
Feature activation-1.55e+00

Pos logit contributions
 Clean+0.029
amed+0.026
 then+0.026
 lately+0.026
elt+0.026
Neg logit contributions
ls-0.054
l-0.052
 cor-0.049
 AW-0.043
 beast-0.043
Token spend
Token ID4341
Feature activation-2.60e+00

Pos logit contributions
elt+0.029
 Clean+0.029
 lately+0.027
itted+0.027
amed+0.027
Neg logit contributions
l-0.040
ls-0.039
 cor-0.038
ah-0.032
 crocod-0.032
Token their
Token ID511
Feature activation-4.13e+00

Pos logit contributions
 Clean+0.051
elt+0.050
 lately+0.048
apping+0.047
itted+0.046
Neg logit contributions
ls-0.066
l-0.064
 cor-0.062
 crocod-0.051
osh-0.051
Token days
Token ID1528
Feature activation-5.54e+00

Pos logit contributions
 Clean+0.057
elt+0.051
 lately+0.050
side+0.049
amed+0.048
Neg logit contributions
ls-0.130
l-0.127
 cor-0.121
 crocod-0.106
 beast-0.104
Token exercising
Token ID25352
Feature activation-4.09e+00

Pos logit contributions
 Clean+0.088
 lately+0.084
 Fighting+0.083
uggets+0.080
elt+0.078
Neg logit contributions
ls-0.162
l-0.156
 cor-0.153
osh-0.132
 flies-0.128
Token in
Token ID287
Feature activation-1.14e+00

Pos logit contributions
elt+0.064
 Clean+0.062
itted+0.059
 lately+0.058
 Fighting+0.056
Neg logit contributions
ls-0.120
l-0.116
 cor-0.114
osh-0.098
 slit-0.096
Token the
Token ID262
Feature activation+3.90e-01

Pos logit contributions
elt+0.032
 Clean+0.032
itted+0.030
 lately+0.030
apping+0.030
Neg logit contributions
ls-0.019
l-0.018
 cor-0.018
 crocod-0.013
osh-0.013
Token forest
Token ID8222
Feature activation-6.85e-01

Pos logit contributions
ls+0.010
l+0.010
 cor+0.009
 crocod+0.008
osh+0.008
Neg logit contributions
 Clean-0.008
elt-0.007
 lately-0.007
side-0.007
apping-0.007
Token,
Token ID11
Feature activation-1.22e+00

Pos logit contributions
 Clean+0.011
elt+0.011
 lately+0.010
itted+0.010
amed+0.010
Neg logit contributions
ls-0.019
l-0.019
 cor-0.018
 crocod-0.016
osh-0.015
Token said
Token ID531
Feature activation+3.33e-01

Pos logit contributions
ls+0.044
l+0.043
 cor+0.038
 beast+0.034
 AW+0.033
Neg logit contributions
 Clean-0.047
 then-0.046
 Then-0.044
elt-0.044
wards-0.043
Token,
Token ID11
Feature activation-1.34e+00

Pos logit contributions
ls+0.009
l+0.009
 cor+0.009
 crocod+0.007
 beast+0.007
Neg logit contributions
 Clean-0.006
elt-0.005
 lately-0.005
side-0.005
apping-0.005
Token "
Token ID366
Feature activation-3.35e+00

Pos logit contributions
 Clean+0.026
 lately+0.025
amed+0.025
othes+0.025
keeping+0.025
Neg logit contributions
l-0.033
ls-0.032
 cor-0.031
 crocod-0.026
 slit-0.025
TokenI
Token ID40
Feature activation-2.65e+00

Pos logit contributions
 Clean+0.054
 lately+0.051
apping+0.051
 organizing+0.050
 ago+0.050
Neg logit contributions
l-0.091
 cor-0.087
ls-0.087
ah-0.079
osh-0.076
Token like
Token ID588
Feature activation-1.10e+00

Pos logit contributions
 Clean+0.047
elt+0.043
 lately+0.043
apping+0.042
side+0.041
Neg logit contributions
ls-0.075
l-0.071
 cor-0.069
 beast-0.058
 crocod-0.058
Token Tues
Token ID48496
Feature activation-5.52e+00

Pos logit contributions
amed+0.025
side+0.024
 Clean+0.024
apping+0.024
oned+0.024
Neg logit contributions
ls-0.025
l-0.024
 cor-0.024
 crocod-0.020
 crow-0.019
Tokendays
Token ID12545
Feature activation-1.37e+00

Pos logit contributions
itted+0.070
 including+0.070
 lately+0.065
First+0.064
 Then+0.064
Neg logit contributions
ls-0.148
osh-0.146
ah-0.142
l-0.139
 cor-0.138
Token too
Token ID1165
Feature activation+1.07e+00

Pos logit contributions
amed+0.031
 Clean+0.031
elt+0.031
itted+0.031
oned+0.031
Neg logit contributions
l-0.027
ls-0.027
 cor-0.023
ah-0.021
osh-0.021
Token!
Token ID0
Feature activation-1.77e+00

Pos logit contributions
ls+0.022
l+0.021
 cor+0.019
 crocod+0.015
ia+0.015
Neg logit contributions
 Clean-0.026
elt-0.026
amed-0.025
side-0.025
itted-0.025
Token It
Token ID632
Feature activation+9.67e-01

Pos logit contributions
amed+0.036
side+0.036
keeping+0.034
aming+0.033
 indoors+0.033
Neg logit contributions
l-0.038
 cor-0.036
ls-0.036
osh-0.035
 Ro-0.034
Token's
Token ID338
Feature activation-1.24e+00

Pos logit contributions
ls+0.024
l+0.023
 cor+0.023
 troll+0.020
 crocod+0.019
Neg logit contributions
 Clean-0.019
 Working-0.018
 Always-0.017
elt-0.017
 then-0.017
Tokenbs
Token ID1443
Feature activation-7.53e-01

Pos logit contributions
 Clean+0.004
elt+0.004
 lately+0.003
amed+0.003
itted+0.003
Neg logit contributions
ls-0.004
l-0.004
 cor-0.004
 crocod-0.003
 beast-0.003
Token.
Token ID13
Feature activation-1.08e+00

Pos logit contributions
 Clean+0.015
elt+0.014
 lately+0.013
side+0.013
apping+0.013
Neg logit contributions
ls-0.019
l-0.019
 cor-0.018
 crocod-0.015
osh-0.015
Token She
Token ID1375
Feature activation-5.06e-01

Pos logit contributions
amed+0.023
aming+0.021
 lately+0.021
apping+0.021
keeping+0.020
Neg logit contributions
l-0.025
ls-0.023
 cor-0.022
ia-0.020
ira-0.020
Token spent
Token ID3377
Feature activation-2.92e+00

Pos logit contributions
 Clean+0.010
elt+0.009
amed+0.009
 lately+0.009
side+0.009
Neg logit contributions
ls-0.013
l-0.013
 cor-0.012
 AW-0.011
 beast-0.010
Token her
Token ID607
Feature activation-3.63e+00

Pos logit contributions
 Clean+0.068
elt+0.067
 lately+0.065
apping+0.063
itted+0.063
Neg logit contributions
ls-0.064
l-0.062
 cor-0.059
 crocod-0.047
osh-0.046
Token days
Token ID1528
Feature activation-5.51e+00

Pos logit contributions
 Clean+0.042
elt+0.037
 lately+0.037
side+0.036
amed+0.035
Neg logit contributions
ls-0.122
l-0.119
 cor-0.114
 crocod-0.100
 beast-0.099
Token spinning
Token ID19493
Feature activation-1.73e+00

Pos logit contributions
 lately+0.096
 Clean+0.096
 Fighting+0.094
elt+0.090
keeping+0.085
Neg logit contributions
ls-0.149
l-0.145
 cor-0.143
 crocod-0.127
 flies-0.121
Token her
Token ID607
Feature activation-2.34e+00

Pos logit contributions
 Clean+0.026
elt+0.024
 lately+0.023
side+0.023
amed+0.022
Neg logit contributions
ls-0.052
l-0.051
 cor-0.049
 crocod-0.042
 beast-0.042
Token webs
Token ID2639
Feature activation-1.73e+00

Pos logit contributions
 Clean+0.048
side+0.045
 lately+0.044
 cleaning+0.044
elt+0.044
Neg logit contributions
ls-0.055
l-0.053
 cor-0.052
ah-0.044
 AW-0.043
Token around
Token ID1088
Feature activation-1.28e+00

Pos logit contributions
 Clean+0.028
elt+0.026
uggets+0.024
 lately+0.024
 Working+0.024
Neg logit contributions
ls-0.051
l-0.051
 cor-0.047
osh-0.041
ah-0.040
Token the
Token ID262
Feature activation+9.63e-01

Pos logit contributions
 Clean+0.027
elt+0.026
 lately+0.025
side+0.025
apping+0.025
Neg logit contributions
ls-0.031
l-0.030
 cor-0.028
 crocod-0.023
 beast-0.023
Token was
Token ID373
Feature activation-2.20e+00

Pos logit contributions
ls+0.005
l+0.005
 cor+0.005
 crocod+0.004
 beast+0.004
Neg logit contributions
 Clean-0.004
elt-0.004
 lately-0.004
amed-0.004
side-0.004
Token happy
Token ID3772
Feature activation-2.27e+00

Pos logit contributions
 Clean+0.033
elt+0.032
othes+0.030
 lately+0.030
oned+0.030
Neg logit contributions
l-0.062
 cor-0.062
ls-0.060
osh-0.052
ah-0.052
Token to
Token ID284
Feature activation+8.40e-01

Pos logit contributions
oned+0.048
amed+0.048
othes+0.046
side+0.046
First+0.046
Neg logit contributions
ls-0.052
l-0.050
 cor-0.049
 crocod-0.042
 giant-0.041
Token spend
Token ID4341
Feature activation-2.56e+00

Pos logit contributions
l+0.018
ls+0.018
 cor+0.018
ah+0.015
ia+0.014
Neg logit contributions
elt-0.019
 Clean-0.018
 lately-0.018
itted-0.017
side-0.017
Token his
Token ID465
Feature activation-3.24e+00

Pos logit contributions
elt+0.068
 Clean+0.065
apping+0.063
uggets+0.062
itted+0.062
Neg logit contributions
ls-0.048
l-0.047
 cor-0.045
ia-0.034
ah-0.034
Token days
Token ID1528
Feature activation-5.50e+00

Pos logit contributions
 Clean+0.066
elt+0.063
 lately+0.062
side+0.060
apping+0.060
Neg logit contributions
ls-0.080
l-0.078
 cor-0.074
 crocod-0.061
 beast-0.060
Token on
Token ID319
Feature activation-8.20e-01

Pos logit contributions
 Clean+0.088
 Fighting+0.082
 lately+0.078
othes+0.075
 While+0.075
Neg logit contributions
ls-0.159
l-0.154
 cor-0.152
 crocod-0.129
 flies-0.127
Token the
Token ID262
Feature activation+2.24e-01

Pos logit contributions
 Clean+0.019
side+0.018
 lately+0.018
wards+0.018
 organizing+0.018
Neg logit contributions
ls-0.018
l-0.017
 cor-0.016
ah-0.013
osh-0.013
Token farm
Token ID5318
Feature activation-1.57e+00

Pos logit contributions
ls+0.006
l+0.006
 cor+0.006
osh+0.005
ia+0.005
Neg logit contributions
 Clean-0.004
side-0.004
 cleaning-0.003
 organizing-0.003
 lately-0.003
Token,
Token ID11
Feature activation-1.64e+00

Pos logit contributions
 Clean+0.027
elt+0.027
itted+0.025
 lately+0.025
amed+0.025
Neg logit contributions
ls-0.042
l-0.042
 cor-0.040
 crocod-0.034
 beast-0.033
Token learning
Token ID4673
Feature activation-3.74e-01

Pos logit contributions
 Clean+0.027
elt+0.027
 lately+0.026
oned+0.025
itted+0.025
Neg logit contributions
ls-0.044
l-0.044
 cor-0.043
 crocod-0.037
osh-0.036
Token
Token ID198
Feature activation-6.03e-01

Pos logit contributions
ls+0.005
l+0.005
 cor+0.005
 crocod+0.004
 beast+0.004
Neg logit contributions
 Clean-0.002
elt-0.002
 lately-0.002
side-0.002
apping-0.002
TokenS
Token ID50
Feature activation+1.10e+00

Pos logit contributions
 Clean+0.012
elt+0.011
 lately+0.011
uggets+0.011
itted+0.011
Neg logit contributions
l-0.015
ls-0.015
 cor-0.014
ia-0.012
 AW-0.012
Tokenue
Token ID518
Feature activation-4.38e-02

Pos logit contributions
 cor+0.022
ls+0.021
l+0.020
 beast+0.017
 crocod+0.016
Neg logit contributions
elt-0.027
 Clean-0.026
apping-0.026
amed-0.025
side-0.025
Token spent
Token ID3377
Feature activation-2.31e+00

Pos logit contributions
 Clean+0.001
elt+0.001
 then+0.001
amed+0.001
 lately+0.001
Neg logit contributions
ls-0.001
l-0.001
 cor-0.001
 AW-0.001
 crocod-0.001
Token her
Token ID607
Feature activation-3.81e+00

Pos logit contributions
 Clean+0.058
elt+0.058
 lately+0.055
apping+0.055
itted+0.055
Neg logit contributions
ls-0.045
l-0.043
 cor-0.041
 crocod-0.032
ia-0.031
Token days
Token ID1528
Feature activation-5.49e+00

Pos logit contributions
 Clean+0.054
 lately+0.048
side+0.047
elt+0.047
itted+0.045
Neg logit contributions
ls-0.120
l-0.116
 cor-0.112
 crocod-0.096
osh-0.095
Token reading
Token ID3555
Feature activation-7.22e-01

Pos logit contributions
 Fighting+0.095
 Clean+0.092
uggets+0.087
 Acc+0.086
 lately+0.085
Neg logit contributions
ls-0.140
l-0.139
 cor-0.136
 sne-0.119
 per-0.117
Token the
Token ID262
Feature activation-2.67e-01

Pos logit contributions
 Clean+0.015
elt+0.014
apping+0.014
 lately+0.014
itted+0.014
Neg logit contributions
ls-0.018
l-0.017
 cor-0.016
 crocod-0.013
ia-0.013
Token books
Token ID3835
Feature activation-1.39e+00

Pos logit contributions
 Clean+0.004
 birthday+0.004
 downstairs+0.004
side+0.004
 organizing+0.004
Neg logit contributions
ls-0.008
 cor-0.007
l-0.007
 slit-0.007
fish-0.006
Token in
Token ID287
Feature activation+3.57e-02

Pos logit contributions
 Clean+0.025
elt+0.025
uggets+0.023
itted+0.023
 Working+0.023
Neg logit contributions
ls-0.037
l-0.036
 cor-0.032
osh-0.029
ah-0.028
Token the
Token ID262
Feature activation+8.38e-01

Pos logit contributions
ls+0.001
l+0.001
 cor+0.001
 crocod+0.001
ia+0.001
Neg logit contributions
 Clean-0.001
elt-0.001
apping-0.001
 lately-0.001
itted-0.001
Token Ben
Token ID3932
Feature activation+2.55e+00

Pos logit contributions
ls+0.004
l+0.004
 cor+0.004
inkle+0.003
 AW+0.003
Neg logit contributions
 Clean-0.001
 cleaning-0.001
 organizing-0.001
 birthday-0.001
 sorting-0.001
Token.
Token ID13
Feature activation-2.06e+00

Pos logit contributions
ls+0.063
l+0.060
 cor+0.057
 AW+0.049
osh+0.047
Neg logit contributions
 Clean-0.054
 lately-0.050
elt-0.050
apping-0.048
side-0.048
Token Ben
Token ID3932
Feature activation+1.91e+00

Pos logit contributions
 Clean+0.039
 lately+0.037
elt+0.036
side+0.035
apping+0.035
Neg logit contributions
ls-0.056
l-0.055
 cor-0.052
 beast-0.044
 AW-0.043
Token spent
Token ID3377
Feature activation-2.92e+00

Pos logit contributions
l+0.040
ls+0.040
 cor+0.039
 crocod+0.032
ia+0.032
Neg logit contributions
oned-0.044
elt-0.042
side-0.042
wards-0.042
apping-0.042
Token his
Token ID465
Feature activation-2.43e+00

Pos logit contributions
 Clean+0.071
elt+0.069
 lately+0.067
apping+0.066
side+0.065
Neg logit contributions
ls-0.061
l-0.058
 cor-0.056
 crocod-0.044
osh-0.043
Token days
Token ID1528
Feature activation-5.49e+00

Pos logit contributions
 Clean+0.043
elt+0.040
 lately+0.039
side+0.038
apping+0.038
Neg logit contributions
ls-0.067
l-0.066
 cor-0.063
 crocod-0.053
 beast-0.052
Token playing
Token ID2712
Feature activation-1.39e+00

Pos logit contributions
 Fighting+0.090
 Clean+0.089
 lately+0.086
elt+0.084
 Working+0.084
Neg logit contributions
ls-0.151
 cor-0.142
l-0.142
 sne-0.127
 crocod-0.127
Token in
Token ID287
Feature activation-6.97e-01

Pos logit contributions
elt+0.020
 Clean+0.020
 lately+0.019
itted+0.018
side+0.018
Neg logit contributions
ls-0.042
l-0.041
 cor-0.040
 crocod-0.035
 slit-0.034
Token the
Token ID262
Feature activation+6.78e-01

Pos logit contributions
 Clean+0.020
elt+0.020
 lately+0.019
apping+0.019
itted+0.019
Neg logit contributions
ls-0.011
l-0.011
 cor-0.010
 crocod-0.008
 beast-0.007
Token park
Token ID3952
Feature activation+1.76e-01

Pos logit contributions
ls+0.017
l+0.016
 cor+0.016
 crocod+0.014
 beast+0.013
Neg logit contributions
 lately-0.013
 Clean-0.013
elt-0.013
apping-0.013
itted-0.013
Token near
Token ID1474
Feature activation+1.08e+00

Pos logit contributions
ls+0.005
l+0.005
 cor+0.005
 crocod+0.004
ia+0.004
Neg logit contributions
 Clean-0.003
elt-0.003
 Working-0.003
itted-0.003
amed-0.003
Token on
Token ID319
Feature activation+1.06e+00

Pos logit contributions
uggets+0.041
elt+0.041
keeping+0.039
 Clean+0.039
side+0.039
Neg logit contributions
ls-0.051
l-0.049
 cor-0.045
 AW-0.040
ah-0.039
Token,
Token ID11
Feature activation-2.82e-01

Pos logit contributions
ls+0.031
l+0.030
 cor+0.030
 crocod+0.025
osh+0.025
Neg logit contributions
 Clean-0.017
 lately-0.016
side-0.015
elt-0.014
apping-0.014
Token Spike
Token ID26309
Feature activation+1.13e+00

Pos logit contributions
 Clean+0.006
 then+0.006
 lately+0.005
 Working+0.005
 Then+0.005
Neg logit contributions
ls-0.007
l-0.006
 cor-0.006
ira-0.005
ia-0.005
Token spent
Token ID3377
Feature activation-2.51e+00

Pos logit contributions
ls+0.031
l+0.029
 cor+0.027
 AW+0.024
 crocod+0.024
Neg logit contributions
 Clean-0.020
amed-0.019
side-0.018
 lately-0.018
 then-0.018
Token his
Token ID465
Feature activation-2.92e+00

Pos logit contributions
 Clean+0.052
elt+0.049
 lately+0.048
side+0.047
amed+0.047
Neg logit contributions
ls-0.062
l-0.060
 cor-0.057
 crocod-0.047
osh-0.046
Token days
Token ID1528
Feature activation-5.47e+00

Pos logit contributions
 Clean+0.056
elt+0.052
 lately+0.052
side+0.051
itted+0.051
Neg logit contributions
ls-0.076
l-0.074
 cor-0.070
 crocod-0.058
osh-0.057
Token running
Token ID2491
Feature activation-1.01e+00

Pos logit contributions
 Clean+0.096
 lately+0.093
 Fighting+0.092
ework+0.086
 While+0.086
Neg logit contributions
ls-0.145
l-0.143
 cor-0.142
 crocod-0.124
 slit-0.121
Token around
Token ID1088
Feature activation-1.72e+00

Pos logit contributions
elt+0.019
 Clean+0.018
uggets+0.018
 anniversary+0.017
 lately+0.017
Neg logit contributions
ls-0.028
l-0.028
 cor-0.026
 crocod-0.022
ches-0.021
Token the
Token ID262
Feature activation+5.27e-01

Pos logit contributions
 Clean+0.033
 lately+0.031
side+0.030
 organizing+0.030
 including+0.029
Neg logit contributions
ls-0.045
l-0.044
 cor-0.041
 AW-0.035
ah-0.035
Token yard
Token ID12699
Feature activation-8.50e-01

Pos logit contributions
ls+0.015
l+0.014
 cor+0.014
 crocod+0.012
 beast+0.011
Neg logit contributions
 Clean-0.009
elt-0.009
 lately-0.009
side-0.008
apping-0.008
Token with
Token ID351
Feature activation-9.78e-01

Pos logit contributions
 Clean+0.016
elt+0.016
uggets+0.015
othes+0.015
itted+0.015
Neg logit contributions
l-0.021
ls-0.021
 cor-0.020
 crocod-0.017
ah-0.017
Token and
Token ID290
Feature activation+2.14e+00

Pos logit contributions
ls+0.007
l+0.007
 cor+0.007
 crocod+0.006
 AW+0.006
Neg logit contributions
 Clean-0.004
elt-0.004
 lately-0.004
amed-0.004
side-0.004
Token his
Token ID465
Feature activation-6.07e-02

Pos logit contributions
ls+0.054
l+0.049
 cor+0.042
 AW+0.042
inkle+0.040
Neg logit contributions
 Clean-0.048
 organizing-0.045
 Working-0.044
 lately-0.044
 sorting-0.044
Token friend
Token ID1545
Feature activation-1.81e+00

Pos logit contributions
 birthday+0.001
 Clean+0.001
elt+0.001
amed+0.001
itted+0.001
Neg logit contributions
ls-0.002
l-0.002
 cor-0.002
osh-0.002
 AW-0.002
Token spent
Token ID3377
Feature activation-3.37e+00

Pos logit contributions
 Clean+0.034
elt+0.032
 lately+0.031
amed+0.031
side+0.030
Neg logit contributions
ls-0.048
l-0.047
 cor-0.044
 AW-0.038
 beast-0.037
Token their
Token ID511
Feature activation-4.11e+00

Pos logit contributions
 Clean+0.071
side+0.065
 lately+0.064
amed+0.064
 Working+0.063
Neg logit contributions
ls-0.082
l-0.081
 cor-0.074
osh-0.062
 beast-0.062
Token days
Token ID1528
Feature activation-5.43e+00

Pos logit contributions
 Clean+0.086
elt+0.080
 lately+0.079
side+0.078
itted+0.077
Neg logit contributions
ls-0.100
l-0.098
 cor-0.092
 crocod-0.076
 beast-0.074
Token visiting
Token ID10013
Feature activation-7.43e-01

Pos logit contributions
 Clean+0.092
 Fighting+0.091
 lately+0.088
 Acc+0.086
elt+0.085
Neg logit contributions
ls-0.151
l-0.146
 cor-0.142
 crocod-0.118
 sne-0.117
Token people
Token ID661
Feature activation-1.09e+00

Pos logit contributions
 Clean+0.014
 lately+0.013
 including+0.013
 organizing+0.013
side+0.012
Neg logit contributions
ls-0.019
l-0.019
 cor-0.017
ah-0.016
 AW-0.016
Token who
Token ID508
Feature activation-5.98e-01

Pos logit contributions
 Clean+0.018
elt+0.018
 lately+0.017
amed+0.017
itted+0.017
Neg logit contributions
ls-0.031
l-0.030
 cor-0.029
 crocod-0.024
osh-0.024
Token were
Token ID547
Feature activation-2.28e+00

Pos logit contributions
 Clean+0.011
 lately+0.010
elt+0.009
 Working+0.009
side+0.009
Neg logit contributions
ls-0.017
l-0.016
 cor-0.016
 AW-0.014
ia-0.014
Token doing
Token ID1804
Feature activation-1.03e+00

Pos logit contributions
 Clean+0.037
elt+0.034
 lately+0.034
apping+0.033
side+0.033
Neg logit contributions
ls-0.066
l-0.064
 cor-0.061
 AW-0.053
 crocod-0.052
Token apart
Token ID5475
Feature activation-7.69e-01

Pos logit contributions
ls+0.032
l+0.031
 cor+0.029
 crocod+0.024
 AW+0.023
Neg logit contributions
 Clean-0.029
elt-0.028
 lately-0.026
amed-0.026
side-0.026
Token.
Token ID13
Feature activation-6.09e-01

Pos logit contributions
 Clean+0.020
elt+0.019
amed+0.019
itted+0.018
side+0.018
Neg logit contributions
ls-0.015
l-0.015
 cor-0.014
 crocod-0.011
 beast-0.010
Token They
Token ID1119
Feature activation-1.17e+00

Pos logit contributions
amed+0.009
aming+0.008
elt+0.008
 Clean+0.008
side+0.008
Neg logit contributions
ls-0.017
l-0.017
 cor-0.016
 crocod-0.015
ia-0.015
Token spent
Token ID3377
Feature activation-2.76e+00

Pos logit contributions
 Clean+0.019
elt+0.017
 lately+0.017
side+0.017
amed+0.017
Neg logit contributions
ls-0.034
l-0.033
 cor-0.032
 AW-0.027
 beast-0.027
Token their
Token ID511
Feature activation-3.44e+00

Pos logit contributions
 Clean+0.060
elt+0.058
 lately+0.056
apping+0.055
side+0.055
Neg logit contributions
ls-0.064
l-0.063
 cor-0.060
 crocod-0.048
osh-0.048
Token days
Token ID1528
Feature activation-5.39e+00

Pos logit contributions
 Clean+0.057
elt+0.055
 lately+0.054
apping+0.052
side+0.051
Neg logit contributions
ls-0.097
l-0.095
 cor-0.091
 crocod-0.078
osh-0.077
Token playing
Token ID2712
Feature activation-1.49e+00

Pos logit contributions
 Clean+0.086
 lately+0.084
 Fighting+0.081
uggets+0.076
elt+0.076
Neg logit contributions
ls-0.159
l-0.151
 cor-0.147
 flies-0.130
 slit-0.129
Token together
Token ID1978
Feature activation-1.76e+00

Pos logit contributions
 Clean+0.019
elt+0.019
 lately+0.018
itted+0.017
side+0.017
Neg logit contributions
ls-0.048
l-0.047
 cor-0.045
 crocod-0.039
osh-0.039
Token and
Token ID290
Feature activation-5.90e-01

Pos logit contributions
 Clean+0.029
elt+0.029
 lately+0.026
itted+0.026
amed+0.026
Neg logit contributions
ls-0.050
l-0.049
 cor-0.047
 crocod-0.041
osh-0.040
Token enjoying
Token ID13226
Feature activation-2.10e+00

Pos logit contributions
 Clean+0.007
elt+0.007
 lately+0.007
itted+0.007
apping+0.007
Neg logit contributions
ls-0.019
l-0.018
 cor-0.018
 crocod-0.016
 roared-0.016
Token the
Token ID262
Feature activation-8.39e-01

Pos logit contributions
 Clean+0.048
 lately+0.044
elt+0.044
side+0.044
apping+0.043
Neg logit contributions
ls-0.047
l-0.046
 cor-0.042
ah-0.035
 AW-0.035
Token He
Token ID679
Feature activation+6.32e-01

Pos logit contributions
ls+0.004
l+0.004
 cor+0.003
 AW+0.003
 beast+0.003
Neg logit contributions
 Clean-0.002
elt-0.002
 lately-0.002
 Working-0.001
side-0.001
Token crawled
Token ID45668
Feature activation-6.72e-01

Pos logit contributions
ls+0.017
l+0.016
 AW+0.015
 cor+0.015
 troll+0.014
Neg logit contributions
 then-0.012
 Clean-0.010
amed-0.010
elt-0.010
 While-0.009
Token out
Token ID503
Feature activation+1.25e+00

Pos logit contributions
 Clean+0.014
apping+0.013
 Working+0.013
 anniversary+0.013
elt+0.012
Neg logit contributions
ls-0.018
l-0.017
 cor-0.016
ches-0.013
 crocod-0.013
Token of
Token ID286
Feature activation-1.59e+00

Pos logit contributions
ls+0.037
l+0.036
 cor+0.035
 crocod+0.030
 AW+0.030
Neg logit contributions
 Clean-0.019
elt-0.018
 lately-0.017
side-0.017
apping-0.017
Token his
Token ID465
Feature activation-2.33e+00

Pos logit contributions
 Clean+0.043
 lately+0.041
elt+0.041
side+0.040
amed+0.039
Neg logit contributions
ls-0.029
l-0.028
 cor-0.026
 crocod-0.019
 AW-0.019
Token cob
Token ID22843
Feature activation-5.34e+00

Pos logit contributions
 Clean+0.050
elt+0.047
 lately+0.047
side+0.046
apping+0.045
Neg logit contributions
ls-0.056
l-0.054
 cor-0.051
 crocod-0.042
 beast-0.041
Tokenweb
Token ID12384
Feature activation-3.05e+00

Pos logit contributions
 Clean+0.117
 Working+0.110
oned+0.107
wards+0.105
 lately+0.105
Neg logit contributions
ls-0.130
l-0.123
 cor-0.116
ia-0.100
ira-0.099
Token and
Token ID290
Feature activation+7.57e-01

Pos logit contributions
 Clean+0.055
elt+0.053
 Working+0.051
uggets+0.049
amed+0.049
Neg logit contributions
ls-0.083
l-0.080
 cor-0.076
 crocod-0.065
ches-0.065
Token started
Token ID2067
Feature activation-1.70e+00

Pos logit contributions
ls+0.019
l+0.019
 cor+0.017
ia+0.015
 AW+0.015
Neg logit contributions
 Clean-0.015
 then-0.015
elt-0.014
 lately-0.014
side-0.014
Token to
Token ID284
Feature activation-1.05e+00

Pos logit contributions
 Clean+0.024
elt+0.023
 lately+0.021
apping+0.021
itted+0.021
Neg logit contributions
ls-0.053
l-0.052
 cor-0.049
 crocod-0.043
 AW-0.043
Token stretch
Token ID7539
Feature activation-9.73e-01

Pos logit contributions
 Clean+0.024
 then+0.023
 cleaning+0.022
 lately+0.022
 soaking+0.022
Neg logit contributions
ls-0.024
l-0.023
 cor-0.020
 crocod-0.017
ia-0.017
Token,
Token ID11
Feature activation-3.22e-02

Pos logit contributions
ls+0.008
l+0.008
 cor+0.008
 crocod+0.006
ia+0.006
Neg logit contributions
elt-0.007
 Clean-0.007
itted-0.007
amed-0.007
 Working-0.006
Token they
Token ID484
Feature activation-1.15e+00

Pos logit contributions
 Clean+0.001
elt+0.001
 lately+0.001
itted+0.001
apping+0.001
Neg logit contributions
ls-0.001
l-0.001
 cor-0.001
 crocod-0.001
 slit-0.001
Token started
Token ID2067
Feature activation-1.77e+00

Pos logit contributions
 Clean+0.021
elt+0.020
 lately+0.020
side+0.019
amed+0.019
Neg logit contributions
ls-0.031
l-0.030
 cor-0.028
 beast-0.024
 crocod-0.024
Token counting
Token ID14143
Feature activation-2.07e+00

Pos logit contributions
 Clean+0.031
elt+0.029
 lately+0.029
side+0.028
apping+0.028
Neg logit contributions
ls-0.048
l-0.047
 cor-0.045
 crocod-0.038
 beast-0.038
Token the
Token ID262
Feature activation-1.05e+00

Pos logit contributions
 Clean+0.030
side+0.029
 lately+0.029
 then+0.028
elt+0.028
Neg logit contributions
ls-0.059
l-0.057
 cor-0.056
 AW-0.052
 beast-0.050
Token days
Token ID1528
Feature activation-5.33e+00

Pos logit contributions
 Clean+0.022
 lately+0.021
 birthday+0.021
side+0.021
elt+0.021
Neg logit contributions
ls-0.025
l-0.023
 cor-0.022
 AW-0.019
osh-0.019
Token on
Token ID319
Feature activation+5.79e-01

Pos logit contributions
 Clean+0.093
 lately+0.086
elt+0.086
uggets+0.084
amed+0.083
Neg logit contributions
ls-0.148
l-0.144
 cor-0.136
osh-0.118
ia-0.116
Token the
Token ID262
Feature activation+2.88e-01

Pos logit contributions
ls+0.012
l+0.012
 cor+0.011
 AW+0.009
ah+0.009
Neg logit contributions
 Clean-0.014
side-0.013
 lately-0.013
elt-0.013
wards-0.012
Token calendar
Token ID11845
Feature activation-1.06e+00

Pos logit contributions
ls+0.007
l+0.007
 cor+0.007
osh+0.006
 AW+0.006
Neg logit contributions
 Clean-0.005
side-0.005
 lately-0.005
elt-0.005
 organizing-0.005
Token.
Token ID13
Feature activation-4.50e-01

Pos logit contributions
 Clean+0.023
elt+0.022
 lately+0.022
itted+0.021
amed+0.021
Neg logit contributions
ls-0.025
l-0.024
 cor-0.022
 crocod-0.018
osh-0.018
Token Lily
Token ID20037
Feature activation-2.59e-01

Pos logit contributions
 lately+0.008
keeping+0.008
aming+0.008
apping+0.008
 anniversary+0.008
Neg logit contributions
l-0.011
ls-0.011
 cor-0.010
ia-0.009
 crocod-0.009
Token together
Token ID1978
Feature activation-8.95e-01

Pos logit contributions
 Clean+0.028
amed+0.026
elt+0.026
side+0.026
oned+0.026
Neg logit contributions
ls-0.034
l-0.033
 cor-0.031
 crocod-0.026
ia-0.025
Token.
Token ID13
Feature activation-1.02e+00

Pos logit contributions
 Clean+0.021
elt+0.020
 lately+0.019
amed+0.019
itted+0.019
Neg logit contributions
ls-0.020
l-0.019
 cor-0.018
 crocod-0.014
osh-0.014
Token They
Token ID1119
Feature activation-1.30e+00

Pos logit contributions
elt+0.014
amed+0.014
 lately+0.014
 Clean+0.014
itted+0.013
Neg logit contributions
ls-0.030
l-0.029
 cor-0.028
 crocod-0.025
ia-0.025
Token spent
Token ID3377
Feature activation-2.61e+00

Pos logit contributions
 Clean+0.023
 lately+0.021
elt+0.021
amed+0.021
side+0.021
Neg logit contributions
ls-0.036
l-0.034
 cor-0.032
 beast-0.028
 AW-0.028
Token their
Token ID511
Feature activation-3.24e+00

Pos logit contributions
 Clean+0.062
elt+0.058
 lately+0.058
side+0.057
amed+0.057
Neg logit contributions
ls-0.056
l-0.054
 cor-0.051
 crocod-0.041
 beast-0.040
Token days
Token ID1528
Feature activation-5.33e+00

Pos logit contributions
 Clean+0.054
elt+0.051
 lately+0.050
apping+0.048
side+0.048
Neg logit contributions
ls-0.092
l-0.090
 cor-0.086
 crocod-0.074
osh-0.073
Token running
Token ID2491
Feature activation-1.09e+00

Pos logit contributions
 Clean+0.089
 Fighting+0.087
 lately+0.083
elt+0.079
othes+0.078
Neg logit contributions
ls-0.153
l-0.146
 cor-0.145
 slit-0.121
 flies-0.121
Token and
Token ID290
Feature activation-1.05e+00

Pos logit contributions
elt+0.020
 Clean+0.020
uggets+0.019
othes+0.018
 lately+0.018
Neg logit contributions
ls-0.030
l-0.030
 cor-0.027
ches-0.023
ah-0.023
Token hopping
Token ID47153
Feature activation-3.93e-01

Pos logit contributions
 Clean+0.015
elt+0.015
 lately+0.015
itted+0.014
apping+0.014
Neg logit contributions
ls-0.031
l-0.030
 cor-0.030
 crocod-0.026
 roared-0.026
Token around
Token ID1088
Feature activation-1.49e+00

Pos logit contributions
 Clean+0.006
elt+0.005
 lately+0.005
 Working+0.005
apping+0.005
Neg logit contributions
ls-0.012
l-0.012
 cor-0.011
 crocod-0.010
osh-0.010
Token the
Token ID262
Feature activation+6.50e-01

Pos logit contributions
 Clean+0.027
 lately+0.025
elt+0.025
side+0.025
apping+0.025
Neg logit contributions
ls-0.040
l-0.039
 cor-0.037
 AW-0.031
ah-0.031
Token enjoyed
Token ID8359
Feature activation-3.97e+00

Pos logit contributions
 Clean+0.051
elt+0.048
 lately+0.047
side+0.046
apping+0.046
Neg logit contributions
ls-0.068
l-0.067
 cor-0.063
 crocod-0.053
 beast-0.052
Token the
Token ID262
Feature activation-1.56e+00

Pos logit contributions
elt+0.119
 Clean+0.119
 lately+0.118
oned+0.115
itted+0.115
Neg logit contributions
 cor-0.058
l-0.055
ls-0.055
 crocod-0.042
 slit-0.036
Token rest
Token ID1334
Feature activation-2.93e+00

Pos logit contributions
 Clean+0.028
elt+0.027
 cleaning+0.026
amed+0.026
side+0.026
Neg logit contributions
ls-0.042
l-0.039
 cor-0.036
ches-0.033
 AW-0.033
Token of
Token ID286
Feature activation-2.46e+00

Pos logit contributions
 Clean+0.053
elt+0.048
 Working+0.047
oned+0.047
 lately+0.046
Neg logit contributions
ls-0.082
l-0.079
 cor-0.075
osh-0.064
 crocod-0.064
Token their
Token ID511
Feature activation-3.37e+00

Pos logit contributions
 Clean+0.070
elt+0.070
 lately+0.068
itted+0.067
apping+0.067
Neg logit contributions
ls-0.040
l-0.039
 cor-0.038
 crocod-0.029
 slit-0.026
Token summer
Token ID3931
Feature activation-5.29e+00

Pos logit contributions
 Clean+0.065
elt+0.061
side+0.059
 lately+0.058
apping+0.058
Neg logit contributions
ls-0.087
l-0.085
 cor-0.080
 beast-0.068
 AW-0.067
Token day
Token ID1110
Feature activation-3.49e+00

Pos logit contributions
 Clean+0.091
elt+0.087
 lately+0.087
itted+0.084
oned+0.084
Neg logit contributions
ls-0.145
l-0.142
 cor-0.137
 crocod-0.116
osh-0.114
Token.
Token ID13
Feature activation-1.88e+00

Pos logit contributions
 Clean+0.064
elt+0.060
 lately+0.060
amed+0.057
 Working+0.057
Neg logit contributions
ls-0.094
l-0.091
 cor-0.087
 crocod-0.074
osh-0.073
Token<|endoftext|>
Token ID50256
Feature activation-1.49e+00

Pos logit contributions
 lately+0.036
amed+0.035
elt+0.035
side+0.034
 Clean+0.034
Neg logit contributions
l-0.044
ls-0.044
 cor-0.042
osh-0.037
 crocod-0.035
TokenThe
Token ID464
Feature activation+1.72e-01

Pos logit contributions
 Clean+0.032
elt+0.030
 lately+0.029
side+0.029
apping+0.029
Neg logit contributions
ls-0.035
l-0.034
 cor-0.032
 crocod-0.027
 AW-0.026
Token little
Token ID1310
Feature activation-1.36e+00

Pos logit contributions
ls+0.004
l+0.004
 cor+0.003
 crocod+0.003
 beast+0.003
Neg logit contributions
 Clean-0.004
elt-0.004
 lately-0.004
side-0.004
apping-0.004
Token Joe
Token ID5689
Feature activation+1.50e+00

Pos logit contributions
ls+0.020
l+0.019
 cor+0.019
 crocod+0.015
 AW+0.015
Neg logit contributions
 Clean-0.014
elt-0.014
 lately-0.013
side-0.013
 Working-0.012
Token forgot
Token ID16453
Feature activation-1.46e+00

Pos logit contributions
ls+0.038
l+0.036
 cor+0.034
 AW+0.030
 troll+0.029
Neg logit contributions
 Clean-0.030
 then-0.029
elt-0.028
amed-0.027
 lately-0.026
Token all
Token ID477
Feature activation-2.06e+00

Pos logit contributions
 Clean+0.029
elt+0.028
 lately+0.027
side+0.027
apping+0.026
Neg logit contributions
ls-0.036
l-0.035
 cor-0.034
 crocod-0.028
 beast-0.027
Token about
Token ID546
Feature activation-1.58e+00

Pos logit contributions
 Clean+0.041
elt+0.039
 lately+0.038
side+0.038
amed+0.038
Neg logit contributions
ls-0.052
l-0.051
 cor-0.048
 crocod-0.041
osh-0.040
Token his
Token ID465
Feature activation-1.59e+00

Pos logit contributions
 Clean+0.038
 lately+0.036
elt+0.036
side+0.035
apping+0.035
Neg logit contributions
ls-0.033
l-0.032
 cor-0.029
 AW-0.024
ah-0.023
Token k
Token ID479
Feature activation-5.27e+00

Pos logit contributions
 Clean+0.036
 lately+0.034
elt+0.033
apping+0.033
side+0.033
Neg logit contributions
ls-0.034
l-0.033
 cor-0.031
 AW-0.025
ah-0.025
Tokenite
Token ID578
Feature activation-1.86e+00

Pos logit contributions
 Clean+0.110
 lately+0.099
oned+0.098
side+0.098
 Then+0.096
Neg logit contributions
ls-0.141
l-0.131
 cor-0.122
 crocod-0.104
ah-0.104
Token,
Token ID11
Feature activation-1.58e+00

Pos logit contributions
 Clean+0.037
elt+0.035
 lately+0.034
side+0.034
itted+0.033
Neg logit contributions
ls-0.046
l-0.045
 cor-0.043
 crocod-0.036
ah-0.035
Token and
Token ID290
Feature activation+5.39e-01

Pos logit contributions
 Clean+0.028
elt+0.026
oned+0.025
amed+0.025
 lately+0.025
Neg logit contributions
l-0.042
ls-0.042
 cor-0.041
 crocod-0.035
osh-0.034
Token started
Token ID2067
Feature activation-1.44e+00

Pos logit contributions
ls+0.012
l+0.011
 cor+0.011
 AW+0.008
 crocod+0.008
Neg logit contributions
 Clean-0.013
elt-0.012
 lately-0.012
side-0.012
apping-0.012
Token to
Token ID284
Feature activation-9.12e-01

Pos logit contributions
 Clean+0.024
 lately+0.023
elt+0.023
side+0.022
amed+0.022
Neg logit contributions
ls-0.040
l-0.040
 cor-0.038
 crocod-0.032
osh-0.032
Token.
Token ID13
Feature activation+1.72e+00

Pos logit contributions
 cor+0.069
ls+0.066
l+0.060
 troll+0.058
 crow+0.057
Neg logit contributions
 Clean-0.037
apping-0.036
 together-0.036
side-0.035
 lately-0.034
Token They
Token ID1119
Feature activation+1.03e+00

Pos logit contributions
l+0.047
ls+0.045
 cor+0.043
ia+0.040
 crocod+0.040
Neg logit contributions
amed-0.028
oned-0.026
 lately-0.026
aming-0.025
side-0.025
Token do
Token ID466
Feature activation+2.29e+00

Pos logit contributions
ls+0.030
l+0.027
 cor+0.027
 beast+0.023
 troll+0.023
Neg logit contributions
 Clean-0.017
 then-0.015
 lately-0.015
days-0.015
side-0.014
Token not
Token ID407
Feature activation+2.02e+00

Pos logit contributions
ls+0.069
 cor+0.065
l+0.064
 crocod+0.057
 beast+0.057
Neg logit contributions
 Clean-0.034
 then-0.030
 organizing-0.029
side-0.029
 Working-0.029
Token make
Token ID787
Feature activation+4.30e+00

Pos logit contributions
ls+0.045
l+0.045
 cor+0.042
 crocod+0.034
ah+0.034
Neg logit contributions
 Clean-0.045
elt-0.043
 lately-0.042
side-0.042
amed-0.042
Token loud
Token ID7812
Feature activation+4.29e+00

Pos logit contributions
ls+0.116
 cor+0.113
l+0.111
 troll+0.098
ah+0.097
Neg logit contributions
 Clean-0.080
 themselves-0.071
 birthday-0.068
 cleaning-0.067
 then-0.066
Token noises
Token ID26782
Feature activation+4.75e+00

Pos logit contributions
ls+0.097
 cor+0.089
l+0.087
ira+0.071
osh+0.071
Neg logit contributions
 Clean-0.097
elt-0.088
apping-0.088
 lately-0.087
 Working-0.087
Token.
Token ID13
Feature activation+1.76e+00

Pos logit contributions
ls+0.114
l+0.109
 cor+0.106
 crocod+0.086
 AW+0.086
Neg logit contributions
 Clean-0.100
 lately-0.094
elt-0.093
apping-0.092
side-0.091
Token They
Token ID1119
Feature activation+1.11e+00

Pos logit contributions
l+0.047
ls+0.045
 cor+0.043
ia+0.040
 crocod+0.039
Neg logit contributions
amed-0.030
oned-0.028
 lately-0.028
side-0.028
aming-0.027
Token are
Token ID389
Feature activation-4.26e-01

Pos logit contributions
ls+0.032
l+0.029
 cor+0.029
 beast+0.025
 troll+0.025
Neg logit contributions
 Clean-0.019
 then-0.017
 lately-0.016
days-0.016
 anniversary-0.015
Token quiet
Token ID5897
Feature activation-5.19e-01

Pos logit contributions
 Clean+0.009
elt+0.009
side+0.008
 lately+0.008
oned+0.008
Neg logit contributions
ls-0.010
l-0.010
 cor-0.009
 crocod-0.008
ia-0.008
Token day
Token ID1110
Feature activation-1.34e+00

Pos logit contributions
 Clean+0.001
 lately+0.001
elt+0.001
apping+0.001
side+0.001
Neg logit contributions
ls-0.003
l-0.003
 cor-0.003
 AW-0.003
 beast-0.002
Token,
Token ID11
Feature activation-5.79e-01

Pos logit contributions
 Clean+0.024
elt+0.023
 lately+0.022
side+0.022
amed+0.021
Neg logit contributions
ls-0.037
l-0.036
 cor-0.034
 crocod-0.029
osh-0.028
Token the
Token ID262
Feature activation-1.61e-01

Pos logit contributions
 Clean+0.010
elt+0.010
 lately+0.009
side+0.009
apping+0.009
Neg logit contributions
ls-0.016
l-0.016
 cor-0.015
 crocod-0.013
osh-0.012
Token TV
Token ID3195
Feature activation-3.22e-01

Pos logit contributions
 Clean+0.003
elt+0.003
oned+0.003
apping+0.003
amed+0.003
Neg logit contributions
ls-0.004
l-0.004
 cor-0.004
 crocod-0.003
 giant-0.003
Token made
Token ID925
Feature activation+1.89e+00

Pos logit contributions
 Clean+0.008
elt+0.007
 lately+0.007
side+0.007
apping+0.007
Neg logit contributions
ls-0.007
l-0.007
 cor-0.006
 AW-0.005
 crocod-0.005
Token a
Token ID257
Feature activation+3.62e+00

Pos logit contributions
ls+0.055
l+0.052
 cor+0.051
 AW+0.046
 beast+0.045
Neg logit contributions
 Clean-0.031
elt-0.028
wards-0.028
 cleaning-0.027
amed-0.027
Token loud
Token ID7812
Feature activation+3.56e+00

Pos logit contributions
ls+0.071
l+0.065
 cor+0.061
osh+0.049
ah+0.049
Neg logit contributions
 Clean-0.094
 birthday-0.089
 cleaning-0.086
 organizing-0.086
apping-0.084
Token noise
Token ID7838
Feature activation+3.33e+00

Pos logit contributions
ls+0.078
l+0.072
 cor+0.071
ira+0.054
osh+0.054
Neg logit contributions
 Clean-0.086
elt-0.077
apping-0.077
 Working-0.076
 lately-0.076
Token and
Token ID290
Feature activation+2.63e+00

Pos logit contributions
ls+0.090
l+0.087
 cor+0.083
 crocod+0.070
 AW+0.070
Neg logit contributions
 Clean-0.061
elt-0.057
 lately-0.056
side-0.054
apping-0.054
Token stopped
Token ID5025
Feature activation-7.71e-01

Pos logit contributions
ls+0.068
l+0.066
 cor+0.062
 AW+0.053
ah+0.053
Neg logit contributions
 Clean-0.051
elt-0.047
 lately-0.047
 then-0.045
wards-0.045
Token working
Token ID1762
Feature activation+1.12e+00

Pos logit contributions
 Clean+0.014
elt+0.013
 lately+0.013
apping+0.013
itted+0.012
Neg logit contributions
ls-0.021
l-0.021
 cor-0.019
 AW-0.017
ah-0.017
Token does
Token ID857
Feature activation+2.57e+00

Pos logit contributions
ls+0.010
l+0.010
 cor+0.009
 crocod+0.008
 AW+0.008
Neg logit contributions
 Clean-0.012
 then-0.011
elt-0.011
 Then-0.011
 Working-0.011
Token not
Token ID407
Feature activation+1.96e+00

Pos logit contributions
ls+0.070
l+0.067
 cor+0.065
 crocod+0.055
ah+0.055
Neg logit contributions
 Clean-0.046
 lately-0.043
elt-0.042
apping-0.041
side-0.041
Token want
Token ID765
Feature activation+2.81e+00

Pos logit contributions
ls+0.051
l+0.051
 cor+0.048
 crocod+0.040
ah+0.040
Neg logit contributions
 Clean-0.037
elt-0.035
side-0.034
 lately-0.034
amed-0.033
Token to
Token ID284
Feature activation+3.32e+00

Pos logit contributions
ls+0.084
l+0.082
 cor+0.078
 AW+0.068
ah+0.067
Neg logit contributions
 Clean-0.043
elt-0.040
 lately-0.040
side-0.037
itted-0.037
Token make
Token ID787
Feature activation+3.63e+00

Pos logit contributions
ls+0.066
l+0.065
 cor+0.062
 crocod+0.050
ah+0.047
Neg logit contributions
elt-0.081
 Clean-0.081
amed-0.080
itted-0.079
side-0.079
Token a
Token ID257
Feature activation+4.34e+00

Pos logit contributions
ls+0.097
l+0.095
 cor+0.092
 AW+0.083
osh+0.082
Neg logit contributions
 Clean-0.064
 cleaning-0.057
 Then-0.056
 themselves-0.056
 lately-0.056
Token mess
Token ID2085
Feature activation+2.35e+00

Pos logit contributions
ls+0.091
l+0.087
 cor+0.084
osh+0.075
 AW+0.071
Neg logit contributions
 Clean-0.095
 birthday-0.092
 anniversary-0.088
 cleaning-0.087
elt-0.087
Token or
Token ID393
Feature activation+2.43e+00

Pos logit contributions
ls+0.046
l+0.045
 cor+0.035
ia+0.030
 crocod+0.028
Neg logit contributions
elt-0.062
amed-0.061
 While-0.060
 Clean-0.060
itted-0.059
Token make
Token ID787
Feature activation+3.90e+00

Pos logit contributions
ls+0.050
l+0.049
 cor+0.047
 crocod+0.038
 slit+0.036
Neg logit contributions
 Clean-0.059
elt-0.057
apping-0.057
amed-0.056
side-0.056
Token Mom
Token ID11254
Feature activation+1.28e+00

Pos logit contributions
ls+0.100
 cor+0.097
l+0.096
ah+0.086
 AW+0.084
Neg logit contributions
 Clean-0.072
elt-0.064
 cleaning-0.064
itted-0.063
wards-0.062
Token angry
Token ID7954
Feature activation+2.74e+00

Pos logit contributions
ls+0.026
l+0.025
 cor+0.024
 AW+0.019
 beast+0.019
Neg logit contributions
 Clean-0.032
elt-0.030
 lately-0.030
apping-0.029
itted-0.029
Token when
Token ID618
Feature activation+2.26e+00

Pos logit contributions
ls+0.026
l+0.025
 cor+0.024
 crocod+0.020
 beast+0.020
Neg logit contributions
 Clean-0.019
elt-0.018
 lately-0.017
side-0.017
apping-0.017
Token he
Token ID339
Feature activation+1.91e+00

Pos logit contributions
ls+0.046
l+0.044
 cor+0.041
 AW+0.035
ah+0.034
Neg logit contributions
 Clean-0.055
 lately-0.053
elt-0.051
wards-0.051
side-0.051
Token couldn
Token ID3521
Feature activation+2.44e+00

Pos logit contributions
ls+0.051
l+0.048
 AW+0.047
 cor+0.046
 crocod+0.040
Neg logit contributions
 then-0.036
elt-0.030
 Clean-0.030
 lately-0.028
 Then-0.028
Token't
Token ID470
Feature activation+1.41e+00

Pos logit contributions
 cor+0.076
ls+0.075
l+0.072
 troll+0.065
 crocod+0.064
Neg logit contributions
 Clean-0.032
 Working-0.028
oned-0.027
 lately-0.025
 then-0.025
Token see
Token ID766
Feature activation+2.85e+00

Pos logit contributions
ls+0.032
l+0.031
 cor+0.030
 crocod+0.024
 beast+0.024
Neg logit contributions
 Clean-0.031
elt-0.030
 lately-0.029
side-0.029
apping-0.029
Token anything
Token ID1997
Feature activation+3.94e+00

Pos logit contributions
ls+0.068
l+0.065
 cor+0.060
 AW+0.058
 slit+0.053
Neg logit contributions
 Clean-0.057
 lately-0.057
 then-0.054
 cleaning-0.052
elt-0.051
Token.
Token ID13
Feature activation+4.16e-01

Pos logit contributions
ls+0.097
 cor+0.093
l+0.092
ira+0.080
ah+0.079
Neg logit contributions
 Clean-0.076
 lately-0.075
side-0.069
apping-0.068
elt-0.068
Token
Token ID198
Feature activation+5.13e-01

Pos logit contributions
ls+0.011
l+0.011
 cor+0.010
 crocod+0.009
ia+0.009
Neg logit contributions
 Clean-0.007
elt-0.007
 lately-0.007
side-0.007
amed-0.007
Token
Token ID198
Feature activation-3.91e-01

Pos logit contributions
ls+0.016
l+0.015
 cor+0.015
 AW+0.013
 crocod+0.013
Neg logit contributions
 Clean-0.007
elt-0.006
 Working-0.006
side-0.006
apping-0.006
Token"
Token ID1
Feature activation-1.77e+00

Pos logit contributions
 Clean+0.007
elt+0.007
 lately+0.007
itted+0.007
apping+0.007
Neg logit contributions
ls-0.010
l-0.010
 cor-0.010
ia-0.008
 crocod-0.008
TokenYes
Token ID5297
Feature activation+5.65e-01

Pos logit contributions
 Clean+0.024
elt+0.022
 lately+0.021
side+0.021
apping+0.020
Neg logit contributions
ls-0.057
l-0.055
 cor-0.053
 crocod-0.046
 AW-0.046
Token and
Token ID290
Feature activation+1.97e+00

Pos logit contributions
 Clean+0.023
elt+0.022
 lately+0.021
side+0.021
apping+0.020
Neg logit contributions
ls-0.036
l-0.035
 cor-0.034
 crocod-0.028
 beast-0.028
Token why
Token ID1521
Feature activation+2.63e+00

Pos logit contributions
ls+0.045
l+0.045
 cor+0.042
 crocod+0.035
osh+0.034
Neg logit contributions
 Clean-0.043
elt-0.041
 lately-0.040
apping-0.040
side-0.040
Token it
Token ID340
Feature activation+3.51e+00

Pos logit contributions
ls+0.074
l+0.071
 cor+0.071
ah+0.064
 AW+0.062
Neg logit contributions
 Clean-0.043
 lately-0.039
 cleaning-0.038
 organizing-0.038
 buying-0.037
Token made
Token ID925
Feature activation+3.17e+00

Pos logit contributions
ls+0.094
l+0.090
 AW+0.087
 cor+0.087
ia+0.079
Neg logit contributions
 Clean-0.061
 then-0.054
elt-0.053
 cleaning-0.052
 including-0.052
Token that
Token ID326
Feature activation+2.51e+00

Pos logit contributions
ls+0.082
l+0.078
 cor+0.076
 AW+0.064
ah+0.064
Neg logit contributions
 Clean-0.062
 lately-0.057
elt-0.056
 cleaning-0.054
side-0.054
Token noise
Token ID7838
Feature activation+3.81e+00

Pos logit contributions
ls+0.059
l+0.057
 cor+0.055
ah+0.045
 AW+0.045
Neg logit contributions
 Clean-0.054
 lately-0.050
elt-0.050
side-0.048
apping-0.048
Token.
Token ID13
Feature activation+1.25e+00

Pos logit contributions
ls+0.094
l+0.090
 cor+0.090
 AW+0.075
ah+0.075
Neg logit contributions
 Clean-0.077
 lately-0.071
elt-0.069
apping-0.067
 including-0.067
Token They
Token ID1119
Feature activation+1.54e-01

Pos logit contributions
l+0.031
ls+0.030
 cor+0.027
ia+0.026
 Ray+0.026
Neg logit contributions
amed-0.023
 lately-0.022
side-0.022
aming-0.022
oned-0.022
Token waved
Token ID26834
Feature activation+9.00e-01

Pos logit contributions
ls+0.004
l+0.004
 cor+0.004
 AW+0.003
 beast+0.003
Neg logit contributions
 Clean-0.003
elt-0.003
 lately-0.003
amed-0.003
 then-0.003
Token to
Token ID284
Feature activation+2.39e+00

Pos logit contributions
ls+0.024
l+0.024
 cor+0.023
 crocod+0.019
 beast+0.019
Neg logit contributions
 Clean-0.016
elt-0.015
 lately-0.015
apping-0.015
side-0.014
Token the
Token ID262
Feature activation+5.01e-01

Pos logit contributions
ls+0.073
l+0.068
 cor+0.065
 AW+0.058
ah+0.057
Neg logit contributions
 Clean-0.040
 then-0.034
 themselves-0.034
 Then-0.033
wards-0.033
Token went
Token ID1816
Feature activation-1.55e+00

Pos logit contributions
ls+0.048
l+0.046
 cor+0.044
 AW+0.039
 beast+0.038
Neg logit contributions
 Clean-0.020
elt-0.017
 lately-0.017
side-0.016
amed-0.016
Token out
Token ID503
Feature activation+2.19e-01

Pos logit contributions
 Clean+0.030
side+0.030
amed+0.030
 lately+0.029
keeping+0.029
Neg logit contributions
ls-0.039
l-0.037
 cor-0.035
 crocod-0.030
ches-0.028
Token to
Token ID284
Feature activation-2.54e-01

Pos logit contributions
ls+0.006
l+0.006
 cor+0.005
 crocod+0.005
osh+0.004
Neg logit contributions
 Clean-0.004
elt-0.004
side-0.004
amed-0.004
 lately-0.004
Token search
Token ID2989
Feature activation+1.16e+00

Pos logit contributions
 Clean+0.004
 cleaning+0.003
 organizing+0.003
 birthday+0.003
wards+0.003
Neg logit contributions
ls-0.008
l-0.008
 cor-0.007
 AW-0.007
ah-0.006
Token for
Token ID329
Feature activation-1.58e+00

Pos logit contributions
ls+0.032
l+0.031
 cor+0.029
 crocod+0.025
osh+0.024
Neg logit contributions
 Clean-0.021
elt-0.020
side-0.019
 lately-0.019
amed-0.019
Token something
Token ID1223
Feature activation+2.67e+00

Pos logit contributions
 Clean+0.028
 lately+0.026
 cleaning+0.026
 including+0.026
 organizing+0.026
Neg logit contributions
ls-0.042
l-0.041
 cor-0.037
ah-0.036
 AW-0.036
Token special
Token ID2041
Feature activation-1.30e+00

Pos logit contributions
ls+0.063
l+0.062
 cor+0.058
osh+0.049
ah+0.049
Neg logit contributions
 Clean-0.057
elt-0.053
 lately-0.053
apping-0.051
side-0.051
Token.
Token ID13
Feature activation-1.01e+00

Pos logit contributions
amed+0.035
 Clean+0.034
elt+0.034
apping+0.034
oned+0.034
Neg logit contributions
ls-0.022
l-0.021
 cor-0.020
 crocod-0.016
 trick-0.016
Token He
Token ID679
Feature activation+4.68e-01

Pos logit contributions
amed+0.020
 Clean+0.020
elt+0.019
apping+0.019
 lately+0.019
Neg logit contributions
l-0.023
ls-0.023
 cor-0.022
 crocod-0.019
osh-0.018
Token went
Token ID1816
Feature activation-1.28e+00

Pos logit contributions
ls+0.012
l+0.012
 cor+0.011
 AW+0.009
 beast+0.009
Neg logit contributions
 Clean-0.009
elt-0.009
 lately-0.009
side-0.008
amed-0.008
Token out
Token ID503
Feature activation+3.99e-01

Pos logit contributions
 Clean+0.027
elt+0.027
itted+0.027
apping+0.026
 Fighting+0.026
Neg logit contributions
ls-0.031
l-0.030
 cor-0.028
 crocod-0.023
 trick-0.023
Token dragon
Token ID10441
Feature activation+6.34e-01

Pos logit contributions
ls+0.000
l+0.000
 AW+0.000
 cor+0.000
osh+0.000
Neg logit contributions
 birthday-0.000
 lately-0.000
 cleaning-0.000
 outside-0.000
 celebrating-0.000
Token breathing
Token ID12704
Feature activation+1.83e+00

Pos logit contributions
ls+0.017
l+0.016
 cor+0.016
 AW+0.014
ira+0.014
Neg logit contributions
 Clean-0.011
apping-0.010
 then-0.010
elt-0.010
 lately-0.010
Token fire
Token ID2046
Feature activation+1.09e+00

Pos logit contributions
ls+0.048
l+0.045
 cor+0.044
ira+0.039
 AW+0.038
Neg logit contributions
 Clean-0.033
side-0.031
 lately-0.031
amed-0.030
 including-0.030
Token.
Token ID13
Feature activation+5.10e-01

Pos logit contributions
ls+0.028
l+0.027
 cor+0.026
 AW+0.022
ira+0.022
Neg logit contributions
 Clean-0.021
elt-0.020
side-0.019
 lately-0.019
apping-0.019
Token He
Token ID679
Feature activation+5.25e-01

Pos logit contributions
ls+0.014
l+0.014
 cor+0.013
 crocod+0.011
 beast+0.011
Neg logit contributions
 Clean-0.009
elt-0.008
 lately-0.008
side-0.008
apping-0.008
Token brave
Token ID14802
Feature activation+7.00e-01

Pos logit contributions
ls+0.014
l+0.013
 AW+0.012
 cor+0.012
inkle+0.010
Neg logit contributions
 then-0.010
 Clean-0.009
amed-0.009
 Then-0.008
elt-0.008
Tokenly
Token ID306
Feature activation-7.29e-02

Pos logit contributions
ls+0.019
l+0.019
 cor+0.018
 AW+0.015
osh+0.015
Neg logit contributions
 Clean-0.012
elt-0.012
 lately-0.011
itted-0.011
apping-0.011
Token swung
Token ID28507
Feature activation-7.62e-01

Pos logit contributions
 Clean+0.002
elt+0.001
 then+0.001
 lately+0.001
amed+0.001
Neg logit contributions
ls-0.002
l-0.002
 cor-0.002
 AW-0.001
ia-0.001
Token his
Token ID465
Feature activation-1.75e+00

Pos logit contributions
 Clean+0.017
elt+0.016
 lately+0.016
side+0.015
amed+0.015
Neg logit contributions
ls-0.018
l-0.017
 cor-0.016
 crocod-0.013
 beast-0.013
Token sword
Token ID8429
Feature activation-1.92e-01

Pos logit contributions
 birthday+0.042
 Clean+0.042
itted+0.041
 cleaning+0.041
 then+0.041
Neg logit contributions
l-0.031
ls-0.030
 cor-0.026
ah-0.022
 AW-0.022
Token and
Token ID290
Feature activation+9.92e-01

Pos logit contributions
 Clean+0.003
elt+0.003
 lately+0.003
side+0.002
apping+0.002
Neg logit contributions
ls-0.006
l-0.006
 cor-0.006
 crocod-0.005
 beast-0.005
Token want
Token ID765
Feature activation+4.14e-01

Pos logit contributions
l+0.008
ls+0.008
 cor+0.007
ah+0.006
 crocod+0.006
Neg logit contributions
 Clean-0.007
elt-0.006
side-0.006
 lately-0.006
wards-0.006
Token to
Token ID284
Feature activation+1.77e+00

Pos logit contributions
ls+0.011
l+0.010
 cor+0.010
 crocod+0.008
 crow+0.008
Neg logit contributions
 Clean-0.008
apping-0.008
itted-0.008
side-0.008
elt-0.007
Token play
Token ID711
Feature activation+1.29e+00

Pos logit contributions
ls+0.049
l+0.047
 cor+0.045
 crocod+0.038
 beast+0.038
Neg logit contributions
 Clean-0.031
elt-0.029
 lately-0.029
side-0.028
apping-0.028
Token with
Token ID351
Feature activation+6.59e-02

Pos logit contributions
ls+0.037
l+0.037
 cor+0.034
 crocod+0.030
 beast+0.029
Neg logit contributions
 Clean-0.020
elt-0.019
itted-0.019
side-0.019
 lately-0.019
Token me
Token ID502
Feature activation-9.91e-01

Pos logit contributions
ls+0.002
l+0.002
 cor+0.002
 AW+0.001
ah+0.001
Neg logit contributions
 Clean-0.001
 lately-0.001
elt-0.001
 then-0.001
 cleaning-0.001
Token and
Token ID290
Feature activation+1.26e+00

Pos logit contributions
 Clean+0.027
elt+0.026
itted+0.025
side+0.025
wards+0.025
Neg logit contributions
ls-0.018
l-0.017
 cor-0.015
ah-0.012
ia-0.012
Token my
Token ID616
Feature activation-1.38e+00

Pos logit contributions
ls+0.029
l+0.028
 cor+0.027
 crocod+0.022
 beast+0.021
Neg logit contributions
 Clean-0.028
elt-0.027
 lately-0.026
side-0.026
apping-0.026
Token doll
Token ID3654
Feature activation+5.74e-02

Pos logit contributions
 Clean+0.026
elt+0.024
 lately+0.024
 birthday+0.023
 anniversary+0.023
Neg logit contributions
ls-0.036
l-0.035
 cor-0.033
osh-0.028
 AW-0.028
Token?"
Token ID1701
Feature activation+1.25e+00

Pos logit contributions
ls+0.001
l+0.001
 cor+0.001
 crocod+0.001
 AW+0.001
Neg logit contributions
 Clean-0.001
elt-0.001
 lately-0.001
side-0.001
apping-0.001
Token
Token ID198
Feature activation+6.47e-01

Pos logit contributions
l+0.033
ls+0.033
 cor+0.030
 crocod+0.026
 crow+0.025
Neg logit contributions
amed-0.023
 Clean-0.022
 lately-0.022
itted-0.022
side-0.021
Token
Token ID198
Feature activation-1.67e-01

Pos logit contributions
ls+0.020
l+0.020
 cor+0.019
 crocod+0.017
 beast+0.017
Neg logit contributions
 Clean-0.009
elt-0.008
side-0.008
 lately-0.008
apping-0.007
Token safe
Token ID3338
Feature activation-1.75e+00

Pos logit contributions
 Clean+0.039
elt+0.038
 lately+0.036
side+0.035
itted+0.035
Neg logit contributions
ls-0.059
l-0.058
 cor-0.056
 crocod-0.047
osh-0.046
Token and
Token ID290
Feature activation-1.55e-02

Pos logit contributions
 Clean+0.036
elt+0.035
oned+0.034
uggets+0.034
itted+0.034
Neg logit contributions
ls-0.042
l-0.041
 cor-0.039
 beast-0.032
 crocod-0.031
Token sound
Token ID2128
Feature activation+2.45e+00

Pos logit contributions
elt+0.000
uggets+0.000
 Clean+0.000
oned+0.000
apping+0.000
Neg logit contributions
l-0.000
ls-0.000
 cor-0.000
osh-0.000
 flies-0.000
Token!
Token ID0
Feature activation-1.10e+00

Pos logit contributions
ls+0.060
l+0.058
 cor+0.056
 crocod+0.047
 beast+0.046
Neg logit contributions
 Clean-0.050
 lately-0.048
elt-0.047
side-0.046
apping-0.045
Token<|endoftext|>
Token ID50256
Feature activation-1.32e+00

Pos logit contributions
 lately+0.022
amed+0.021
uggets+0.021
 Clean+0.021
side+0.021
Neg logit contributions
l-0.026
ls-0.025
 cor-0.023
ia-0.020
ira-0.020
TokenOnce
Token ID7454
Feature activation+1.83e+00

Pos logit contributions
 Clean+0.028
elt+0.026
 lately+0.026
side+0.025
apping+0.025
Neg logit contributions
ls-0.032
l-0.030
 cor-0.029
 crocod-0.024
 AW-0.024
Token there
Token ID612
Feature activation+2.00e+00

Pos logit contributions
 cor+0.047
ls+0.044
l+0.043
 beast+0.039
 AW+0.039
Neg logit contributions
 Clean-0.034
 lately-0.033
apping-0.032
elt-0.031
 ago-0.031
Token was
Token ID373
Feature activation-7.99e-01

Pos logit contributions
ls+0.047
l+0.044
 cor+0.043
 AW+0.035
 beast+0.035
Neg logit contributions
 Clean-0.045
elt-0.042
 lately-0.042
side-0.040
wards-0.040
Token a
Token ID257
Feature activation+2.11e+00

Pos logit contributions
 Clean+0.021
side+0.021
oned+0.020
apping+0.020
 Working+0.020
Neg logit contributions
l-0.013
 cor-0.013
ls-0.013
 crocod-0.010
 flies-0.010
Token generous
Token ID14431
Feature activation-3.90e-02

Pos logit contributions
ls+0.050
l+0.047
 cor+0.042
 AW+0.038
 weighs+0.037
Neg logit contributions
 Clean-0.048
apping-0.043
 cleaning-0.042
 birthday-0.042
 organizing-0.041
Token man
Token ID582
Feature activation+7.58e-01

Pos logit contributions
 Clean+0.001
elt+0.001
side+0.001
 lately+0.001
amed+0.001
Neg logit contributions
ls-0.001
l-0.001
 cor-0.001
 crocod-0.001
 beast-0.000
Token Tom
Token ID4186
Feature activation+2.56e+00

Pos logit contributions
ls+0.015
l+0.015
 cor+0.013
 AW+0.012
inkle+0.012
Neg logit contributions
 Clean-0.010
 lately-0.009
elt-0.009
 organizing-0.009
uggets-0.008
Token.
Token ID13
Feature activation-1.04e+00

Pos logit contributions
ls+0.065
l+0.062
 cor+0.060
 AW+0.052
 beast+0.049
Neg logit contributions
 Clean-0.053
 lately-0.049
elt-0.049
apping-0.047
side-0.046
Token Tom
Token ID4186
Feature activation+2.12e+00

Pos logit contributions
 Clean+0.021
elt+0.019
 lately+0.019
 Then+0.019
side+0.018
Neg logit contributions
ls-0.028
l-0.026
 cor-0.026
 beast-0.021
 crow-0.021
Token was
Token ID373
Feature activation-1.47e+00

Pos logit contributions
ls+0.055
l+0.053
 cor+0.051
 crocod+0.043
osh+0.042
Neg logit contributions
 Clean-0.040
elt-0.039
 lately-0.038
side-0.037
apping-0.037
Token 3
Token ID513
Feature activation-1.95e+00

Pos logit contributions
 Clean+0.027
side+0.027
keeping+0.026
oned+0.026
elt+0.025
Neg logit contributions
ls-0.036
l-0.036
 cor-0.035
 crocod-0.031
osh-0.029
Token years
Token ID812
Feature activation+8.91e-01

Pos logit contributions
 Clean+0.022
 lately+0.019
elt+0.018
apping+0.018
side+0.017
Neg logit contributions
ls-0.066
l-0.064
 cor-0.062
 AW-0.056
osh-0.055
Token old
Token ID1468
Feature activation-2.89e+00

Pos logit contributions
ls+0.017
l+0.017
 cor+0.015
 crocod+0.013
ia+0.012
Neg logit contributions
 Clean-0.023
elt-0.023
amed-0.022
oned-0.022
side-0.022
Token and
Token ID290
Feature activation+1.73e+00

Pos logit contributions
oned+0.063
amed+0.061
side+0.061
 Clean+0.061
itted+0.060
Neg logit contributions
ls-0.065
l-0.065
 cor-0.062
 crocod-0.053
 beast-0.051
Token he
Token ID339
Feature activation+7.34e-01

Pos logit contributions
ls+0.049
l+0.048
 cor+0.046
 crocod+0.039
 AW+0.039
Neg logit contributions
 Clean-0.029
elt-0.027
 lately-0.027
side-0.026
apping-0.026
Token was
Token ID373
Feature activation-1.86e+00

Pos logit contributions
ls+0.024
l+0.024
 cor+0.023
 AW+0.020
 crocod+0.019
Neg logit contributions
 Clean-0.010
elt-0.008
 lately-0.008
amed-0.008
 Working-0.008
Token not
Token ID407
Feature activation+8.35e-01

Pos logit contributions
side+0.029
 Clean+0.029
elt+0.028
oned+0.028
 lately+0.027
Neg logit contributions
 cor-0.050
l-0.050
ls-0.050
 crocod-0.044
ah-0.042
Token indoors
Token ID31797
Feature activation-1.76e+00

Pos logit contributions
elt+0.017
 lately+0.017
 Clean+0.016
apping+0.016
Playing+0.016
Neg logit contributions
ls-0.019
l-0.019
 cor-0.017
ches-0.014
 trick-0.014
Token to
Token ID284
Feature activation+2.92e-01

Pos logit contributions
elt+0.044
 Clean+0.044
amed+0.042
itted+0.042
 lately+0.041
Neg logit contributions
l-0.034
ls-0.033
 cor-0.031
 crocod-0.024
ia-0.023
Token tell
Token ID1560
Feature activation-1.54e-01

Pos logit contributions
ls+0.007
l+0.007
 cor+0.006
 AW+0.005
ah+0.005
Neg logit contributions
 Clean-0.006
 lately-0.006
wards-0.006
side-0.006
 organizing-0.006
Token dad
Token ID9955
Feature activation-1.06e+00

Pos logit contributions
 Clean+0.003
 lately+0.003
wards+0.002
elt+0.002
side+0.002
Neg logit contributions
ls-0.004
l-0.004
 cor-0.004
 AW-0.003
ah-0.003
Token the
Token ID262
Feature activation+5.22e-01

Pos logit contributions
 Clean+0.017
 lately+0.016
side+0.015
elt+0.015
 Working+0.015
Neg logit contributions
ls-0.031
l-0.029
 cor-0.029
 AW-0.026
 slit-0.025
Token news
Token ID1705
Feature activation-5.62e-01

Pos logit contributions
ls+0.012
l+0.011
 cor+0.011
osh+0.009
 AW+0.009
Neg logit contributions
 Clean-0.011
 birthday-0.011
 Working-0.011
 anniversary-0.011
 cleaning-0.010
Token.
Token ID13
Feature activation-1.16e+00

Pos logit contributions
 Clean+0.011
 lately+0.010
elt+0.010
apping+0.010
side+0.010
Neg logit contributions
ls-0.014
l-0.014
 cor-0.013
 AW-0.011
 crocod-0.011
Token Dad
Token ID17415
Feature activation-8.61e-01

Pos logit contributions
 Clean+0.018
amed+0.018
 lately+0.018
elt+0.017
side+0.017
Neg logit contributions
ls-0.032
l-0.031
 cor-0.031
 crocod-0.026
ia-0.026
Token was
Token ID373
Feature activation-1.63e+00

Pos logit contributions
 Clean+0.019
elt+0.018
amed+0.017
 then+0.017
 Working+0.017
Neg logit contributions
ls-0.020
l-0.019
 cor-0.019
 AW-0.016
 crocod-0.015
Token so
Token ID523
Feature activation+9.49e-01

Pos logit contributions
elt+0.028
 Clean+0.027
othes+0.026
 lately+0.026
side+0.026
Neg logit contributions
l-0.043
 cor-0.042
ls-0.041
 crow-0.035
 giant-0.035
Token proud
Token ID6613
Feature activation-8.85e-01

Pos logit contributions
ls+0.029
l+0.028
 cor+0.027
ah+0.024
 AW+0.024
Neg logit contributions
 Clean-0.014
apping-0.013
elt-0.013
amed-0.012
 lately-0.012
Token They
Token ID1119
Feature activation-1.47e-01

Pos logit contributions
ls+0.012
l+0.012
 cor+0.011
 says+0.010
ia+0.010
Neg logit contributions
amed-0.009
oned-0.008
aming-0.008
keeping-0.007
 lately-0.007
Token take
Token ID1011
Feature activation+1.20e+00

Pos logit contributions
 Clean+0.003
 then+0.003
 lately+0.002
amed+0.002
 including+0.002
Neg logit contributions
ls-0.004
l-0.004
 cor-0.003
 beast-0.003
 crocod-0.003
Token the
Token ID262
Feature activation+8.83e-01

Pos logit contributions
ls+0.027
l+0.027
 cor+0.025
 beast+0.022
 crocod+0.022
Neg logit contributions
 Clean-0.027
elt-0.024
amed-0.024
side-0.024
 lately-0.024
Token red
Token ID2266
Feature activation-3.53e-01

Pos logit contributions
ls+0.022
l+0.021
 cor+0.020
ah+0.017
ia+0.017
Neg logit contributions
 Clean-0.017
elt-0.016
 cleaning-0.016
 lately-0.016
 organizing-0.016
Token square
Token ID6616
Feature activation-1.77e+00

Pos logit contributions
 Clean+0.009
amed+0.009
 lately+0.009
elt+0.009
apping+0.009
Neg logit contributions
ls-0.007
l-0.007
 cor-0.006
 crocod-0.005
ah-0.005
Token block
Token ID2512
Feature activation-2.32e-01

Pos logit contributions
 Clean+0.036
elt+0.034
 lately+0.034
itted+0.032
 Working+0.032
Neg logit contributions
ls-0.046
l-0.044
 cor-0.042
 crocod-0.037
ia-0.035
Token and
Token ID290
Feature activation+1.69e+00

Pos logit contributions
 Clean+0.004
elt+0.004
 lately+0.004
 Working+0.003
itted+0.003
Neg logit contributions
ls-0.007
l-0.007
 cor-0.006
 crocod-0.005
osh-0.005
Token some
Token ID617
Feature activation+4.92e-01

Pos logit contributions
ls+0.038
l+0.037
 cor+0.035
 beast+0.028
ah+0.028
Neg logit contributions
 Clean-0.038
elt-0.035
 lately-0.035
side-0.034
wards-0.033
Token other
Token ID584
Feature activation-1.51e+00

Pos logit contributions
ls+0.011
l+0.011
 cor+0.010
 crocod+0.009
osh+0.008
Neg logit contributions
 Clean-0.011
elt-0.010
 lately-0.010
side-0.010
apping-0.010
Token blocks
Token ID7021
Feature activation-7.95e-01

Pos logit contributions
 Clean+0.029
elt+0.027
 lately+0.026
side+0.026
apping+0.026
Neg logit contributions
ls-0.039
l-0.038
 cor-0.036
 AW-0.030
 beast-0.030
Token and
Token ID290
Feature activation+9.95e-01

Pos logit contributions
 Clean+0.018
 lately+0.018
elt+0.017
 Working+0.017
oned+0.016
Neg logit contributions
ls-0.019
l-0.018
 cor-0.016
 crocod-0.013
osh-0.013
Token and
Token ID290
Feature activation+3.71e-01

Pos logit contributions
 Clean+0.012
elt+0.011
itted+0.011
 Working+0.011
Playing+0.011
Neg logit contributions
l-0.013
ls-0.013
 cor-0.012
 crocod-0.011
 beast-0.010
Token rushed
Token ID13999
Feature activation-1.61e+00

Pos logit contributions
ls+0.008
l+0.008
 cor+0.007
 crocod+0.006
 slit+0.006
Neg logit contributions
 Clean-0.008
apping-0.008
elt-0.008
itted-0.008
 lately-0.008
Token back
Token ID736
Feature activation-1.97e+00

Pos logit contributions
apping+0.034
elt+0.034
itted+0.033
Playing+0.033
 lately+0.033
Neg logit contributions
l-0.036
ls-0.036
 cor-0.035
ches-0.027
 trick-0.027
Token to
Token ID284
Feature activation-1.02e-01

Pos logit contributions
 Clean+0.033
elt+0.032
 lately+0.032
itted+0.031
apping+0.031
Neg logit contributions
ls-0.056
l-0.053
 cor-0.052
 crocod-0.043
 beast-0.042
Token the
Token ID262
Feature activation+1.79e-01

Pos logit contributions
 Clean+0.003
elt+0.003
 lately+0.003
side+0.003
apping+0.003
Neg logit contributions
ls-0.002
l-0.002
 cor-0.002
 crocod-0.001
 beast-0.001
Token park
Token ID3952
Feature activation+4.06e-01

Pos logit contributions
l+0.004
 cor+0.004
ls+0.004
 crocod+0.003
 beast+0.003
Neg logit contributions
 lately-0.004
elt-0.004
itted-0.004
 Clean-0.004
apping-0.004
Token.
Token ID13
Feature activation-7.86e-01

Pos logit contributions
ls+0.010
l+0.009
 cor+0.009
 crocod+0.007
 beast+0.007
Neg logit contributions
 Clean-0.009
elt-0.008
 lately-0.008
itted-0.008
amed-0.008
Token He
Token ID679
Feature activation+1.24e-02

Pos logit contributions
amed+0.015
apping+0.014
aming+0.014
 lately+0.014
side+0.014
Neg logit contributions
l-0.018
ls-0.018
 cor-0.017
 crocod-0.015
ia-0.015
Token used
Token ID973
Feature activation-1.54e+00

Pos logit contributions
ls+0.000
l+0.000
 cor+0.000
 AW+0.000
 beast+0.000
Neg logit contributions
 Clean-0.000
 then-0.000
amed-0.000
elt-0.000
 cleaning-0.000
Token the
Token ID262
Feature activation-8.03e-01

Pos logit contributions
 Clean+0.046
elt+0.045
apping+0.045
 lately+0.045
side+0.044
Neg logit contributions
ls-0.023
l-0.023
 cor-0.021
 crocod-0.015
osh-0.014
Token tool
Token ID2891
Feature activation-2.21e+00

Pos logit contributions
 Clean+0.017
elt+0.016
 lately+0.016
side+0.016
apping+0.016
Neg logit contributions
ls-0.019
l-0.019
 cor-0.018
 crocod-0.014
osh-0.014
TokenThe
Token ID464
Feature activation-6.46e-02

Pos logit contributions
elt+0.037
 lately+0.035
keeping+0.034
 Clean+0.034
itted+0.034
Neg logit contributions
l-0.043
ls-0.039
 cor-0.039
osh-0.033
ia-0.033
Token rain
Token ID6290
Feature activation-2.23e+00

Pos logit contributions
 Clean+0.001
elt+0.001
 lately+0.001
itted+0.001
apping+0.001
Neg logit contributions
ls-0.002
l-0.002
 cor-0.001
 crocod-0.001
 beast-0.001
Token stopped
Token ID5025
Feature activation-9.58e-01

Pos logit contributions
 Clean+0.053
 lately+0.050
elt+0.050
 Working+0.049
apping+0.049
Neg logit contributions
ls-0.047
l-0.046
 cor-0.042
osh-0.034
ah-0.034
Token,
Token ID11
Feature activation-3.35e-01

Pos logit contributions
 Clean+0.017
 lately+0.016
amed+0.016
elt+0.016
side+0.016
Neg logit contributions
ls-0.026
l-0.025
 cor-0.025
 crocod-0.021
 slit-0.020
Token and
Token ID290
Feature activation+1.35e+00

Pos logit contributions
 Clean+0.006
elt+0.006
uggets+0.006
itted+0.006
apping+0.006
Neg logit contributions
ls-0.009
l-0.009
 cor-0.008
 crocod-0.007
osh-0.007
Token the
Token ID262
Feature activation+3.17e-01

Pos logit contributions
ls+0.029
l+0.028
 cor+0.026
 AW+0.022
ah+0.021
Neg logit contributions
 Clean-0.032
 lately-0.030
elt-0.030
side-0.030
wards-0.029
Token sun
Token ID4252
Feature activation-2.21e-01

Pos logit contributions
ls+0.007
l+0.007
 cor+0.007
 crocod+0.006
 beast+0.006
Neg logit contributions
 Clean-0.007
elt-0.007
 lately-0.006
apping-0.006
itted-0.006
Token came
Token ID1625
Feature activation+2.45e-01

Pos logit contributions
 Clean+0.005
elt+0.005
side+0.004
amed+0.004
 lately+0.004
Neg logit contributions
ls-0.005
l-0.005
 cor-0.005
 AW-0.004
 beast-0.004
Token out
Token ID503
Feature activation+7.65e-01

Pos logit contributions
ls+0.007
l+0.006
 cor+0.006
 crocod+0.005
osh+0.005
Neg logit contributions
 Clean-0.004
elt-0.004
 lately-0.004
side-0.004
apping-0.004
Token again
Token ID757
Feature activation-1.02e+00

Pos logit contributions
ls+0.019
l+0.018
 cor+0.017
 crocod+0.014
osh+0.014
Neg logit contributions
 Clean-0.016
elt-0.015
 lately-0.015
side-0.014
itted-0.014
Token.
Token ID13
Feature activation-1.26e+00

Pos logit contributions
 Clean+0.023
elt+0.022
 lately+0.022
amed+0.021
itted+0.021
Neg logit contributions
ls-0.022
l-0.022
 cor-0.021
 crocod-0.017
ah-0.016
Token at
Token ID379
Feature activation-1.23e+00

Pos logit contributions
 Clean+0.058
elt+0.054
 Fighting+0.054
 Working+0.054
wards+0.053
Neg logit contributions
l-0.064
ls-0.063
 cor-0.055
ches-0.045
 trick-0.045
Token home
Token ID1363
Feature activation-1.22e+00

Pos logit contributions
 Clean+0.022
side+0.020
 lately+0.020
elt+0.019
 Working+0.019
Neg logit contributions
ls-0.035
l-0.033
 cor-0.032
 AW-0.027
osh-0.027
Token.
Token ID13
Feature activation-6.52e-01

Pos logit contributions
 Clean+0.028
elt+0.028
itted+0.027
amed+0.026
 Working+0.026
Neg logit contributions
ls-0.026
l-0.026
 cor-0.024
 crocod-0.019
ia-0.019
Token
Token ID220
Feature activation+5.28e-01

Pos logit contributions
 Clean+0.010
elt+0.010
 lately+0.010
amed+0.010
apping+0.010
Neg logit contributions
ls-0.019
l-0.018
 cor-0.017
 crocod-0.015
osh-0.015
Token
Token ID198
Feature activation-2.90e-01

Pos logit contributions
ls+0.015
l+0.015
 cor+0.014
 crocod+0.012
osh+0.012
Neg logit contributions
 Clean-0.008
elt-0.008
 lately-0.008
side-0.007
apping-0.007
Token
Token ID198
Feature activation-9.86e-01

Pos logit contributions
 Clean+0.004
side+0.003
apping+0.003
elt+0.003
 lately+0.003
Neg logit contributions
ls-0.009
 cor-0.009
l-0.009
 AW-0.008
 beast-0.008
TokenHe
Token ID1544
Feature activation-2.12e-01

Pos logit contributions
 Clean+0.013
side+0.012
 lately+0.012
elt+0.012
amed+0.011
Neg logit contributions
ls-0.032
l-0.030
 cor-0.030
 crocod-0.026
 beast-0.026
Token carefully
Token ID7773
Feature activation-1.96e+00

Pos logit contributions
 Clean+0.004
elt+0.004
 lately+0.004
amed+0.004
side+0.004
Neg logit contributions
ls-0.005
l-0.005
 cor-0.005
 AW-0.004
 beast-0.004
Token put
Token ID1234
Feature activation-1.56e+00

Pos logit contributions
 Clean+0.043
 lately+0.041
elt+0.041
apping+0.040
side+0.039
Neg logit contributions
ls-0.045
l-0.044
 cor-0.042
 crocod-0.034
osh-0.033
Token the
Token ID262
Feature activation-1.16e+00

Pos logit contributions
 Clean+0.038
elt+0.036
 lately+0.035
side+0.035
amed+0.035
Neg logit contributions
ls-0.033
l-0.032
 cor-0.030
 crocod-0.024
 beast-0.024
Token photo
Token ID4590
Feature activation-7.10e-01

Pos logit contributions
 Clean+0.026
elt+0.026
 lately+0.025
apping+0.025
 Working+0.025
Neg logit contributions
ls-0.026
l-0.026
 cor-0.023
 crocod-0.019
 giant-0.019
TokenM
Token ID44
Feature activation-1.36e-01

Pos logit contributions
 Clean+0.032
elt+0.031
 lately+0.031
itted+0.030
side+0.029
Neg logit contributions
l-0.025
ls-0.025
 cor-0.023
ia-0.018
 crocod-0.018
Tokenandy
Token ID10757
Feature activation+7.06e-01

Pos logit contributions
 Clean+0.003
elt+0.003
apping+0.003
 lately+0.003
itted+0.003
Neg logit contributions
ls-0.003
 cor-0.003
l-0.003
 AW-0.002
 roared-0.002
Token cl
Token ID537
Feature activation-1.78e+00

Pos logit contributions
ls+0.017
l+0.016
 cor+0.015
 AW+0.013
 crocod+0.012
Neg logit contributions
 Clean-0.015
elt-0.015
amed-0.014
 then-0.014
 Working-0.014
Tokenapped
Token ID6320
Feature activation-1.05e+00

Pos logit contributions
 lately+0.017
side+0.017
 Clean+0.016
 including+0.016
keeping+0.014
Neg logit contributions
ls-0.060
l-0.058
 cor-0.056
osh-0.053
ia-0.052
Token her
Token ID607
Feature activation-2.63e+00

Pos logit contributions
 Clean+0.024
elt+0.023
side+0.022
 Working+0.022
 lately+0.022
Neg logit contributions
l-0.024
ls-0.023
 cor-0.022
ia-0.018
 crocod-0.017
Token hands
Token ID2832
Feature activation-2.87e+00

Pos logit contributions
 Clean+0.044
 lately+0.042
elt+0.041
side+0.040
apping+0.039
Neg logit contributions
ls-0.076
l-0.073
 cor-0.071
 crocod-0.059
 slit-0.059
Token and
Token ID290
Feature activation+6.61e-01

Pos logit contributions
 Clean+0.058
othes+0.056
uggets+0.054
 anniversary+0.054
elt+0.053
Neg logit contributions
l-0.072
ls-0.069
 cor-0.063
ah-0.054
ches-0.052
Token said
Token ID531
Feature activation-6.57e-01

Pos logit contributions
ls+0.013
l+0.013
 cor+0.012
 beast+0.010
 crocod+0.010
Neg logit contributions
 Clean-0.017
elt-0.016
 lately-0.015
amed-0.015
side-0.015
Token,
Token ID11
Feature activation-1.85e+00

Pos logit contributions
 Clean+0.013
elt+0.012
 lately+0.012
side+0.011
itted+0.011
Neg logit contributions
ls-0.017
l-0.017
 cor-0.016
 crocod-0.013
osh-0.013
Token "
Token ID366
Feature activation-3.77e+00

Pos logit contributions
 Clean+0.037
keeping+0.037
 lately+0.036
othes+0.036
amed+0.035
Neg logit contributions
l-0.043
ls-0.042
 cor-0.041
 crocod-0.033
 slit-0.033
TokenLet
Token ID5756
Feature activation+7.39e-01

Pos logit contributions
 Clean+0.064
apping+0.061
 lately+0.061
 organizing+0.061
 indoors+0.059
Neg logit contributions
l-0.096
 cor-0.094
ls-0.088
ah-0.083
osh-0.082
Token supplies
Token ID9416
Feature activation-1.38e+00

Pos logit contributions
amed+0.056
elt+0.055
 Clean+0.053
 Fighting+0.052
itted+0.051
Neg logit contributions
ls-0.070
l-0.065
 cor-0.064
 crocod-0.052
 flies-0.051
Token to
Token ID284
Feature activation+2.59e-01

Pos logit contributions
 Clean+0.028
elt+0.026
oned+0.025
itted+0.025
amed+0.025
Neg logit contributions
ls-0.034
l-0.034
 cor-0.032
 crocod-0.026
ah-0.026
Token play
Token ID711
Feature activation+2.37e-01

Pos logit contributions
ls+0.007
l+0.006
 cor+0.006
 crocod+0.005
 beast+0.005
Neg logit contributions
 Clean-0.005
elt-0.005
 lately-0.005
side-0.005
apping-0.005
Token with
Token ID351
Feature activation-8.48e-01

Pos logit contributions
ls+0.007
l+0.007
 cor+0.006
 crocod+0.006
 beast+0.005
Neg logit contributions
elt-0.004
 Clean-0.004
itted-0.004
amed-0.003
 lately-0.003
Token for
Token ID329
Feature activation-1.25e+00

Pos logit contributions
 Clean+0.015
 lately+0.015
side+0.015
 then+0.014
 including+0.014
Neg logit contributions
ls-0.022
l-0.021
 cor-0.020
 AW-0.019
ah-0.018
Token hours
Token ID2250
Feature activation-3.00e+00

Pos logit contributions
 Clean+0.028
elt+0.028
 lately+0.027
apping+0.027
itted+0.027
Neg logit contributions
ls-0.028
l-0.027
 cor-0.026
 crocod-0.021
 beast-0.020
Token.
Token ID13
Feature activation-1.35e+00

Pos logit contributions
 Clean+0.073
elt+0.072
amed+0.068
itted+0.068
othes+0.068
Neg logit contributions
ls-0.062
l-0.061
 cor-0.056
 crocod-0.045
ah-0.044
Token He
Token ID679
Feature activation-6.78e-01

Pos logit contributions
 Clean+0.023
elt+0.022
 lately+0.022
amed+0.022
side+0.022
Neg logit contributions
ls-0.036
l-0.036
 cor-0.034
osh-0.029
 crocod-0.029
Token thanked
Token ID26280
Feature activation+6.36e-01

Pos logit contributions
 Clean+0.013
 then+0.012
amed+0.012
elt+0.012
 lately+0.011
Neg logit contributions
ls-0.018
l-0.017
 cor-0.016
 AW-0.014
 beast-0.014
Token the
Token ID262
Feature activation+5.15e-01

Pos logit contributions
ls+0.015
l+0.014
 cor+0.013
 AW+0.011
ah+0.010
Neg logit contributions
 Clean-0.015
 then-0.014
 lately-0.014
 including-0.013
wards-0.013
Token plastic
Token ID7309
Feature activation-2.51e+00

Pos logit contributions
l+0.010
ls+0.010
 cor+0.009
 crocod+0.008
 giant+0.008
Neg logit contributions
elt-0.013
 Clean-0.013
apping-0.012
 lately-0.012
itted-0.012
Token man
Token ID582
Feature activation+1.64e-01

Pos logit contributions
 Clean+0.018
elt+0.017
 lately+0.017
side+0.016
apping+0.016
Neg logit contributions
ls-0.021
l-0.020
 cor-0.019
 crocod-0.016
osh-0.016
Token.
Token ID13
Feature activation-1.41e+00

Pos logit contributions
ls+0.004
l+0.004
 cor+0.004
 crocod+0.003
 AW+0.003
Neg logit contributions
 Clean-0.003
elt-0.003
 lately-0.003
side-0.003
apping-0.003
Token She
Token ID1375
Feature activation-9.17e-01

Pos logit contributions
 Clean+0.017
amed+0.017
 lately+0.017
aming+0.016
side+0.016
Neg logit contributions
ls-0.042
l-0.042
 cor-0.040
ia-0.035
osh-0.035
Token thanked
Token ID26280
Feature activation+3.27e-02

Pos logit contributions
 Clean+0.019
 then+0.018
amed+0.017
elt+0.017
 lately+0.017
Neg logit contributions
ls-0.022
l-0.022
 cor-0.020
 AW-0.018
 beast-0.017
Token him
Token ID683
Feature activation+5.22e-02

Pos logit contributions
ls+0.001
l+0.001
 cor+0.001
 AW+0.001
osh+0.001
Neg logit contributions
 Clean-0.001
 lately-0.001
 then-0.001
wards-0.001
side-0.001
Token again
Token ID757
Feature activation-3.22e+00

Pos logit contributions
ls+0.002
l+0.001
 cor+0.001
 crocod+0.001
 beast+0.001
Neg logit contributions
 Clean-0.001
elt-0.001
 lately-0.001
side-0.001
apping-0.001
Token and
Token ID290
Feature activation-4.11e-01

Pos logit contributions
 Clean+0.056
elt+0.054
amed+0.052
itted+0.051
 lately+0.050
Neg logit contributions
ls-0.087
l-0.087
 cor-0.083
 crocod-0.071
ia-0.071
Token thanked
Token ID26280
Feature activation+7.77e-02

Pos logit contributions
 Clean+0.008
elt+0.008
 lately+0.008
side+0.007
amed+0.007
Neg logit contributions
ls-0.010
l-0.010
 cor-0.010
 AW-0.008
 crocod-0.008
Token him
Token ID683
Feature activation+8.67e-02

Pos logit contributions
ls+0.002
l+0.002
 cor+0.002
 AW+0.002
osh+0.002
Neg logit contributions
 Clean-0.001
 lately-0.001
 then-0.001
wards-0.001
side-0.001
Token for
Token ID329
Feature activation-1.74e+00

Pos logit contributions
ls+0.002
l+0.002
 cor+0.002
 crocod+0.002
 AW+0.002
Neg logit contributions
 Clean-0.001
 lately-0.001
apping-0.001
elt-0.001
side-0.001
Token making
Token ID1642
Feature activation+6.40e-01

Pos logit contributions
elt+0.038
 Clean+0.036
itted+0.035
 lately+0.035
apping+0.034
Neg logit contributions
 cor-0.039
l-0.038
ls-0.038
 crocod-0.032
 beast-0.031
Token running
Token ID2491
Feature activation-5.26e-01

Pos logit contributions
 Clean+0.027
elt+0.025
 lately+0.025
side+0.024
apping+0.024
Neg logit contributions
ls-0.047
l-0.046
 cor-0.044
 crocod-0.037
 beast-0.037
Token again
Token ID757
Feature activation-2.55e+00

Pos logit contributions
 Clean+0.011
elt+0.011
 lately+0.011
itted+0.010
apping+0.010
Neg logit contributions
ls-0.012
l-0.012
 cor-0.011
 crocod-0.009
 AW-0.009
Token.
Token ID13
Feature activation-8.20e-01

Pos logit contributions
 Clean+0.060
elt+0.057
 lately+0.057
side+0.056
apping+0.056
Neg logit contributions
ls-0.054
l-0.053
 cor-0.050
 crocod-0.040
osh-0.039
Token
Token ID198
Feature activation-6.69e-01

Pos logit contributions
 Clean+0.014
elt+0.013
 lately+0.013
amed+0.013
side+0.013
Neg logit contributions
ls-0.022
l-0.022
 cor-0.021
osh-0.018
 beast-0.018
Token
Token ID198
Feature activation-1.28e+00

Pos logit contributions
 Clean+0.010
elt+0.009
 lately+0.009
side+0.009
apping+0.008
Neg logit contributions
ls-0.020
l-0.020
 cor-0.019
 crocod-0.017
 AW-0.016
TokenAfter
Token ID3260
Feature activation-2.98e+00

Pos logit contributions
 Clean+0.022
elt+0.021
 lately+0.021
itted+0.020
apping+0.020
Neg logit contributions
ls-0.035
l-0.035
 cor-0.033
osh-0.028
 beast-0.028
Token a
Token ID257
Feature activation+1.37e+00

Pos logit contributions
 Clean+0.069
elt+0.069
amed+0.067
itted+0.067
 lately+0.067
Neg logit contributions
ls-0.062
l-0.062
 cor-0.059
 crocod-0.049
 slit-0.048
Token while
Token ID981
Feature activation-1.50e+00

Pos logit contributions
ls+0.043
l+0.041
 cor+0.040
osh+0.035
ia+0.035
Neg logit contributions
 Clean-0.019
side-0.017
 lately-0.016
elt-0.016
wards-0.016
Token,
Token ID11
Feature activation+3.92e-03

Pos logit contributions
 Clean+0.028
elt+0.027
 lately+0.026
side+0.026
apping+0.026
Neg logit contributions
ls-0.039
l-0.038
 cor-0.037
 crocod-0.031
 beast-0.030
Token he
Token ID339
Feature activation+9.13e-01

Pos logit contributions
ls+0.000
l+0.000
 cor+0.000
 AW+0.000
ah+0.000
Neg logit contributions
 Clean-0.000
 lately-0.000
wards-0.000
side-0.000
 then-0.000
Token started
Token ID2067
Feature activation-1.28e+00

Pos logit contributions
ls+0.024
l+0.022
 cor+0.021
 AW+0.020
ira+0.019
Neg logit contributions
 then-0.017
 Clean-0.016
amed-0.016
elt-0.015
side-0.015
Token together
Token ID1978
Feature activation-3.82e-01

Pos logit contributions
ls+0.029
l+0.028
 cor+0.027
 crocod+0.023
 beast+0.022
Neg logit contributions
 Clean-0.017
elt-0.016
 lately-0.016
side-0.015
apping-0.015
Token.
Token ID13
Feature activation+6.62e-01

Pos logit contributions
 Clean+0.009
elt+0.009
amed+0.008
itted+0.008
 Working+0.008
Neg logit contributions
ls-0.008
l-0.008
 cor-0.008
 crocod-0.006
ia-0.006
Token<|endoftext|>
Token ID50256
Feature activation-1.17e+00

Pos logit contributions
l+0.015
ls+0.015
 cor+0.014
 Ray+0.013
ia+0.013
Neg logit contributions
amed-0.014
aming-0.013
itted-0.013
elt-0.013
side-0.013
TokenL
Token ID43
Feature activation+6.44e-02

Pos logit contributions
 Clean+0.023
elt+0.021
 Working+0.020
apping+0.020
side+0.020
Neg logit contributions
ls-0.030
l-0.029
 cor-0.028
 AW-0.024
 crocod-0.024
Tokenily
Token ID813
Feature activation-1.55e-01

Pos logit contributions
ls+0.001
l+0.001
 cor+0.001
 AW+0.001
 crocod+0.001
Neg logit contributions
 Clean-0.001
elt-0.001
 lately-0.001
side-0.001
amed-0.001
Token loved
Token ID6151
Feature activation-3.15e+00

Pos logit contributions
 Clean+0.003
elt+0.003
 lately+0.003
amed+0.003
side+0.003
Neg logit contributions
ls-0.004
l-0.004
 cor-0.004
 crocod-0.003
 AW-0.003
Token flowers
Token ID12734
Feature activation-5.86e-01

Pos logit contributions
 Clean+0.057
elt+0.053
 lately+0.053
side+0.052
amed+0.052
Neg logit contributions
ls-0.085
l-0.082
 cor-0.079
 crocod-0.067
 beast-0.065
Token.
Token ID13
Feature activation-1.62e+00

Pos logit contributions
 Clean+0.015
elt+0.014
oned+0.014
 Working+0.013
amed+0.013
Neg logit contributions
ls-0.013
l-0.013
 cor-0.011
 crocod-0.009
ah-0.009
Token She
Token ID1375
Feature activation+1.56e-01

Pos logit contributions
amed+0.031
wards+0.027
side+0.027
keeping+0.026
aming+0.026
Neg logit contributions
l-0.039
ia-0.036
ls-0.034
 cor-0.034
 crocod-0.033
Token liked
Token ID8288
Feature activation-2.47e+00

Pos logit contributions
ls+0.004
l+0.004
 cor+0.004
 crocod+0.003
osh+0.003
Neg logit contributions
 Clean-0.003
elt-0.003
 lately-0.003
side-0.003
apping-0.003
Token to
Token ID284
Feature activation+6.55e-01

Pos logit contributions
 Clean+0.050
amed+0.047
 lately+0.047
apping+0.047
oned+0.047
Neg logit contributions
ls-0.062
l-0.060
 cor-0.059
 crocod-0.050
 crow-0.048