TOP ACTIVATIONS
MAX = 0.052

sorry . She put the extra piece back and they continued
cone . He had different flavors . He asked them what
. His mom wrapped up a sandwich for him . It
It was enormous ! A little girl named Lily loved to
, Tom . Tom loved Sara and gave her good food
bored . She spent the day talking to the v ase
Tom spent the rest of his day playing with
top . She loved spending time with him and they both
said Lily . She wrapped a scarf around his neck and
very organized . They put each block in a box that
ite . She spent the day flying the black k ite
. She spent the whole day collecting more match sticks ,
Tim my saw a cooler full of ice cream and asked
ice . She liked to hold it in her hand and
time , there was a chip m unk who lived in
it . She spent the morning trying to yield the cord
his animals . He puts each toy in the right place
and drink some water . Sara , you need to wash
ite . He spent the day playing with it , but
Emma took a smooth ice pack and put it on Lily

MIN ACTIVATIONS
MIN = -0.069

if she could talk about microsc opes . Her
had to know all about microsc opes .
the stars and planets in outer space . So one day
to be restored to its former glory ."
the tree to its once former beauty . He was happy
big book with pictures of volcan oes . Vol can oes
" What is famous ?" she asks . Ben
Do you like d aff od ils ?" Ben asks .
, what 's that ?" she asked , pointing at the
big book with pictures of volcan oes . Vol can oes
" What 's that ?" she said . She put her
What do you want ?" she asks . "
kinds of ear rings and brace lets , and he really
did you find this ?" she asks . "
is fun . I like aer opl anes ."
Emma who loved learning about outer space . Everyday , she
and that lady separate ?" he asked , pointing at two
big , white pup with floppy ears and a curly tail
" What is that ?" he asks . "
What did you create ?" she asks , looking at the

INTERVAL 0.022 - 0.052
CONTAINS 1.552%

At the bar ber , the bar ber said
big bisc uit on a plate and placed it in front
any money . They stayed friends for a long time afterwards
Spot found a hidden spot under a big tree .
Max 's Mom sat on the nearby bench ,

INTERVAL -0.009 - 0.022
CONTAINS 79.962%

! There 's a big storm coming !"
ladder , please ?" Lily asked her mom .
house and share the fruits with their families . They are
sat on the egg to keep it warm . She waited
moment . Then she says , " What do you call

INTERVAL -0.039 - -0.009
CONTAINS 18.457%

take a walk along the shore line . But
. But then , a friendly dolphin came and helped the
knew his mom would help him manage it .
come and play again soon !" <|endoftext|> Once upon a time
THE <|endoftext|> Once upon a time there was a little

INTERVAL -0.069 - -0.039
CONTAINS 0.029%

jumped in the ocean and sw am around until he saw
that all of the other sw immers were much bigger than
pearl followed her as she sw am back to the surface
tree could see lots of av oc ados that the supplier
sometimes pretend to be a mer maid .
Token sorry
Token ID7926
Feature activation+5.95e-03

Pos logit contributions
ross+0.011
andra+0.010
rina+0.010
iet+0.010
ation+0.010
Neg logit contributions
 sad-0.003
 very-0.002
 so-0.002
 happy-0.001
 jealous-0.001
Token.
Token ID13
Feature activation-3.04e-03

Pos logit contributions
 sad+0.006
 very+0.005
 so+0.005
 boy+0.004
 but+0.004
Neg logit contributions
ross-0.016
rina-0.015
andra-0.014
ation-0.014
ressive-0.014
Token She
Token ID1375
Feature activation-1.60e-03

Pos logit contributions
ross+0.009
rina+0.008
andra+0.008
ation+0.008
ressive+0.008
Neg logit contributions
 sad-0.002
 so-0.002
 very-0.002
 boy-0.002
 but-0.002
Token put
Token ID1234
Feature activation+1.63e-02

Pos logit contributions
ross+0.004
rina+0.004
ation+0.004
ressive+0.004
andra+0.003
Neg logit contributions
 sad-0.002
 very-0.002
 so-0.002
 were-0.002
 boy-0.002
Token the
Token ID262
Feature activation+3.31e-02

Pos logit contributions
 sad+0.016
 very+0.013
 so+0.013
 boy+0.011
 but+0.011
Neg logit contributions
ross-0.045
rina-0.042
andra-0.040
ation-0.040
ressive-0.039
Token extra
Token ID3131
Feature activation+5.20e-02

Pos logit contributions
 sad+0.042
 so+0.035
 very+0.035
 were+0.033
 but+0.033
Neg logit contributions
ross-0.075
rina-0.069
andra-0.067
ation-0.067
ressive-0.065
Token piece
Token ID3704
Feature activation+2.52e-02

Pos logit contributions
 sad+0.063
 very+0.052
 so+0.052
 boy+0.049
 but+0.048
Neg logit contributions
ross-0.126
rina-0.117
andra-0.113
ation-0.110
ressive-0.108
Token back
Token ID736
Feature activation+2.73e-02

Pos logit contributions
 sad+0.024
 very+0.019
 so+0.018
 boy+0.017
 but+0.016
Neg logit contributions
ross-0.068
rina-0.064
andra-0.062
ation-0.061
ressive-0.060
Token and
Token ID290
Feature activation-8.35e-03

Pos logit contributions
 sad+0.024
 very+0.019
 so+0.019
 but+0.016
 boy+0.015
Neg logit contributions
ross-0.077
rina-0.073
andra-0.071
ation-0.069
ressive-0.068
Token they
Token ID484
Feature activation-8.10e-03

Pos logit contributions
ross+0.020
rina+0.019
iet+0.017
eh+0.017
andra+0.017
Neg logit contributions
 sad-0.011
 so-0.010
 boy-0.010
 very-0.009
 were-0.009
Token continued
Token ID3767
Feature activation-1.15e-02

Pos logit contributions
ross+0.020
rina+0.018
ation+0.018
andra+0.018
ressive+0.017
Neg logit contributions
 sad-0.010
 very-0.009
 were-0.009
 so-0.009
 boy-0.008
Token cone
Token ID27763
Feature activation+2.62e-02

Pos logit contributions
 sad+0.021
 so+0.017
 very+0.016
 but+0.016
 boy+0.015
Neg logit contributions
ross-0.041
rina-0.038
andra-0.038
eh-0.037
ation-0.036
Token.
Token ID13
Feature activation+2.93e-03

Pos logit contributions
 sad+0.040
 so+0.033
 very+0.032
 boy+0.032
 but+0.031
Neg logit contributions
ross-0.054
rina-0.048
andra-0.046
ation-0.045
eh-0.044
Token He
Token ID679
Feature activation+1.82e-02

Pos logit contributions
 sad+0.002
 very+0.002
 so+0.002
 boy+0.001
 penn+0.001
Neg logit contributions
ross-0.008
andra-0.007
rina-0.007
ation-0.007
ressive-0.007
Token had
Token ID550
Feature activation+8.45e-04

Pos logit contributions
 sad+0.023
 very+0.020
 so+0.020
 boy+0.019
 were+0.019
Neg logit contributions
ross-0.045
rina-0.041
ation-0.040
andra-0.039
ressive-0.038
Token different
Token ID1180
Feature activation-1.67e-02

Pos logit contributions
 sad+0.001
 so+0.001
 very+0.001
 cones+0.001
 were+0.001
Neg logit contributions
ross-0.002
rina-0.002
iet-0.002
ation-0.002
andra-0.002
Token flavors
Token ID17361
Feature activation+5.07e-02

Pos logit contributions
ross+0.036
rina+0.033
andra+0.032
ation+0.031
ressive+0.030
Neg logit contributions
 sad-0.025
 very-0.021
 so-0.021
 boy-0.020
 but-0.020
Token.
Token ID13
Feature activation+6.71e-03

Pos logit contributions
 sad+0.065
 boy+0.050
 ladder+0.048
 je+0.047
nam+0.046
Neg logit contributions
ross-0.109
rina-0.092
andra-0.090
et-0.090
at-0.088
Token He
Token ID679
Feature activation+2.06e-02

Pos logit contributions
 sad+0.005
 very+0.004
 so+0.003
 boy+0.003
 were+0.003
Neg logit contributions
ross-0.019
rina-0.017
andra-0.017
ressive-0.016
ation-0.016
Token asked
Token ID1965
Feature activation+2.16e-02

Pos logit contributions
 sad+0.025
 very+0.022
 so+0.021
 boy+0.020
 were+0.020
Neg logit contributions
ross-0.052
rina-0.047
ation-0.046
andra-0.045
ressive-0.044
Token them
Token ID606
Feature activation+8.23e-03

Pos logit contributions
 sad+0.029
 very+0.025
 so+0.025
 boy+0.023
 but+0.023
Neg logit contributions
ross-0.051
rina-0.047
andra-0.045
ation-0.044
ressive-0.043
Token what
Token ID644
Feature activation+2.96e-03

Pos logit contributions
 sad+0.008
 very+0.006
 so+0.006
 boy+0.005
 but+0.005
Neg logit contributions
ross-0.022
rina-0.021
andra-0.020
ation-0.020
ressive-0.019
Token.
Token ID13
Feature activation+4.59e-03

Pos logit contributions
 sad+0.003
 very+0.003
 so+0.003
 but+0.003
 boy+0.002
Neg logit contributions
ross-0.008
rina-0.007
andra-0.007
ation-0.007
ressive-0.007
Token His
Token ID2399
Feature activation+3.97e-03

Pos logit contributions
 sad+0.005
 penn+0.004
 very+0.004
 kay+0.003
 ladder+0.003
Neg logit contributions
ross-0.011
andra-0.010
ressive-0.010
ation-0.010
rina-0.010
Token mom
Token ID1995
Feature activation+6.20e-03

Pos logit contributions
 sad+0.004
 so+0.003
 ladder+0.003
 but+0.003
 very+0.003
Neg logit contributions
ross-0.009
rina-0.009
andra-0.009
ation-0.008
iet-0.008
Token wrapped
Token ID12908
Feature activation+2.54e-02

Pos logit contributions
 sad+0.008
 very+0.006
 so+0.006
 boy+0.006
 but+0.006
Neg logit contributions
ross-0.016
rina-0.014
ation-0.014
andra-0.014
ressive-0.013
Token up
Token ID510
Feature activation+2.96e-02

Pos logit contributions
 sad+0.033
 very+0.026
 ladder+0.026
 boy+0.025
 so+0.025
Neg logit contributions
ross-0.057
rina-0.053
andra-0.051
ation-0.051
ressive-0.049
Token a
Token ID257
Feature activation+5.03e-02

Pos logit contributions
 sad+0.033
 very+0.027
 so+0.027
 boy+0.025
 but+0.024
Neg logit contributions
ross-0.075
rina-0.070
andra-0.067
ation-0.066
ressive-0.065
Token sandwich
Token ID20433
Feature activation+2.56e-02

Pos logit contributions
 sad+0.061
 so+0.051
 were+0.050
 but+0.047
 boy+0.047
Neg logit contributions
ross-0.106
eh-0.101
rina-0.097
andra-0.097
iet-0.097
Token for
Token ID329
Feature activation+1.41e-02

Pos logit contributions
 sad+0.023
 very+0.018
 so+0.018
 boy+0.016
 but+0.015
Neg logit contributions
ross-0.071
rina-0.067
andra-0.064
ation-0.063
ressive-0.062
Token him
Token ID683
Feature activation+1.57e-02

Pos logit contributions
 sad+0.021
 very+0.018
 so+0.018
 boy+0.017
 but+0.017
Neg logit contributions
ross-0.030
rina-0.028
andra-0.027
ation-0.026
ressive-0.025
Token.
Token ID13
Feature activation+6.50e-03

Pos logit contributions
 sad+0.018
 very+0.015
 so+0.015
 but+0.013
 boy+0.013
Neg logit contributions
ross-0.040
rina-0.037
andra-0.036
ation-0.035
ressive-0.035
Token It
Token ID632
Feature activation-6.30e-05

Pos logit contributions
 sad+0.007
 very+0.005
 so+0.005
 penn+0.004
 boy+0.004
Neg logit contributions
ross-0.016
andra-0.015
rina-0.015
ation-0.014
ressive-0.014
Token It
Token ID632
Feature activation+5.30e-03

Pos logit contributions
 sad+0.005
 very+0.004
 so+0.004
 boy+0.004
 but+0.003
Neg logit contributions
ross-0.014
rina-0.013
andra-0.012
ation-0.012
ressive-0.012
Token was
Token ID373
Feature activation+1.90e-02

Pos logit contributions
 sad+0.005
 so+0.004
 penn+0.004
 cones+0.004
 but+0.004
Neg logit contributions
ross-0.013
rina-0.012
eh-0.012
ation-0.012
andra-0.011
Token enormous
Token ID9812
Feature activation+8.34e-03

Pos logit contributions
 sad+0.016
s+0.015
 penn+0.013
 boy+0.012
 cones+0.011
Neg logit contributions
ross-0.049
eh-0.045
rina-0.044
ation-0.043
andra-0.043
Token!
Token ID0
Feature activation+4.29e-03

Pos logit contributions
 sad+0.010
 very+0.008
 so+0.008
 boy+0.008
 but+0.008
Neg logit contributions
ross-0.020
rina-0.018
andra-0.018
ation-0.017
ressive-0.017
Token A
Token ID317
Feature activation+1.42e-02

Pos logit contributions
 sad+0.005
 very+0.004
 so+0.003
 boy+0.003
s+0.003
Neg logit contributions
ross-0.011
rina-0.010
andra-0.010
ation-0.010
ressive-0.009
Token little
Token ID1310
Feature activation+5.02e-02

Pos logit contributions
 sad+0.012
 so+0.010
 penn+0.010
s+0.009
 were+0.009
Neg logit contributions
ross-0.034
eh-0.032
andra-0.031
iet-0.031
rina-0.031
Token girl
Token ID2576
Feature activation+2.62e-02

Pos logit contributions
 sad+0.043
 penn+0.041
nam+0.040
s+0.039
 Some+0.038
Neg logit contributions
eh-0.096
ross-0.093
et-0.091
iet-0.086
ab-0.085
Token named
Token ID3706
Feature activation+3.60e-03

Pos logit contributions
 sad+0.039
 very+0.034
 so+0.034
 boy+0.032
 but+0.031
Neg logit contributions
ross-0.057
rina-0.053
andra-0.050
ation-0.049
ressive-0.048
Token Lily
Token ID20037
Feature activation+7.88e-03

Pos logit contributions
 sad+0.004
 very+0.003
 so+0.003
 boy+0.003
 but+0.003
Neg logit contributions
ross-0.009
rina-0.009
andra-0.008
ation-0.008
ressive-0.008
Token loved
Token ID6151
Feature activation+2.21e-02

Pos logit contributions
 sad+0.007
 very+0.006
 were+0.006
 so+0.006
 boy+0.005
Neg logit contributions
ross-0.023
rina-0.021
andra-0.021
ation-0.020
ressive-0.020
Token to
Token ID284
Feature activation+4.10e-03

Pos logit contributions
 sad+0.021
s+0.014
 boy+0.013
 very+0.013
 but+0.013
Neg logit contributions
ross-0.055
rina-0.051
ation-0.050
andra-0.050
eh-0.049
Token,
Token ID11
Feature activation-6.30e-03

Pos logit contributions
 sad+0.022
 very+0.015
 so+0.015
 boy+0.014
 but+0.013
Neg logit contributions
ross-0.076
rina-0.072
andra-0.069
ation-0.068
ressive-0.067
Token Tom
Token ID4186
Feature activation+2.16e-02

Pos logit contributions
ross+0.016
rina+0.015
ation+0.014
andra+0.014
ressive+0.014
Neg logit contributions
 sad-0.007
 so-0.006
 very-0.006
 but-0.005
 boy-0.005
Token.
Token ID13
Feature activation+1.61e-03

Pos logit contributions
 sad+0.026
 very+0.023
 so+0.023
 but+0.021
 boy+0.020
Neg logit contributions
ross-0.054
rina-0.050
andra-0.049
ation-0.048
ressive-0.047
Token Tom
Token ID4186
Feature activation+2.45e-02

Pos logit contributions
 sad+0.002
 very+0.001
 so+0.001
 boy+0.001
 but+0.001
Neg logit contributions
ross-0.004
rina-0.004
andra-0.004
ation-0.004
ressive-0.004
Token loved
Token ID6151
Feature activation+2.69e-02

Pos logit contributions
 sad+0.026
 very+0.020
 so+0.020
 boy+0.019
 but+0.018
Neg logit contributions
ross-0.063
rina-0.059
andra-0.057
ation-0.056
ressive-0.055
Token Sara
Token ID24799
Feature activation+4.96e-02

Pos logit contributions
 sad+0.022
 very+0.015
 so+0.014
 boy+0.013
 but+0.013
Neg logit contributions
ross-0.075
rina-0.070
andra-0.068
ation-0.067
ressive-0.066
Token and
Token ID290
Feature activation-8.40e-03

Pos logit contributions
 ladder+0.042
 sad+0.041
 boy+0.034
 kay+0.034
 penn+0.034
Neg logit contributions
rina-0.110
ross-0.104
end-0.102
ita-0.101
andra-0.099
Token gave
Token ID2921
Feature activation+1.72e-02

Pos logit contributions
ross+0.024
rina+0.022
ation+0.021
ressive+0.021
andra+0.021
Neg logit contributions
 sad-0.008
 very-0.007
 so-0.007
 boy-0.006
 but-0.006
Token her
Token ID607
Feature activation+1.09e-02

Pos logit contributions
 sad+0.024
 very+0.021
 so+0.020
 boy+0.019
 but+0.019
Neg logit contributions
ross-0.039
rina-0.036
andra-0.035
ation-0.034
ressive-0.033
Token good
Token ID922
Feature activation-1.16e-02

Pos logit contributions
 sad+0.012
 very+0.009
 so+0.009
 boy+0.009
 but+0.008
Neg logit contributions
ross-0.028
rina-0.026
andra-0.025
ation-0.025
ressive-0.024
Token food
Token ID2057
Feature activation+4.62e-03

Pos logit contributions
ross+0.028
rina+0.026
andra+0.024
ation+0.024
ressive+0.024
Neg logit contributions
 sad-0.015
 very-0.013
 boy-0.013
 so-0.013
 but-0.012
Token bored
Token ID17533
Feature activation+6.93e-03

Pos logit contributions
ross+0.031
rina+0.029
andra+0.028
eh+0.028
ation+0.027
Neg logit contributions
 sad-0.009
 so-0.007
 very-0.006
 but-0.006
 cones-0.006
Token.
Token ID13
Feature activation+1.27e-02

Pos logit contributions
 sad+0.007
 so+0.007
 very+0.006
 but+0.006
 boy+0.006
Neg logit contributions
ross-0.019
rina-0.018
andra-0.017
ressive-0.017
ation-0.017
Token She
Token ID1375
Feature activation+8.78e-03

Pos logit contributions
 sad+0.007
 so+0.005
 very+0.004
 but+0.004
 boy+0.004
Neg logit contributions
ross-0.041
rina-0.038
ation-0.037
andra-0.037
ressive-0.036
Token spent
Token ID3377
Feature activation+1.14e-02

Pos logit contributions
 sad+0.010
 very+0.008
 were+0.008
 so+0.008
 can+0.007
Neg logit contributions
ross-0.025
rina-0.022
ressive-0.022
ation-0.022
andra-0.022
Token the
Token ID262
Feature activation+1.03e-02

Pos logit contributions
 sad+0.011
 so+0.009
 very+0.009
 but+0.007
 many+0.007
Neg logit contributions
ross-0.033
rina-0.031
andra-0.030
ressive-0.029
ation-0.028
Token day
Token ID1110
Feature activation+4.94e-02

Pos logit contributions
 sad+0.013
 very+0.012
 boy+0.011
 so+0.010
 ladder+0.010
Neg logit contributions
ross-0.026
rina-0.025
andra-0.024
ressive-0.023
isdom-0.023
Token talking
Token ID3375
Feature activation+1.05e-02

Pos logit contributions
 sad+0.058
 very+0.048
 so+0.047
 but+0.043
 boy+0.042
Neg logit contributions
ross-0.126
rina-0.119
andra-0.115
ation-0.111
ressive-0.109
Token to
Token ID284
Feature activation-1.10e-02

Pos logit contributions
 sad+0.007
 so+0.006
 very+0.005
 but+0.005
 boy+0.004
Neg logit contributions
ross-0.033
rina-0.031
andra-0.031
ation-0.030
ician-0.029
Token the
Token ID262
Feature activation+3.67e-04

Pos logit contributions
ross+0.032
rina+0.028
andra+0.028
eh+0.027
int+0.027
Neg logit contributions
 sad-0.011
 very-0.010
 so-0.010
 many-0.009
 but-0.008
Token v
Token ID410
Feature activation-2.62e-03

Pos logit contributions
 sad+0.000
 very+0.000
 boy+0.000
 so+0.000
 but+0.000
Neg logit contributions
ross-0.001
rina-0.001
andra-0.001
ation-0.001
ressive-0.001
Tokenase
Token ID589
Feature activation-1.14e-03

Pos logit contributions
ross+0.005
rina+0.005
andra+0.004
ation+0.004
ressive+0.004
Neg logit contributions
 sad-0.004
 very-0.004
 so-0.004
 boy-0.004
s-0.004
Token
Token ID198
Feature activation+8.53e-03

Pos logit contributions
 sad+0.008
 so+0.007
 very+0.006
 but+0.006
 boy+0.006
Neg logit contributions
ross-0.035
rina-0.033
ation-0.032
andra-0.032
iet-0.031
Token
Token ID198
Feature activation+3.42e-03

Pos logit contributions
 sad+0.008
 very+0.006
 so+0.006
 boy+0.006
 but+0.006
Neg logit contributions
ross-0.023
rina-0.021
andra-0.021
ation-0.020
ressive-0.020
TokenTom
Token ID13787
Feature activation+5.74e-03

Pos logit contributions
 sad+0.004
 very+0.004
 so+0.004
 boy+0.003
+0.003
Neg logit contributions
ross-0.008
rina-0.008
andra-0.007
ressive-0.007
ation-0.007
Token spent
Token ID3377
Feature activation+1.11e-02

Pos logit contributions
 sad+0.006
 so+0.005
 very+0.005
 boy+0.005
 were+0.005
Neg logit contributions
ross-0.015
rina-0.014
andra-0.013
ation-0.013
ressive-0.013
Token the
Token ID262
Feature activation+5.80e-03

Pos logit contributions
 sad+0.009
 so+0.008
 very+0.008
 but+0.006
 boy+0.006
Neg logit contributions
ross-0.032
rina-0.031
andra-0.030
ressive-0.029
ation-0.028
Token rest
Token ID1334
Feature activation+4.91e-02

Pos logit contributions
 sad+0.007
 very+0.007
 so+0.006
 boy+0.006
 but+0.006
Neg logit contributions
ross-0.014
rina-0.014
andra-0.013
ressive-0.013
isdom-0.013
Token of
Token ID286
Feature activation+6.07e-03

Pos logit contributions
 sad+0.034
 very+0.022
 so+0.021
 boy+0.020
 ladder+0.017
Neg logit contributions
ross-0.146
rina-0.135
andra-0.130
ation-0.129
eh-0.127
Token his
Token ID465
Feature activation+1.52e-02

Pos logit contributions
 sad+0.005
 very+0.004
 so+0.004
 boy+0.004
 but+0.003
Neg logit contributions
ross-0.017
rina-0.016
andra-0.015
ation-0.015
ressive-0.015
Token day
Token ID1110
Feature activation+3.68e-02

Pos logit contributions
 sad+0.022
 very+0.019
 so+0.019
 boy+0.018
 but+0.017
Neg logit contributions
ross-0.034
rina-0.031
andra-0.030
ation-0.029
ressive-0.029
Token playing
Token ID2712
Feature activation+2.58e-02

Pos logit contributions
 sad+0.037
 very+0.032
 so+0.031
 but+0.027
 were+0.025
Neg logit contributions
ross-0.102
rina-0.099
andra-0.094
ation-0.091
ressive-0.091
Token with
Token ID351
Feature activation+1.03e-02

Pos logit contributions
 sad+0.017
 so+0.015
 very+0.015
 but+0.013
 boy+0.010
Neg logit contributions
ross-0.080
rina-0.078
andra-0.077
ation-0.072
ician-0.071
Token top
Token ID1353
Feature activation-1.68e-02

Pos logit contributions
 sad+0.001
 very+0.001
 so+0.001
 boy+0.001
 but+0.001
Neg logit contributions
ross-0.001
rina-0.001
andra-0.001
ation-0.001
iet-0.001
Token.
Token ID13
Feature activation+3.32e-03

Pos logit contributions
ross+0.043
rina+0.041
andra+0.041
ation+0.038
ician+0.038
Neg logit contributions
 sad-0.019
 so-0.017
 very-0.017
 but-0.015
 boy-0.015
Token She
Token ID1375
Feature activation+3.88e-03

Pos logit contributions
 sad+0.003
 very+0.002
 so+0.002
 boy+0.002
 ladder+0.002
Neg logit contributions
ross-0.009
rina-0.008
andra-0.008
ation-0.008
end-0.008
Token loved
Token ID6151
Feature activation+1.45e-02

Pos logit contributions
 sad+0.005
 very+0.004
 so+0.004
 boy+0.004
 but+0.004
Neg logit contributions
ross-0.009
rina-0.009
andra-0.008
ation-0.008
ressive-0.008
Token spending
Token ID4581
Feature activation+2.50e-03

Pos logit contributions
 sad+0.018
 very+0.014
 so+0.014
 boy+0.014
 ladder+0.013
Neg logit contributions
ross-0.034
rina-0.032
andra-0.031
ation-0.030
ressive-0.030
Token time
Token ID640
Feature activation+4.83e-02

Pos logit contributions
 sad+0.004
 very+0.004
 so+0.004
 boy+0.004
 but+0.004
Neg logit contributions
ross-0.005
rina-0.004
andra-0.004
ation-0.004
ressive-0.004
Token with
Token ID351
Feature activation-5.48e-03

Pos logit contributions
 sad+0.045
 so+0.034
 very+0.034
 boy+0.033
 but+0.030
Neg logit contributions
ross-0.129
rina-0.119
andra-0.116
ation-0.114
ressive-0.112
Token him
Token ID683
Feature activation+1.04e-02

Pos logit contributions
ross+0.011
rina+0.010
andra+0.010
ation+0.009
ressive+0.009
Neg logit contributions
 sad-0.009
 very-0.008
 so-0.008
 boy-0.008
 ladder-0.007
Token and
Token ID290
Feature activation-3.99e-03

Pos logit contributions
 sad+0.010
 very+0.008
 so+0.008
 boy+0.007
 but+0.007
Neg logit contributions
ross-0.028
rina-0.026
andra-0.025
ation-0.025
ressive-0.024
Token they
Token ID484
Feature activation-5.66e-03

Pos logit contributions
ross+0.009
rina+0.008
andra+0.008
ation+0.008
ressive+0.008
Neg logit contributions
 sad-0.005
 very-0.004
 so-0.004
 boy-0.004
 but-0.004
Token both
Token ID1111
Feature activation+1.05e-02

Pos logit contributions
ross+0.014
rina+0.013
andra+0.012
ation+0.012
ressive+0.012
Neg logit contributions
 sad-0.007
 very-0.006
 so-0.006
 were-0.005
 boy-0.005
Token said
Token ID531
Feature activation-7.43e-03

Pos logit contributions
ross+0.012
rina+0.012
andra+0.011
ation+0.011
ressive+0.011
Neg logit contributions
 sad-0.005
 very-0.004
 so-0.004
 boy-0.004
 but-0.003
Token Lily
Token ID20037
Feature activation-9.39e-03

Pos logit contributions
ross+0.021
rina+0.019
ation+0.018
andra+0.018
iet+0.018
Neg logit contributions
 sad-0.008
 so-0.006
 very-0.006
 boy-0.005
 but-0.005
Token.
Token ID13
Feature activation+2.95e-03

Pos logit contributions
ross+0.025
rina+0.023
ation+0.022
andra+0.022
iet+0.022
Neg logit contributions
 sad-0.011
 so-0.009
 very-0.009
 but-0.008
s-0.008
Token She
Token ID1375
Feature activation+2.17e-03

Pos logit contributions
 sad+0.002
 very+0.002
 so+0.002
 penn+0.002
 boy+0.002
Neg logit contributions
ross-0.008
rina-0.007
andra-0.007
end-0.007
ation-0.007
Token wrapped
Token ID12908
Feature activation+2.11e-02

Pos logit contributions
 sad+0.002
 very+0.002
 so+0.002
 were+0.002
 boy+0.002
Neg logit contributions
ross-0.006
rina-0.005
ation-0.005
ressive-0.005
andra-0.005
Token a
Token ID257
Feature activation+4.83e-02

Pos logit contributions
 sad+0.029
 very+0.023
 so+0.023
 boy+0.022
 but+0.022
Neg logit contributions
ross-0.047
rina-0.044
andra-0.042
ation-0.042
ressive-0.040
Token scarf
Token ID36953
Feature activation+3.28e-02

Pos logit contributions
 sad+0.056
 so+0.045
 very+0.044
 boy+0.042
 but+0.041
Neg logit contributions
ross-0.117
rina-0.109
andra-0.105
ation-0.103
eh-0.102
Token around
Token ID1088
Feature activation+1.33e-02

Pos logit contributions
 sad+0.032
 boy+0.024
 very+0.024
 so+0.024
 were+0.021
Neg logit contributions
ross-0.086
rina-0.080
andra-0.076
ation-0.075
iet-0.074
Token his
Token ID465
Feature activation+1.66e-02

Pos logit contributions
 sad+0.017
 very+0.015
 so+0.015
 boy+0.014
 but+0.013
Neg logit contributions
ross-0.031
rina-0.029
andra-0.028
ation-0.027
ressive-0.027
Token neck
Token ID7393
Feature activation+2.86e-02

Pos logit contributions
 sad+0.022
 so+0.019
 very+0.018
 boy+0.017
 were+0.017
Neg logit contributions
ross-0.038
rina-0.035
andra-0.034
ation-0.033
ressive-0.032
Token and
Token ID290
Feature activation+3.65e-03

Pos logit contributions
 sad+0.024
 very+0.018
 so+0.017
 boy+0.017
 but+0.015
Neg logit contributions
ross-0.079
rina-0.074
andra-0.072
ation-0.070
ressive-0.069
Token very
Token ID845
Feature activation+8.76e-03

Pos logit contributions
 sad+0.003
s+0.002
 boy+0.002
 but+0.001
 so+0.001
Neg logit contributions
ross-0.017
rina-0.016
andra-0.016
ation-0.016
eh-0.015
Token organized
Token ID8389
Feature activation-4.84e-04

Pos logit contributions
 sad+0.007
 so+0.006
 boy+0.005
 but+0.005
 very+0.005
Neg logit contributions
ross-0.024
rina-0.022
andra-0.021
ation-0.021
eh-0.021
Token.
Token ID13
Feature activation+7.32e-03

Pos logit contributions
ross+0.001
rina+0.001
andra+0.001
ation+0.001
eh+0.001
Neg logit contributions
 sad-0.001
 so-0.001
 very-0.000
 but-0.000
 boy-0.000
Token They
Token ID1119
Feature activation+2.12e-02

Pos logit contributions
 sad+0.007
 very+0.005
 so+0.005
 penn+0.005
 ladder+0.005
Neg logit contributions
ross-0.018
andra-0.018
rina-0.018
end-0.017
ife-0.016
Token put
Token ID1234
Feature activation+2.59e-02

Pos logit contributions
 sad+0.028
 so+0.023
 very+0.023
 boy+0.022
 but+0.022
Neg logit contributions
ross-0.048
rina-0.045
andra-0.043
ation-0.042
ressive-0.041
Token each
Token ID1123
Feature activation+4.81e-02

Pos logit contributions
 sad+0.031
 so+0.023
 but+0.023
s+0.023
 boy+0.023
Neg logit contributions
ross-0.058
rina-0.056
andra-0.054
ation-0.053
iet-0.052
Token block
Token ID2512
Feature activation+9.75e-03

Pos logit contributions
 sad+0.056
 so+0.044
 very+0.040
 but+0.040
 boy+0.039
Neg logit contributions
ross-0.111
rina-0.103
eh-0.102
ation-0.100
andra-0.098
Token in
Token ID287
Feature activation+1.59e-02

Pos logit contributions
 sad+0.011
 so+0.008
 but+0.008
 boy+0.008
 very+0.008
Neg logit contributions
ross-0.022
rina-0.021
ation-0.020
andra-0.020
iet-0.020
Token a
Token ID257
Feature activation+1.20e-02

Pos logit contributions
 sad+0.019
 very+0.015
 so+0.015
 boy+0.014
 but+0.014
Neg logit contributions
ross-0.039
rina-0.036
andra-0.035
ation-0.034
ressive-0.033
Token box
Token ID3091
Feature activation+5.95e-03

Pos logit contributions
 sad+0.012
 so+0.009
 but+0.008
 were+0.008
 boy+0.008
Neg logit contributions
ross-0.030
rina-0.028
eh-0.027
andra-0.027
iet-0.026
Token that
Token ID326
Feature activation+1.08e-02

Pos logit contributions
 sad+0.005
 so+0.004
 very+0.004
 but+0.004
 boy+0.003
Neg logit contributions
ross-0.017
rina-0.016
andra-0.015
ation-0.015
ressive-0.015
Tokenite
Token ID578
Feature activation-1.59e-03

Pos logit contributions
 sad+0.033
 so+0.028
 very+0.027
 but+0.026
 boy+0.025
Neg logit contributions
ross-0.050
rina-0.046
eh-0.045
andra-0.044
end-0.043
Token.
Token ID13
Feature activation+1.08e-02

Pos logit contributions
ross+0.004
rina+0.004
andra+0.004
ation+0.004
ressive+0.004
Neg logit contributions
 sad-0.002
 so-0.002
 very-0.002
 but-0.001
s-0.001
Token She
Token ID1375
Feature activation+1.25e-02

Pos logit contributions
 sad+0.006
 so+0.004
 very+0.004
 but+0.004
 boy+0.004
Neg logit contributions
ross-0.034
rina-0.032
ation-0.031
andra-0.030
ressive-0.030
Token spent
Token ID3377
Feature activation+1.26e-02

Pos logit contributions
 sad+0.016
 very+0.013
 so+0.013
 were+0.012
 boy+0.012
Neg logit contributions
ross-0.031
rina-0.028
andra-0.027
ation-0.027
ressive-0.027
Token the
Token ID262
Feature activation+7.54e-03

Pos logit contributions
 sad+0.011
 very+0.009
 so+0.008
 boy+0.007
 but+0.007
Neg logit contributions
ross-0.035
rina-0.033
andra-0.032
ation-0.031
ressive-0.031
Token day
Token ID1110
Feature activation+4.79e-02

Pos logit contributions
 sad+0.010
 very+0.009
 so+0.008
 boy+0.008
 but+0.007
Neg logit contributions
ross-0.018
rina-0.018
andra-0.017
isdom-0.016
ressive-0.016
Token flying
Token ID7348
Feature activation+1.03e-02

Pos logit contributions
 sad+0.061
 very+0.052
 so+0.051
 but+0.047
 boy+0.046
Neg logit contributions
ross-0.115
rina-0.108
andra-0.104
ation-0.101
ressive-0.099
Token the
Token ID262
Feature activation+6.50e-03

Pos logit contributions
 sad+0.010
 very+0.008
 so+0.008
 boy+0.007
 but+0.007
Neg logit contributions
ross-0.027
rina-0.025
andra-0.024
ation-0.024
ressive-0.023
Token black
Token ID2042
Feature activation+2.34e-04

Pos logit contributions
 sad+0.007
 very+0.005
 so+0.005
 boy+0.005
 but+0.005
Neg logit contributions
ross-0.017
rina-0.016
andra-0.016
ation-0.015
ressive-0.015
Token k
Token ID479
Feature activation+2.60e-02

Pos logit contributions
 sad+0.000
 very+0.000
 so+0.000
 boy+0.000
 but+0.000
Neg logit contributions
ross-0.001
rina-0.001
andra-0.001
ation-0.001
iet-0.001
Tokenite
Token ID578
Feature activation+9.36e-03

Pos logit contributions
 sad+0.035
 so+0.028
 very+0.028
 but+0.027
 boy+0.026
Neg logit contributions
ross-0.059
rina-0.054
eh-0.054
andra-0.052
ressive-0.052
Token.
Token ID13
Feature activation+1.98e-02

Pos logit contributions
 sad+0.016
 so+0.014
 very+0.014
 were+0.012
 but+0.012
Neg logit contributions
ross-0.041
andra-0.037
rina-0.037
ation-0.035
wen-0.035
Token She
Token ID1375
Feature activation+1.29e-02

Pos logit contributions
 sad+0.012
 so+0.008
 very+0.008
 but+0.007
 boy+0.007
Neg logit contributions
ross-0.062
rina-0.058
andra-0.056
ation-0.056
iet-0.055
Token spent
Token ID3377
Feature activation+1.39e-02

Pos logit contributions
 sad+0.014
 so+0.012
 very+0.012
 were+0.012
 boy+0.010
Neg logit contributions
ross-0.036
rina-0.031
andra-0.031
ation-0.031
ressive-0.031
Token the
Token ID262
Feature activation+1.14e-02

Pos logit contributions
 sad+0.011
 so+0.009
 very+0.009
 but+0.006
 boy+0.006
Neg logit contributions
ross-0.041
rina-0.039
andra-0.038
ressive-0.037
ation-0.037
Token whole
Token ID2187
Feature activation+1.56e-02

Pos logit contributions
 sad+0.014
 very+0.013
 boy+0.012
 so+0.012
 ladder+0.011
Neg logit contributions
ross-0.028
rina-0.027
andra-0.026
ressive-0.025
isdom-0.025
Token day
Token ID1110
Feature activation+4.79e-02

Pos logit contributions
 sad+0.021
 very+0.018
 so+0.018
 boy+0.017
 but+0.016
Neg logit contributions
ross-0.038
rina-0.036
andra-0.034
ressive-0.033
ation-0.033
Token collecting
Token ID13157
Feature activation+1.83e-02

Pos logit contributions
 sad+0.055
 very+0.046
 so+0.046
 but+0.041
 were+0.039
Neg logit contributions
ross-0.123
rina-0.118
andra-0.113
ation-0.110
ressive-0.108
Token more
Token ID517
Feature activation+9.18e-03

Pos logit contributions
 sad+0.019
 so+0.016
 very+0.016
 but+0.014
 boy+0.014
Neg logit contributions
ross-0.049
rina-0.046
andra-0.045
ation-0.043
ician-0.043
Token match
Token ID2872
Feature activation+7.02e-03

Pos logit contributions
 sad+0.009
 very+0.008
 so+0.008
 but+0.007
 boy+0.007
Neg logit contributions
ross-0.025
rina-0.023
andra-0.023
ation-0.022
ressive-0.022
Tokensticks
Token ID34810
Feature activation+8.02e-03

Pos logit contributions
 sad+0.006
 so+0.005
 very+0.005
 but+0.004
 were+0.004
Neg logit contributions
ross-0.020
rina-0.019
andra-0.018
ation-0.018
ician-0.018
Token,
Token ID11
Feature activation+1.07e-02

Pos logit contributions
 sad+0.005
 so+0.004
 very+0.004
 but+0.003
 were+0.003
Neg logit contributions
ross-0.025
rina-0.024
andra-0.023
ation-0.023
ician-0.022
TokenTim
Token ID14967
Feature activation-3.88e-03

Pos logit contributions
ross+0.012
rina+0.011
andra+0.011
ation+0.010
ressive+0.010
Neg logit contributions
 sad-0.009
 very-0.008
 so-0.008
 boy-0.008
 but-0.008
Tokenmy
Token ID1820
Feature activation+1.01e-02

Pos logit contributions
ross+0.008
rina+0.008
andra+0.007
ation+0.007
ressive+0.007
Neg logit contributions
 sad-0.006
 very-0.005
 so-0.005
 boy-0.005
 but-0.005
Token saw
Token ID2497
Feature activation-3.29e-03

Pos logit contributions
 sad+0.014
 cones+0.012
 penn+0.011
 very+0.011
 ladder+0.011
Neg logit contributions
ross-0.020
andra-0.020
rina-0.019
eh-0.018
ation-0.018
Token a
Token ID257
Feature activation+1.32e-02

Pos logit contributions
ross+0.006
rina+0.006
ation+0.006
andra+0.006
eh+0.006
Neg logit contributions
 sad-0.005
s-0.004
 ladder-0.004
 but-0.004
 very-0.004
Token cooler
Token ID19346
Feature activation+2.13e-02

Pos logit contributions
 sad+0.012
 were+0.012
 Some+0.012
 so+0.012
s+0.011
Neg logit contributions
ross-0.027
eh-0.026
andra-0.026
rina-0.026
ician-0.025
Token full
Token ID1336
Feature activation+4.74e-02

Pos logit contributions
 sad+0.022
 so+0.017
 very+0.016
 Some+0.015
 were+0.015
Neg logit contributions
ross-0.055
rina-0.048
andra-0.047
ation-0.047
eh-0.046
Token of
Token ID286
Feature activation+1.89e-02

Pos logit contributions
 sad+0.049
nam+0.040
s+0.038
 maybe+0.036
 cannot+0.034
Neg logit contributions
ross-0.100
eh-0.089
ilt-0.086
ife-0.086
andra-0.085
Token ice
Token ID4771
Feature activation+1.70e-02

Pos logit contributions
 sad+0.025
s+0.021
 but+0.019
 ladder+0.019
ho+0.018
Neg logit contributions
ross-0.032
eh-0.032
ation-0.031
asted-0.031
bet-0.030
Token cream
Token ID8566
Feature activation+1.62e-02

Pos logit contributions
 sad+0.035
 so+0.029
 Some+0.029
 very+0.028
 boy+0.028
Neg logit contributions
ross-0.026
eh-0.022
rina-0.022
at-0.021
ation-0.021
Token and
Token ID290
Feature activation-1.62e-02

Pos logit contributions
 sad+0.019
 very+0.014
 so+0.013
 boy+0.013
 but+0.012
Neg logit contributions
ross-0.040
rina-0.037
andra-0.036
ation-0.035
ressive-0.034
Token asked
Token ID1965
Feature activation+6.47e-03

Pos logit contributions
ross+0.042
rina+0.039
andra+0.038
ation+0.037
ressive+0.036
Neg logit contributions
 sad-0.017
 very-0.014
 so-0.013
 boy-0.013
 but-0.012
Token ice
Token ID4771
Feature activation+1.47e-02

Pos logit contributions
 sad+0.001
 very+0.001
 so+0.001
 boy+0.001
 but+0.001
Neg logit contributions
ross-0.002
rina-0.002
andra-0.002
ation-0.002
ressive-0.002
Token.
Token ID13
Feature activation+3.14e-03

Pos logit contributions
 sad+0.027
 so+0.023
 but+0.022
 very+0.022
 Some+0.022
Neg logit contributions
ross-0.025
rina-0.022
ation-0.022
eh-0.021
iet-0.020
Token She
Token ID1375
Feature activation+2.79e-02

Pos logit contributions
 sad+0.003
 very+0.002
 so+0.002
 boy+0.002
s+0.002
Neg logit contributions
ross-0.008
rina-0.008
andra-0.007
ation-0.007
ife-0.007
Token liked
Token ID8288
Feature activation+3.23e-02

Pos logit contributions
 sad+0.032
 so+0.026
 very+0.026
 boy+0.024
 but+0.024
Neg logit contributions
ross-0.069
rina-0.064
andra-0.062
ation-0.060
eh-0.059
Token to
Token ID284
Feature activation+1.05e-02

Pos logit contributions
 sad+0.040
 boy+0.028
s+0.026
 pocket+0.026
 slam+0.026
Neg logit contributions
ross-0.061
rina-0.059
ation-0.057
eh-0.055
andra-0.055
Token hold
Token ID1745
Feature activation+4.74e-02

Pos logit contributions
 sad+0.012
 very+0.010
 so+0.010
 boy+0.009
 but+0.009
Neg logit contributions
ross-0.027
rina-0.025
andra-0.024
ation-0.024
ressive-0.023
Token it
Token ID340
Feature activation+3.25e-03

Pos logit contributions
 sad+0.066
 hill+0.056
 boy+0.053
 ladder+0.052
 but+0.051
Neg logit contributions
ross-0.084
eh-0.075
rina-0.075
at-0.075
ation-0.074
Token in
Token ID287
Feature activation+1.09e-02

Pos logit contributions
 sad+0.004
 very+0.003
 so+0.003
 boy+0.003
 but+0.003
Neg logit contributions
ross-0.008
rina-0.008
andra-0.007
ation-0.007
ressive-0.007
Token her
Token ID607
Feature activation+1.88e-02

Pos logit contributions
 sad+0.018
 very+0.015
 so+0.015
 boy+0.015
s+0.014
Neg logit contributions
ross-0.021
rina-0.019
andra-0.019
ation-0.018
iet-0.017
Token hand
Token ID1021
Feature activation+1.45e-02

Pos logit contributions
 sad+0.021
 so+0.019
 but+0.018
 were+0.017
s+0.017
Neg logit contributions
ross-0.039
andra-0.037
iet-0.036
eh-0.036
rina-0.036
Token and
Token ID290
Feature activation-5.96e-03

Pos logit contributions
 sad+0.019
 very+0.016
 so+0.016
 but+0.015
 boy+0.015
Neg logit contributions
ross-0.034
rina-0.032
andra-0.031
ation-0.030
ressive-0.029
Token time
Token ID640
Feature activation-2.86e-03

Pos logit contributions
ross+0.048
rina+0.045
ation+0.043
ressive+0.043
andra+0.042
Neg logit contributions
 sad-0.049
 very-0.045
 so-0.043
 but-0.042
 boy-0.041
Token,
Token ID11
Feature activation-4.13e-03

Pos logit contributions
ross+0.009
rina+0.008
andra+0.008
ation+0.008
ressive+0.007
Neg logit contributions
 sad-0.002
 very-0.001
 so-0.001
 boy-0.001
 but-0.001
Token there
Token ID612
Feature activation-1.82e-03

Pos logit contributions
ross+0.012
rina+0.011
andra+0.011
ation+0.011
ressive+0.011
Neg logit contributions
 sad-0.003
 very-0.002
 so-0.002
 boy-0.002
 but-0.002
Token was
Token ID373
Feature activation-1.93e-02

Pos logit contributions
ross+0.005
rina+0.005
andra+0.004
ation+0.004
ressive+0.004
Neg logit contributions
 sad-0.002
 very-0.001
 so-0.001
 boy-0.001
 but-0.001
Token a
Token ID257
Feature activation+7.48e-03

Pos logit contributions
ross+0.053
rina+0.050
andra+0.048
ation+0.047
ressive+0.046
Neg logit contributions
 sad-0.019
 very-0.016
 so-0.016
 boy-0.013
 but-0.013
Token chip
Token ID11594
Feature activation+4.73e-02

Pos logit contributions
 sad+0.007
 so+0.006
 very+0.005
 were+0.005
 but+0.005
Neg logit contributions
ross-0.019
rina-0.018
andra-0.018
eh-0.017
iet-0.017
Tokenm
Token ID76
Feature activation+4.75e-03

Pos logit contributions
 sad+0.053
 so+0.046
 ladder+0.045
 rusty+0.044
 round+0.044
Neg logit contributions
ross-0.100
eh-0.098
rina-0.098
at-0.093
iet-0.089
Tokenunk
Token ID2954
Feature activation+1.23e-02

Pos logit contributions
 sad+0.009
 so+0.008
 very+0.008
 but+0.008
 boy+0.008
Neg logit contributions
ross-0.008
rina-0.007
andra-0.007
ation-0.006
iet-0.006
Token who
Token ID508
Feature activation-1.28e-03

Pos logit contributions
 sad+0.016
 very+0.015
 so+0.014
 boy+0.013
 but+0.013
Neg logit contributions
ross-0.029
rina-0.027
andra-0.027
ation-0.026
ressive-0.025
Token lived
Token ID5615
Feature activation+2.40e-03

Pos logit contributions
ross+0.004
rina+0.003
andra+0.003
ation+0.003
ressive+0.003
Neg logit contributions
 sad-0.001
 very-0.001
 were-0.001
 so-0.001
 boy-0.001
Token in
Token ID287
Feature activation-1.02e-02

Pos logit contributions
 sad+0.002
 very+0.002
 so+0.001
 boy+0.001
 but+0.001
Neg logit contributions
ross-0.007
andra-0.007
rina-0.007
ation-0.007
ressive-0.007
Token it
Token ID340
Feature activation-7.97e-03

Pos logit contributions
 sad+0.013
 so+0.010
 very+0.010
 but+0.008
 boy+0.008
Neg logit contributions
ross-0.046
rina-0.043
andra-0.042
ation-0.040
iet-0.040
Token.
Token ID13
Feature activation+1.55e-02

Pos logit contributions
ross+0.024
andra+0.024
rina+0.023
wen+0.023
iet+0.023
Neg logit contributions
 sad-0.006
 so-0.006
 very-0.006
 to-0.004
 but-0.004
Token She
Token ID1375
Feature activation+1.17e-02

Pos logit contributions
 sad+0.010
 so+0.007
 very+0.007
 boy+0.006
 but+0.006
Neg logit contributions
ross-0.047
rina-0.044
andra-0.042
ation-0.042
iet-0.041
Token spent
Token ID3377
Feature activation+1.20e-02

Pos logit contributions
 sad+0.013
 very+0.011
 so+0.011
 were+0.011
 boy+0.010
Neg logit contributions
ross-0.032
rina-0.028
andra-0.027
ation-0.027
ressive-0.027
Token the
Token ID262
Feature activation+8.30e-03

Pos logit contributions
 sad+0.009
 very+0.007
 so+0.007
 boy+0.005
 but+0.005
Neg logit contributions
ross-0.036
rina-0.034
andra-0.033
ation-0.032
ressive-0.032
Token morning
Token ID3329
Feature activation+4.73e-02

Pos logit contributions
 sad+0.011
 very+0.010
 boy+0.009
 so+0.009
 round+0.008
Neg logit contributions
ross-0.020
rina-0.020
andra-0.019
ressive-0.018
isdom-0.018
Token trying
Token ID2111
Feature activation+7.08e-03

Pos logit contributions
 sad+0.056
 very+0.048
 so+0.047
 but+0.041
 were+0.039
Neg logit contributions
ross-0.122
rina-0.117
andra-0.112
ation-0.109
ressive-0.107
Token to
Token ID284
Feature activation-1.60e-04

Pos logit contributions
 sad+0.005
 very+0.004
 so+0.004
 but+0.003
 boy+0.002
Neg logit contributions
ross-0.021
rina-0.020
andra-0.020
ation-0.019
ressive-0.019
Token yield
Token ID7800
Feature activation+1.44e-02

Pos logit contributions
ross+0.000
rina+0.000
andra+0.000
iet+0.000
anne+0.000
Neg logit contributions
 sad-0.000
 so-0.000
 very-0.000
 but-0.000
 were-0.000
Token the
Token ID262
Feature activation+8.69e-03

Pos logit contributions
 sad+0.012
 so+0.009
 very+0.009
 but+0.008
 boy+0.007
Neg logit contributions
ross-0.042
rina-0.039
andra-0.038
ation-0.037
ressive-0.037
Token cord
Token ID15050
Feature activation-3.43e-03

Pos logit contributions
 sad+0.011
 very+0.009
 so+0.009
 boy+0.008
 but+0.008
Neg logit contributions
ross-0.022
rina-0.020
andra-0.019
ation-0.019
ressive-0.018
Token his
Token ID465
Feature activation+1.25e-03

Pos logit contributions
 sad+0.022
 very+0.019
 so+0.018
 boy+0.017
 but+0.017
Neg logit contributions
ross-0.039
rina-0.036
andra-0.035
ation-0.034
ressive-0.033
Token animals
Token ID4695
Feature activation-1.26e-02

Pos logit contributions
 sad+0.001
 very+0.001
 so+0.001
 boy+0.001
 but+0.001
Neg logit contributions
ross-0.003
rina-0.003
andra-0.003
ation-0.003
ressive-0.003
Token.
Token ID13
Feature activation-3.47e-03

Pos logit contributions
ross+0.035
rina+0.032
andra+0.032
ation+0.031
iet+0.030
Neg logit contributions
 sad-0.012
 very-0.012
 so-0.012
 but-0.011
s-0.010
Token He
Token ID679
Feature activation+1.28e-02

Pos logit contributions
ross+0.009
rina+0.009
andra+0.009
ation+0.008
ressive+0.008
Neg logit contributions
 sad-0.003
 very-0.002
 so-0.002
 boy-0.002
 penn-0.002
Token puts
Token ID7584
Feature activation+2.25e-02

Pos logit contributions
 sad+0.015
 very+0.012
 so+0.012
 boy+0.011
 were+0.011
Neg logit contributions
ross-0.033
rina-0.030
andra-0.029
ation-0.029
ressive-0.028
Token each
Token ID1123
Feature activation+4.70e-02

Pos logit contributions
 sad+0.029
 very+0.024
 so+0.024
 boy+0.023
 but+0.022
Neg logit contributions
ross-0.052
rina-0.048
andra-0.046
ation-0.045
ressive-0.044
Token toy
Token ID13373
Feature activation+3.67e-03

Pos logit contributions
 sad+0.055
 so+0.044
 but+0.040
 very+0.040
 cones+0.040
Neg logit contributions
ross-0.103
eh-0.097
rina-0.095
ation-0.094
end-0.091
Token in
Token ID287
Feature activation+1.76e-02

Pos logit contributions
 sad+0.003
 so+0.003
 very+0.003
 boy+0.002
 but+0.002
Neg logit contributions
ross-0.010
rina-0.009
andra-0.009
ation-0.009
ressive-0.009
Token the
Token ID262
Feature activation+1.61e-02

Pos logit contributions
 sad+0.022
 boy+0.017
 very+0.017
s+0.017
 ladder+0.017
Neg logit contributions
ross-0.038
rina-0.036
andra-0.035
ation-0.035
ressive-0.033
Token right
Token ID826
Feature activation-7.09e-03

Pos logit contributions
 sad+0.018
 so+0.014
 very+0.013
 but+0.013
 were+0.012
Neg logit contributions
ross-0.038
rina-0.035
andra-0.034
ation-0.034
eh-0.033
Token place
Token ID1295
Feature activation+1.99e-02

Pos logit contributions
ross+0.017
rina+0.016
andra+0.015
ation+0.015
ressive+0.014
Neg logit contributions
 sad-0.009
 very-0.007
 so-0.007
 boy-0.007
 but-0.007
Token and
Token ID290
Feature activation-9.65e-04

Pos logit contributions
 sad+0.024
 very+0.019
 so+0.019
 boy+0.017
 but+0.017
Neg logit contributions
ross-0.064
rina-0.060
andra-0.058
ation-0.057
ressive-0.056
Token drink
Token ID4144
Feature activation+2.88e-02

Pos logit contributions
ross+0.002
rina+0.002
andra+0.002
ation+0.002
ressive+0.002
Neg logit contributions
 sad-0.001
 very-0.001
 so-0.001
 boy-0.001
 but-0.001
Token some
Token ID617
Feature activation+2.09e-02

Pos logit contributions
 sad+0.032
 very+0.026
 so+0.026
 boy+0.024
 but+0.023
Neg logit contributions
ross-0.072
rina-0.067
andra-0.065
ation-0.064
ressive-0.062
Token water
Token ID1660
Feature activation+6.14e-03

Pos logit contributions
 sad+0.025
 very+0.020
 so+0.020
 boy+0.019
 but+0.018
Neg logit contributions
ross-0.051
rina-0.048
andra-0.046
ation-0.045
ressive-0.044
Token.
Token ID13
Feature activation+1.09e-03

Pos logit contributions
 sad+0.006
 so+0.005
 very+0.005
 but+0.005
 boy+0.004
Neg logit contributions
ross-0.017
rina-0.016
andra-0.016
ation-0.015
ressive-0.015
Token Sara
Token ID24799
Feature activation+4.69e-02

Pos logit contributions
 sad+0.001
 very+0.001
 so+0.001
 ladder+0.001
 boy+0.001
Neg logit contributions
ross-0.003
rina-0.002
andra-0.002
ation-0.002
ressive-0.002
Token,
Token ID11
Feature activation+2.21e-02

Pos logit contributions
 sad+0.042
nam+0.037
 ladder+0.034
 so+0.033
 round+0.033
Neg logit contributions
ross-0.112
rina-0.107
ation-0.101
andra-0.100
end-0.098
Token you
Token ID345
Feature activation+2.01e-02

Pos logit contributions
 sad+0.027
s+0.021
 very+0.021
 ladder+0.021
 round+0.021
Neg logit contributions
ross-0.048
rina-0.044
andra-0.043
ation-0.041
ife-0.041
Token need
Token ID761
Feature activation+1.94e-02

Pos logit contributions
 round+0.024
 ladder+0.024
nam+0.023
 cones+0.023
 sad+0.023
Neg logit contributions
rina-0.034
end-0.033
ross-0.033
ils-0.031
eded-0.031
Token to
Token ID284
Feature activation-2.05e-03

Pos logit contributions
 sad+0.013
 very+0.009
 so+0.009
 boy+0.008
 but+0.007
Neg logit contributions
ross-0.057
rina-0.054
andra-0.052
ation-0.051
ressive-0.050
Token wash
Token ID13502
Feature activation+2.07e-02

Pos logit contributions
ross+0.005
rina+0.005
andra+0.005
ressive+0.004
ation+0.004
Neg logit contributions
 sad-0.003
 very-0.002
 so-0.002
 but-0.002
 boy-0.002
Tokenite
Token ID578
Feature activation-7.76e-03

Pos logit contributions
 sad+0.028
 so+0.022
 very+0.022
 but+0.020
 boy+0.019
Neg logit contributions
ross-0.060
rina-0.056
eh-0.053
andra-0.053
ation-0.053
Token.
Token ID13
Feature activation+7.39e-03

Pos logit contributions
ross+0.021
andra+0.020
rina+0.020
ation+0.019
iet+0.019
Neg logit contributions
 so-0.008
 sad-0.008
 very-0.007
 but-0.007
s-0.006
Token He
Token ID679
Feature activation+4.88e-03

Pos logit contributions
 sad+0.004
 so+0.003
 very+0.003
 but+0.003
 boy+0.003
Neg logit contributions
ross-0.023
rina-0.022
ation-0.021
andra-0.021
ressive-0.020
Token spent
Token ID3377
Feature activation+9.37e-03

Pos logit contributions
 sad+0.006
 very+0.005
 so+0.005
 were+0.005
 boy+0.005
Neg logit contributions
ross-0.013
rina-0.011
andra-0.011
ation-0.011
ressive-0.011
Token the
Token ID262
Feature activation+8.45e-03

Pos logit contributions
 sad+0.007
 so+0.006
 very+0.006
 but+0.004
 boy+0.004
Neg logit contributions
ross-0.028
rina-0.026
andra-0.025
ressive-0.024
ation-0.024
Token day
Token ID1110
Feature activation+4.68e-02

Pos logit contributions
 sad+0.011
 very+0.010
 so+0.010
 boy+0.009
 but+0.009
Neg logit contributions
ross-0.020
rina-0.019
andra-0.018
ressive-0.017
isdom-0.017
Token playing
Token ID2712
Feature activation+2.57e-02

Pos logit contributions
 sad+0.063
 very+0.054
 so+0.053
 boy+0.049
 but+0.049
Neg logit contributions
ross-0.109
rina-0.102
andra-0.098
ation-0.095
ressive-0.093
Token with
Token ID351
Feature activation+8.85e-03

Pos logit contributions
 sad+0.019
 very+0.015
 so+0.015
 but+0.013
 boy+0.012
Neg logit contributions
ross-0.077
rina-0.074
andra-0.072
ation-0.069
ressive-0.067
Token it
Token ID340
Feature activation+5.14e-03

Pos logit contributions
 sad+0.010
 so+0.008
 very+0.008
 boy+0.007
 but+0.007
Neg logit contributions
ross-0.024
rina-0.022
andra-0.021
ation-0.021
ressive-0.020
Token,
Token ID11
Feature activation+9.88e-03

Pos logit contributions
 sad+0.004
 so+0.004
 very+0.004
 but+0.003
 were+0.003
Neg logit contributions
ross-0.015
andra-0.015
rina-0.015
ician-0.014
ation-0.014
Token but
Token ID475
Feature activation+1.35e-02

Pos logit contributions
 sad+0.011
 so+0.009
 very+0.009
 but+0.008
 boy+0.008
Neg logit contributions
ross-0.026
rina-0.024
andra-0.023
ation-0.023
ressive-0.022
Token Emma
Token ID18966
Feature activation-3.38e-03

Pos logit contributions
 sad+0.005
 very+0.004
 so+0.004
 boy+0.004
 but+0.004
Neg logit contributions
ross-0.006
rina-0.005
andra-0.005
ation-0.005
ressive-0.005
Token took
Token ID1718
Feature activation+6.10e-04

Pos logit contributions
ross+0.009
rina+0.008
ation+0.007
andra+0.007
ressive+0.007
Neg logit contributions
 sad-0.004
 very-0.004
 so-0.004
 boy-0.003
 were-0.003
Token a
Token ID257
Feature activation+7.71e-03

Pos logit contributions
 sad+0.001
 very+0.001
 so+0.001
 boy+0.001
 but+0.001
Neg logit contributions
ross-0.001
rina-0.001
andra-0.001
ation-0.001
ressive-0.001
Token smooth
Token ID7209
Feature activation+3.44e-03

Pos logit contributions
 sad+0.009
 cones+0.009
 so+0.009
 were+0.008
 very+0.008
Neg logit contributions
ross-0.015
eh-0.015
rina-0.014
andra-0.014
iet-0.014
Token ice
Token ID4771
Feature activation+1.93e-02

Pos logit contributions
 sad+0.005
 very+0.004
 so+0.004
 were+0.004
 cones+0.004
Neg logit contributions
ross-0.007
ation-0.007
rina-0.006
eh-0.006
andra-0.006
Token pack
Token ID2353
Feature activation+4.66e-02

Pos logit contributions
 sad+0.035
 Some+0.029
 so+0.028
 were+0.028
 very+0.028
Neg logit contributions
ross-0.035
eh-0.029
ation-0.028
rina-0.028
at-0.027
Token and
Token ID290
Feature activation-1.36e-02

Pos logit contributions
 sad+0.045
 boy+0.032
 ladder+0.030
 very+0.030
 cones+0.028
Neg logit contributions
ross-0.121
rina-0.108
andra-0.106
ation-0.104
eh-0.102
Token put
Token ID1234
Feature activation+1.63e-02

Pos logit contributions
ross+0.035
rina+0.033
andra+0.031
iet+0.030
ation+0.030
Neg logit contributions
 sad-0.017
 so-0.015
 very-0.015
 were-0.013
 but-0.013
Token it
Token ID340
Feature activation+8.03e-04

Pos logit contributions
 sad+0.010
 very+0.007
 so+0.007
 boy+0.005
 but+0.005
Neg logit contributions
ross-0.050
rina-0.048
andra-0.046
ation-0.045
ressive-0.045
Token on
Token ID319
Feature activation+3.49e-03

Pos logit contributions
 sad+0.001
 very+0.001
 so+0.001
 but+0.000
 were+0.000
Neg logit contributions
ross-0.002
andra-0.002
rina-0.002
ation-0.002
ressive-0.002
Token Lily
Token ID20037
Feature activation+8.78e-03

Pos logit contributions
 sad+0.004
 very+0.004
 so+0.004
 but+0.003
 boy+0.003
Neg logit contributions
ross-0.009
rina-0.009
andra-0.008
ation-0.008
ressive-0.008
Token if
Token ID611
Feature activation+3.94e-03

Pos logit contributions
ross+0.005
rina+0.004
andra+0.004
ation+0.004
eh+0.004
Neg logit contributions
 sad-0.002
 very-0.002
 so-0.002
 but-0.002
 boy-0.002
Token she
Token ID673
Feature activation-1.24e-02

Pos logit contributions
 sad+0.003
 very+0.002
 so+0.002
 boy+0.002
 but+0.002
Neg logit contributions
ross-0.012
rina-0.011
andra-0.011
ation-0.011
ressive-0.010
Token could
Token ID714
Feature activation+1.46e-03

Pos logit contributions
ross+0.037
rina+0.035
andra+0.034
ation+0.033
ressive+0.033
Neg logit contributions
 sad-0.008
 very-0.006
 so-0.006
 boy-0.005
 but-0.004
Token talk
Token ID1561
Feature activation+7.16e-03

Pos logit contributions
 sad+0.001
 very+0.001
 so+0.001
 but+0.001
 were+0.001
Neg logit contributions
ross-0.004
rina-0.004
andra-0.004
eh-0.004
ation-0.004
Token about
Token ID546
Feature activation-1.58e-02

Pos logit contributions
 sad+0.005
 so+0.004
 very+0.004
 but+0.003
 boy+0.003
Neg logit contributions
ross-0.022
rina-0.020
andra-0.020
ation-0.020
eh-0.019
Token microsc
Token ID21145
Feature activation-6.94e-02

Pos logit contributions
ross+0.047
rina+0.043
andra+0.042
ation+0.041
iet+0.040
Neg logit contributions
 sad-0.013
 very-0.011
 so-0.011
 but-0.009
 boy-0.008
Tokenopes
Token ID13920
Feature activation+1.46e-03

Pos logit contributions
ross+0.088
eh+0.076
andra+0.075
rina+0.070
ation+0.067
Neg logit contributions
 so-0.145
 sad-0.145
 very-0.141
 old-0.135
 round-0.135
Token.
Token ID13
Feature activation+1.02e-03

Pos logit contributions
 sad+0.001
 so+0.001
 very+0.001
 but+0.001
 were+0.001
Neg logit contributions
ross-0.004
rina-0.004
andra-0.004
ation-0.004
iet-0.003
Token
Token ID198
Feature activation+3.83e-03

Pos logit contributions
 sad+0.001
 very+0.001
 so+0.001
 boy+0.001
 but+0.001
Neg logit contributions
ross-0.003
rina-0.003
andra-0.003
ation-0.002
ressive-0.002
Token
Token ID198
Feature activation+1.17e-04

Pos logit contributions
 sad+0.003
 very+0.002
 so+0.002
 boy+0.002
 but+0.001
Neg logit contributions
ross-0.011
rina-0.011
andra-0.010
ation-0.010
ressive-0.010
TokenHer
Token ID9360
Feature activation-2.84e-02

Pos logit contributions
 sad+0.000
 very+0.000
 so+0.000
 boy+0.000
 but+0.000
Neg logit contributions
ross-0.000
rina-0.000
andra-0.000
ation-0.000
ressive-0.000
Token had
Token ID550
Feature activation-2.64e-02

Pos logit contributions
ross+0.035
rina+0.031
andra+0.030
ressive+0.030
ation+0.030
Neg logit contributions
 sad-0.014
 very-0.012
 so-0.011
 were-0.011
 boy-0.010
Token to
Token ID284
Feature activation-7.52e-03

Pos logit contributions
ross+0.074
rina+0.068
aghetti+0.066
ician+0.065
iet+0.064
Neg logit contributions
 so-0.026
 sad-0.026
 very-0.025
 many-0.021
 were-0.021
Token know
Token ID760
Feature activation-3.42e-03

Pos logit contributions
ross+0.020
rina+0.019
eh+0.018
andra+0.018
ressive+0.017
Neg logit contributions
 sad-0.008
 so-0.007
 very-0.007
 but-0.006
 were-0.006
Token all
Token ID477
Feature activation-1.25e-02

Pos logit contributions
ross+0.009
rina+0.008
andra+0.008
ation+0.008
ressive+0.007
Neg logit contributions
 sad-0.004
 very-0.003
 so-0.003
 but-0.003
 boy-0.003
Token about
Token ID546
Feature activation-7.60e-03

Pos logit contributions
ross+0.035
rina+0.033
andra+0.032
ation+0.031
ressive+0.031
Neg logit contributions
 sad-0.011
 so-0.009
 very-0.009
 but-0.007
 boy-0.007
Token microsc
Token ID21145
Feature activation-6.89e-02

Pos logit contributions
ross+0.021
rina+0.019
andra+0.019
ation+0.018
eh+0.018
Neg logit contributions
 sad-0.008
 very-0.007
 so-0.007
 but-0.006
 were-0.005
Tokenopes
Token ID13920
Feature activation-6.97e-03

Pos logit contributions
ross+0.090
andra+0.076
rina+0.072
eh+0.072
ician+0.069
Neg logit contributions
 so-0.145
 very-0.141
 sad-0.141
s-0.138
 cones-0.135
Token.
Token ID13
Feature activation+3.44e-03

Pos logit contributions
ross+0.020
rina+0.019
andra+0.019
ation+0.018
ician+0.018
Neg logit contributions
 sad-0.006
 so-0.005
 very-0.005
 but-0.004
 were-0.004
Token
Token ID220
Feature activation-4.73e-03

Pos logit contributions
 sad+0.003
 very+0.002
 so+0.002
 boy+0.002
 cones+0.002
Neg logit contributions
ross-0.009
rina-0.009
andra-0.008
ation-0.008
ressive-0.008
Token
Token ID198
Feature activation+5.60e-03

Pos logit contributions
ross+0.015
rina+0.014
andra+0.014
ation+0.014
ressive+0.013
Neg logit contributions
 sad-0.002
 very-0.001
 so-0.001
 boy-0.001
 but-0.001
Token
Token ID198
Feature activation-5.75e-04

Pos logit contributions
 sad+0.004
 very+0.003
 so+0.003
 boy+0.003
 cones+0.003
Neg logit contributions
ross-0.016
rina-0.015
andra-0.014
ation-0.014
end-0.014
Token the
Token ID262
Feature activation-1.52e-02

Pos logit contributions
ross+0.036
rina+0.034
andra+0.032
ation+0.032
ressive+0.031
Neg logit contributions
 sad-0.013
 very-0.010
 so-0.010
 boy-0.009
 but-0.009
Token stars
Token ID5788
Feature activation-6.89e-03

Pos logit contributions
ross+0.028
rina+0.026
andra+0.024
ation+0.024
ressive+0.023
Neg logit contributions
 sad-0.026
 so-0.022
 very-0.022
 cones-0.021
 boy-0.021
Token and
Token ID290
Feature activation-2.14e-02

Pos logit contributions
ross+0.019
rina+0.018
andra+0.018
ation+0.017
ressive+0.017
Neg logit contributions
 sad-0.006
 very-0.004
 so-0.004
 boy-0.004
 but-0.004
Token planets
Token ID14705
Feature activation-1.88e-02

Pos logit contributions
ross+0.048
rina+0.045
andra+0.044
ation+0.043
ressive+0.041
Neg logit contributions
 sad-0.028
 very-0.023
 boy-0.023
 so-0.022
 ladder-0.021
Token in
Token ID287
Feature activation-1.00e-02

Pos logit contributions
ross+0.048
rina+0.045
andra+0.044
ation+0.043
ressive+0.042
Neg logit contributions
 sad-0.022
 very-0.019
 so-0.019
 but-0.017
 were-0.017
Token outer
Token ID12076
Feature activation-6.60e-02

Pos logit contributions
ross+0.024
rina+0.023
andra+0.022
ation+0.022
iet+0.021
Neg logit contributions
 sad-0.012
 very-0.009
 so-0.009
 boy-0.009
s-0.008
Token space
Token ID2272
Feature activation-1.35e-02

Pos logit contributions
ross+0.162
rina+0.151
andra+0.145
ation+0.142
ressive+0.140
Neg logit contributions
 sad-0.080
 very-0.069
 so-0.068
 boy-0.062
 but-0.061
Token.
Token ID13
Feature activation+1.59e-03

Pos logit contributions
ross+0.034
rina+0.032
andra+0.031
ressive+0.030
ation+0.030
Neg logit contributions
 sad-0.016
 very-0.014
 so-0.014
 but-0.013
 were-0.012
Token So
Token ID1406
Feature activation+4.28e-03

Pos logit contributions
 sad+0.002
 very+0.001
 penn+0.001
 boy+0.001
 so+0.001
Neg logit contributions
ross-0.004
andra-0.004
rina-0.004
anne-0.004
end-0.004
Token one
Token ID530
Feature activation-6.35e-04

Pos logit contributions
 sad+0.004
 very+0.003
 so+0.003
 boy+0.002
 but+0.002
Neg logit contributions
ross-0.012
rina-0.011
andra-0.011
ation-0.011
ressive-0.011
Token day
Token ID1110
Feature activation+3.07e-03

Pos logit contributions
ross+0.002
rina+0.001
andra+0.001
ation+0.001
ressive+0.001
Neg logit contributions
 sad-0.001
 very-0.001
 so-0.001
 boy-0.001
 but-0.001
Token to
Token ID284
Feature activation-3.46e-03

Pos logit contributions
ross+0.014
rina+0.013
andra+0.013
ation+0.013
iet+0.012
Neg logit contributions
 sad-0.005
 so-0.004
 very-0.004
 but-0.003
 boy-0.003
Token be
Token ID307
Feature activation-1.51e-02

Pos logit contributions
ross+0.008
rina+0.007
andra+0.007
ation+0.007
ressive+0.007
Neg logit contributions
 sad-0.005
 very-0.004
 so-0.004
 boy-0.004
s-0.004
Token restored
Token ID15032
Feature activation-7.96e-03

Pos logit contributions
ross+0.042
rina+0.039
andra+0.038
iet+0.037
ife+0.036
Neg logit contributions
 sad-0.019
 very-0.018
 so-0.017
 old-0.013
 but-0.013
Token to
Token ID284
Feature activation-1.65e-02

Pos logit contributions
ross+0.021
andra+0.019
rina+0.019
ation+0.018
iet+0.018
Neg logit contributions
 sad-0.009
 so-0.008
 very-0.008
 but-0.008
 boy-0.007
Token its
Token ID663
Feature activation-2.91e-02

Pos logit contributions
ross+0.044
rina+0.041
andra+0.040
iet+0.039
eh+0.038
Neg logit contributions
 sad-0.018
 so-0.015
 very-0.015
 but-0.013
 boy-0.013
Token former
Token ID1966
Feature activation-6.17e-02

Pos logit contributions
ross+0.065
rina+0.059
andra+0.058
ation+0.055
iet+0.054
Neg logit contributions
 sad-0.046
 very-0.040
 so-0.039
 boy-0.037
 but-0.036
Token glory
Token ID13476
Feature activation-1.46e-02

Pos logit contributions
ross+0.102
rina+0.090
andra+0.087
ressive+0.081
ation+0.081
Neg logit contributions
 sad-0.124
 very-0.112
 so-0.112
 but-0.105
 old-0.103
Token."
Token ID526
Feature activation-4.02e-04

Pos logit contributions
ross+0.035
rina+0.032
andra+0.032
ation+0.031
iet+0.030
Neg logit contributions
 sad-0.019
 so-0.017
 very-0.017
 but-0.016
 boy-0.014
Token
Token ID220
Feature activation-9.84e-03

Pos logit contributions
ross+0.001
rina+0.001
andra+0.001
ation+0.001
end+0.001
Neg logit contributions
 sad-0.000
 cones-0.000
 very-0.000
 ladder-0.000
 boy-0.000
Token
Token ID198
Feature activation+3.53e-03

Pos logit contributions
ross+0.031
rina+0.029
andra+0.028
ation+0.028
ressive+0.027
Neg logit contributions
 sad-0.005
 very-0.003
 so-0.003
 boy-0.002
 but-0.002
Token
Token ID198
Feature activation+8.11e-04

Pos logit contributions
 sad+0.003
 cones+0.002
 boy+0.002
 very+0.002
 ladder+0.002
Neg logit contributions
ross-0.009
rina-0.009
andra-0.009
eh-0.008
end-0.008
Token the
Token ID262
Feature activation+7.86e-03

Pos logit contributions
ross+0.005
rina+0.004
andra+0.004
ation+0.004
ressive+0.004
Neg logit contributions
 sad-0.001
 so-0.001
 very-0.001
 but-0.001
 boy-0.001
Token tree
Token ID5509
Feature activation+3.30e-03

Pos logit contributions
 sad+0.007
 so+0.006
 very+0.005
 but+0.005
s+0.005
Neg logit contributions
ross-0.021
rina-0.019
andra-0.019
ation-0.019
eh-0.018
Token to
Token ID284
Feature activation-1.45e-02

Pos logit contributions
 sad+0.003
 very+0.002
 so+0.002
 but+0.002
 boy+0.002
Neg logit contributions
ross-0.009
rina-0.009
andra-0.008
ation-0.008
ressive-0.008
Token its
Token ID663
Feature activation-2.41e-02

Pos logit contributions
ross+0.039
rina+0.036
andra+0.035
iet+0.034
ation+0.034
Neg logit contributions
 sad-0.015
 very-0.013
 so-0.013
 but-0.012
 were-0.011
Token once
Token ID1752
Feature activation+4.77e-03

Pos logit contributions
ross+0.060
rina+0.055
andra+0.054
ation+0.051
iet+0.051
Neg logit contributions
 sad-0.031
 very-0.027
 so-0.026
 boy-0.023
 but-0.023
Token former
Token ID1966
Feature activation-6.04e-02

Pos logit contributions
 sad+0.006
 very+0.005
 so+0.005
 but+0.004
 boy+0.004
Neg logit contributions
ross-0.012
rina-0.011
andra-0.011
ation-0.011
ressive-0.011
Token beauty
Token ID8737
Feature activation-4.36e-03

Pos logit contributions
ross+0.101
rina+0.090
andra+0.088
ician+0.082
ressive+0.081
Neg logit contributions
 sad-0.119
 so-0.108
 very-0.108
 old-0.101
 but-0.100
Token.
Token ID13
Feature activation+1.41e-03

Pos logit contributions
ross+0.012
rina+0.011
andra+0.011
ation+0.010
ician+0.010
Neg logit contributions
 sad-0.004
 so-0.004
 very-0.004
 but-0.004
 were-0.003
Token He
Token ID679
Feature activation-2.52e-04

Pos logit contributions
 sad+0.001
 very+0.001
 so+0.001
 boy+0.001
 but+0.001
Neg logit contributions
ross-0.004
rina-0.004
andra-0.004
ation-0.003
ressive-0.003
Token was
Token ID373
Feature activation+6.63e-03

Pos logit contributions
ross+0.001
rina+0.001
andra+0.001
ation+0.001
iet+0.001
Neg logit contributions
 sad-0.000
 very-0.000
 so-0.000
 were-0.000
 boy-0.000
Token happy
Token ID3772
Feature activation+4.38e-03

Pos logit contributions
 sad+0.004
 so+0.003
 very+0.003
 but+0.001
 were+0.001
Neg logit contributions
ross-0.022
rina-0.020
andra-0.020
ation-0.020
iet-0.019
Token big
Token ID1263
Feature activation+6.61e-03

Pos logit contributions
 sad+0.001
 were+0.001
 so+0.001
 cones+0.001
s+0.001
Neg logit contributions
ross-0.004
rina-0.004
andra-0.003
eh-0.003
iet-0.003
Token book
Token ID1492
Feature activation-7.82e-03

Pos logit contributions
 sad+0.008
s+0.007
 so+0.006
 were+0.006
 Some+0.006
Neg logit contributions
ross-0.014
rina-0.013
andra-0.013
eh-0.012
ation-0.012
Token with
Token ID351
Feature activation-1.87e-02

Pos logit contributions
ross+0.021
rina+0.020
andra+0.020
ation+0.019
ressive+0.019
Neg logit contributions
 sad-0.007
 very-0.006
 so-0.006
 but-0.005
 boy-0.005
Token pictures
Token ID5986
Feature activation-8.01e-03

Pos logit contributions
ross+0.049
rina+0.046
andra+0.044
ation+0.044
ressive+0.043
Neg logit contributions
 sad-0.018
 very-0.014
 so-0.014
 boy-0.013
 but-0.013
Token of
Token ID286
Feature activation-5.84e-03

Pos logit contributions
ross+0.023
rina+0.022
andra+0.021
ation+0.021
iet+0.020
Neg logit contributions
 sad-0.006
 so-0.006
 very-0.005
 but-0.005
 were-0.004
Token volcan
Token ID17516
Feature activation-5.73e-02

Pos logit contributions
ross+0.013
rina+0.013
andra+0.012
ation+0.012
ressive+0.012
Neg logit contributions
 sad-0.007
 so-0.006
 very-0.006
 boy-0.005
 ladder-0.005
Tokenoes
Token ID3028
Feature activation-1.93e-02

Pos logit contributions
ross+0.131
rina+0.119
andra+0.116
ation+0.114
ressive+0.110
Neg logit contributions
 sad-0.077
 very-0.068
 so-0.067
 boy-0.062
 but-0.061
Token.
Token ID13
Feature activation+7.71e-03

Pos logit contributions
ross+0.051
rina+0.047
andra+0.046
ation+0.045
iet+0.044
Neg logit contributions
 sad-0.021
 so-0.019
 very-0.019
 but-0.018
 were-0.017
Token Vol
Token ID4709
Feature activation-1.82e-02

Pos logit contributions
 sad+0.010
 very+0.009
 penn+0.008
 kay+0.007
 so+0.007
Neg logit contributions
andra-0.016
ross-0.015
ife-0.014
rina-0.013
ation-0.013
Tokencan
Token ID5171
Feature activation+7.44e-03

Pos logit contributions
ross+0.040
rina+0.037
andra+0.036
eh+0.035
ation+0.034
Neg logit contributions
 sad-0.023
s-0.021
 were-0.021
 boy-0.020
 very-0.020
Tokenoes
Token ID3028
Feature activation-2.39e-03

Pos logit contributions
 sad+0.009
 very+0.007
 so+0.007
 boy+0.007
 but+0.007
Neg logit contributions
ross-0.018
rina-0.017
andra-0.017
ation-0.016
ressive-0.016
Token"
Token ID1
Feature activation-2.09e-03

Pos logit contributions
ross+0.015
rina+0.014
andra+0.013
ation+0.013
ressive+0.012
Neg logit contributions
 sad-0.009
 very-0.008
 so-0.008
 boy-0.008
 but-0.007
TokenWhat
Token ID2061
Feature activation-2.67e-02

Pos logit contributions
ross+0.005
rina+0.005
andra+0.004
ation+0.004
eh+0.004
Neg logit contributions
 sad-0.003
 very-0.002
 boy-0.002
 so-0.002
 but-0.002
Token is
Token ID318
Feature activation-2.34e-02

Pos logit contributions
ross+0.074
rina+0.069
andra+0.067
ation+0.066
ressive+0.064
Neg logit contributions
 sad-0.024
 very-0.019
 so-0.018
 boy-0.017
 but-0.016
Token famous
Token ID5863
Feature activation-1.48e-02

Pos logit contributions
ross+0.072
rina+0.068
andra+0.067
ation+0.064
iet+0.063
Neg logit contributions
 sad-0.016
 very-0.013
 so-0.012
 round-0.010
 but-0.009
Token?"
Token ID1701
Feature activation-1.81e-02

Pos logit contributions
ross+0.034
rina+0.032
andra+0.032
eh+0.030
wen+0.029
Neg logit contributions
 sad-0.022
 so-0.020
 but-0.019
 very-0.018
 boy-0.018
Token she
Token ID673
Feature activation-5.70e-02

Pos logit contributions
ross+0.046
andra+0.043
rina+0.042
ation+0.040
iet+0.039
Neg logit contributions
 sad-0.017
 very-0.014
 so-0.013
 ladder-0.012
 boy-0.012
Token asks
Token ID7893
Feature activation-3.30e-03

Pos logit contributions
ross+0.143
rina+0.123
ressive+0.120
eh+0.119
andra+0.119
Neg logit contributions
 sad-0.081
 very-0.066
 so-0.065
 boy-0.063
 but-0.060
Token.
Token ID13
Feature activation-2.63e-03

Pos logit contributions
ross+0.007
rina+0.007
andra+0.006
ation+0.006
ressive+0.006
Neg logit contributions
 sad-0.005
 very-0.004
 so-0.004
 boy-0.004
 but-0.004
Token
Token ID198
Feature activation-2.88e-03

Pos logit contributions
ross+0.008
rina+0.007
andra+0.007
ation+0.007
ressive+0.007
Neg logit contributions
 sad-0.002
 very-0.001
 so-0.001
 boy-0.001
 were-0.001
Token
Token ID198
Feature activation-2.62e-03

Pos logit contributions
ross+0.009
rina+0.008
andra+0.008
ation+0.008
ressive+0.008
Neg logit contributions
 sad-0.002
 very-0.001
 so-0.001
 boy-0.001
 but-0.001
TokenBen
Token ID11696
Feature activation-1.75e-02

Pos logit contributions
ross+0.006
rina+0.006
ation+0.005
andra+0.005
ressive+0.005
Neg logit contributions
 sad-0.004
 very-0.003
 so-0.003
 boy-0.003
s-0.003
Token Do
Token ID2141
Feature activation+2.54e-03

Pos logit contributions
 sad+0.001
 very+0.000
 so+0.000
 boy+0.000
 but+0.000
Neg logit contributions
ross-0.002
rina-0.002
andra-0.002
ation-0.002
ressive-0.002
Token you
Token ID345
Feature activation-1.22e-02

Pos logit contributions
 sad+0.004
 boy+0.003
 ladder+0.003
 very+0.003
 so+0.003
Neg logit contributions
ross-0.005
rina-0.005
andra-0.005
end-0.005
ation-0.005
Token like
Token ID588
Feature activation+6.61e-04

Pos logit contributions
ross+0.034
rina+0.031
ation+0.030
andra+0.030
ressive+0.029
Neg logit contributions
 sad-0.012
 very-0.010
 so-0.010
 but-0.009
 were-0.008
Token d
Token ID288
Feature activation-1.86e-02

Pos logit contributions
 sad+0.001
 very+0.000
 so+0.000
 boy+0.000
 but+0.000
Neg logit contributions
ross-0.002
rina-0.002
andra-0.002
ation-0.002
ressive-0.002
Tokenaff
Token ID2001
Feature activation-4.64e-02

Pos logit contributions
ross+0.032
rina+0.029
andra+0.027
ation+0.027
ressive+0.026
Neg logit contributions
 sad-0.036
 very-0.032
 so-0.032
 boy-0.030
 but-0.030
Tokenod
Token ID375
Feature activation-5.69e-02

Pos logit contributions
ross+0.086
andra+0.071
ation+0.070
rina+0.070
icol+0.067
Neg logit contributions
 sad-0.088
 very-0.079
 so-0.077
s-0.077
 were-0.074
Tokenils
Token ID4487
Feature activation-8.47e-03

Pos logit contributions
ross+0.110
andra+0.086
ation+0.084
icol+0.084
esh+0.083
Neg logit contributions
 sad-0.104
s-0.098
 very-0.096
 so-0.093
 were-0.090
Token?"
Token ID1701
Feature activation-1.76e-02

Pos logit contributions
ross+0.017
rina+0.016
andra+0.015
ation+0.015
ressive+0.014
Neg logit contributions
 sad-0.014
 very-0.012
 so-0.012
 boy-0.011
 but-0.011
Token Ben
Token ID3932
Feature activation-3.20e-02

Pos logit contributions
ross+0.039
andra+0.037
rina+0.036
ation+0.034
eh+0.034
Neg logit contributions
 sad-0.018
 so-0.016
 very-0.016
 ladder-0.014
 were-0.014
Token asks
Token ID7893
Feature activation+7.63e-04

Pos logit contributions
ross+0.078
rina+0.069
andra+0.067
ation+0.066
ressive+0.065
Neg logit contributions
 sad-0.044
 so-0.037
 very-0.037
 boy-0.035
 but-0.035
Token.
Token ID13
Feature activation+1.66e-03

Pos logit contributions
 sad+0.001
 so+0.001
 very+0.001
 boy+0.001
 ladder+0.001
Neg logit contributions
ross-0.002
rina-0.001
andra-0.001
ation-0.001
ressive-0.001
Token,
Token ID11
Feature activation-4.92e-03

Pos logit contributions
 sad+0.001
 so+0.001
 boy+0.001
 very+0.001
 but+0.001
Neg logit contributions
ross-0.010
rina-0.009
ation-0.009
iet-0.009
eh-0.008
Token what
Token ID644
Feature activation-1.87e-02

Pos logit contributions
ross+0.011
rina+0.010
andra+0.010
ressive+0.010
ation+0.009
Neg logit contributions
 sad-0.006
s-0.005
 very-0.005
 cones-0.005
 so-0.005
Token's
Token ID338
Feature activation-1.83e-02

Pos logit contributions
ross+0.049
rina+0.046
andra+0.045
ation+0.044
ressive+0.043
Neg logit contributions
 sad-0.018
 so-0.013
 very-0.013
 boy-0.012
 but-0.012
Token that
Token ID326
Feature activation-3.44e-03

Pos logit contributions
ross+0.057
rina+0.054
andra+0.053
ation+0.051
ressive+0.050
Neg logit contributions
 sad-0.011
 very-0.008
 so-0.008
 boy-0.006
 but-0.005
Token?"
Token ID1701
Feature activation-1.28e-02

Pos logit contributions
ross+0.007
rina+0.007
andra+0.006
ation+0.006
ressive+0.006
Neg logit contributions
 sad-0.006
 so-0.005
 very-0.005
 boy-0.005
 but-0.005
Token she
Token ID673
Feature activation-5.66e-02

Pos logit contributions
ross+0.031
rina+0.029
andra+0.028
ation+0.027
ressive+0.026
Neg logit contributions
 sad-0.016
 very-0.013
 so-0.013
 boy-0.012
 but-0.012
Token asked
Token ID1965
Feature activation-2.63e-03

Pos logit contributions
ross+0.156
ressive+0.137
rina+0.135
ation+0.134
aghetti+0.133
Neg logit contributions
 sad-0.069
 very-0.057
 so-0.056
 but-0.048
 boy-0.048
Token,
Token ID11
Feature activation-6.16e-03

Pos logit contributions
ross+0.007
rina+0.006
andra+0.006
ation+0.006
iet+0.006
Neg logit contributions
 sad-0.003
 very-0.003
 so-0.003
 but-0.002
 boy-0.002
Token pointing
Token ID10609
Feature activation-2.11e-02

Pos logit contributions
ross+0.015
rina+0.014
ation+0.014
andra+0.013
iet+0.013
Neg logit contributions
 sad-0.009
 so-0.007
 very-0.007
 but-0.007
 boy-0.006
Token at
Token ID379
Feature activation-8.20e-03

Pos logit contributions
ross+0.061
rina+0.057
ation+0.054
andra+0.054
ressive+0.053
Neg logit contributions
 sad-0.020
 very-0.016
 so-0.015
 but-0.013
 boy-0.013
Token the
Token ID262
Feature activation+3.27e-03

Pos logit contributions
ross+0.021
rina+0.020
andra+0.019
ation+0.019
ressive+0.019
Neg logit contributions
 sad-0.008
 very-0.006
 so-0.006
 boy-0.005
s-0.005
Token big
Token ID1263
Feature activation+6.50e-03

Pos logit contributions
 sad+0.001
 were+0.001
 so+0.001
 cones+0.001
s+0.001
Neg logit contributions
ross-0.002
rina-0.002
andra-0.002
eh-0.002
ation-0.002
Token book
Token ID1492
Feature activation-7.36e-03

Pos logit contributions
 sad+0.008
s+0.007
 so+0.006
 were+0.006
 Some+0.006
Neg logit contributions
ross-0.013
rina-0.013
andra-0.012
eh-0.012
ation-0.012
Token with
Token ID351
Feature activation-2.04e-02

Pos logit contributions
ross+0.020
rina+0.019
andra+0.019
ation+0.018
ressive+0.018
Neg logit contributions
 sad-0.007
 very-0.006
 so-0.006
 but-0.005
 boy-0.005
Token pictures
Token ID5986
Feature activation-7.26e-03

Pos logit contributions
ross+0.053
rina+0.050
andra+0.048
ation+0.047
ressive+0.046
Neg logit contributions
 sad-0.021
 very-0.016
 so-0.016
 boy-0.015
 but-0.015
Token of
Token ID286
Feature activation-5.82e-03

Pos logit contributions
ross+0.021
rina+0.020
andra+0.019
ation+0.019
iet+0.018
Neg logit contributions
 sad-0.006
 so-0.005
 very-0.005
 but-0.004
 were-0.004
Token volcan
Token ID17516
Feature activation-5.66e-02

Pos logit contributions
ross+0.013
rina+0.013
andra+0.012
ation+0.012
ressive+0.012
Neg logit contributions
 sad-0.007
 so-0.006
 very-0.006
 boy-0.005
 ladder-0.005
Tokenoes
Token ID3028
Feature activation-1.92e-02

Pos logit contributions
ross+0.128
rina+0.116
andra+0.113
ation+0.110
ressive+0.107
Neg logit contributions
 sad-0.078
 very-0.068
 so-0.068
 boy-0.063
 but-0.063
Token.
Token ID13
Feature activation+9.89e-03

Pos logit contributions
ross+0.050
rina+0.046
andra+0.046
ation+0.044
iet+0.043
Neg logit contributions
 sad-0.022
 so-0.019
 very-0.019
 but-0.018
 were-0.017
Token Vol
Token ID4709
Feature activation-1.83e-02

Pos logit contributions
 sad+0.013
 very+0.011
 penn+0.010
 kay+0.009
 so+0.009
Neg logit contributions
andra-0.020
ross-0.019
ife-0.017
rina-0.017
anne-0.017
Tokencan
Token ID5171
Feature activation+7.98e-03

Pos logit contributions
ross+0.040
rina+0.038
andra+0.036
eh+0.035
ation+0.035
Neg logit contributions
 sad-0.023
 were-0.021
s-0.021
 boy-0.020
 very-0.020
Tokenoes
Token ID3028
Feature activation-3.91e-03

Pos logit contributions
 sad+0.010
 very+0.009
 so+0.009
 boy+0.008
 but+0.008
Neg logit contributions
ross-0.019
rina-0.017
andra-0.017
ation-0.016
ressive-0.016
Token"
Token ID1
Feature activation-8.04e-03

Pos logit contributions
ross+0.001
rina+0.001
ation+0.001
andra+0.001
ressive+0.001
Neg logit contributions
 sad-0.000
 very-0.000
 so-0.000
 boy-0.000
 were-0.000
TokenWhat
Token ID2061
Feature activation-2.70e-02

Pos logit contributions
ross+0.020
rina+0.019
isdom+0.018
ation+0.018
inkle+0.017
Neg logit contributions
 sad-0.008
 boy-0.008
 were-0.008
 so-0.008
 but-0.007
Token's
Token ID338
Feature activation-1.97e-02

Pos logit contributions
ross+0.077
rina+0.072
andra+0.070
ation+0.069
ressive+0.067
Neg logit contributions
 sad-0.022
 very-0.017
 so-0.016
 boy-0.015
 but-0.014
Token that
Token ID326
Feature activation-8.56e-03

Pos logit contributions
ross+0.051
rina+0.048
andra+0.047
rene+0.045
ita+0.044
Neg logit contributions
 sad-0.027
 very-0.025
 so-0.025
 that-0.020
 round-0.019
Token?"
Token ID1701
Feature activation-1.37e-02

Pos logit contributions
ross+0.019
rina+0.018
andra+0.017
ation+0.016
ressive+0.016
Neg logit contributions
 sad-0.013
 so-0.012
 very-0.012
 boy-0.011
 but-0.011
Token she
Token ID673
Feature activation-5.65e-02

Pos logit contributions
ross+0.031
rina+0.028
andra+0.028
ation+0.027
ressive+0.026
Neg logit contributions
 sad-0.018
 very-0.015
 so-0.015
 boy-0.014
 cones-0.014
Token said
Token ID531
Feature activation-1.41e-02

Pos logit contributions
ross+0.136
rina+0.122
ation+0.117
ressive+0.116
andra+0.116
Neg logit contributions
 sad-0.079
 very-0.067
 so-0.067
 boy-0.062
 but-0.060
Token.
Token ID13
Feature activation-1.14e-03

Pos logit contributions
ross+0.037
rina+0.034
andra+0.033
ation+0.033
ressive+0.032
Neg logit contributions
 sad-0.016
 very-0.013
 so-0.013
 but-0.012
 boy-0.012
Token She
Token ID1375
Feature activation-2.82e-02

Pos logit contributions
ross+0.003
rina+0.003
andra+0.003
ation+0.003
ressive+0.003
Neg logit contributions
 sad-0.001
 very-0.001
 so-0.001
 boy-0.000
 cones-0.000
Token put
Token ID1234
Feature activation+1.36e-02

Pos logit contributions
ross+0.081
ation+0.073
rina+0.071
ife+0.071
ressive+0.071
Neg logit contributions
 sad-0.028
 very-0.026
 so-0.024
 were-0.021
 boy-0.020
Token her
Token ID607
Feature activation+1.61e-02

Pos logit contributions
 sad+0.014
 very+0.011
 so+0.011
 boy+0.010
 but+0.010
Neg logit contributions
ross-0.037
rina-0.035
andra-0.033
ation-0.033
ressive-0.032
Token What
Token ID1867
Feature activation-1.19e-02

Pos logit contributions
 sad+0.012
 very+0.009
 so+0.009
 boy+0.009
 but+0.008
Neg logit contributions
ross-0.032
rina-0.030
andra-0.029
ation-0.028
ressive-0.028
Token do
Token ID466
Feature activation+3.91e-04

Pos logit contributions
ross+0.035
rina+0.033
andra+0.032
ation+0.031
ressive+0.031
Neg logit contributions
 sad-0.008
 very-0.006
 so-0.006
 boy-0.005
 but-0.005
Token you
Token ID345
Feature activation-1.40e-02

Pos logit contributions
 sad+0.000
 very+0.000
 so+0.000
 boy+0.000
 but+0.000
Neg logit contributions
ross-0.001
rina-0.001
andra-0.001
ation-0.001
ressive-0.001
Token want
Token ID765
Feature activation-1.95e-03

Pos logit contributions
ross+0.042
rina+0.039
andra+0.038
ation+0.037
ician+0.037
Neg logit contributions
 sad-0.013
 very-0.010
 so-0.009
 were-0.009
 are-0.009
Token?"
Token ID1701
Feature activation-1.31e-02

Pos logit contributions
ross+0.006
rina+0.006
andra+0.006
ation+0.006
ician+0.005
Neg logit contributions
 sad-0.001
 so-0.001
 very-0.001
 but-0.001
 were-0.001
Token she
Token ID673
Feature activation-5.64e-02

Pos logit contributions
andra+0.026
ross+0.025
rina+0.024
anne+0.023
ita+0.022
Neg logit contributions
 sad-0.013
 so-0.012
 cones-0.012
ocular-0.012
 very-0.012
Token asks
Token ID7893
Feature activation-6.75e-03

Pos logit contributions
ross+0.128
rina+0.113
andra+0.109
ation+0.107
ressive+0.106
Neg logit contributions
 sad-0.086
 very-0.073
 so-0.072
 boy-0.070
 but-0.067
Token.
Token ID13
Feature activation-6.18e-03

Pos logit contributions
ross+0.013
rina+0.012
andra+0.012
ation+0.011
ressive+0.011
Neg logit contributions
 sad-0.010
 so-0.009
 very-0.009
 boy-0.009
 ladder-0.008
Token
Token ID198
Feature activation+2.86e-03

Pos logit contributions
ross+0.017
andra+0.016
rina+0.016
ation+0.014
end+0.014
Neg logit contributions
 sad-0.005
 so-0.003
 very-0.003
 boy-0.003
 cones-0.003
Token
Token ID198
Feature activation+7.04e-03

Pos logit contributions
 sad+0.002
 so+0.002
 very+0.002
 boy+0.002
 but+0.002
Neg logit contributions
ross-0.008
rina-0.007
andra-0.007
ation-0.007
end-0.007
Token"
Token ID1
Feature activation+1.78e-02

Pos logit contributions
 sad+0.011
 so+0.010
 very+0.010
 but+0.009
 boy+0.009
Neg logit contributions
ross-0.014
rina-0.013
andra-0.012
ation-0.012
eh-0.012
Token kinds
Token ID6982
Feature activation+3.01e-02

Pos logit contributions
ross+0.001
rina+0.001
ation+0.001
andra+0.001
eh+0.001
Neg logit contributions
 sad-0.001
 but-0.001
 boy-0.001
 so-0.001
 very-0.001
Token of
Token ID286
Feature activation-4.72e-03

Pos logit contributions
 sad+0.035
 ladder+0.033
 boy+0.031
 hill+0.030
 kay+0.027
Neg logit contributions
ross-0.063
end-0.058
eh-0.055
ils-0.053
isdom-0.052
Token ear
Token ID1027
Feature activation-1.44e-02

Pos logit contributions
ross+0.008
rina+0.008
eh+0.008
isdom+0.008
ation+0.008
Neg logit contributions
 sad-0.006
 ladder-0.006
 hill-0.006
s-0.005
 boy-0.005
Tokenrings
Token ID33173
Feature activation-9.26e-03

Pos logit contributions
rina+0.025
ross+0.025
andra+0.024
ita+0.021
ation+0.021
Neg logit contributions
 sad-0.024
 boy-0.021
 ladder-0.020
 hill-0.019
 very-0.019
Token and
Token ID290
Feature activation-4.57e-03

Pos logit contributions
ross+0.024
rina+0.022
andra+0.021
ation+0.020
ressive+0.020
Neg logit contributions
 sad-0.009
 boy-0.007
 ladder-0.006
 cones-0.005
 very-0.005
Token brace
Token ID22581
Feature activation-5.59e-02

Pos logit contributions
ross+0.009
rina+0.009
andra+0.009
ation+0.008
ita+0.008
Neg logit contributions
 sad-0.006
 ladder-0.005
s-0.005
 boy-0.004
 hill-0.004
Tokenlets
Token ID5289
Feature activation-1.21e-02

Pos logit contributions
ross+0.094
rina+0.083
andra+0.079
ation+0.078
ressive+0.074
Neg logit contributions
 sad-0.110
 very-0.099
 so-0.099
s-0.095
 boy-0.094
Token,
Token ID11
Feature activation+5.90e-03

Pos logit contributions
ross+0.030
rina+0.028
andra+0.026
ation+0.026
ressive+0.025
Neg logit contributions
 sad-0.014
 boy-0.011
 very-0.011
 so-0.011
 ladder-0.010
Token and
Token ID290
Feature activation+7.31e-03

Pos logit contributions
 sad+0.005
s+0.004
 ladder+0.004
 boy+0.004
 cones+0.003
Neg logit contributions
ross-0.014
rina-0.014
andra-0.013
eh-0.013
ressive-0.013
Token he
Token ID339
Feature activation+8.57e-03

Pos logit contributions
 sad+0.008
 very+0.005
 ladder+0.005
s+0.005
 so+0.005
Neg logit contributions
ross-0.018
rina-0.017
andra-0.016
ation-0.016
ressive-0.016
Token really
Token ID1107
Feature activation+3.25e-03

Pos logit contributions
 sad+0.011
 so+0.009
 very+0.009
 but+0.009
 boy+0.009
Neg logit contributions
ross-0.019
rina-0.018
andra-0.017
ation-0.017
ressive-0.016
Token did
Token ID750
Feature activation-5.49e-03

Pos logit contributions
ross+0.046
rina+0.043
andra+0.041
ation+0.040
ressive+0.040
Neg logit contributions
 sad-0.019
 so-0.015
 very-0.015
 boy-0.014
 but-0.014
Token you
Token ID345
Feature activation-2.05e-02

Pos logit contributions
ross+0.017
rina+0.016
andra+0.016
iet+0.015
eh+0.015
Neg logit contributions
 sad-0.004
 so-0.003
 very-0.003
 but-0.002
 boy-0.002
Token find
Token ID1064
Feature activation-2.87e-02

Pos logit contributions
ross+0.057
rina+0.053
andra+0.050
anne+0.049
ician+0.049
Neg logit contributions
 sad-0.022
 so-0.019
 very-0.018
 were-0.017
 but-0.016
Token this
Token ID428
Feature activation-1.45e-02

Pos logit contributions
ross+0.087
rina+0.080
andra+0.077
ation+0.076
iet+0.075
Neg logit contributions
 sad-0.021
 so-0.018
 very-0.018
 but-0.014
 were-0.012
Token?"
Token ID1701
Feature activation-1.71e-02

Pos logit contributions
ross+0.028
rina+0.026
andra+0.024
ation+0.023
iet+0.023
Neg logit contributions
 sad-0.026
 very-0.023
 so-0.023
 boy-0.022
 but-0.022
Token she
Token ID673
Feature activation-5.54e-02

Pos logit contributions
ross+0.041
rina+0.039
andra+0.038
ation+0.036
iet+0.035
Neg logit contributions
 sad-0.019
 very-0.016
 so-0.016
 boy-0.014
 but-0.014
Token asks
Token ID7893
Feature activation-2.35e-03

Pos logit contributions
ross+0.133
rina+0.118
andra+0.113
ressive+0.113
ation+0.112
Neg logit contributions
 sad-0.081
 very-0.066
 so-0.066
 boy-0.065
 but-0.062
Token.
Token ID13
Feature activation-7.39e-03

Pos logit contributions
ross+0.005
rina+0.005
andra+0.005
ation+0.004
ressive+0.004
Neg logit contributions
 sad-0.003
 very-0.003
 so-0.003
 boy-0.003
 but-0.003
Token
Token ID198
Feature activation-2.30e-03

Pos logit contributions
ross+0.020
rina+0.018
andra+0.018
end+0.017
anne+0.017
Neg logit contributions
 sad-0.006
 so-0.005
 very-0.004
 penn-0.004
 cones-0.004
Token
Token ID198
Feature activation-5.31e-04

Pos logit contributions
ross+0.007
rina+0.006
andra+0.006
ation+0.006
end+0.006
Neg logit contributions
 sad-0.002
 so-0.001
 very-0.001
 boy-0.001
 but-0.001
Token"
Token ID1
Feature activation+9.87e-03

Pos logit contributions
ross+0.001
rina+0.001
andra+0.001
ation+0.001
ressive+0.001
Neg logit contributions
 sad-0.001
 very-0.001
 so-0.001
 boy-0.001
 but-0.001
Token is
Token ID318
Feature activation-3.79e-03

Pos logit contributions
ross+0.028
rina+0.026
andra+0.025
ressive+0.024
ation+0.024
Neg logit contributions
 sad-0.006
 very-0.005
 so-0.005
 boy-0.005
 were-0.004
Token fun
Token ID1257
Feature activation+5.56e-04

Pos logit contributions
ross+0.012
andra+0.011
rina+0.011
ation+0.010
eman+0.010
Neg logit contributions
 so-0.003
 very-0.003
 sad-0.003
 but-0.002
 old-0.002
Token.
Token ID13
Feature activation-4.03e-04

Pos logit contributions
 so+0.001
 sad+0.001
 very+0.001
 but+0.001
 boy+0.001
Neg logit contributions
ross-0.001
andra-0.001
rina-0.001
eh-0.001
ation-0.001
Token I
Token ID314
Feature activation+7.19e-03

Pos logit contributions
ross+0.001
rina+0.001
andra+0.001
ation+0.001
ressive+0.001
Neg logit contributions
 sad-0.000
 very-0.000
 so-0.000
 boy-0.000
 but-0.000
Token like
Token ID588
Feature activation+1.43e-02

Pos logit contributions
 sad+0.006
 very+0.005
 were+0.004
 so+0.004
 boy+0.004
Neg logit contributions
ross-0.022
rina-0.020
andra-0.019
ation-0.019
ressive-0.019
Token aer
Token ID9551
Feature activation-5.53e-02

Pos logit contributions
 sad+0.012
 so+0.010
 very+0.010
 boy+0.008
 but+0.008
Neg logit contributions
ross-0.040
rina-0.038
andra-0.037
ation-0.036
ressive-0.035
Tokenopl
Token ID20106
Feature activation-3.11e-02

Pos logit contributions
ross+0.073
rina+0.063
andra+0.056
ation+0.055
iet+0.052
Neg logit contributions
 sad-0.123
 very-0.115
 so-0.114
 cones-0.111
s-0.110
Tokenanes
Token ID7305
Feature activation-1.67e-02

Pos logit contributions
ross+0.064
rina+0.060
andra+0.056
ation+0.055
eh+0.054
Neg logit contributions
 sad-0.047
 very-0.041
 so-0.041
s-0.040
 cones-0.039
Token."
Token ID526
Feature activation+4.72e-03

Pos logit contributions
ross+0.045
rina+0.042
andra+0.041
ation+0.040
ressive+0.039
Neg logit contributions
 sad-0.017
 very-0.014
 so-0.014
 but-0.012
 boy-0.012
Token
Token ID198
Feature activation+5.21e-03

Pos logit contributions
 sad+0.005
 very+0.004
 cones+0.004
 penn+0.004
 kay+0.004
Neg logit contributions
andra-0.010
ross-0.010
rina-0.009
anne-0.009
ife-0.009
Token
Token ID198
Feature activation+3.45e-03

Pos logit contributions
 sad+0.005
 so+0.003
 very+0.003
 cones+0.003
 boy+0.003
Neg logit contributions
ross-0.013
rina-0.013
andra-0.013
end-0.012
ation-0.012
Token Emma
Token ID18966
Feature activation+4.94e-03

Pos logit contributions
ross+0.006
rina+0.006
andra+0.005
ation+0.005
ressive+0.005
Neg logit contributions
 sad-0.002
 very-0.001
 so-0.001
 boy-0.001
 but-0.001
Token who
Token ID508
Feature activation-4.32e-05

Pos logit contributions
 sad+0.005
 very+0.005
 so+0.005
 were+0.004
 but+0.004
Neg logit contributions
ross-0.014
rina-0.012
andra-0.012
ation-0.012
ressive-0.012
Token loved
Token ID6151
Feature activation+8.25e-03

Pos logit contributions
ross+0.000
rina+0.000
andra+0.000
ation+0.000
ressive+0.000
Neg logit contributions
 sad-0.000
 very-0.000
 so-0.000
 boy-0.000
 were-0.000
Token learning
Token ID4673
Feature activation-1.52e-02

Pos logit contributions
 sad+0.005
 very+0.003
 so+0.003
 boy+0.003
 but+0.002
Neg logit contributions
ross-0.026
rina-0.024
andra-0.023
ation-0.023
ressive-0.023
Token about
Token ID546
Feature activation-1.82e-02

Pos logit contributions
ross+0.041
rina+0.039
andra+0.038
ation+0.037
ressive+0.036
Neg logit contributions
 sad-0.015
 very-0.012
 so-0.012
 but-0.010
 boy-0.010
Token outer
Token ID12076
Feature activation-5.53e-02

Pos logit contributions
ross+0.048
rina+0.044
andra+0.043
ation+0.042
ressive+0.041
Neg logit contributions
 sad-0.020
 very-0.016
 so-0.016
 but-0.014
 boy-0.014
Token space
Token ID2272
Feature activation-2.16e-02

Pos logit contributions
ross+0.135
rina+0.125
andra+0.121
ation+0.118
ressive+0.115
Neg logit contributions
 sad-0.067
 very-0.057
 so-0.056
 boy-0.053
 but-0.051
Token.
Token ID13
Feature activation+2.73e-04

Pos logit contributions
ross+0.057
andra+0.054
rina+0.053
ation+0.050
ressive+0.049
Neg logit contributions
 sad-0.023
 very-0.021
 so-0.020
 but-0.018
 boy-0.017
Token Everyday
Token ID48154
Feature activation+2.00e-02

Pos logit contributions
 sad+0.000
 very+0.000
 so+0.000
 boy+0.000
 cones+0.000
Neg logit contributions
ross-0.001
rina-0.001
andra-0.001
ation-0.001
ressive-0.001
Token,
Token ID11
Feature activation-3.60e-03

Pos logit contributions
 sad+0.018
 so+0.013
 cones+0.012
 very+0.012
 ladder+0.012
Neg logit contributions
ross-0.052
rina-0.049
andra-0.047
ation-0.046
ressive-0.045
Token she
Token ID673
Feature activation-5.88e-04

Pos logit contributions
ross+0.007
rina+0.007
andra+0.007
ation+0.006
ressive+0.006
Neg logit contributions
 sad-0.004
 cones-0.004
s-0.004
 ladder-0.004
 kay-0.004
Token and
Token ID290
Feature activation-1.73e-02

Pos logit contributions
ross+0.023
rina+0.021
andra+0.020
iet+0.019
ation+0.019
Neg logit contributions
 sad-0.013
 so-0.011
 very-0.011
 boy-0.010
 were-0.010
Token that
Token ID326
Feature activation-1.17e-02

Pos logit contributions
ross+0.046
rina+0.042
andra+0.041
eh+0.039
ation+0.039
Neg logit contributions
 sad-0.020
 so-0.018
 very-0.017
 boy-0.016
 but-0.015
Token lady
Token ID10846
Feature activation-7.23e-03

Pos logit contributions
ross+0.030
rina+0.027
andra+0.027
iet+0.025
ation+0.025
Neg logit contributions
 sad-0.016
 so-0.013
 very-0.013
 boy-0.012
 but-0.012
Token separate
Token ID4553
Feature activation-5.63e-03

Pos logit contributions
ross+0.017
andra+0.015
rina+0.015
iet+0.015
ician+0.014
Neg logit contributions
 sad-0.011
 so-0.009
 very-0.009
 were-0.008
 boy-0.008
Token?"
Token ID1701
Feature activation-2.43e-02

Pos logit contributions
ross+0.015
rina+0.013
andra+0.013
ation+0.013
ressive+0.012
Neg logit contributions
 sad-0.006
 so-0.005
 very-0.005
 boy-0.005
 but-0.005
Token he
Token ID339
Feature activation-5.50e-02

Pos logit contributions
ross+0.055
rina+0.051
andra+0.049
ation+0.048
ressive+0.046
Neg logit contributions
 sad-0.033
 very-0.028
 so-0.027
 boy-0.026
 but-0.025
Token asked
Token ID1965
Feature activation-7.82e-03

Pos logit contributions
ross+0.146
ressive+0.128
rina+0.127
ation+0.126
aghetti+0.124
Neg logit contributions
 sad-0.071
 very-0.059
 so-0.057
 boy-0.049
 but-0.049
Token,
Token ID11
Feature activation-3.47e-03

Pos logit contributions
ross+0.020
rina+0.018
andra+0.018
ation+0.018
iet+0.017
Neg logit contributions
 sad-0.010
 very-0.008
 so-0.008
 but-0.007
 boy-0.007
Token pointing
Token ID10609
Feature activation-1.75e-02

Pos logit contributions
ross+0.009
rina+0.008
ation+0.008
andra+0.008
iet+0.008
Neg logit contributions
 sad-0.005
 so-0.004
 very-0.004
 but-0.004
 boy-0.003
Token at
Token ID379
Feature activation-8.00e-03

Pos logit contributions
ross+0.048
rina+0.045
andra+0.043
ation+0.043
ressive+0.042
Neg logit contributions
 sad-0.018
 very-0.015
 so-0.015
 but-0.013
 boy-0.012
Token two
Token ID734
Feature activation-1.36e-02

Pos logit contributions
ross+0.020
rina+0.019
andra+0.018
ation+0.018
ressive+0.018
Neg logit contributions
 sad-0.008
 very-0.007
 so-0.006
 boy-0.006
 ladder-0.006
Token big
Token ID1263
Feature activation+7.07e-03

Pos logit contributions
 sad+0.020
 so+0.017
 were+0.017
 cones+0.017
s+0.017
Neg logit contributions
ross-0.050
rina-0.046
eh-0.045
andra-0.045
iet-0.044
Token,
Token ID11
Feature activation+2.47e-02

Pos logit contributions
 sad+0.006
s+0.006
 Some+0.005
 were+0.005
 so+0.005
Neg logit contributions
ross-0.016
andra-0.015
rina-0.015
iet-0.014
ation-0.014
Token white
Token ID2330
Feature activation+7.43e-03

Pos logit contributions
 sad+0.024
s+0.022
 were+0.021
 penn+0.021
 cones+0.021
Neg logit contributions
ross-0.054
eh-0.050
andra-0.049
rina-0.049
ation-0.048
Token pup
Token ID15552
Feature activation+2.16e-02

Pos logit contributions
 sad+0.009
 so+0.007
 very+0.007
 were+0.006
s+0.006
Neg logit contributions
ross-0.018
rina-0.016
andra-0.016
ation-0.016
iet-0.016
Token with
Token ID351
Feature activation-1.45e-02

Pos logit contributions
 sad+0.028
 very+0.024
 so+0.024
 but+0.022
 boy+0.022
Neg logit contributions
ross-0.053
rina-0.049
andra-0.048
ation-0.046
ressive-0.045
Token floppy
Token ID49415
Feature activation-5.47e-02

Pos logit contributions
ross+0.036
rina+0.033
andra+0.032
ation+0.032
iet+0.031
Neg logit contributions
 sad-0.015
 boy-0.012
 ladder-0.011
 very-0.011
 so-0.011
Token ears
Token ID11368
Feature activation-1.50e-02

Pos logit contributions
ross+0.139
rina+0.127
andra+0.121
ation+0.120
ressive+0.118
Neg logit contributions
 sad-0.068
 so-0.056
 very-0.056
 but-0.053
 boy-0.051
Token and
Token ID290
Feature activation-2.64e-02

Pos logit contributions
ross+0.043
rina+0.040
andra+0.039
ation+0.039
ressive+0.038
Neg logit contributions
 sad-0.013
 very-0.010
 so-0.010
 but-0.008
 were-0.008
Token a
Token ID257
Feature activation-1.22e-02

Pos logit contributions
ross+0.070
rina+0.066
andra+0.063
ation+0.062
ressive+0.061
Neg logit contributions
 sad-0.028
 very-0.023
 so-0.023
 boy-0.021
 but-0.020
Token curly
Token ID45731
Feature activation-3.86e-02

Pos logit contributions
ross+0.030
rina+0.028
andra+0.027
ation+0.026
iet+0.026
Neg logit contributions
 sad-0.012
 so-0.010
 very-0.010
 boy-0.009
 were-0.009
Token tail
Token ID7894
Feature activation-6.57e-03

Pos logit contributions
ross+0.092
rina+0.087
ation+0.080
andra+0.080
ressive+0.080
Neg logit contributions
 sad-0.055
 very-0.047
 so-0.045
 but-0.044
 boy-0.042
Token"
Token ID1
Feature activation-1.13e-03

Pos logit contributions
 sad+0.000
 so+0.000
 very+0.000
 but+0.000
 boy+0.000
Neg logit contributions
ross-0.000
rina-0.000
andra-0.000
ation-0.000
iet-0.000
TokenWhat
Token ID2061
Feature activation-2.69e-02

Pos logit contributions
ross+0.003
rina+0.003
andra+0.002
ation+0.002
eh+0.002
Neg logit contributions
 sad-0.001
 boy-0.001
 very-0.001
 so-0.001
 but-0.001
Token is
Token ID318
Feature activation-2.29e-02

Pos logit contributions
ross+0.072
rina+0.067
andra+0.065
ation+0.064
ressive+0.063
Neg logit contributions
 sad-0.025
 so-0.019
 very-0.019
 boy-0.017
 but-0.017
Token that
Token ID326
Feature activation-7.29e-03

Pos logit contributions
ross+0.064
rina+0.060
andra+0.058
ation+0.057
iet+0.056
Neg logit contributions
 sad-0.021
 very-0.017
 so-0.017
 boy-0.014
 but-0.014
Token?"
Token ID1701
Feature activation-1.17e-02

Pos logit contributions
ross+0.016
rina+0.015
andra+0.014
ation+0.014
eh+0.014
Neg logit contributions
 sad-0.011
 so-0.010
 very-0.010
 boy-0.010
 but-0.009
Token he
Token ID339
Feature activation-5.45e-02

Pos logit contributions
ross+0.027
andra+0.025
rina+0.025
ation+0.023
eh+0.023
Neg logit contributions
 sad-0.012
 very-0.010
 so-0.010
 cones-0.010
s-0.010
Token asks
Token ID7893
Feature activation+4.37e-03

Pos logit contributions
ross+0.131
rina+0.116
andra+0.111
ressive+0.110
ation+0.109
Neg logit contributions
 sad-0.079
 very-0.066
 so-0.065
 boy-0.063
 but-0.060
Token.
Token ID13
Feature activation-5.14e-03

Pos logit contributions
 sad+0.006
 very+0.005
 so+0.005
 boy+0.005
 but+0.005
Neg logit contributions
ross-0.010
rina-0.009
andra-0.009
ation-0.009
ressive-0.009
Token
Token ID198
Feature activation+1.76e-03

Pos logit contributions
ross+0.014
rina+0.013
andra+0.013
eh+0.012
ation+0.012
Neg logit contributions
 sad-0.004
 very-0.003
 so-0.003
 penn-0.003
 were-0.003
Token
Token ID198
Feature activation+2.14e-03

Pos logit contributions
 sad+0.001
 so+0.001
 very+0.001
 boy+0.001
 but+0.001
Neg logit contributions
ross-0.005
rina-0.005
andra-0.005
ation-0.005
iet-0.004
Token"
Token ID1
Feature activation+7.87e-03

Pos logit contributions
 sad+0.003
 very+0.003
 so+0.003
 boy+0.003
 but+0.003
Neg logit contributions
ross-0.005
rina-0.004
andra-0.004
ation-0.004
ressive-0.004
Token What
Token ID1867
Feature activation-1.28e-02

Pos logit contributions
 sad+0.014
 very+0.011
 so+0.011
 boy+0.010
 but+0.010
Neg logit contributions
ross-0.029
rina-0.027
andra-0.026
ation-0.026
ressive-0.026
Token did
Token ID750
Feature activation-3.07e-03

Pos logit contributions
ross+0.035
rina+0.033
andra+0.032
ation+0.031
ressive+0.031
Neg logit contributions
 sad-0.011
 so-0.008
 very-0.008
 boy-0.007
 but-0.007
Token you
Token ID345
Feature activation-1.65e-02

Pos logit contributions
ross+0.009
rina+0.008
andra+0.008
ation+0.008
iet+0.008
Neg logit contributions
 sad-0.003
 very-0.002
 so-0.002
 boy-0.002
 but-0.002
Token create
Token ID2251
Feature activation-2.36e-02

Pos logit contributions
ross+0.052
rina+0.049
andra+0.047
ation+0.046
anne+0.046
Neg logit contributions
 sad-0.012
 very-0.009
 so-0.008
 were-0.006
 but-0.006
Token?"
Token ID1701
Feature activation-1.93e-02

Pos logit contributions
ross+0.053
rina+0.049
andra+0.047
anne+0.046
ician+0.044
Neg logit contributions
 sad-0.037
 so-0.036
 very-0.035
 but-0.032
 round-0.029
Token she
Token ID673
Feature activation-5.45e-02

Pos logit contributions
ross+0.039
andra+0.039
rina+0.038
ita+0.036
iet+0.035
Neg logit contributions
 sad-0.020
 cones-0.020
 so-0.019
 very-0.019
 ladder-0.018
Token asks
Token ID7893
Feature activation-5.91e-03

Pos logit contributions
ross+0.118
rina+0.106
andra+0.102
ation+0.100
ressive+0.098
Neg logit contributions
 sad-0.085
 very-0.073
 so-0.073
 boy-0.070
 but-0.068
Token,
Token ID11
Feature activation-6.74e-03

Pos logit contributions
ross+0.012
rina+0.011
andra+0.011
ation+0.010
ressive+0.010
Neg logit contributions
 sad-0.009
 so-0.008
 very-0.008
 boy-0.007
 ladder-0.007
Token looking
Token ID2045
Feature activation-2.27e-02

Pos logit contributions
ross+0.017
rina+0.016
andra+0.015
ation+0.015
ressive+0.015
Neg logit contributions
 sad-0.008
 very-0.006
 so-0.006
 but-0.006
 boy-0.006
Token at
Token ID379
Feature activation-7.31e-03

Pos logit contributions
ross+0.052
rina+0.048
andra+0.046
ation+0.045
iet+0.044
Neg logit contributions
 sad-0.029
 so-0.025
 very-0.025
 boy-0.024
 but-0.023
Token the
Token ID262
Feature activation-6.03e-03

Pos logit contributions
ross+0.019
rina+0.018
andra+0.018
ation+0.017
ressive+0.017
Neg logit contributions
 sad-0.007
 very-0.005
 so-0.005
 boy-0.005
 but-0.005
Token
Token ID198
Feature activation+2.14e-03

Pos logit contributions
ross+0.021
andra+0.020
rina+0.020
ation+0.019
ressive+0.019
Neg logit contributions
 sad-0.008
 very-0.006
 so-0.005
 boy-0.005
 penn-0.005
Token
Token ID198
Feature activation-1.18e-02

Pos logit contributions
 sad+0.002
 very+0.001
 so+0.001
 boy+0.001
 penn+0.001
Neg logit contributions
ross-0.006
rina-0.006
andra-0.006
ation-0.005
iet-0.005
TokenAt
Token ID2953
Feature activation-1.35e-02

Pos logit contributions
ross+0.026
rina+0.025
ressive+0.022
ation+0.022
andra+0.022
Neg logit contributions
 sad-0.017
 so-0.015
 very-0.015
 but-0.015
 were-0.014
Token the
Token ID262
Feature activation+6.52e-03

Pos logit contributions
ross+0.036
rina+0.034
andra+0.034
ation+0.033
iet+0.032
Neg logit contributions
 sad-0.012
 very-0.009
 so-0.008
 boy-0.008
 but-0.008
Token bar
Token ID2318
Feature activation+1.81e-02

Pos logit contributions
 sad+0.008
 so+0.006
 very+0.006
 but+0.006
 boy+0.006
Neg logit contributions
ross-0.016
rina-0.014
andra-0.014
ation-0.014
ressive-0.013
Tokenber
Token ID527
Feature activation+2.37e-02

Pos logit contributions
 sad+0.030
 so+0.026
 very+0.025
 round+0.024
 but+0.024
Neg logit contributions
ross-0.034
rina-0.031
andra-0.030
ation-0.029
ician-0.028
Token,
Token ID11
Feature activation-9.40e-03

Pos logit contributions
 sad+0.025
 very+0.020
 so+0.020
 boy+0.018
 but+0.017
Neg logit contributions
ross-0.062
rina-0.058
andra-0.056
ation-0.055
ressive-0.053
Token the
Token ID262
Feature activation+1.46e-02

Pos logit contributions
ross+0.024
rina+0.022
andra+0.021
ation+0.021
ressive+0.021
Neg logit contributions
 sad-0.010
 very-0.008
 so-0.008
 boy-0.007
 but-0.007
Token bar
Token ID2318
Feature activation+1.43e-02

Pos logit contributions
 sad+0.017
 so+0.014
 very+0.014
 were+0.013
s+0.013
Neg logit contributions
ross-0.034
rina-0.032
andra-0.030
ation-0.030
ressive-0.029
Tokenber
Token ID527
Feature activation+2.17e-02

Pos logit contributions
 sad+0.021
 so+0.018
 very+0.018
 but+0.017
 boy+0.017
Neg logit contributions
ross-0.031
rina-0.028
andra-0.027
ation-0.026
ressive-0.026
Token said
Token ID531
Feature activation+1.03e-02

Pos logit contributions
 sad+0.025
 very+0.022
 so+0.021
 boy+0.021
 were+0.020
Neg logit contributions
ross-0.057
rina-0.052
ation-0.049
andra-0.049
iet-0.049
Token big
Token ID1263
Feature activation+2.04e-02

Pos logit contributions
 sad+0.038
 so+0.031
 but+0.029
 boy+0.029
 were+0.029
Neg logit contributions
ross-0.078
rina-0.073
eh-0.071
andra-0.071
iet-0.068
Token bisc
Token ID36228
Feature activation+2.40e-02

Pos logit contributions
 sad+0.024
 so+0.020
 very+0.020
 boy+0.019
 were+0.018
Neg logit contributions
ross-0.047
rina-0.043
andra-0.042
eh-0.041
iet-0.040
Tokenuit
Token ID5013
Feature activation+2.12e-02

Pos logit contributions
 sad+0.055
 very+0.051
 so+0.051
 but+0.050
 boy+0.050
Neg logit contributions
ross-0.031
rina-0.028
ation-0.026
andra-0.025
ressive-0.024
Token on
Token ID319
Feature activation+3.29e-03

Pos logit contributions
 sad+0.017
 very+0.014
 so+0.014
 but+0.012
 boy+0.011
Neg logit contributions
ross-0.061
rina-0.058
andra-0.056
ation-0.055
ressive-0.054
Token a
Token ID257
Feature activation+1.46e-02

Pos logit contributions
 sad+0.003
 very+0.002
 so+0.002
 boy+0.002
 but+0.002
Neg logit contributions
ross-0.009
rina-0.008
andra-0.008
ation-0.008
ressive-0.008
Token plate
Token ID7480
Feature activation+2.99e-02

Pos logit contributions
 sad+0.019
 so+0.016
 very+0.015
 boy+0.015
 were+0.015
Neg logit contributions
ross-0.033
rina-0.030
andra-0.029
eh-0.029
ation-0.028
Token and
Token ID290
Feature activation-1.62e-02

Pos logit contributions
 sad+0.016
 so+0.013
 very+0.012
 but+0.010
 were+0.007
Neg logit contributions
ross-0.093
rina-0.090
andra-0.087
ation-0.086
ressive-0.085
Token placed
Token ID4624
Feature activation+1.17e-02

Pos logit contributions
ross+0.044
rina+0.040
ation+0.039
ressive+0.039
ician+0.037
Neg logit contributions
 sad-0.018
 so-0.018
 very-0.017
 were-0.016
 boy-0.015
Token it
Token ID340
Feature activation+1.28e-03

Pos logit contributions
 sad+0.009
 very+0.006
 so+0.006
 boy+0.005
 but+0.005
Neg logit contributions
ross-0.034
rina-0.032
andra-0.031
ation-0.031
ressive-0.030
Token in
Token ID287
Feature activation+5.77e-03

Pos logit contributions
 very+0.001
 sad+0.001
 so+0.001
 round+0.000
 were+0.000
Neg logit contributions
ross-0.004
andra-0.004
rina-0.004
isdom-0.004
ation-0.004
Token front
Token ID2166
Feature activation-7.64e-04

Pos logit contributions
 sad+0.006
 very+0.005
 so+0.005
 boy+0.004
 but+0.004
Neg logit contributions
ross-0.015
rina-0.014
andra-0.014
ation-0.014
ressive-0.013
Token any
Token ID597
Feature activation-1.13e-02

Pos logit contributions
ross+0.071
rina+0.065
iet+0.062
andra+0.062
ician+0.061
Neg logit contributions
 sad-0.029
 so-0.027
 very-0.025
 were-0.021
 many-0.020
Token money
Token ID1637
Feature activation+7.18e-03

Pos logit contributions
ross+0.028
rina+0.026
andra+0.025
ation+0.024
ressive+0.024
Neg logit contributions
 sad-0.015
 very-0.012
 so-0.012
 boy-0.011
 but-0.011
Token.
Token ID13
Feature activation-3.17e-03

Pos logit contributions
 sad+0.006
 so+0.006
 very+0.005
 but+0.005
 were+0.005
Neg logit contributions
ross-0.020
rina-0.020
andra-0.019
iet-0.018
eh-0.018
Token They
Token ID1119
Feature activation+3.27e-03

Pos logit contributions
ross+0.009
rina+0.009
andra+0.008
ation+0.008
ressive+0.008
Neg logit contributions
 sad-0.002
 very-0.002
 so-0.002
 boy-0.002
 but-0.001
Token stayed
Token ID9658
Feature activation+9.79e-03

Pos logit contributions
 sad+0.003
 very+0.003
 so+0.003
 were+0.003
 boy+0.002
Neg logit contributions
ross-0.009
rina-0.008
andra-0.008
ation-0.008
ressive-0.008
Token friends
Token ID2460
Feature activation+2.48e-02

Pos logit contributions
 sad+0.010
 very+0.008
 so+0.008
 but+0.007
 boy+0.007
Neg logit contributions
ross-0.026
rina-0.025
andra-0.024
ation-0.023
ressive-0.023
Token for
Token ID329
Feature activation-4.15e-03

Pos logit contributions
 sad+0.022
 so+0.017
 very+0.017
 but+0.015
 boy+0.014
Neg logit contributions
ross-0.070
rina-0.065
andra-0.063
ation-0.062
ressive-0.061
Token a
Token ID257
Feature activation-1.12e-04

Pos logit contributions
ross+0.010
rina+0.010
andra+0.009
ation+0.009
ressive+0.009
Neg logit contributions
 sad-0.005
 very-0.004
 so-0.004
 but-0.004
 boy-0.004
Token long
Token ID890
Feature activation-1.05e-02

Pos logit contributions
ross+0.000
rina+0.000
andra+0.000
ation+0.000
ressive+0.000
Neg logit contributions
 sad-0.000
 so-0.000
 very-0.000
 boy-0.000
 but-0.000
Token time
Token ID640
Feature activation+1.39e-02

Pos logit contributions
ross+0.020
rina+0.018
ressive+0.017
andra+0.016
ation+0.016
Neg logit contributions
 sad-0.019
 so-0.017
 very-0.017
 but-0.017
 boy-0.017
Token afterwards
Token ID12979
Feature activation+2.96e-03

Pos logit contributions
 sad+0.009
 so+0.007
 very+0.006
 but+0.006
 were+0.005
Neg logit contributions
ross-0.043
rina-0.041
andra-0.040
ation-0.040
iet-0.039
Token
Token ID198
Feature activation+1.20e-03

Pos logit contributions
 sad+0.003
 very+0.002
 so+0.002
 boy+0.001
 but+0.001
Neg logit contributions
ross-0.013
rina-0.012
andra-0.012
ation-0.012
ressive-0.011
TokenSpot
Token ID32565
Feature activation+4.94e-03

Pos logit contributions
 sad+0.001
 very+0.001
 so+0.001
 boy+0.001
+0.001
Neg logit contributions
ross-0.003
rina-0.003
andra-0.003
ressive-0.003
ation-0.003
Token found
Token ID1043
Feature activation-3.86e-03

Pos logit contributions
 sad+0.005
 very+0.004
 so+0.004
 boy+0.004
 but+0.004
Neg logit contributions
ross-0.013
rina-0.012
ation-0.012
andra-0.012
ressive-0.012
Token a
Token ID257
Feature activation+1.42e-02

Pos logit contributions
ross+0.009
rina+0.008
ation+0.007
andra+0.007
eh+0.007
Neg logit contributions
 sad-0.005
 ladder-0.004
s-0.004
 very-0.004
 boy-0.004
Token hidden
Token ID7104
Feature activation+4.28e-03

Pos logit contributions
 sad+0.014
 so+0.012
 were+0.011
s+0.011
 very+0.011
Neg logit contributions
ross-0.034
rina-0.031
andra-0.031
eh-0.030
ation-0.029
Token spot
Token ID4136
Feature activation+2.63e-02

Pos logit contributions
 sad+0.006
 very+0.005
 so+0.005
 boy+0.005
 but+0.005
Neg logit contributions
ross-0.010
rina-0.009
andra-0.009
ation-0.008
ressive-0.008
Token under
Token ID739
Feature activation+4.58e-03

Pos logit contributions
 sad+0.027
 very+0.022
 so+0.021
 boy+0.020
 but+0.019
Neg logit contributions
ross-0.068
rina-0.063
andra-0.061
ation-0.060
ressive-0.059
Token a
Token ID257
Feature activation-6.79e-04

Pos logit contributions
 sad+0.004
 so+0.003
 very+0.003
 boy+0.003
 but+0.003
Neg logit contributions
ross-0.013
rina-0.012
andra-0.012
ation-0.011
ressive-0.011
Token big
Token ID1263
Feature activation-3.14e-03

Pos logit contributions
ross+0.002
rina+0.002
eh+0.002
andra+0.002
iet+0.002
Neg logit contributions
 sad-0.001
 so-0.000
 were-0.000
 boy-0.000
 but-0.000
Token tree
Token ID5509
Feature activation+1.03e-02

Pos logit contributions
ross+0.008
eh+0.007
rina+0.007
andra+0.007
ation+0.007
Neg logit contributions
 sad-0.003
 very-0.002
 so-0.002
s-0.002
 were-0.002
Token.
Token ID13
Feature activation+7.86e-03

Pos logit contributions
 sad+0.013
 very+0.011
 so+0.011
 boy+0.011
 but+0.010
Neg logit contributions
ross-0.024
rina-0.022
andra-0.022
ation-0.021
ressive-0.021
Token
Token ID198
Feature activation+9.93e-03

Pos logit contributions
 sad+0.002
 very+0.001
 so+0.001
 but+0.001
 boy+0.001
Neg logit contributions
ross-0.017
rina-0.017
andra-0.016
ation-0.016
ressive-0.016
Token
Token ID198
Feature activation+5.87e-03

Pos logit contributions
 sad+0.005
 very+0.003
 so+0.003
 boy+0.003
 but+0.002
Neg logit contributions
ross-0.031
rina-0.030
andra-0.028
ation-0.028
ressive-0.028
TokenMax
Token ID11518
Feature activation-7.08e-03

Pos logit contributions
 sad+0.007
 very+0.006
 so+0.006
 boy+0.006
 but+0.006
Neg logit contributions
ross-0.014
rina-0.013
ation-0.012
andra-0.012
ressive-0.012
Token's
Token ID338
Feature activation-2.90e-03

Pos logit contributions
ross+0.019
rina+0.017
ation+0.016
ife+0.016
andra+0.016
Neg logit contributions
 sad-0.007
 so-0.007
 very-0.007
 were-0.006
 but-0.006
Token Mom
Token ID11254
Feature activation+2.61e-03

Pos logit contributions
ross+0.007
rina+0.007
andra+0.006
ation+0.006
ressive+0.006
Neg logit contributions
 sad-0.003
 very-0.003
 so-0.003
 boy-0.003
 but-0.002
Token sat
Token ID3332
Feature activation+2.18e-02

Pos logit contributions
 sad+0.003
 so+0.003
 very+0.003
 were+0.003
 but+0.003
Neg logit contributions
ross-0.007
ation-0.006
rina-0.006
eh-0.006
iet-0.006
Token on
Token ID319
Feature activation+1.92e-04

Pos logit contributions
 sad+0.025
 very+0.020
 so+0.020
 boy+0.019
 but+0.018
Neg logit contributions
ross-0.055
rina-0.051
andra-0.049
ation-0.048
ressive-0.047
Token the
Token ID262
Feature activation+1.35e-02

Pos logit contributions
 sad+0.000
 very+0.000
 so+0.000
 but+0.000
 boy+0.000
Neg logit contributions
ross-0.001
rina-0.001
andra-0.001
ation-0.001
ressive-0.001
Token nearby
Token ID6716
Feature activation+1.40e-02

Pos logit contributions
 sad+0.015
 so+0.012
 very+0.012
 boy+0.011
 were+0.011
Neg logit contributions
ross-0.034
rina-0.031
andra-0.030
ation-0.030
ressive-0.029
Token bench
Token ID7624
Feature activation+2.51e-02

Pos logit contributions
 sad+0.014
 very+0.011
 so+0.011
 boy+0.010
 but+0.010
Neg logit contributions
ross-0.038
rina-0.035
andra-0.034
ation-0.033
ressive-0.033
Token,
Token ID11
Feature activation+1.16e-02

Pos logit contributions
 sad+0.015
 so+0.011
 very+0.011
 but+0.009
 boy+0.008
Neg logit contributions
ross-0.077
rina-0.073
andra-0.071
ation-0.070
ressive-0.068
Token!
Token ID0
Feature activation+1.30e-02

Pos logit contributions
 sad+0.010
 so+0.009
 very+0.009
 but+0.008
 boy+0.008
Neg logit contributions
ross-0.018
rina-0.017
andra-0.016
ation-0.015
ressive-0.015
Token There
Token ID1318
Feature activation+5.62e-03

Pos logit contributions
 sad+0.015
 very+0.012
 so+0.012
 boy+0.011
 but+0.011
Neg logit contributions
ross-0.032
rina-0.029
andra-0.028
ation-0.028
ressive-0.028
Token's
Token ID338
Feature activation-2.04e-02

Pos logit contributions
 sad+0.005
 very+0.004
 so+0.004
 boy+0.004
 but+0.003
Neg logit contributions
ross-0.015
rina-0.014
ation-0.014
andra-0.013
ressive-0.013
Token a
Token ID257
Feature activation-3.56e-03

Pos logit contributions
ross+0.056
rina+0.052
andra+0.051
ation+0.049
ressive+0.048
Neg logit contributions
 sad-0.021
 very-0.017
 so-0.017
 but-0.015
 boy-0.014
Token big
Token ID1263
Feature activation-5.59e-03

Pos logit contributions
ross+0.009
rina+0.009
andra+0.009
ation+0.008
ressive+0.008
Neg logit contributions
 sad-0.004
 very-0.003
 so-0.003
 boy-0.003
 ladder-0.003
Token storm
Token ID6388
Feature activation-4.08e-03

Pos logit contributions
ross+0.012
rina+0.011
andra+0.011
ation+0.010
ressive+0.010
Neg logit contributions
 sad-0.008
 very-0.007
 so-0.007
 boy-0.006
 but-0.006
Token coming
Token ID2406
Feature activation+3.18e-03

Pos logit contributions
ross+0.010
andra+0.009
rina+0.009
isdom+0.009
wen+0.008
Neg logit contributions
 sad-0.005
 so-0.005
 very-0.005
 but-0.005
 boy-0.004
Token!"
Token ID2474
Feature activation+4.00e-03

Pos logit contributions
 sad+0.005
 so+0.005
 very+0.005
 but+0.004
 boy+0.004
Neg logit contributions
ross-0.007
andra-0.006
rina-0.006
ation-0.006
anne-0.006
Token
Token ID220
Feature activation-5.67e-03

Pos logit contributions
 sad+0.004
 very+0.002
 penn+0.002
 so+0.002
 round+0.002
Neg logit contributions
ross-0.010
rina-0.010
andra-0.010
ation-0.009
ressive-0.009
Token
Token ID198
Feature activation+9.08e-03

Pos logit contributions
ross+0.017
rina+0.016
andra+0.015
ation+0.015
ressive+0.015
Neg logit contributions
 sad-0.004
 very-0.002
 so-0.002
 boy-0.002
 but-0.002
Token
Token ID198
Feature activation+3.29e-03

Pos logit contributions
 sad+0.010
 penn+0.008
 cones+0.007
 rusty+0.007
 ladder+0.007
Neg logit contributions
ross-0.021
rina-0.020
andra-0.020
end-0.019
ation-0.019
Token ladder
Token ID18002
Feature activation-1.58e-02

Pos logit contributions
 sad+0.005
 so+0.004
 very+0.003
s+0.003
 were+0.003
Neg logit contributions
ross-0.021
rina-0.020
eh-0.019
ation-0.019
andra-0.019
Token,
Token ID11
Feature activation-1.12e-02

Pos logit contributions
ross+0.033
rina+0.031
andra+0.029
ation+0.029
ressive+0.028
Neg logit contributions
 sad-0.023
 very-0.020
 so-0.019
 boy-0.019
 were-0.018
Token please
Token ID3387
Feature activation-1.37e-02

Pos logit contributions
ross+0.025
rina+0.024
andra+0.023
iet+0.022
ffic+0.022
Neg logit contributions
 sad-0.012
s-0.010
 very-0.010
 penn-0.010
 cones-0.009
Token?"
Token ID1701
Feature activation-9.58e-03

Pos logit contributions
ross+0.030
rina+0.027
andra+0.026
ation+0.025
iet+0.025
Neg logit contributions
 sad-0.022
 so-0.020
 very-0.019
 but-0.019
 boy-0.019
Token Lily
Token ID20037
Feature activation-2.51e-02

Pos logit contributions
ross+0.021
andra+0.020
rina+0.020
ation+0.019
anne+0.018
Neg logit contributions
 sad-0.011
 very-0.009
 so-0.009
 ladder-0.009
 cones-0.008
Token asked
Token ID1965
Feature activation-4.01e-03

Pos logit contributions
ross+0.056
rina+0.052
andra+0.049
ation+0.048
ressive+0.047
Neg logit contributions
 sad-0.037
 so-0.032
 very-0.032
 boy-0.030
 but-0.030
Token her
Token ID607
Feature activation-6.53e-03

Pos logit contributions
ross+0.011
rina+0.010
ation+0.009
andra+0.009
iet+0.009
Neg logit contributions
 sad-0.004
 very-0.004
 so-0.004
 but-0.003
 boy-0.003
Token mom
Token ID1995
Feature activation-7.32e-03

Pos logit contributions
ross+0.017
rina+0.016
andra+0.016
ation+0.015
ressive+0.015
Neg logit contributions
 sad-0.006
 so-0.005
 very-0.005
 were-0.004
 but-0.004
Token.
Token ID13
Feature activation-3.91e-04

Pos logit contributions
ross+0.017
rina+0.016
andra+0.015
ation+0.015
ressive+0.014
Neg logit contributions
 sad-0.010
 very-0.009
 so-0.008
 boy-0.008
 but-0.008
Token
Token ID198
Feature activation+2.77e-03

Pos logit contributions
ross+0.001
andra+0.001
rina+0.001
ation+0.001
anne+0.001
Neg logit contributions
 sad-0.000
 so-0.000
 very-0.000
 penn-0.000
 boy-0.000
Token
Token ID198
Feature activation-2.55e-04

Pos logit contributions
 sad+0.002
 very+0.001
 so+0.001
 boy+0.001
 but+0.001
Neg logit contributions
ross-0.009
rina-0.008
andra-0.008
ation-0.008
ressive-0.008
Token house
Token ID2156
Feature activation+1.25e-03

Pos logit contributions
ross+0.000
rina+0.000
ation+0.000
andra+0.000
ressive+0.000
Neg logit contributions
 sad-0.000
 so-0.000
 very-0.000
 but-0.000
 were-0.000
Token and
Token ID290
Feature activation-1.26e-03

Pos logit contributions
 sad+0.001
 very+0.001
 so+0.001
 boy+0.001
 but+0.001
Neg logit contributions
ross-0.003
rina-0.003
andra-0.003
ation-0.003
ressive-0.003
Token share
Token ID2648
Feature activation+1.15e-02

Pos logit contributions
ross+0.003
rina+0.003
andra+0.003
ation+0.002
ressive+0.002
Neg logit contributions
 sad-0.002
 so-0.001
 very-0.001
 boy-0.001
s-0.001
Token the
Token ID262
Feature activation+8.56e-03

Pos logit contributions
 sad+0.014
 very+0.011
 so+0.011
 boy+0.011
 but+0.010
Neg logit contributions
ross-0.028
rina-0.026
andra-0.025
ation-0.024
ressive-0.024
Token fruits
Token ID15921
Feature activation+9.92e-03

Pos logit contributions
 sad+0.010
 so+0.008
 very+0.008
s+0.008
 but+0.007
Neg logit contributions
ross-0.020
rina-0.018
andra-0.018
ation-0.018
eh-0.018
Token with
Token ID351
Feature activation-2.25e-03

Pos logit contributions
 sad+0.015
 boy+0.012
 very+0.012
 so+0.012
 ladder+0.011
Neg logit contributions
ross-0.020
rina-0.019
andra-0.018
ation-0.017
eh-0.017
Token their
Token ID511
Feature activation+1.18e-03

Pos logit contributions
ross+0.006
rina+0.006
andra+0.005
ation+0.005
ressive+0.005
Neg logit contributions
 sad-0.002
 very-0.002
 so-0.002
 were-0.002
 boy-0.002
Token families
Token ID4172
Feature activation-1.52e-04

Pos logit contributions
 sad+0.001
 so+0.001
 very+0.001
 but+0.001
 boy+0.001
Neg logit contributions
ross-0.003
rina-0.003
andra-0.003
ation-0.002
ressive-0.002
Token.
Token ID13
Feature activation-4.38e-03

Pos logit contributions
ross+0.000
rina+0.000
andra+0.000
ation+0.000
ressive+0.000
Neg logit contributions
 sad-0.000
 very-0.000
 so-0.000
 boy-0.000
 but-0.000
Token They
Token ID1119
Feature activation+1.48e-02

Pos logit contributions
ross+0.012
rina+0.011
andra+0.011
ation+0.011
ressive+0.010
Neg logit contributions
 sad-0.004
 very-0.003
 so-0.003
 boy-0.002
 ladder-0.002
Token are
Token ID389
Feature activation+1.11e-02

Pos logit contributions
 sad+0.014
 so+0.010
 very+0.010
 boy+0.010
 but+0.009
Neg logit contributions
ross-0.039
rina-0.037
andra-0.036
ation-0.035
ressive-0.035
Token sat
Token ID3332
Feature activation+2.40e-02

Pos logit contributions
 sad+0.017
 very+0.014
 so+0.013
 boy+0.013
 were+0.012
Neg logit contributions
ross-0.038
rina-0.034
andra-0.033
ation-0.033
ressive-0.033
Token on
Token ID319
Feature activation+1.52e-03

Pos logit contributions
 sad+0.028
 cones+0.023
 so+0.022
 penn+0.022
 ladder+0.022
Neg logit contributions
ross-0.056
rina-0.051
eh-0.048
andra-0.047
iet-0.047
Token the
Token ID262
Feature activation-4.24e-03

Pos logit contributions
 sad+0.001
 very+0.001
 so+0.001
 but+0.001
 boy+0.001
Neg logit contributions
ross-0.005
rina-0.004
andra-0.004
ation-0.004
ressive-0.004
Token egg
Token ID5935
Feature activation+8.91e-03

Pos logit contributions
ross+0.009
rina+0.009
andra+0.008
ation+0.008
eh+0.008
Neg logit contributions
 sad-0.006
 were-0.005
 so-0.005
 boy-0.005
 very-0.005
Token to
Token ID284
Feature activation-6.77e-04

Pos logit contributions
 sad+0.008
 very+0.006
 so+0.006
 boy+0.005
 but+0.005
Neg logit contributions
ross-0.024
rina-0.023
andra-0.022
ation-0.022
ressive-0.021
Token keep
Token ID1394
Feature activation+8.05e-03

Pos logit contributions
ross+0.002
rina+0.002
andra+0.002
ation+0.002
ressive+0.001
Neg logit contributions
 sad-0.001
 very-0.001
 so-0.001
 but-0.001
 boy-0.001
Token it
Token ID340
Feature activation-4.69e-03

Pos logit contributions
 sad+0.008
 very+0.007
 so+0.007
 boy+0.006
 but+0.006
Neg logit contributions
ross-0.022
rina-0.020
andra-0.019
ation-0.019
ressive-0.019
Token warm
Token ID5814
Feature activation+5.69e-03

Pos logit contributions
ross+0.012
rina+0.011
andra+0.011
ation+0.010
ressive+0.010
Neg logit contributions
 sad-0.006
 very-0.005
 so-0.005
 but-0.004
 boy-0.004
Token.
Token ID13
Feature activation-1.30e-03

Pos logit contributions
 sad+0.008
 very+0.006
 so+0.006
 boy+0.006
 but+0.006
Neg logit contributions
ross-0.013
rina-0.012
andra-0.012
ation-0.011
ressive-0.011
Token She
Token ID1375
Feature activation+1.38e-02

Pos logit contributions
ross+0.004
rina+0.003
andra+0.003
ation+0.003
ressive+0.003
Neg logit contributions
 sad-0.001
 very-0.001
 so-0.001
 boy-0.001
 but-0.001
Token waited
Token ID13488
Feature activation-3.15e-03

Pos logit contributions
 sad+0.017
 very+0.014
 so+0.014
 boy+0.013
 but+0.012
Neg logit contributions
ross-0.034
rina-0.032
andra-0.030
ation-0.030
ressive-0.029
Token moment
Token ID2589
Feature activation+4.00e-03

Pos logit contributions
ross+0.004
rina+0.004
andra+0.004
ation+0.004
eh+0.004
Neg logit contributions
 sad-0.003
 so-0.003
 very-0.003
 cones-0.003
 were-0.003
Token.
Token ID13
Feature activation-6.46e-03

Pos logit contributions
 sad+0.005
 very+0.005
 so+0.005
 but+0.004
 boy+0.004
Neg logit contributions
ross-0.009
rina-0.009
andra-0.008
ation-0.008
ressive-0.008
Token Then
Token ID3244
Feature activation-5.30e-03

Pos logit contributions
ross+0.017
rina+0.016
andra+0.016
end+0.015
ressive+0.015
Neg logit contributions
 sad-0.005
 so-0.004
 very-0.004
 penn-0.004
 boy-0.003
Token she
Token ID673
Feature activation-2.03e-02

Pos logit contributions
ross+0.017
rina+0.016
andra+0.015
ation+0.015
eh+0.015
Neg logit contributions
 sad-0.003
 very-0.002
 so-0.002
 boy-0.002
 but-0.002
Token says
Token ID1139
Feature activation+4.19e-03

Pos logit contributions
ross+0.056
andra+0.048
ation+0.047
rina+0.047
ressive+0.046
Neg logit contributions
 sad-0.024
 can-0.022
 very-0.022
 so-0.019
 boy-0.018
Token,
Token ID11
Feature activation-2.84e-03

Pos logit contributions
 sad+0.003
 very+0.002
 so+0.002
 boy+0.002
 but+0.002
Neg logit contributions
ross-0.012
rina-0.012
andra-0.011
ation-0.011
ressive-0.011
Token "
Token ID366
Feature activation+5.96e-03

Pos logit contributions
ross+0.009
rina+0.008
andra+0.008
ation+0.008
ressive+0.008
Neg logit contributions
 sad-0.002
 very-0.001
 so-0.001
 but-0.001
 boy-0.001
TokenWhat
Token ID2061
Feature activation-2.18e-02

Pos logit contributions
 sad+0.006
 boy+0.006
 very+0.006
 so+0.005
s+0.005
Neg logit contributions
ross-0.015
rina-0.014
andra-0.013
ation-0.013
eh-0.013
Token do
Token ID466
Feature activation-9.12e-03

Pos logit contributions
ross+0.061
rina+0.058
andra+0.056
ation+0.055
ressive+0.053
Neg logit contributions
 sad-0.019
 very-0.015
 so-0.015
 boy-0.013
 but-0.013
Token you
Token ID345
Feature activation-1.19e-02

Pos logit contributions
ross+0.026
rina+0.024
andra+0.023
ation+0.023
ressive+0.022
Neg logit contributions
 sad-0.008
 very-0.006
 so-0.006
 boy-0.005
 but-0.005
Token call
Token ID869
Feature activation-3.19e-03

Pos logit contributions
ross+0.033
rina+0.031
andra+0.029
ation+0.029
wen+0.028
Neg logit contributions
 sad-0.013
 very-0.011
 so-0.010
 but-0.009
 were-0.009
Token take
Token ID1011
Feature activation+2.08e-03

Pos logit contributions
ross+0.008
rina+0.008
andra+0.007
ressive+0.007
eh+0.007
Neg logit contributions
 sad-0.004
 so-0.004
 very-0.004
 were-0.003
 but-0.003
Token a
Token ID257
Feature activation-6.42e-03

Pos logit contributions
 sad+0.002
 very+0.002
 so+0.002
 many+0.001
 round+0.001
Neg logit contributions
ross-0.006
rina-0.006
andra-0.005
ician-0.005
ation-0.005
Token walk
Token ID2513
Feature activation+1.02e-02

Pos logit contributions
ross+0.013
rina+0.013
andra+0.012
iet+0.012
ation+0.012
Neg logit contributions
 sad-0.008
 so-0.007
 boy-0.007
 very-0.007
 were-0.007
Token along
Token ID1863
Feature activation+1.80e-03

Pos logit contributions
 sad+0.012
 very+0.010
 so+0.010
 boy+0.009
 but+0.009
Neg logit contributions
ross-0.025
rina-0.023
andra-0.023
ation-0.022
ressive-0.022
Token the
Token ID262
Feature activation-7.05e-03

Pos logit contributions
 sad+0.002
 so+0.002
 very+0.002
 but+0.001
 boy+0.001
Neg logit contributions
ross-0.005
rina-0.004
andra-0.004
ation-0.004
iet-0.004
Token shore
Token ID15191
Feature activation-1.04e-02

Pos logit contributions
ross+0.017
rina+0.016
andra+0.015
ressive+0.014
ation+0.014
Neg logit contributions
 sad-0.009
 very-0.008
 so-0.008
 boy-0.007
 but-0.007
Tokenline
Token ID1370
Feature activation-2.20e-02

Pos logit contributions
ross+0.027
rina+0.025
andra+0.024
ation+0.023
iet+0.023
Neg logit contributions
 sad-0.012
 so-0.011
 very-0.011
 but-0.010
 boy-0.009
Token.
Token ID13
Feature activation+5.99e-03

Pos logit contributions
ross+0.056
rina+0.053
andra+0.051
ation+0.050
iet+0.050
Neg logit contributions
 sad-0.025
 so-0.023
 very-0.022
 but-0.021
 were-0.019
Token
Token ID198
Feature activation+6.09e-03

Pos logit contributions
 sad+0.006
 penn+0.004
 very+0.004
 boy+0.004
 so+0.004
Neg logit contributions
ross-0.015
andra-0.015
rina-0.014
anne-0.014
end-0.014
Token
Token ID198
Feature activation+2.47e-04

Pos logit contributions
 sad+0.007
 penn+0.005
nam+0.005
 cones+0.005
 boy+0.005
Neg logit contributions
ross-0.014
rina-0.014
andra-0.014
end-0.014
anne-0.013
TokenBut
Token ID1537
Feature activation+7.18e-03

Pos logit contributions
 sad+0.000
 so+0.000
 very+0.000
 but+0.000
 were+0.000
Neg logit contributions
ross-0.001
rina-0.001
andra-0.000
ation-0.000
ressive-0.000
Token.
Token ID13
Feature activation+3.65e-03

Pos logit contributions
 sad+0.002
 very+0.002
 so+0.002
 but+0.002
 boy+0.001
Neg logit contributions
ross-0.003
rina-0.003
andra-0.003
ation-0.003
ressive-0.003
Token But
Token ID887
Feature activation+1.16e-02

Pos logit contributions
 sad+0.004
 penn+0.003
 very+0.003
 are+0.003
 cones+0.003
Neg logit contributions
ross-0.008
andra-0.008
rina-0.008
eh-0.008
ation-0.008
Token then
Token ID788
Feature activation-6.24e-03

Pos logit contributions
 sad+0.015
 cones+0.014
s+0.014
 ladder+0.014
 penn+0.013
Neg logit contributions
ross-0.023
rina-0.021
ation-0.020
andra-0.020
eh-0.020
Token,
Token ID11
Feature activation-7.67e-03

Pos logit contributions
ross+0.017
rina+0.016
andra+0.016
ation+0.016
ressive+0.015
Neg logit contributions
 sad-0.005
 so-0.003
 very-0.003
 ladder-0.003
 boy-0.003
Token a
Token ID257
Feature activation-1.57e-02

Pos logit contributions
ross+0.020
rina+0.018
andra+0.018
ation+0.017
ressive+0.017
Neg logit contributions
 sad-0.007
 very-0.006
 so-0.006
s-0.005
 boy-0.005
Token friendly
Token ID8030
Feature activation-1.10e-02

Pos logit contributions
ross+0.039
rina+0.036
andra+0.035
ation+0.034
eh+0.034
Neg logit contributions
 sad-0.016
 so-0.013
 very-0.012
s-0.012
 cones-0.012
Token dolphin
Token ID44136
Feature activation-1.44e-02

Pos logit contributions
ross+0.029
rina+0.027
andra+0.026
ation+0.025
ressive+0.025
Neg logit contributions
 sad-0.011
 very-0.009
 so-0.009
 but-0.008
s-0.008
Token came
Token ID1625
Feature activation-1.34e-02

Pos logit contributions
ross+0.039
rina+0.036
andra+0.035
ation+0.034
wen+0.034
Neg logit contributions
 sad-0.017
 were-0.016
 very-0.015
 so-0.015
 but-0.014
Token and
Token ID290
Feature activation-1.18e-02

Pos logit contributions
ross+0.039
rina+0.037
andra+0.036
ation+0.035
ressive+0.034
Neg logit contributions
 sad-0.010
 very-0.008
 so-0.008
 boy-0.007
 but-0.006
Token helped
Token ID4193
Feature activation-1.52e-03

Pos logit contributions
ross+0.031
rina+0.028
ation+0.027
andra+0.027
iet+0.026
Neg logit contributions
 sad-0.015
 very-0.013
 so-0.013
 boy-0.013
s-0.012
Token the
Token ID262
Feature activation-3.94e-03

Pos logit contributions
ross+0.004
rina+0.004
andra+0.004
ation+0.004
iet+0.003
Neg logit contributions
 sad-0.002
 very-0.001
 so-0.001
 but-0.001
 boy-0.001
Token knew
Token ID2993
Feature activation-4.25e-03

Pos logit contributions
ross+0.020
ation+0.018
rina+0.018
andra+0.018
ressive+0.017
Neg logit contributions
 sad-0.008
 very-0.007
 so-0.007
 were-0.006
 boy-0.006
Token his
Token ID465
Feature activation-2.73e-03

Pos logit contributions
ross+0.013
rina+0.012
andra+0.012
ation+0.012
iet+0.012
Neg logit contributions
 sad-0.003
 very-0.002
 so-0.002
 but-0.002
 boy-0.001
Token mom
Token ID1995
Feature activation-2.07e-03

Pos logit contributions
ross+0.008
rina+0.007
andra+0.007
ation+0.007
ressive+0.007
Neg logit contributions
 sad-0.003
 very-0.002
 so-0.002
 boy-0.002
 but-0.002
Token would
Token ID561
Feature activation+7.16e-03

Pos logit contributions
ross+0.005
rina+0.005
andra+0.005
ation+0.005
iet+0.004
Neg logit contributions
 sad-0.002
 very-0.002
 so-0.002
 boy-0.002
 but-0.002
Token help
Token ID1037
Feature activation-4.66e-03

Pos logit contributions
 sad+0.008
 very+0.007
 so+0.007
 but+0.006
 were+0.006
Neg logit contributions
ross-0.019
rina-0.017
andra-0.017
ation-0.016
eh-0.016
Token him
Token ID683
Feature activation-9.23e-03

Pos logit contributions
ross+0.011
rina+0.010
andra+0.009
ation+0.009
ressive+0.009
Neg logit contributions
 sad-0.007
 very-0.006
 so-0.006
 but-0.005
 boy-0.005
Token manage
Token ID6687
Feature activation-4.52e-03

Pos logit contributions
ross+0.025
andra+0.023
rina+0.023
ation+0.021
ressive+0.021
Neg logit contributions
 sad-0.010
 very-0.009
 so-0.009
 but-0.008
 were-0.007
Token it
Token ID340
Feature activation-2.99e-02

Pos logit contributions
ross+0.014
rina+0.013
andra+0.013
iet+0.012
ation+0.012
Neg logit contributions
 sad-0.004
 very-0.003
 so-0.003
 but-0.003
 round-0.002
Token.
Token ID13
Feature activation-2.82e-03

Pos logit contributions
ross+0.087
andra+0.083
rina+0.079
iet+0.077
ation+0.075
Neg logit contributions
 so-0.030
 very-0.029
 sad-0.028
 but-0.026
 were-0.021
Token
Token ID220
Feature activation-4.64e-03

Pos logit contributions
ross+0.008
rina+0.007
andra+0.007
ation+0.007
ician+0.007
Neg logit contributions
 sad-0.003
 very-0.002
 so-0.002
 boy-0.001
 round-0.001
Token
Token ID198
Feature activation-2.56e-04

Pos logit contributions
ross+0.015
rina+0.014
andra+0.013
ation+0.013
ressive+0.013
Neg logit contributions
 sad-0.002
 very-0.001
 so-0.001
 boy-0.001
 but-0.001
Token come
Token ID1282
Feature activation+2.40e-03

Pos logit contributions
 sad+0.001
 so+0.001
 very+0.001
 were+0.001
 are+0.001
Neg logit contributions
ross-0.006
rina-0.005
andra-0.005
ation-0.005
ician-0.005
Token and
Token ID290
Feature activation-9.96e-03

Pos logit contributions
 sad+0.003
 so+0.002
 very+0.002
 but+0.002
 boy+0.002
Neg logit contributions
ross-0.007
andra-0.006
rina-0.006
ation-0.006
eh-0.006
Token play
Token ID711
Feature activation+4.36e-04

Pos logit contributions
ross+0.028
rina+0.026
andra+0.026
ation+0.025
iet+0.025
Neg logit contributions
 sad-0.010
 so-0.008
 very-0.008
 boy-0.007
 were-0.007
Token again
Token ID757
Feature activation-1.30e-02

Pos logit contributions
 sad+0.000
 so+0.000
 very+0.000
 but+0.000
 boy+0.000
Neg logit contributions
ross-0.001
andra-0.001
rina-0.001
wen-0.001
anne-0.001
Token soon
Token ID2582
Feature activation-1.16e-02

Pos logit contributions
ross+0.030
andra+0.028
rina+0.027
ation+0.026
eh+0.026
Neg logit contributions
 so-0.018
 sad-0.018
 very-0.017
 but-0.017
,-0.015
Token!"
Token ID2474
Feature activation-1.79e-02

Pos logit contributions
ross+0.027
rina+0.026
andra+0.025
iet+0.025
eh+0.024
Neg logit contributions
 so-0.016
 sad-0.016
 very-0.014
 but-0.014
,-0.014
Token<|endoftext|>
Token ID50256
Feature activation-5.40e-03

Pos logit contributions
ross+0.057
rina+0.054
andra+0.052
ation+0.051
iet+0.050
Neg logit contributions
 sad-0.009
 so-0.006
 very-0.006
 boy-0.005
 but-0.005
TokenOnce
Token ID7454
Feature activation-1.92e-03

Pos logit contributions
ross+0.014
rina+0.013
andra+0.013
ation+0.012
ician+0.012
Neg logit contributions
 sad-0.005
 so-0.005
 very-0.005
 boy-0.005
 were-0.005
Token upon
Token ID2402
Feature activation-9.92e-03

Pos logit contributions
ross+0.004
rina+0.004
andra+0.004
anne+0.003
iet+0.003
Neg logit contributions
 sad-0.002
 cones-0.002
 so-0.002
 very-0.002
 penn-0.002
Token a
Token ID257
Feature activation-3.65e-02

Pos logit contributions
ross+0.024
rina+0.023
andra+0.022
ation+0.021
ressive+0.021
Neg logit contributions
 sad-0.011
 very-0.009
 so-0.009
 boy-0.009
 but-0.008
Token time
Token ID640
Feature activation-8.43e-03

Pos logit contributions
ross+0.076
rina+0.069
ation+0.068
ressive+0.067
andra+0.064
Neg logit contributions
 sad-0.061
 very-0.057
 so-0.054
 but-0.054
 boy-0.050
Token
Token ID198
Feature activation+3.93e-03

Pos logit contributions
 sad+0.008
 very+0.005
 so+0.005
 boy+0.005
 ladder+0.005
Neg logit contributions
ross-0.031
rina-0.030
andra-0.029
ation-0.028
eh-0.028
TokenTHE
Token ID10970
Feature activation+3.63e-04

Pos logit contributions
 sad+0.005
 very+0.004
 so+0.004
 boy+0.004
 were+0.004
Neg logit contributions
ross-0.009
rina-0.009
andra-0.008
ation-0.008
ressive-0.008
Token<|endoftext|>
Token ID50256
Feature activation-1.66e-03

Pos logit contributions
 sad+0.000
 very+0.000
 so+0.000
 boy+0.000
 but+0.000
Neg logit contributions
ross-0.001
rina-0.001
andra-0.001
ation-0.001
ressive-0.001
TokenOnce
Token ID7454
Feature activation+6.98e-03

Pos logit contributions
ross+0.005
rina+0.005
andra+0.004
ation+0.004
ressive+0.004
Neg logit contributions
 sad-0.001
 very-0.001
 so-0.001
 boy-0.001
 were-0.001
Token upon
Token ID2402
Feature activation-4.45e-03

Pos logit contributions
 sad+0.008
 cones+0.008
 so+0.008
 very+0.007
 penn+0.007
Neg logit contributions
ross-0.015
rina-0.015
andra-0.014
anne-0.013
iet-0.013
Token a
Token ID257
Feature activation-3.27e-02

Pos logit contributions
ross+0.011
rina+0.010
andra+0.010
ation+0.010
ressive+0.010
Neg logit contributions
 sad-0.005
 very-0.004
 so-0.004
 boy-0.004
 but-0.004
Token time
Token ID640
Feature activation-3.06e-03

Pos logit contributions
ross+0.070
rina+0.064
ressive+0.064
ation+0.063
andra+0.060
Neg logit contributions
 sad-0.053
 very-0.049
 so-0.046
 but-0.045
 boy-0.042
Token there
Token ID612
Feature activation-2.92e-03

Pos logit contributions
ross+0.008
rina+0.008
andra+0.007
end+0.007
eh+0.007
Neg logit contributions
 sad-0.003
 boy-0.002
 very-0.002
 cones-0.002
 so-0.002
Token was
Token ID373
Feature activation-2.13e-02

Pos logit contributions
ross+0.007
rina+0.007
ation+0.006
andra+0.006
ressive+0.006
Neg logit contributions
 sad-0.003
 cones-0.003
 very-0.002
 so-0.002
 penn-0.002
Token a
Token ID257
Feature activation-4.92e-03

Pos logit contributions
ross+0.060
rina+0.057
andra+0.054
ation+0.054
ressive+0.053
Neg logit contributions
 sad-0.020
 very-0.017
 so-0.017
 boy-0.014
 but-0.013
Token little
Token ID1310
Feature activation+2.12e-02

Pos logit contributions
ross+0.012
rina+0.012
andra+0.011
iet+0.011
eh+0.011
Neg logit contributions
 sad-0.005
 so-0.004
 were-0.004
 very-0.003
 cones-0.003
Token jumped
Token ID11687
Feature activation-3.34e-03

Pos logit contributions
ross+0.030
rina+0.027
andra+0.026
ation+0.026
ressive+0.025
Neg logit contributions
 sad-0.017
 very-0.015
 so-0.015
 boy-0.014
 were-0.013
Token in
Token ID287
Feature activation+4.66e-03

Pos logit contributions
ross+0.008
rina+0.007
andra+0.007
iet+0.007
ation+0.007
Neg logit contributions
 sad-0.004
 boy-0.003
 very-0.003
 so-0.003
 were-0.003
Token the
Token ID262
Feature activation-2.60e-03

Pos logit contributions
 sad+0.005
s+0.004
 boy+0.004
 very+0.003
 ladder+0.003
Neg logit contributions
ross-0.011
andra-0.010
rina-0.010
ation-0.010
iet-0.010
Token ocean
Token ID9151
Feature activation-1.29e-03

Pos logit contributions
ross+0.006
rina+0.006
andra+0.005
ation+0.005
iet+0.005
Neg logit contributions
 sad-0.003
 so-0.003
 very-0.003
 but-0.003
 boy-0.002
Token and
Token ID290
Feature activation-9.07e-03

Pos logit contributions
ross+0.004
rina+0.004
andra+0.004
ation+0.003
ressive+0.003
Neg logit contributions
 sad-0.001
 very-0.001
 so-0.001
 boy-0.001
 but-0.000
Token sw
Token ID1509
Feature activation-4.64e-02

Pos logit contributions
ross+0.023
rina+0.022
andra+0.021
ation+0.020
ressive+0.020
Neg logit contributions
 sad-0.011
 very-0.009
 so-0.009
 boy-0.008
 but-0.008
Tokenam
Token ID321
Feature activation+8.12e-04

Pos logit contributions
ross+0.101
rina+0.095
andra+0.090
ressive+0.090
ician+0.085
Neg logit contributions
 sad-0.064
s-0.057
 very-0.056
 so-0.055
 were-0.050
Token around
Token ID1088
Feature activation-1.62e-03

Pos logit contributions
 sad+0.001
 very+0.001
 so+0.001
 boy+0.001
 ladder+0.001
Neg logit contributions
ross-0.002
rina-0.002
andra-0.002
ation-0.002
iet-0.002
Token until
Token ID1566
Feature activation-8.55e-03

Pos logit contributions
ross+0.004
rina+0.004
andra+0.004
ation+0.004
ressive+0.004
Neg logit contributions
 sad-0.002
 very-0.001
 so-0.001
 boy-0.001
 ladder-0.001
Token he
Token ID339
Feature activation-1.17e-03

Pos logit contributions
ross+0.026
rina+0.024
andra+0.024
ation+0.023
ressive+0.023
Neg logit contributions
 sad-0.005
 very-0.004
 so-0.004
 boy-0.003
 but-0.003
Token saw
Token ID2497
Feature activation-1.89e-02

Pos logit contributions
ross+0.003
rina+0.002
andra+0.002
ation+0.002
ressive+0.002
Neg logit contributions
 sad-0.002
 very-0.001
 so-0.001
 boy-0.001
 but-0.001
Token that
Token ID326
Feature activation+5.01e-03

Pos logit contributions
ross+0.035
rina+0.033
andra+0.031
ation+0.031
ressive+0.030
Neg logit contributions
 sad-0.014
 very-0.011
 so-0.010
 boy-0.010
 but-0.009
Token all
Token ID477
Feature activation-4.16e-03

Pos logit contributions
 sad+0.004
 very+0.003
 so+0.003
 boy+0.003
 but+0.003
Neg logit contributions
ross-0.014
rina-0.013
andra-0.012
ation-0.012
ressive-0.012
Token of
Token ID286
Feature activation+2.09e-03

Pos logit contributions
ross+0.011
rina+0.011
andra+0.010
ation+0.010
iet+0.010
Neg logit contributions
 sad-0.003
 very-0.002
 so-0.002
 boy-0.002
 but-0.002
Token the
Token ID262
Feature activation+5.20e-03

Pos logit contributions
 sad+0.002
 very+0.002
 so+0.002
 boy+0.002
 but+0.002
Neg logit contributions
ross-0.005
rina-0.005
andra-0.005
ation-0.005
ressive-0.005
Token other
Token ID584
Feature activation-5.62e-03

Pos logit contributions
 sad+0.006
 so+0.005
 very+0.005
 but+0.004
 ladder+0.004
Neg logit contributions
ross-0.012
rina-0.012
ation-0.011
andra-0.011
iet-0.011
Token sw
Token ID1509
Feature activation-4.41e-02

Pos logit contributions
ross+0.015
rina+0.014
andra+0.014
ation+0.013
iet+0.013
Neg logit contributions
 sad-0.005
 very-0.004
 so-0.004
 boy-0.003
 but-0.003
Tokenimmers
Token ID36904
Feature activation+3.88e-03

Pos logit contributions
ross+0.078
rina+0.072
andra+0.069
ressive+0.067
ician+0.063
Neg logit contributions
 sad-0.080
s-0.070
 very-0.070
 cones-0.069
 kay-0.068
Token were
Token ID547
Feature activation+4.08e-03

Pos logit contributions
 sad+0.003
 very+0.002
 so+0.002
 were+0.002
 but+0.002
Neg logit contributions
ross-0.012
rina-0.011
andra-0.011
ation-0.011
ician-0.010
Token much
Token ID881
Feature activation-5.85e-04

Pos logit contributions
 sad+0.005
 very+0.004
 so+0.004
 boy+0.003
 but+0.003
Neg logit contributions
ross-0.010
rina-0.010
andra-0.009
ation-0.009
ressive-0.009
Token bigger
Token ID5749
Feature activation-1.64e-02

Pos logit contributions
ross+0.001
rina+0.001
ation+0.001
andra+0.001
iet+0.001
Neg logit contributions
 ladder-0.001
 sad-0.001
 It-0.001
 -0.001
 She-0.001
Token than
Token ID621
Feature activation+1.00e-03

Pos logit contributions
ross+0.038
rina+0.035
andra+0.033
ation+0.033
ressive+0.032
Neg logit contributions
 sad-0.022
 very-0.019
 so-0.019
 boy-0.018
 but-0.017
Token pearl
Token ID43836
Feature activation-8.35e-03

Pos logit contributions
 sad+0.007
 very+0.006
 so+0.006
 boy+0.006
 but+0.005
Neg logit contributions
ross-0.014
rina-0.013
andra-0.012
ation-0.012
ressive-0.012
Token followed
Token ID3940
Feature activation-1.69e-03

Pos logit contributions
ross+0.022
rina+0.020
andra+0.020
iet+0.020
ife+0.019
Neg logit contributions
 sad-0.009
 were-0.008
 so-0.008
 very-0.008
 but-0.007
Token her
Token ID607
Feature activation-1.27e-02

Pos logit contributions
ross+0.005
rina+0.005
andra+0.004
ation+0.004
ressive+0.004
Neg logit contributions
 sad-0.002
 so-0.001
 very-0.001
 but-0.001
 boy-0.001
Token as
Token ID355
Feature activation-6.26e-03

Pos logit contributions
ross+0.032
rina+0.030
andra+0.029
ation+0.028
eh+0.028
Neg logit contributions
 sad-0.015
 very-0.013
 so-0.013
 but-0.012
 boy-0.011
Token she
Token ID673
Feature activation-4.42e-03

Pos logit contributions
ross+0.019
rina+0.018
andra+0.018
ation+0.018
iet+0.017
Neg logit contributions
 sad-0.004
 very-0.003
 so-0.003
 but-0.002
 boy-0.002
Token sw
Token ID1509
Feature activation-3.91e-02

Pos logit contributions
ross+0.012
rina+0.011
isdom+0.010
ation+0.010
ife+0.010
Neg logit contributions
 were-0.005
 sad-0.005
s-0.005
 could-0.004
 so-0.004
Tokenam
Token ID321
Feature activation+3.06e-03

Pos logit contributions
ross+0.094
rina+0.091
andra+0.088
ressive+0.084
anne+0.083
Neg logit contributions
 sad-0.042
s-0.037
 kay-0.036
 very-0.033
 cones-0.033
Token back
Token ID736
Feature activation-4.67e-03

Pos logit contributions
 sad+0.003
 very+0.003
 so+0.003
 but+0.003
s+0.003
Neg logit contributions
ross-0.008
rina-0.008
andra-0.007
ressive-0.007
ation-0.007
Token to
Token ID284
Feature activation-1.76e-02

Pos logit contributions
ross+0.015
rina+0.014
andra+0.014
ation+0.013
ressive+0.013
Neg logit contributions
 sad-0.003
 very-0.002
 so-0.002
 but-0.002
s-0.002
Token the
Token ID262
Feature activation-1.11e-02

Pos logit contributions
ross+0.056
rina+0.054
andra+0.052
iet+0.051
anne+0.050
Neg logit contributions
 sad-0.011
 so-0.010
 very-0.009
 but-0.007
 many-0.006
Token surface
Token ID4417
Feature activation+5.71e-04

Pos logit contributions
ross+0.025
rina+0.023
andra+0.023
ation+0.022
ressive+0.021
Neg logit contributions
 sad-0.015
 very-0.013
 so-0.013
 boy-0.012
 but-0.012
Token tree
Token ID5509
Feature activation-7.86e-03

Pos logit contributions
ross+0.011
rina+0.010
andra+0.010
ation+0.010
iet+0.010
Neg logit contributions
 sad-0.005
 very-0.004
 so-0.004
 boy-0.004
 were-0.004
Token could
Token ID714
Feature activation+1.47e-02

Pos logit contributions
ross+0.020
rina+0.018
andra+0.018
ation+0.018
iet+0.018
Neg logit contributions
 sad-0.009
 very-0.008
 so-0.008
 were-0.007
s-0.007
Token see
Token ID766
Feature activation-1.56e-02

Pos logit contributions
 sad+0.018
 very+0.015
 so+0.015
 but+0.014
 boy+0.014
Neg logit contributions
ross-0.036
rina-0.033
andra-0.032
ation-0.032
ressive-0.031
Token lots
Token ID6041
Feature activation-6.16e-03

Pos logit contributions
ross+0.044
andra+0.041
rina+0.041
ation+0.039
iet+0.039
Neg logit contributions
 sad-0.014
 so-0.013
 very-0.012
 were-0.010
 but-0.010
Token of
Token ID286
Feature activation-1.55e-02

Pos logit contributions
ross+0.019
rina+0.017
andra+0.017
ation+0.017
ressive+0.016
Neg logit contributions
 sad-0.004
 very-0.002
 so-0.002
 boy-0.002
 but-0.002
Token av
Token ID1196
Feature activation-4.37e-02

Pos logit contributions
ross+0.034
rina+0.032
andra+0.030
ation+0.030
ressive+0.029
Neg logit contributions
 sad-0.020
 very-0.017
 so-0.017
 boy-0.016
 ladder-0.016
Tokenoc
Token ID420
Feature activation-1.55e-02

Pos logit contributions
ross+0.078
rina+0.070
andra+0.067
ation+0.064
ressive+0.062
Neg logit contributions
 sad-0.082
 very-0.072
 so-0.072
 boy-0.069
 but-0.068
Tokenados
Token ID22484
Feature activation-2.88e-02

Pos logit contributions
ross+0.032
rina+0.029
andra+0.028
ation+0.027
iet+0.027
Neg logit contributions
 sad-0.024
 so-0.021
 very-0.021
 boy-0.020
 cones-0.020
Token that
Token ID326
Feature activation-6.86e-03

Pos logit contributions
ross+0.074
rina+0.068
andra+0.067
iet+0.065
ation+0.065
Neg logit contributions
 sad-0.033
 so-0.031
 very-0.030
 but-0.028
 were-0.026
Token the
Token ID262
Feature activation-4.44e-03

Pos logit contributions
ross+0.019
rina+0.018
andra+0.017
ation+0.017
iet+0.017
Neg logit contributions
 sad-0.006
 very-0.005
 so-0.005
 were-0.005
 boy-0.005
Token supplier
Token ID22693
Feature activation-1.66e-02

Pos logit contributions
ross+0.011
rina+0.010
andra+0.010
ation+0.010
ressive+0.010
Neg logit contributions
 sad-0.005
 very-0.004
 so-0.004
 boy-0.004
 but-0.004
Token sometimes
Token ID3360
Feature activation+2.29e-03

Pos logit contributions
ross+0.006
rina+0.006
andra+0.005
ation+0.005
iet+0.005
Neg logit contributions
 sad-0.004
 very-0.003
 so-0.003
 boy-0.003
s-0.003
Token pretend
Token ID16614
Feature activation-2.79e-03

Pos logit contributions
 sad+0.002
 very+0.001
 so+0.001
 but+0.001
 boy+0.001
Neg logit contributions
ross-0.007
rina-0.006
andra-0.006
ation-0.006
ressive-0.006
Token to
Token ID284
Feature activation+6.94e-03

Pos logit contributions
ross+0.008
rina+0.008
andra+0.008
ation+0.007
ressive+0.007
Neg logit contributions
 sad-0.002
 very-0.001
 so-0.001
 boy-0.001
 but-0.001
Token be
Token ID307
Feature activation-1.50e-02

Pos logit contributions
 sad+0.008
 so+0.007
 very+0.007
 boy+0.007
 cones+0.006
Neg logit contributions
ross-0.016
rina-0.015
ation-0.015
andra-0.014
iet-0.014
Token a
Token ID257
Feature activation+4.93e-03

Pos logit contributions
ross+0.042
andra+0.041
rina+0.040
ation+0.038
isdom+0.038
Neg logit contributions
 sad-0.016
 very-0.015
 so-0.014
 but-0.011
 boy-0.011
Token mer
Token ID4017
Feature activation-5.03e-02

Pos logit contributions
 sad+0.006
 very+0.005
 boy+0.005
 so+0.005
 but+0.004
Neg logit contributions
ross-0.013
rina-0.012
andra-0.012
ation-0.011
ressive-0.011
Tokenmaid
Token ID23151
Feature activation-1.50e-02

Pos logit contributions
ross+0.083
rina+0.071
andra+0.070
ation+0.068
eh+0.065
Neg logit contributions
 sad-0.103
 very-0.091
 so-0.090
 boy-0.090
 but-0.087
Token.
Token ID13
Feature activation+9.61e-03

Pos logit contributions
ross+0.043
andra+0.040
ation+0.038
rina+0.038
wen+0.037
Neg logit contributions
 sad-0.015
 very-0.014
s-0.014
 so-0.013
 but-0.013
Token
Token ID220
Feature activation-1.43e-04

Pos logit contributions
 sad+0.010
 very+0.007
 so+0.007
 cones+0.007
 penn+0.007
Neg logit contributions
ross-0.022
rina-0.022
andra-0.021
end-0.021
ife-0.020
Token
Token ID198
Feature activation+6.61e-03

Pos logit contributions
ross+0.000
rina+0.000
andra+0.000
ation+0.000
end+0.000
Neg logit contributions
 sad-0.000
 very-0.000
 so-0.000
 boy-0.000
 cones-0.000
Token
Token ID198
Feature activation-2.92e-03

Pos logit contributions
 sad+0.008
 penn+0.006
 cones+0.006
 rusty+0.006
 ladder+0.006
Neg logit contributions
rina-0.014
ross-0.014
andra-0.014
end-0.014
anne-0.013