TOP ACTIVATIONS
MAX = 0.087

and gave his mom a big kiss and a hug .
them and gave them a big hug . She loved her
gave Tom and Tim a big hug and a sweet treat
they gave each other a big hug and vowed to come
Mom gave Fl uffy a soft hug and a new toy
up to give Lily a big wet lick . That made
away and gave Spot a soft ball to play with instead
happily and gave Bob a big kitten hug . Bob and
gave O ll ie a big hug with her soft paw
and Lily gave him a small piece to share .
!" Bob said with a big smile on his face .
She gave her mom a big hug and thanked her for
she gave them each a big hug . The
boy and gave him a big hug . The other boy
. She gave Spot a big hug and told him ,
Bob gave the bear a big hug and he felt happy
Bob gave the bear a big hug and he felt happy
Jul ia gave him a big hug and said , "
her t eddy bear a big hug . She was happy
and gave Pac o a big hug . P

MIN ACTIVATIONS
MIN = -0.075

can solve your problem , m ister ," Ben said .
, " Thank you , m ister !" She then showed
m ule , but the m ule refused to move .
anas available for feeding the gor illas ." She said to
they went to see the gor illas . Tim my was
a m ule . The m ule was big and strong
than all of the other m ules . She had beautiful
m ule , but the m ule would not yield .
some bananas and feed the gor illas ?" Tim my nodded
was going to happen on Tues days . Today
, " Thank you , m ister . You saved the
a bar ber . The bar ber was a funny man
to choose one from the bakery . Every day she went
a m ule . The m ule was very strong and
" I am sorry , m ister . That is very
than any other donkey the m ule had seen before .
Suddenly , one of the gor illas reached out his furry
" Sorry , m ister . We didn 't
Maria : Please , m ister , can you help
" Sorry , m ister , we didn 't

INTERVAL 0.047 - 0.087
CONTAINS 0.332%

suddenly , she heard a loud noise coming from the sky
glass of milk and knocks it over . The milk spills
wipes her tears . He takes a band - aid from
She carefully scoop ed it up with a leaf and
time , there was a cute little t eddy bear .

INTERVAL 0.006 - 0.047
CONTAINS 34.465%

dish . Lily ate it all up , but then her
spoon on their nose . It 's just a spoon .
the pie . It is brown and bub bly . They
. He told her he would be okay and thanked her
but finally , the nest was finished ! The bird ie

INTERVAL -0.034 - 0.006
CONTAINS 63.976%

. Tim my loved to play with his toy gun and
it . Her mom agreed , and they found a spot
is always good to ask for permission before doing something that
to call her and tell her about it ?"
She decided to take it home , so she could show

INTERVAL -0.075 - -0.034
CONTAINS 1.227%

and Sam went to their rooms and cried . They wished
at him . She said : " Hi , I 'm
. She was only three years old and loved playing with
said , " Hi little re ind eer !" The re
He sat at his desk , looking out the window ,
Token and
Token ID290
Feature activation+1.37e-02

Pos logit contributions
ups+0.001
grand+0.001
People+0.001
Grand+0.001
we+0.001
Neg logit contributions
 sip-0.001
 leap-0.001
 bite-0.001
 squeeze-0.001
 breaths-0.001
Token gave
Token ID2921
Feature activation+7.88e-03

Pos logit contributions
 sip+0.027
 leap+0.026
 bite+0.026
 squeeze+0.023
 bites+0.022
Neg logit contributions
grand-0.027
People-0.027
ups-0.027
Grand-0.025
ind-0.025
Token his
Token ID465
Feature activation+1.83e-02

Pos logit contributions
 bite+0.014
 sip+0.014
 leap+0.014
 squeeze+0.013
 bites+0.012
Neg logit contributions
grand-0.017
People-0.016
ups-0.016
Grand-0.015
ich-0.015
Token mom
Token ID1995
Feature activation+1.01e-02

Pos logit contributions
 sip+0.041
 leap+0.040
 bite+0.039
 squeeze+0.036
 bites+0.035
Neg logit contributions
grand-0.031
ups-0.031
People-0.030
Grand-0.028
ind-0.027
Token a
Token ID257
Feature activation+5.76e-02

Pos logit contributions
 sip+0.016
 leap+0.016
 bite+0.016
 squeeze+0.014
 bites+0.013
Neg logit contributions
grand-0.023
People-0.023
ups-0.023
Grand-0.022
ind-0.022
Token big
Token ID1263
Feature activation+8.70e-02

Pos logit contributions
 breaths+0.118
 leap+0.115
 sip+0.110
 bite+0.103
 bites+0.102
Neg logit contributions
grand-0.092
Grand-0.090
People-0.088
bel-0.086
ups-0.082
Token kiss
Token ID9245
Feature activation+1.51e-02

Pos logit contributions
 breaths+0.106
 leap+0.106
 trigger+0.105
ede+0.097
 sip+0.097
Neg logit contributions
 office-0.147
Grand-0.143
 painter-0.136
 granddaughter-0.136
grand-0.133
Token and
Token ID290
Feature activation+1.02e-02

Pos logit contributions
 sip+0.027
 leap+0.027
 bite+0.027
 squeeze+0.024
 bites+0.022
Neg logit contributions
grand-0.032
ups-0.031
People-0.031
Grand-0.029
ind-0.029
Token a
Token ID257
Feature activation+5.88e-02

Pos logit contributions
 sip+0.019
 leap+0.018
 bite+0.018
 squeeze+0.016
 bites+0.016
Neg logit contributions
grand-0.021
ups-0.021
People-0.021
Grand-0.020
ind-0.019
Token hug
Token ID16225
Feature activation+3.38e-03

Pos logit contributions
 leap+0.101
 sip+0.099
 bite+0.095
 breaths+0.094
 bites+0.088
Neg logit contributions
grand-0.119
Grand-0.116
People-0.116
ups-0.111
God-0.108
Token.
Token ID13
Feature activation+6.58e-04

Pos logit contributions
 sip+0.006
 bite+0.006
 leap+0.006
 squeeze+0.005
 paws+0.005
Neg logit contributions
People-0.008
grand-0.008
ups-0.007
ind-0.007
Grand-0.007
Token them
Token ID606
Feature activation+1.15e-02

Pos logit contributions
ups+0.038
grand+0.037
People+0.035
ich+0.034
ind+0.033
Neg logit contributions
 sip-0.035
 bite-0.035
 leap-0.034
 squeeze-0.031
 bites-0.029
Token and
Token ID290
Feature activation+2.85e-03

Pos logit contributions
 leap+0.021
 sip+0.021
 bite+0.021
 squeeze+0.018
 bites+0.018
Neg logit contributions
grand-0.023
ups-0.023
People-0.023
Grand-0.022
ind-0.021
Token gave
Token ID2921
Feature activation+2.73e-03

Pos logit contributions
 sip+0.006
 bite+0.005
 leap+0.005
 squeeze+0.005
 bites+0.005
Neg logit contributions
People-0.006
grand-0.006
ups-0.006
ind-0.005
Grand-0.005
Token them
Token ID606
Feature activation+1.47e-02

Pos logit contributions
 sip+0.006
 bite+0.006
 leap+0.006
 squeeze+0.005
 bites+0.005
Neg logit contributions
grand-0.005
People-0.005
ups-0.005
Grand-0.004
ind-0.004
Token a
Token ID257
Feature activation+5.41e-02

Pos logit contributions
 leap+0.027
 sip+0.026
 bite+0.026
 squeeze+0.023
 bites+0.022
Neg logit contributions
ups-0.031
grand-0.030
People-0.030
Grand-0.028
ind-0.028
Token big
Token ID1263
Feature activation+8.54e-02

Pos logit contributions
 breaths+0.110
 leap+0.110
 sip+0.099
 bite+0.096
 bites+0.096
Neg logit contributions
Grand-0.087
grand-0.086
People-0.082
bel-0.081
God-0.077
Token hug
Token ID16225
Feature activation+1.84e-03

Pos logit contributions
 leap+0.121
 trigger+0.117
 breaths+0.117
 Carl+0.115
ede+0.110
Neg logit contributions
Grand-0.150
 office-0.133
 writer-0.133
grand-0.132
 grandma-0.132
Token.
Token ID13
Feature activation+6.94e-04

Pos logit contributions
 sip+0.003
 bite+0.003
 leap+0.003
 squeeze+0.003
 bites+0.002
Neg logit contributions
People-0.004
grand-0.004
ups-0.004
ind-0.004
Grand-0.004
Token She
Token ID1375
Feature activation+3.49e-03

Pos logit contributions
 sip+0.001
 leap+0.001
 bite+0.001
 squeeze+0.001
 bites+0.001
Neg logit contributions
ups-0.001
grand-0.001
People-0.001
ind-0.001
ich-0.001
Token loved
Token ID6151
Feature activation-1.70e-02

Pos logit contributions
 sip+0.007
 bite+0.007
 leap+0.007
 squeeze+0.006
 bites+0.006
Neg logit contributions
grand-0.007
People-0.007
ups-0.007
ind-0.006
Grand-0.006
Token her
Token ID607
Feature activation+2.24e-02

Pos logit contributions
grand+0.037
People+0.035
ups+0.035
ich+0.034
cle+0.033
Neg logit contributions
 bite-0.031
 sip-0.031
 leap-0.028
 bites-0.027
 squeeze-0.026
Token gave
Token ID2921
Feature activation+7.30e-03

Pos logit contributions
People+0.026
grand+0.026
ups+0.026
ind+0.025
ich+0.025
Neg logit contributions
 sip-0.022
 bite-0.021
 leap-0.021
 squeeze-0.019
 bites-0.018
Token Tom
Token ID4186
Feature activation+2.34e-02

Pos logit contributions
 sip+0.015
 leap+0.014
 bite+0.014
 squeeze+0.013
 bites+0.012
Neg logit contributions
grand-0.014
ups-0.014
People-0.014
Grand-0.013
ind-0.013
Token and
Token ID290
Feature activation+8.18e-03

Pos logit contributions
 sip+0.036
 bite+0.035
 leap+0.034
 squeeze+0.030
 bites+0.029
Neg logit contributions
grand-0.055
People-0.055
ups-0.054
Grand-0.052
ind-0.052
Token Tim
Token ID5045
Feature activation+2.46e-02

Pos logit contributions
 leap+0.014
 sip+0.014
 bite+0.013
 squeeze+0.011
 breaths+0.011
Neg logit contributions
grand-0.018
ups-0.018
People-0.018
Grand-0.017
ind-0.016
Token a
Token ID257
Feature activation+5.91e-02

Pos logit contributions
 sip+0.042
 leap+0.042
 bite+0.041
 squeeze+0.036
 bites+0.035
Neg logit contributions
grand-0.054
ups-0.053
People-0.053
Grand-0.050
ind-0.049
Token big
Token ID1263
Feature activation+8.32e-02

Pos logit contributions
 leap+0.114
 sip+0.105
 breaths+0.104
 bite+0.103
 bites+0.095
Neg logit contributions
grand-0.109
Grand-0.106
People-0.105
ups-0.102
bel-0.099
Token hug
Token ID16225
Feature activation+8.48e-03

Pos logit contributions
 leap+0.118
 sip+0.102
 breaths+0.098
 Carl+0.095
 bite+0.093
Neg logit contributions
 office-0.161
grand-0.158
Grand-0.156
 priest-0.152
 writer-0.147
Token and
Token ID290
Feature activation+7.28e-03

Pos logit contributions
 sip+0.012
 bite+0.012
 leap+0.012
 squeeze+0.011
 paws+0.010
Neg logit contributions
People-0.021
grand-0.021
ups-0.020
ind-0.020
Grand-0.019
Token a
Token ID257
Feature activation+5.23e-02

Pos logit contributions
 sip+0.014
 leap+0.014
 bite+0.014
 squeeze+0.012
 bites+0.012
Neg logit contributions
grand-0.014
ups-0.014
People-0.014
Grand-0.013
ind-0.013
Token sweet
Token ID6029
Feature activation+5.81e-02

Pos logit contributions
 leap+0.091
 sip+0.087
 bite+0.084
 breaths+0.081
 bites+0.076
Neg logit contributions
grand-0.109
People-0.107
Grand-0.107
ups-0.104
God-0.099
Token treat
Token ID2190
Feature activation-1.65e-03

Pos logit contributions
 leap+0.087
 breaths+0.073
 sip+0.072
 pulls+0.071
 Carl+0.070
Neg logit contributions
grand-0.130
Grand-0.116
 priest-0.110
God-0.110
People-0.109
Token they
Token ID484
Feature activation+1.08e-02

Pos logit contributions
 sip+0.014
 leap+0.014
 bite+0.014
 squeeze+0.012
 paws+0.012
Neg logit contributions
grand-0.015
ups-0.014
People-0.014
ind-0.013
Grand-0.013
Token gave
Token ID2921
Feature activation+1.94e-02

Pos logit contributions
 sip+0.021
 leap+0.021
 bite+0.020
 squeeze+0.018
 bites+0.018
Neg logit contributions
grand-0.021
ups-0.021
People-0.021
Grand-0.020
ind-0.019
Token each
Token ID1123
Feature activation+2.12e-02

Pos logit contributions
 sip+0.040
 leap+0.040
 bite+0.039
 squeeze+0.035
 bites+0.034
Neg logit contributions
ups-0.035
grand-0.035
People-0.035
ind-0.032
Grand-0.032
Token other
Token ID584
Feature activation+1.14e-02

Pos logit contributions
 sip+0.046
 leap+0.045
 bite+0.044
 squeeze+0.040
 bites+0.039
Neg logit contributions
grand-0.037
People-0.036
ups-0.036
Grand-0.034
ind-0.033
Token a
Token ID257
Feature activation+4.94e-02

Pos logit contributions
 sip+0.020
 leap+0.019
 bite+0.019
 squeeze+0.017
 bites+0.016
Neg logit contributions
grand-0.025
ups-0.025
People-0.024
Grand-0.023
ind-0.023
Token big
Token ID1263
Feature activation+8.26e-02

Pos logit contributions
 leap+0.094
 breaths+0.094
 sip+0.089
 bite+0.087
 bites+0.085
Neg logit contributions
grand-0.084
People-0.081
Grand-0.080
ups-0.079
bel-0.077
Token hug
Token ID16225
Feature activation+7.78e-03

Pos logit contributions
 trigger+0.110
 breaths+0.102
 pulls+0.100
 leap+0.099
ede+0.095
Neg logit contributions
 office-0.139
 priest-0.135
 husband-0.134
Grand-0.133
 grandma-0.131
Token and
Token ID290
Feature activation+9.20e-03

Pos logit contributions
 sip+0.014
 leap+0.013
 bite+0.013
 squeeze+0.012
 bites+0.011
Neg logit contributions
grand-0.017
People-0.017
ups-0.017
Grand-0.016
ind-0.016
Token vowed
Token ID19982
Feature activation-2.35e-02

Pos logit contributions
 sip+0.017
 leap+0.017
 bite+0.017
 squeeze+0.015
 bites+0.014
Neg logit contributions
grand-0.019
ups-0.019
People-0.019
Grand-0.018
ind-0.017
Token to
Token ID284
Feature activation+3.61e-03

Pos logit contributions
People+0.050
ind+0.049
grand+0.049
Grand+0.048
ich+0.047
Neg logit contributions
 bite-0.039
 leap-0.038
 sip-0.037
 squeeze-0.036
 look-0.032
Token come
Token ID1282
Feature activation-4.39e-04

Pos logit contributions
 sip+0.007
 leap+0.006
 bite+0.006
 squeeze+0.006
 bites+0.005
Neg logit contributions
grand-0.008
ups-0.008
People-0.008
Grand-0.007
ind-0.007
Token Mom
Token ID11254
Feature activation-6.65e-03

Pos logit contributions
 leap+0.038
 sip+0.038
 bite+0.037
 squeeze+0.033
 bites+0.031
Neg logit contributions
grand-0.038
People-0.038
ups-0.038
Grand-0.036
ind-0.034
Token gave
Token ID2921
Feature activation+4.17e-02

Pos logit contributions
ups+0.012
People+0.011
Grand+0.011
grand+0.011
we+0.011
Neg logit contributions
 leap-0.014
 sip-0.014
 bite-0.013
 squeeze-0.012
 shock-0.012
Token Fl
Token ID1610
Feature activation+5.30e-03

Pos logit contributions
 leap+0.086
 sip+0.078
 bite+0.075
 shove+0.071
 squeeze+0.070
Neg logit contributions
ups-0.073
People-0.070
ind-0.068
we-0.067
grand-0.065
Tokenuffy
Token ID15352
Feature activation+2.60e-02

Pos logit contributions
 leap+0.013
 bite+0.012
 sip+0.012
 squeeze+0.011
 shove+0.010
Neg logit contributions
ups-0.009
ind-0.009
we-0.009
ich-0.008
People-0.008
Token a
Token ID257
Feature activation+4.52e-02

Pos logit contributions
 leap+0.051
 sip+0.048
 bite+0.046
 squeeze+0.043
 breaths+0.041
Neg logit contributions
ups-0.052
grand-0.049
People-0.049
Grand-0.045
ind-0.044
Token soft
Token ID2705
Feature activation+8.26e-02

Pos logit contributions
 leap+0.094
 breaths+0.094
 sip+0.084
 bite+0.079
ishly+0.079
Neg logit contributions
grand-0.073
Grand-0.069
People-0.065
bel-0.065
ups-0.063
Token hug
Token ID16225
Feature activation+7.38e-03

Pos logit contributions
 leap+0.126
 breaths+0.108
 pulls+0.105
 Carl+0.100
 shove+0.097
Neg logit contributions
grand-0.151
Grand-0.151
 office-0.138
bel-0.134
cle-0.133
Token and
Token ID290
Feature activation+7.35e-03

Pos logit contributions
 sip+0.013
 leap+0.013
 bite+0.013
 squeeze+0.011
 bites+0.011
Neg logit contributions
grand-0.016
ups-0.016
People-0.015
Grand-0.015
ind-0.014
Token a
Token ID257
Feature activation+4.71e-02

Pos logit contributions
 leap+0.014
 sip+0.014
 bite+0.014
 breaths+0.012
 squeeze+0.012
Neg logit contributions
grand-0.014
ups-0.014
People-0.013
Grand-0.013
God-0.012
Token new
Token ID649
Feature activation+5.82e-02

Pos logit contributions
 leap+0.086
 breaths+0.084
 sip+0.079
 bite+0.075
 bites+0.072
Neg logit contributions
grand-0.088
Grand-0.088
People-0.084
God-0.081
bel-0.081
Token toy
Token ID13373
Feature activation+2.38e-02

Pos logit contributions
 leap+0.119
 sip+0.109
 breaths+0.104
 bite+0.096
 knot+0.093
Neg logit contributions
grand-0.109
Grand-0.096
People-0.092
 office-0.092
ich-0.091
Token up
Token ID510
Feature activation+5.11e-03

Pos logit contributions
 leap+0.058
 sip+0.056
 bite+0.056
 bites+0.050
 pulls+0.050
Neg logit contributions
ups-0.041
bel-0.037
Grand-0.036
ind-0.036
People-0.036
Token to
Token ID284
Feature activation+3.18e-02

Pos logit contributions
 leap+0.011
 sip+0.011
 bite+0.011
 squeeze+0.010
 knot+0.010
Neg logit contributions
ups-0.008
grand-0.008
Grand-0.008
People-0.008
bel-0.007
Token give
Token ID1577
Feature activation+3.75e-02

Pos logit contributions
 leap+0.056
 sip+0.051
 knot+0.049
 bite+0.049
 breaths+0.048
Neg logit contributions
People-0.058
grand-0.058
Grand-0.056
we-0.055
ind-0.055
Token Lily
Token ID20037
Feature activation+2.69e-02

Pos logit contributions
 leap+0.090
 sip+0.088
 bite+0.086
 squeeze+0.078
 knot+0.077
Neg logit contributions
ups-0.056
grand-0.053
People-0.052
ind-0.048
we-0.048
Token a
Token ID257
Feature activation+5.24e-02

Pos logit contributions
 sip+0.047
 leap+0.047
 bite+0.045
 squeeze+0.040
 bites+0.039
Neg logit contributions
ups-0.057
grand-0.057
People-0.056
Grand-0.053
ind-0.052
Token big
Token ID1263
Feature activation+8.18e-02

Pos logit contributions
 breaths+0.087
 leap+0.087
 sip+0.081
 bite+0.076
 paws+0.075
Neg logit contributions
grand-0.096
Grand-0.095
People-0.091
ups-0.090
God-0.090
Token wet
Token ID9583
Feature activation+5.93e-02

Pos logit contributions
 trigger+0.109
 breaths+0.107
 webs+0.102
 leap+0.102
ede+0.099
Neg logit contributions
Grand-0.121
 husband-0.112
 bride-0.110
 priest-0.109
 granddaughter-0.109
Token lick
Token ID36728
Feature activation+1.50e-02

Pos logit contributions
 leap+0.087
 sip+0.084
 bite+0.077
 breaths+0.077
 bites+0.074
Neg logit contributions
grand-0.117
ich-0.115
ind-0.114
Grand-0.114
ups-0.110
Token.
Token ID13
Feature activation+2.91e-02

Pos logit contributions
 leap+0.032
 sip+0.031
 bite+0.031
 breaths+0.028
 bites+0.027
Neg logit contributions
ups-0.025
grand-0.023
People-0.023
we-0.023
Grand-0.022
Token That
Token ID1320
Feature activation+2.22e-02

Pos logit contributions
 sip+0.048
 leap+0.048
 bite+0.048
 squeeze+0.041
 bites+0.039
Neg logit contributions
grand-0.065
People-0.064
ups-0.064
Grand-0.061
ind-0.060
Token made
Token ID925
Feature activation+4.07e-02

Pos logit contributions
 sip+0.047
 leap+0.046
 bite+0.045
 squeeze+0.041
 breaths+0.041
Neg logit contributions
grand-0.039
ups-0.039
People-0.037
Grand-0.035
ind-0.035
Token away
Token ID1497
Feature activation+9.00e-04

Pos logit contributions
 sip+0.074
 leap+0.073
 bite+0.068
 breaths+0.067
 paws+0.064
Neg logit contributions
ups-0.070
grand-0.062
People-0.060
Grand-0.057
bel-0.056
Token and
Token ID290
Feature activation+7.95e-03

Pos logit contributions
 sip+0.002
 leap+0.002
 bite+0.002
 squeeze+0.002
 bites+0.001
Neg logit contributions
ups-0.002
grand-0.002
People-0.002
Grand-0.002
ind-0.002
Token gave
Token ID2921
Feature activation+1.90e-02

Pos logit contributions
 sip+0.015
 leap+0.015
 bite+0.014
 squeeze+0.013
 bites+0.012
Neg logit contributions
grand-0.016
ups-0.016
People-0.016
Grand-0.015
ind-0.015
Token Spot
Token ID15899
Feature activation+3.24e-02

Pos logit contributions
 leap+0.046
 sip+0.045
 bite+0.042
 squeeze+0.040
 breaths+0.040
Neg logit contributions
ups-0.028
grand-0.026
People-0.026
ind-0.024
Grand-0.024
Token a
Token ID257
Feature activation+4.77e-02

Pos logit contributions
 leap+0.067
 sip+0.064
 bite+0.062
 squeeze+0.057
 knot+0.056
Neg logit contributions
ups-0.061
grand-0.059
People-0.056
ind-0.053
Grand-0.052
Token soft
Token ID2705
Feature activation+8.17e-02

Pos logit contributions
 breaths+0.094
 leap+0.090
 sip+0.080
ishly+0.078
 paws+0.077
Neg logit contributions
grand-0.080
Grand-0.076
bel-0.073
ups-0.073
People-0.072
Token ball
Token ID2613
Feature activation+3.83e-02

Pos logit contributions
 leap+0.110
 breaths+0.102
 pulls+0.098
ishly+0.094
 Carl+0.092
Neg logit contributions
grand-0.140
Grand-0.140
bel-0.132
Dad-0.128
 office-0.127
Token to
Token ID284
Feature activation+1.18e-02

Pos logit contributions
 leap+0.091
 sip+0.087
 bite+0.082
 breaths+0.081
 shock+0.077
Neg logit contributions
grand-0.056
ups-0.056
Grand-0.049
People-0.047
bel-0.046
Token play
Token ID711
Feature activation+8.80e-03

Pos logit contributions
 leap+0.019
 sip+0.018
 bite+0.018
 breaths+0.017
 bites+0.016
Neg logit contributions
grand-0.026
ups-0.025
People-0.025
Grand-0.024
ind-0.023
Token with
Token ID351
Feature activation+1.30e-02

Pos logit contributions
 leap+0.021
 sip+0.021
 bite+0.020
 squeeze+0.018
 breaths+0.018
Neg logit contributions
ups-0.013
People-0.013
grand-0.012
we-0.012
Grand-0.012
Token instead
Token ID2427
Feature activation-9.49e-03

Pos logit contributions
 sip+0.024
 leap+0.024
 bite+0.024
 squeeze+0.021
 bites+0.020
Neg logit contributions
grand-0.027
ups-0.026
People-0.026
Grand-0.025
ind-0.025
Token happily
Token ID18177
Feature activation+2.46e-02

Pos logit contributions
 sip+0.065
 leap+0.063
 bite+0.062
 squeeze+0.055
 bites+0.053
Neg logit contributions
ups-0.059
People-0.056
Grand-0.056
ind-0.056
we-0.054
Token and
Token ID290
Feature activation+2.87e-02

Pos logit contributions
 sip+0.047
 leap+0.045
 bite+0.043
 squeeze+0.040
 bang+0.039
Neg logit contributions
ups-0.047
grand-0.046
People-0.044
ich-0.043
Grand-0.043
Token gave
Token ID2921
Feature activation+2.23e-02

Pos logit contributions
 sip+0.050
 leap+0.048
 bite+0.048
 squeeze+0.042
 knot+0.040
Neg logit contributions
ups-0.059
grand-0.059
People-0.058
Grand-0.055
we-0.054
Token Bob
Token ID5811
Feature activation+2.39e-02

Pos logit contributions
 sip+0.040
 leap+0.040
 bite+0.038
 squeeze+0.035
 knot+0.033
Neg logit contributions
ups-0.046
grand-0.045
People-0.045
ind-0.043
Grand-0.042
Token a
Token ID257
Feature activation+5.22e-02

Pos logit contributions
 sip+0.041
 leap+0.041
 bite+0.040
 squeeze+0.035
 bites+0.034
Neg logit contributions
grand-0.052
ups-0.052
People-0.051
Grand-0.048
ind-0.048
Token big
Token ID1263
Feature activation+8.15e-02

Pos logit contributions
 breaths+0.095
 leap+0.093
 sip+0.089
 bite+0.083
 bites+0.082
Neg logit contributions
grand-0.093
Grand-0.089
People-0.087
God-0.085
bel-0.084
Token kitten
Token ID42143
Feature activation+3.79e-02

Pos logit contributions
 breaths+0.104
 trigger+0.102
ede+0.096
 pulls+0.095
 leap+0.093
Neg logit contributions
 husband-0.133
Grand-0.133
 priest-0.129
God-0.128
grand-0.128
Token hug
Token ID16225
Feature activation+1.68e-03

Pos logit contributions
 leap+0.073
 sip+0.073
 bite+0.070
 breaths+0.063
 bites+0.063
Neg logit contributions
grand-0.069
ups-0.068
People-0.065
ind-0.063
ich-0.063
Token.
Token ID13
Feature activation+9.68e-03

Pos logit contributions
 sip+0.003
 bite+0.003
 leap+0.003
 squeeze+0.002
 bites+0.002
Neg logit contributions
grand-0.004
People-0.004
ups-0.004
Grand-0.004
ind-0.004
Token Bob
Token ID5811
Feature activation+1.40e-02

Pos logit contributions
 sip+0.014
 leap+0.014
 bite+0.013
 squeeze+0.011
 bites+0.011
Neg logit contributions
grand-0.024
ups-0.024
People-0.024
Grand-0.023
ind-0.022
Token and
Token ID290
Feature activation+1.15e-02

Pos logit contributions
 sip+0.029
 leap+0.028
 bite+0.028
 squeeze+0.025
 bites+0.024
Neg logit contributions
grand-0.026
ups-0.026
People-0.026
Grand-0.024
ind-0.024
Token gave
Token ID2921
Feature activation+3.12e-02

Pos logit contributions
ups+0.006
grand+0.006
People+0.006
Grand+0.006
ind+0.005
Neg logit contributions
 sip-0.006
 leap-0.006
 bite-0.006
 squeeze-0.005
 bites-0.005
Token O
Token ID440
Feature activation-1.58e-02

Pos logit contributions
 sip+0.064
 leap+0.063
 bite+0.062
 squeeze+0.055
 bites+0.054
Neg logit contributions
grand-0.058
ups-0.057
People-0.056
ind-0.053
Grand-0.053
Tokenll
Token ID297
Feature activation+4.00e-02

Pos logit contributions
grand+0.029
People+0.029
ups+0.028
Grand+0.027
ind+0.026
Neg logit contributions
 sip-0.033
 leap-0.032
 bite-0.032
 squeeze-0.029
 bites-0.028
Tokenie
Token ID494
Feature activation+2.55e-02

Pos logit contributions
 sip+0.078
 bite+0.077
 leap+0.076
 squeeze+0.069
 bites+0.065
Neg logit contributions
grand-0.079
People-0.078
ups-0.077
Grand-0.073
ich-0.072
Token a
Token ID257
Feature activation+5.50e-02

Pos logit contributions
 sip+0.044
 leap+0.043
 bite+0.043
 squeeze+0.038
 bites+0.036
Neg logit contributions
grand-0.056
ups-0.055
People-0.055
Grand-0.052
ind-0.051
Token big
Token ID1263
Feature activation+8.14e-02

Pos logit contributions
 leap+0.095
 breaths+0.094
 sip+0.091
 bite+0.085
 bites+0.081
Neg logit contributions
grand-0.108
Grand-0.104
People-0.099
God-0.097
bel-0.096
Token hug
Token ID16225
Feature activation+5.73e-03

Pos logit contributions
 breaths+0.102
 leap+0.101
 trigger+0.101
 sip+0.099
 Carl+0.089
Neg logit contributions
grand-0.149
Grand-0.148
 priest-0.143
 office-0.142
 husband-0.141
Token with
Token ID351
Feature activation+1.97e-02

Pos logit contributions
 sip+0.008
 bite+0.008
 leap+0.008
 squeeze+0.007
 paws+0.006
Neg logit contributions
People-0.015
grand-0.014
ups-0.014
ind-0.014
Grand-0.013
Token her
Token ID607
Feature activation+1.92e-02

Pos logit contributions
 sip+0.035
 leap+0.034
 bite+0.034
 squeeze+0.030
 bites+0.029
Neg logit contributions
grand-0.042
People-0.042
ups-0.042
Grand-0.040
ind-0.039
Token soft
Token ID2705
Feature activation+6.35e-02

Pos logit contributions
 sip+0.037
 leap+0.037
 bite+0.036
 squeeze+0.032
 bites+0.031
Neg logit contributions
grand-0.038
ups-0.038
People-0.037
Grand-0.035
ind-0.035
Token paw
Token ID41827
Feature activation+1.91e-02

Pos logit contributions
 sip+0.099
 leap+0.096
 bite+0.089
 shove+0.080
 bites+0.078
Neg logit contributions
grand-0.135
ind-0.127
Grand-0.126
People-0.124
ich-0.123
Token and
Token ID290
Feature activation+2.33e-02

Pos logit contributions
 leap+0.077
 sip+0.072
 breaths+0.071
 bite+0.070
 knot+0.067
Neg logit contributions
ups-0.049
Grand-0.047
grand-0.043
People-0.043
Kat-0.042
Token Lily
Token ID20037
Feature activation+2.90e-02

Pos logit contributions
 leap+0.045
 sip+0.044
 bite+0.042
 breaths+0.038
 squeeze+0.038
Neg logit contributions
ups-0.043
grand-0.042
People-0.042
Grand-0.042
we-0.039
Token gave
Token ID2921
Feature activation+3.32e-02

Pos logit contributions
 leap+0.059
 sip+0.055
 bite+0.053
 trigger+0.053
 shock+0.052
Neg logit contributions
ups-0.051
Grand-0.046
grand-0.045
People-0.044
we-0.044
Token him
Token ID683
Feature activation+3.27e-02

Pos logit contributions
 leap+0.072
 sip+0.070
 bite+0.067
 squeeze+0.061
 knot+0.060
Neg logit contributions
ups-0.058
grand-0.054
People-0.053
Grand-0.051
we-0.050
Token a
Token ID257
Feature activation+5.36e-02

Pos logit contributions
 leap+0.063
 sip+0.060
 bite+0.057
 squeeze+0.052
 breaths+0.051
Neg logit contributions
ups-0.067
grand-0.063
People-0.062
Grand-0.058
ind-0.057
Token small
Token ID1402
Feature activation+8.13e-02

Pos logit contributions
 breaths+0.100
 leap+0.098
 sip+0.081
ishly+0.081
 paws+0.079
Neg logit contributions
Grand-0.095
grand-0.093
bel-0.089
People-0.088
Mrs-0.086
Token piece
Token ID3704
Feature activation+2.67e-02

Pos logit contributions
 leap+0.118
 breaths+0.108
 Carl+0.100
 Bud+0.095
 AW+0.094
Neg logit contributions
 husband-0.133
 granddaughter-0.131
grand-0.130
Grand-0.130
 office-0.126
Token to
Token ID284
Feature activation+2.12e-02

Pos logit contributions
 leap+0.063
 sip+0.062
 bite+0.059
 breaths+0.055
 squeeze+0.054
Neg logit contributions
ups-0.040
grand-0.039
People-0.037
Grand-0.035
we-0.034
Token share
Token ID2648
Feature activation-1.09e-02

Pos logit contributions
 leap+0.029
 sip+0.027
 bite+0.025
 breaths+0.024
 bites+0.022
Neg logit contributions
grand-0.051
People-0.050
ups-0.050
Grand-0.049
ind-0.047
Token.
Token ID13
Feature activation+2.14e-02

Pos logit contributions
grand+0.023
People+0.023
ups+0.022
Grand+0.021
ind+0.021
Neg logit contributions
 sip-0.020
 bite-0.020
 leap-0.019
 squeeze-0.017
 bites-0.017
Token
Token ID198
Feature activation+4.57e-03

Pos logit contributions
 sip+0.034
 leap+0.033
 bite+0.033
 squeeze+0.029
 bites+0.027
Neg logit contributions
grand-0.050
ups-0.049
People-0.049
ind-0.046
Grand-0.046
Token!"
Token ID2474
Feature activation-3.41e-04

Pos logit contributions
 sip+0.020
 leap+0.020
 bite+0.019
 squeeze+0.016
 bites+0.016
Neg logit contributions
grand-0.031
ups-0.030
People-0.030
Grand-0.029
ind-0.029
Token Bob
Token ID5811
Feature activation+1.10e-03

Pos logit contributions
grand+0.001
People+0.001
ups+0.001
Grand+0.001
we+0.001
Neg logit contributions
 sip-0.001
 leap-0.001
 bite-0.001
 squeeze-0.001
 bites-0.001
Token said
Token ID531
Feature activation+7.80e-03

Pos logit contributions
 sip+0.002
 leap+0.002
 bite+0.002
 squeeze+0.002
 bites+0.002
Neg logit contributions
ups-0.002
grand-0.002
People-0.002
Grand-0.002
ich-0.002
Token with
Token ID351
Feature activation+2.47e-02

Pos logit contributions
 sip+0.016
 leap+0.015
 bite+0.015
 squeeze+0.013
 bites+0.013
Neg logit contributions
ups-0.014
grand-0.014
People-0.014
Grand-0.014
we-0.013
Token a
Token ID257
Feature activation+6.01e-02

Pos logit contributions
 leap+0.052
 sip+0.051
 bite+0.048
 shove+0.046
 breaths+0.046
Neg logit contributions
People-0.038
Grand-0.037
 known-0.035
David-0.034
grand-0.034
Token big
Token ID1263
Feature activation+8.13e-02

Pos logit contributions
 breaths+0.103
 sip+0.101
 leap+0.100
 bite+0.093
 bites+0.093
Neg logit contributions
Grand-0.111
People-0.105
grand-0.102
God-0.101
bel-0.100
Token smile
Token ID8212
Feature activation+2.91e-02

Pos logit contributions
ches+0.159
 repe+0.155
 trigger+0.154
 sip+0.154
ede+0.150
Neg logit contributions
 woman-0.065
 impressed-0.064
 pleased-0.060
Grand-0.059
 painter-0.059
Token on
Token ID319
Feature activation+2.49e-02

Pos logit contributions
 sip+0.060
 leap+0.059
 bite+0.057
 squeeze+0.051
 bites+0.051
Neg logit contributions
grand-0.052
ups-0.051
Grand-0.050
People-0.049
we-0.047
Token his
Token ID465
Feature activation+3.00e-02

Pos logit contributions
 leap+0.063
 sip+0.062
 bite+0.058
 squeeze+0.054
 knot+0.054
Neg logit contributions
People-0.032
grand-0.031
Grand-0.030
ups-0.030
ind-0.028
Token face
Token ID1986
Feature activation+9.45e-03

Pos logit contributions
 sip+0.061
 bite+0.057
 leap+0.056
 bites+0.051
 breaths+0.048
Neg logit contributions
ups-0.051
Grand-0.049
People-0.049
grand-0.048
ich-0.048
Token.
Token ID13
Feature activation+5.67e-03

Pos logit contributions
 sip+0.018
 leap+0.017
 bite+0.017
 squeeze+0.015
 bites+0.015
Neg logit contributions
grand-0.019
ups-0.019
People-0.019
Grand-0.018
ind-0.018
Token She
Token ID1375
Feature activation+4.99e-03

Pos logit contributions
grand+0.003
ups+0.003
ind+0.003
People+0.003
ich+0.003
Neg logit contributions
 sip-0.002
 leap-0.002
 bite-0.002
 squeeze-0.002
 paws-0.002
Token gave
Token ID2921
Feature activation+7.52e-03

Pos logit contributions
 sip+0.010
 leap+0.010
 bite+0.010
 squeeze+0.009
 bites+0.009
Neg logit contributions
grand-0.010
People-0.009
ups-0.009
Grand-0.009
ind-0.009
Token her
Token ID607
Feature activation+1.76e-02

Pos logit contributions
 sip+0.016
 leap+0.016
 bite+0.016
 squeeze+0.014
 bites+0.014
Neg logit contributions
grand-0.013
ups-0.013
People-0.013
Grand-0.012
ind-0.012
Token mom
Token ID1995
Feature activation+8.64e-03

Pos logit contributions
 leap+0.037
 sip+0.036
 bite+0.035
 squeeze+0.032
 breaths+0.030
Neg logit contributions
grand-0.032
ups-0.032
People-0.031
Grand-0.029
ind-0.028
Token a
Token ID257
Feature activation+5.70e-02

Pos logit contributions
 sip+0.014
 leap+0.014
 bite+0.014
 squeeze+0.012
 bites+0.011
Neg logit contributions
grand-0.020
People-0.020
ups-0.020
Grand-0.019
ind-0.018
Token big
Token ID1263
Feature activation+8.09e-02

Pos logit contributions
 breaths+0.116
 leap+0.116
 sip+0.104
 bite+0.102
 bites+0.100
Neg logit contributions
Grand-0.089
grand-0.087
People-0.083
bel-0.082
God-0.076
Token hug
Token ID16225
Feature activation+3.71e-03

Pos logit contributions
 leap+0.106
ede+0.099
 trigger+0.096
 bleed+0.095
 breaths+0.094
Neg logit contributions
Grand-0.134
 office-0.128
 writer-0.123
 priest-0.122
 painter-0.122
Token and
Token ID290
Feature activation+5.57e-03

Pos logit contributions
 sip+0.005
 bite+0.005
 leap+0.005
 squeeze+0.005
 bites+0.004
Neg logit contributions
People-0.009
grand-0.009
ups-0.009
ind-0.009
Grand-0.008
Token thanked
Token ID26280
Feature activation-2.13e-02

Pos logit contributions
 sip+0.011
 leap+0.011
 bite+0.011
 squeeze+0.010
 bites+0.009
Neg logit contributions
grand-0.011
ups-0.010
People-0.010
Grand-0.010
ind-0.009
Token her
Token ID607
Feature activation+2.35e-03

Pos logit contributions
grand+0.044
ups+0.043
People+0.042
ich+0.040
ind+0.039
Neg logit contributions
 sip-0.042
 leap-0.041
 bite-0.040
 squeeze-0.037
 breath-0.034
Token for
Token ID329
Feature activation-1.72e-02

Pos logit contributions
 sip+0.005
 leap+0.005
 bite+0.005
 squeeze+0.004
 bites+0.004
Neg logit contributions
grand-0.004
ups-0.004
People-0.004
Grand-0.004
ind-0.004
Token she
Token ID673
Feature activation-9.98e-03

Pos logit contributions
 sip+0.014
 leap+0.014
 bite+0.014
 squeeze+0.012
 paws+0.011
Neg logit contributions
grand-0.019
People-0.018
ups-0.018
ind-0.017
Grand-0.017
Token gave
Token ID2921
Feature activation+1.68e-02

Pos logit contributions
ups+0.020
grand+0.020
People+0.020
Grand+0.019
ind+0.018
Neg logit contributions
 sip-0.019
 leap-0.018
 bite-0.018
 squeeze-0.016
 bites-0.016
Token them
Token ID606
Feature activation+1.88e-02

Pos logit contributions
 sip+0.034
 leap+0.034
 bite+0.034
 squeeze+0.030
 bites+0.029
Neg logit contributions
grand-0.032
People-0.031
ups-0.031
Grand-0.029
ind-0.029
Token each
Token ID1123
Feature activation+2.44e-02

Pos logit contributions
 leap+0.034
 sip+0.034
 bite+0.033
 squeeze+0.029
 knot+0.028
Neg logit contributions
ups-0.040
People-0.039
grand-0.038
Grand-0.036
ind-0.035
Token a
Token ID257
Feature activation+5.24e-02

Pos logit contributions
 leap+0.045
 sip+0.043
 bite+0.042
 squeeze+0.037
 breaths+0.037
Neg logit contributions
grand-0.049
People-0.049
ups-0.049
Grand-0.046
ind-0.044
Token big
Token ID1263
Feature activation+8.05e-02

Pos logit contributions
 leap+0.097
 breaths+0.096
 sip+0.091
 bite+0.088
 bites+0.084
Neg logit contributions
grand-0.092
Grand-0.091
People-0.088
bel-0.085
ups-0.085
Token hug
Token ID16225
Feature activation+8.23e-03

Pos logit contributions
 breaths+0.097
 leap+0.097
 trigger+0.094
 sip+0.091
ches+0.088
Neg logit contributions
Grand-0.147
 priest-0.145
grand-0.142
 husband-0.142
 office-0.141
Token.
Token ID13
Feature activation+6.69e-03

Pos logit contributions
 sip+0.013
 bite+0.013
 leap+0.013
 squeeze+0.012
 bites+0.011
Neg logit contributions
People-0.019
grand-0.019
ups-0.018
ind-0.018
Grand-0.017
Token
Token ID198
Feature activation-6.81e-03

Pos logit contributions
 sip+0.012
 leap+0.011
 bite+0.011
 squeeze+0.010
 paws+0.010
Neg logit contributions
grand-0.015
ups-0.015
People-0.014
ind-0.014
ich-0.014
Token
Token ID198
Feature activation+8.39e-03

Pos logit contributions
ups+0.013
grand+0.013
People+0.013
ind+0.013
ich+0.013
Neg logit contributions
 sip-0.013
 bite-0.013
 leap-0.012
 squeeze-0.011
 bites-0.011
TokenThe
Token ID464
Feature activation-1.36e-02

Pos logit contributions
 sip+0.017
 bite+0.016
 leap+0.016
 squeeze+0.015
 knot+0.014
Neg logit contributions
grand-0.016
ich-0.015
ups-0.014
ber-0.014
 priest-0.014
Token boy
Token ID2933
Feature activation+9.77e-03

Pos logit contributions
grand+0.003
ups+0.003
People+0.003
Grand+0.002
ind+0.002
Neg logit contributions
 sip-0.003
 leap-0.003
 bite-0.003
 squeeze-0.002
 bites-0.002
Token and
Token ID290
Feature activation+1.25e-02

Pos logit contributions
 sip+0.017
 leap+0.017
 bite+0.017
 squeeze+0.015
 bites+0.014
Neg logit contributions
grand-0.020
ups-0.020
People-0.020
Grand-0.019
ind-0.019
Token gave
Token ID2921
Feature activation+2.01e-02

Pos logit contributions
 sip+0.023
 bite+0.023
 leap+0.023
 squeeze+0.020
 bites+0.019
Neg logit contributions
People-0.026
grand-0.026
ups-0.026
ind-0.025
Grand-0.025
Token him
Token ID683
Feature activation+2.59e-02

Pos logit contributions
 sip+0.041
 leap+0.041
 bite+0.040
 squeeze+0.036
 bites+0.035
Neg logit contributions
grand-0.037
ups-0.037
People-0.036
Grand-0.034
ind-0.034
Token a
Token ID257
Feature activation+4.45e-02

Pos logit contributions
 leap+0.047
 sip+0.047
 bite+0.045
 squeeze+0.040
 knot+0.039
Neg logit contributions
ups-0.054
grand-0.052
People-0.052
Grand-0.048
ind-0.048
Token big
Token ID1263
Feature activation+8.01e-02

Pos logit contributions
 breaths+0.085
 leap+0.083
 sip+0.077
 bite+0.074
 paws+0.072
Neg logit contributions
grand-0.077
Grand-0.073
ups-0.071
People-0.070
bel-0.070
Token hug
Token ID16225
Feature activation-4.54e-04

Pos logit contributions
 breaths+0.103
 trigger+0.101
 leap+0.099
ede+0.094
 webs+0.093
Neg logit contributions
Grand-0.131
grand-0.126
 husband-0.125
 office-0.125
 priest-0.124
Token.
Token ID13
Feature activation+9.85e-03

Pos logit contributions
People+0.001
grand+0.001
ups+0.001
ind+0.001
Grand+0.001
Neg logit contributions
 sip-0.001
 bite-0.001
 leap-0.001
 squeeze-0.001
 paws-0.001
Token The
Token ID383
Feature activation-1.20e-02

Pos logit contributions
 sip+0.018
 leap+0.017
 bite+0.017
 squeeze+0.015
 paws+0.015
Neg logit contributions
grand-0.021
ups-0.021
People-0.021
ind-0.020
Grand-0.019
Token other
Token ID584
Feature activation-9.41e-03

Pos logit contributions
grand+0.022
ups+0.022
People+0.022
Grand+0.020
ind+0.020
Neg logit contributions
 sip-0.024
 leap-0.024
 bite-0.024
 squeeze-0.021
 breaths-0.020
Token boy
Token ID2933
Feature activation+1.34e-02

Pos logit contributions
grand+0.018
People+0.018
ups+0.018
Grand+0.017
ind+0.016
Neg logit contributions
 sip-0.019
 leap-0.019
 bite-0.019
 squeeze-0.017
 bites-0.016
Token.
Token ID13
Feature activation+1.14e-02

Pos logit contributions
ups+0.001
grand+0.001
People+0.001
Grand+0.001
ind+0.001
Neg logit contributions
 leap-0.001
 sip-0.001
 bite-0.001
 squeeze-0.001
 knot-0.001
Token She
Token ID1375
Feature activation-2.22e-03

Pos logit contributions
 sip+0.025
 leap+0.025
 bite+0.024
 squeeze+0.022
 breaths+0.021
Neg logit contributions
ups-0.020
grand-0.020
People-0.020
Grand-0.018
ich-0.018
Token gave
Token ID2921
Feature activation+1.34e-02

Pos logit contributions
ups+0.004
grand+0.004
People+0.004
Grand+0.003
ind+0.003
Neg logit contributions
 leap-0.005
 sip-0.005
 bite-0.005
 squeeze-0.004
 breaths-0.004
Token Spot
Token ID15899
Feature activation+3.87e-02

Pos logit contributions
 leap+0.025
 sip+0.024
 bite+0.022
 squeeze+0.020
 knot+0.020
Neg logit contributions
ups-0.026
People-0.025
grand-0.025
ind-0.024
Grand-0.024
Token a
Token ID257
Feature activation+4.68e-02

Pos logit contributions
 leap+0.085
 sip+0.079
 bite+0.074
 squeeze+0.070
 knot+0.069
Neg logit contributions
ups-0.070
grand-0.065
People-0.062
ind-0.059
Grand-0.058
Token big
Token ID1263
Feature activation+7.99e-02

Pos logit contributions
 leap+0.089
 breaths+0.087
 sip+0.080
 bite+0.073
ishly+0.072
Neg logit contributions
grand-0.081
Grand-0.076
ups-0.076
People-0.073
bel-0.072
Token hug
Token ID16225
Feature activation+1.19e-02

Pos logit contributions
 breaths+0.107
 leap+0.106
ches+0.106
 webs+0.097
ishly+0.096
Neg logit contributions
 office-0.120
Grand-0.114
 husband-0.111
 granddaughter-0.111
 painter-0.108
Token and
Token ID290
Feature activation+1.48e-02

Pos logit contributions
 sip+0.021
 leap+0.021
 bite+0.020
 squeeze+0.018
 bites+0.017
Neg logit contributions
grand-0.025
ups-0.025
People-0.025
Grand-0.024
ind-0.023
Token told
Token ID1297
Feature activation-6.92e-03

Pos logit contributions
 leap+0.030
 sip+0.030
 bite+0.028
 squeeze+0.025
 breaths+0.025
Neg logit contributions
grand-0.028
ups-0.027
People-0.026
Grand-0.025
God-0.024
Token him
Token ID683
Feature activation-1.20e-02

Pos logit contributions
ups+0.011
grand+0.011
People+0.011
Grand+0.010
ind+0.010
Neg logit contributions
 leap-0.016
 sip-0.016
 bite-0.015
 squeeze-0.014
 breaths-0.013
Token,
Token ID11
Feature activation-3.24e-02

Pos logit contributions
grand+0.018
ups+0.018
People+0.018
Grand+0.017
ind+0.016
Neg logit contributions
 sip-0.029
 leap-0.028
 bite-0.028
 squeeze-0.026
 bites-0.025
Token Bob
Token ID5811
Feature activation-1.25e-03

Pos logit contributions
 sip+0.021
 leap+0.020
 bite+0.020
 squeeze+0.018
 bites+0.017
Neg logit contributions
grand-0.021
ups-0.021
People-0.021
Grand-0.020
ind-0.020
Token gave
Token ID2921
Feature activation+2.69e-02

Pos logit contributions
grand+0.002
ups+0.002
People+0.002
Grand+0.002
ind+0.002
Neg logit contributions
 sip-0.003
 leap-0.002
 bite-0.002
 squeeze-0.002
 bites-0.002
Token the
Token ID262
Feature activation+1.59e-02

Pos logit contributions
 sip+0.064
 leap+0.063
 bite+0.060
 squeeze+0.056
 knot+0.054
Neg logit contributions
ups-0.040
People-0.039
grand-0.039
ind-0.036
Grand-0.036
Token bear
Token ID6842
Feature activation+3.03e-02

Pos logit contributions
 leap+0.025
 sip+0.023
 bite+0.021
 breaths+0.020
 shove+0.019
Neg logit contributions
grand-0.037
Grand-0.034
People-0.034
 priest-0.033
ich-0.033
Token a
Token ID257
Feature activation+5.76e-02

Pos logit contributions
 leap+0.054
 sip+0.054
 bite+0.052
 squeeze+0.046
 breaths+0.045
Neg logit contributions
grand-0.063
ups-0.062
People-0.061
Grand-0.058
ind-0.057
Token big
Token ID1263
Feature activation+7.98e-02

Pos logit contributions
 breaths+0.106
 leap+0.103
 sip+0.095
 bite+0.090
 bites+0.090
Neg logit contributions
Grand-0.101
grand-0.100
People-0.091
bel-0.091
God-0.089
Token hug
Token ID16225
Feature activation+1.19e-02

Pos logit contributions
 leap+0.107
ede+0.102
 breaths+0.099
 sip+0.098
 Carl+0.092
Neg logit contributions
Grand-0.130
 office-0.121
grand-0.121
 priest-0.118
 writer-0.117
Token and
Token ID290
Feature activation+1.78e-02

Pos logit contributions
 sip+0.021
 leap+0.021
 bite+0.021
 squeeze+0.018
 bites+0.017
Neg logit contributions
grand-0.025
ups-0.025
People-0.025
Grand-0.024
ind-0.023
Token he
Token ID339
Feature activation+1.25e-02

Pos logit contributions
 sip+0.035
 leap+0.034
 bite+0.034
 squeeze+0.030
 breaths+0.029
Neg logit contributions
grand-0.034
ups-0.034
People-0.033
Grand-0.032
we-0.030
Token felt
Token ID2936
Feature activation+1.28e-02

Pos logit contributions
 sip+0.025
 leap+0.024
 bite+0.024
 squeeze+0.021
 bites+0.020
Neg logit contributions
grand-0.024
ups-0.024
People-0.024
Grand-0.023
ind-0.022
Token happy
Token ID3772
Feature activation+8.87e-03

Pos logit contributions
 sip+0.024
 bite+0.024
 leap+0.024
 squeeze+0.021
 bites+0.020
Neg logit contributions
grand-0.026
ups-0.026
People-0.026
ind-0.025
ich-0.025
Token Bob
Token ID5811
Feature activation-1.25e-03

Pos logit contributions
 sip+0.021
 leap+0.020
 bite+0.020
 squeeze+0.018
 bites+0.017
Neg logit contributions
grand-0.021
ups-0.021
People-0.021
Grand-0.020
ind-0.020
Token gave
Token ID2921
Feature activation+2.69e-02

Pos logit contributions
grand+0.002
ups+0.002
People+0.002
Grand+0.002
ind+0.002
Neg logit contributions
 sip-0.003
 leap-0.002
 bite-0.002
 squeeze-0.002
 bites-0.002
Token the
Token ID262
Feature activation+1.59e-02

Pos logit contributions
 sip+0.064
 leap+0.063
 bite+0.060
 squeeze+0.056
 knot+0.054
Neg logit contributions
ups-0.040
People-0.039
grand-0.039
ind-0.036
Grand-0.036
Token bear
Token ID6842
Feature activation+3.03e-02

Pos logit contributions
 leap+0.025
 sip+0.023
 bite+0.021
 breaths+0.020
 shove+0.019
Neg logit contributions
grand-0.037
Grand-0.034
People-0.034
 priest-0.033
ich-0.033
Token a
Token ID257
Feature activation+5.76e-02

Pos logit contributions
 leap+0.054
 sip+0.054
 bite+0.052
 squeeze+0.046
 breaths+0.045
Neg logit contributions
grand-0.063
ups-0.062
People-0.061
Grand-0.058
ind-0.057
Token big
Token ID1263
Feature activation+7.98e-02

Pos logit contributions
 breaths+0.106
 leap+0.103
 sip+0.095
 bite+0.090
 bites+0.090
Neg logit contributions
Grand-0.101
grand-0.100
People-0.091
bel-0.091
God-0.089
Token hug
Token ID16225
Feature activation+1.19e-02

Pos logit contributions
 leap+0.107
ede+0.102
 breaths+0.099
 sip+0.098
 Carl+0.092
Neg logit contributions
Grand-0.130
 office-0.121
grand-0.121
 priest-0.118
 writer-0.117
Token and
Token ID290
Feature activation+1.78e-02

Pos logit contributions
 sip+0.021
 leap+0.021
 bite+0.021
 squeeze+0.018
 bites+0.017
Neg logit contributions
grand-0.025
ups-0.025
People-0.025
Grand-0.024
ind-0.023
Token he
Token ID339
Feature activation+1.25e-02

Pos logit contributions
 sip+0.035
 leap+0.034
 bite+0.034
 squeeze+0.030
 breaths+0.029
Neg logit contributions
grand-0.034
ups-0.034
People-0.033
Grand-0.032
we-0.030
Token felt
Token ID2936
Feature activation+1.28e-02

Pos logit contributions
 sip+0.025
 leap+0.024
 bite+0.024
 squeeze+0.021
 bites+0.020
Neg logit contributions
grand-0.024
ups-0.024
People-0.024
Grand-0.023
ind-0.022
Token happy
Token ID3772
Feature activation+8.87e-03

Pos logit contributions
 sip+0.024
 bite+0.024
 leap+0.024
 squeeze+0.021
 bites+0.020
Neg logit contributions
grand-0.026
ups-0.026
People-0.026
ind-0.025
ich-0.025
TokenJul
Token ID16980
Feature activation-1.02e-02

Pos logit contributions
grand+0.014
ich+0.012
ind+0.012
eline+0.011
being+0.011
Neg logit contributions
 sip-0.015
 leap-0.014
 bite-0.014
 knot-0.013
 squeeze-0.013
Tokenia
Token ID544
Feature activation-1.24e-03

Pos logit contributions
grand+0.022
People+0.021
ups+0.021
Grand+0.020
ich+0.020
Neg logit contributions
 sip-0.018
 leap-0.018
 bite-0.018
 squeeze-0.015
 bites-0.015
Token gave
Token ID2921
Feature activation+1.63e-02

Pos logit contributions
ups+0.002
grand+0.002
People+0.002
Grand+0.002
ind+0.002
Neg logit contributions
 sip-0.003
 leap-0.002
 bite-0.002
 squeeze-0.002
 bites-0.002
Token him
Token ID683
Feature activation+2.66e-02

Pos logit contributions
 sip+0.034
 leap+0.034
 bite+0.033
 squeeze+0.030
 bites+0.029
Neg logit contributions
grand-0.030
ups-0.029
People-0.029
Grand-0.027
ind-0.027
Token a
Token ID257
Feature activation+4.90e-02

Pos logit contributions
 sip+0.051
 leap+0.051
 bite+0.050
 squeeze+0.044
 bites+0.043
Neg logit contributions
ups-0.053
grand-0.052
People-0.052
Grand-0.048
ind-0.048
Token big
Token ID1263
Feature activation+7.98e-02

Pos logit contributions
 leap+0.085
 breaths+0.083
 sip+0.080
 bite+0.073
 bites+0.070
Neg logit contributions
grand-0.097
Grand-0.092
bel-0.088
ups-0.088
People-0.087
Token hug
Token ID16225
Feature activation+8.34e-03

Pos logit contributions
 leap+0.103
 breaths+0.098
 Carl+0.094
ede+0.092
 sip+0.091
Neg logit contributions
 office-0.133
 priest-0.132
Grand-0.132
grand-0.128
spe-0.127
Token and
Token ID290
Feature activation+1.23e-03

Pos logit contributions
 sip+0.013
 bite+0.013
 leap+0.013
 squeeze+0.011
 bites+0.011
Neg logit contributions
People-0.019
grand-0.019
ups-0.019
ind-0.018
Grand-0.018
Token said
Token ID531
Feature activation-9.85e-03

Pos logit contributions
 sip+0.002
 leap+0.002
 bite+0.002
 squeeze+0.002
 bites+0.002
Neg logit contributions
grand-0.002
People-0.002
ups-0.002
ind-0.002
Grand-0.002
Token,
Token ID11
Feature activation-3.92e-02

Pos logit contributions
grand+0.026
ups+0.026
People+0.026
Grand+0.024
ind+0.024
Neg logit contributions
 sip-0.013
 leap-0.012
 bite-0.012
 squeeze-0.010
 bites-0.009
Token "
Token ID366
Feature activation-2.21e-02

Pos logit contributions
grand+0.087
ups+0.086
ind+0.081
People+0.080
ich+0.075
Neg logit contributions
 leap-0.068
 sip-0.068
 bite-0.067
 squeeze-0.062
 shove-0.057
Token her
Token ID607
Feature activation+2.32e-02

Pos logit contributions
 leap+0.070
 sip+0.068
 bite+0.065
 shove+0.063
 squeeze+0.062
Neg logit contributions
ups-0.029
we-0.024
People-0.023
grand-0.020
ind-0.020
Token t
Token ID256
Feature activation+3.25e-03

Pos logit contributions
 leap+0.046
 sip+0.045
 bite+0.041
 shove+0.038
 squeeze+0.037
Neg logit contributions
grand-0.044
ups-0.042
Grand-0.042
People-0.041
ind-0.038
Tokeneddy
Token ID21874
Feature activation+7.06e-02

Pos logit contributions
 sip+0.010
 leap+0.010
 bite+0.010
 squeeze+0.009
 bites+0.009
Neg logit contributions
grand-0.002
ups-0.002
People-0.002
Grand-0.002
ind-0.002
Token bear
Token ID6842
Feature activation+2.96e-02

Pos logit contributions
 leap+0.119
 sip+0.110
 bite+0.094
 shove+0.091
 squeeze+0.086
Neg logit contributions
ups-0.152
grand-0.150
ind-0.147
Grand-0.146
People-0.142
Token a
Token ID257
Feature activation+5.26e-02

Pos logit contributions
 leap+0.057
 sip+0.056
 bite+0.052
 squeeze+0.048
 shove+0.046
Neg logit contributions
ups-0.059
grand-0.058
People-0.054
Grand-0.053
ind-0.051
Token big
Token ID1263
Feature activation+7.98e-02

Pos logit contributions
 leap+0.091
 breaths+0.087
 sip+0.084
 bite+0.079
 bites+0.077
Neg logit contributions
grand-0.103
Grand-0.102
People-0.099
bel-0.094
ups-0.094
Token hug
Token ID16225
Feature activation+1.00e-02

Pos logit contributions
 leap+0.113
ede+0.110
 Carl+0.108
Carl+0.106
 breaths+0.105
Neg logit contributions
 writer-0.121
Grand-0.119
 granddaughter-0.114
grand-0.113
Dear-0.111
Token.
Token ID13
Feature activation+1.70e-02

Pos logit contributions
 sip+0.019
 leap+0.019
 bite+0.018
 squeeze+0.016
 bites+0.016
Neg logit contributions
grand-0.020
ups-0.020
People-0.020
Grand-0.019
ind-0.018
Token She
Token ID1375
Feature activation+6.41e-03

Pos logit contributions
 leap+0.038
 sip+0.038
 bite+0.038
 squeeze+0.034
 breaths+0.032
Neg logit contributions
People-0.028
grand-0.028
ups-0.028
Grand-0.026
we-0.024
Token was
Token ID373
Feature activation+2.70e-03

Pos logit contributions
 sip+0.013
 leap+0.013
 bite+0.013
 squeeze+0.012
 bites+0.011
Neg logit contributions
grand-0.012
ups-0.012
People-0.012
Grand-0.011
ind-0.011
Token happy
Token ID3772
Feature activation+1.12e-02

Pos logit contributions
 bite+0.005
 sip+0.004
 leap+0.004
 squeeze+0.004
 shock+0.004
Neg logit contributions
grand-0.006
ups-0.006
People-0.006
ind-0.006
cle-0.006
Token and
Token ID290
Feature activation-1.49e-02

Pos logit contributions
 sip+0.021
 leap+0.021
 bite+0.020
 squeeze+0.018
 bites+0.017
Neg logit contributions
ups-0.019
grand-0.018
People-0.018
we-0.017
Grand-0.017
Token gave
Token ID2921
Feature activation+1.41e-02

Pos logit contributions
grand+0.031
People+0.031
ups+0.030
ind+0.030
Grand+0.030
Neg logit contributions
 sip-0.027
 leap-0.027
 bite-0.027
 squeeze-0.024
 bites-0.023
Token Pac
Token ID6319
Feature activation-9.44e-03

Pos logit contributions
 sip+0.026
 leap+0.025
 bite+0.025
 squeeze+0.022
 bites+0.021
Neg logit contributions
ups-0.029
grand-0.029
People-0.029
Grand-0.027
ind-0.027
Tokeno
Token ID78
Feature activation+2.25e-02

Pos logit contributions
People+0.017
grand+0.016
ups+0.016
Grand+0.015
ich+0.015
Neg logit contributions
 bite-0.019
 sip-0.018
 leap-0.018
 squeeze-0.017
 paws-0.017
Token a
Token ID257
Feature activation+4.19e-02

Pos logit contributions
 leap+0.042
 sip+0.042
 bite+0.040
 squeeze+0.036
 knot+0.034
Neg logit contributions
ups-0.047
grand-0.045
People-0.044
Grand-0.041
ind-0.041
Token big
Token ID1263
Feature activation+7.97e-02

Pos logit contributions
 leap+0.070
 breaths+0.069
 sip+0.064
 paws+0.058
 bite+0.056
Neg logit contributions
grand-0.084
ups-0.080
bel-0.077
Grand-0.077
People-0.075
Token hug
Token ID16225
Feature activation+6.14e-03

Pos logit contributions
 leap+0.110
 webs+0.103
 Bud+0.100
 breaths+0.100
 Carl+0.100
Neg logit contributions
 office-0.122
Grand-0.118
 granddaughter-0.115
grand-0.115
 writer-0.115
Token.
Token ID13
Feature activation+6.41e-03

Pos logit contributions
 sip+0.011
 bite+0.011
 leap+0.011
 squeeze+0.009
 bites+0.009
Neg logit contributions
grand-0.013
People-0.013
ups-0.013
Grand-0.012
ind-0.012
Token
Token ID198
Feature activation-1.72e-02

Pos logit contributions
 sip+0.012
 leap+0.012
 bite+0.012
 squeeze+0.011
 paws+0.010
Neg logit contributions
grand-0.013
ups-0.013
People-0.013
ind-0.012
Grand-0.012
Token
Token ID198
Feature activation+2.86e-03

Pos logit contributions
grand+0.034
ups+0.034
People+0.033
ich+0.032
ind+0.032
Neg logit contributions
 sip-0.032
 bite-0.031
 leap-0.030
 squeeze-0.027
 bites-0.026
TokenP
Token ID47
Feature activation-1.43e-02

Pos logit contributions
 sip+0.007
 leap+0.006
 bite+0.006
 squeeze+0.006
 shove+0.006
Neg logit contributions
grand-0.005
ups-0.004
ich-0.004
People-0.004
eline-0.004
Token can
Token ID460
Feature activation+1.33e-02

Pos logit contributions
People+0.015
ups+0.015
grand+0.015
ind+0.013
 known+0.013
Neg logit contributions
 leap-0.012
 sip-0.012
 bite-0.011
 breaths-0.010
 cushion-0.010
Token solve
Token ID8494
Feature activation-2.44e-02

Pos logit contributions
 leap+0.023
 sip+0.023
 breaths+0.022
 bite+0.021
 bites+0.020
Neg logit contributions
ups-0.026
grand-0.026
People-0.025
ind-0.024
Grand-0.024
Token your
Token ID534
Feature activation-8.83e-03

Pos logit contributions
ups+0.042
People+0.042
grand+0.041
ind+0.039
Grand+0.038
Neg logit contributions
 leap-0.052
 sip-0.052
 bite-0.050
 squeeze-0.046
 shove-0.043
Token problem
Token ID1917
Feature activation-3.34e-02

Pos logit contributions
ups+0.020
grand+0.020
People+0.019
ind+0.018
ich+0.018
Neg logit contributions
 sip-0.014
 leap-0.013
 bite-0.013
 squeeze-0.011
 cushion-0.011
Token,
Token ID11
Feature activation-5.12e-03

Pos logit contributions
grand+0.078
People+0.076
ups+0.075
Grand+0.073
ind+0.072
Neg logit contributions
 sip-0.052
 bite-0.052
 leap-0.052
 squeeze-0.045
 bites-0.044
Token m
Token ID285
Feature activation-7.46e-02

Pos logit contributions
People+0.011
grand+0.011
ups+0.011
Grand+0.010
ind+0.010
Neg logit contributions
 sip-0.009
 leap-0.009
 bite-0.009
 squeeze-0.008
 bites-0.008
Tokenister
Token ID1694
Feature activation-1.97e-02

Pos logit contributions
People+0.108
grand+0.103
Grand+0.101
ind+0.101
Our+0.100
Neg logit contributions
 leap-0.166
 bite-0.165
 sip-0.156
 shove-0.146
 squeeze-0.146
Token,"
Token ID553
Feature activation-1.84e-02

Pos logit contributions
People+0.048
grand+0.047
ups+0.047
Grand+0.046
ind+0.046
Neg logit contributions
 bite-0.029
 sip-0.028
 leap-0.028
 squeeze-0.024
 bites-0.024
Token Ben
Token ID3932
Feature activation-1.27e-02

Pos logit contributions
grand+0.040
ups+0.039
People+0.039
Grand+0.037
ind+0.037
Neg logit contributions
 sip-0.033
 leap-0.032
 bite-0.032
 squeeze-0.028
 bites-0.027
Token said
Token ID531
Feature activation+8.25e-03

Pos logit contributions
ups+0.022
grand+0.021
Grand+0.020
People+0.020
Kat+0.018
Neg logit contributions
 sip-0.029
 leap-0.029
 bite-0.027
 squeeze-0.025
 shock-0.025
Token.
Token ID13
Feature activation+5.58e-04

Pos logit contributions
 leap+0.018
 sip+0.018
 bite+0.017
 knot+0.015
 squeeze+0.015
Neg logit contributions
ups-0.015
grand-0.014
People-0.013
Grand-0.013
we-0.013
Token,
Token ID11
Feature activation-4.04e-02

Pos logit contributions
grand+0.044
People+0.044
ups+0.044
Grand+0.042
ind+0.041
Neg logit contributions
 sip-0.021
 leap-0.021
 bite-0.021
 squeeze-0.017
 bites-0.016
Token "
Token ID366
Feature activation-1.62e-02

Pos logit contributions
ups+0.087
grand+0.087
People+0.081
ind+0.081
Grand+0.074
Neg logit contributions
 sip-0.074
 leap-0.073
 bite-0.073
 squeeze-0.067
 paws-0.060
TokenThank
Token ID10449
Feature activation-1.91e-02

Pos logit contributions
ups+0.032
ber+0.029
grand+0.029
ich+0.028
 office+0.028
Neg logit contributions
 bite-0.030
 sip-0.029
 leap-0.028
 squeeze-0.027
 lick-0.025
Token you
Token ID345
Feature activation+1.47e-02

Pos logit contributions
ups+0.036
grand+0.036
People+0.035
ind+0.034
eline+0.033
Neg logit contributions
 leap-0.037
 bite-0.036
 sip-0.036
 squeeze-0.032
 bites-0.031
Token,
Token ID11
Feature activation-2.31e-02

Pos logit contributions
 sip+0.023
 leap+0.022
 bite+0.022
 squeeze+0.019
 bites+0.019
Neg logit contributions
grand-0.034
ups-0.034
Grand-0.034
People-0.034
ind-0.032
Token m
Token ID285
Feature activation-7.30e-02

Pos logit contributions
ups+0.047
grand+0.045
People+0.045
we+0.043
ind+0.042
Neg logit contributions
 bite-0.044
 sip-0.044
 leap-0.044
 squeeze-0.039
 bites-0.037
Tokenister
Token ID1694
Feature activation-2.78e-03

Pos logit contributions
ups+0.115
People+0.112
ind+0.106
grand+0.104
Grand+0.104
Neg logit contributions
 bite-0.155
 leap-0.152
 sip-0.148
 squeeze-0.135
 shove-0.131
Token!"
Token ID2474
Feature activation+5.05e-03

Pos logit contributions
People+0.007
ups+0.007
grand+0.007
Grand+0.007
ind+0.007
Neg logit contributions
 bite-0.004
 sip-0.004
 leap-0.004
 squeeze-0.003
 bites-0.003
Token She
Token ID1375
Feature activation-6.12e-03

Pos logit contributions
 sip+0.010
 bite+0.009
 leap+0.009
 squeeze+0.009
 paws+0.008
Neg logit contributions
grand-0.010
ups-0.010
People-0.010
ind-0.010
Grand-0.009
Token then
Token ID788
Feature activation-1.11e-03

Pos logit contributions
grand+0.012
ups+0.012
People+0.012
Grand+0.011
ind+0.011
Neg logit contributions
 sip-0.012
 leap-0.012
 bite-0.012
 squeeze-0.011
 bites-0.010
Token showed
Token ID3751
Feature activation+1.71e-02

Pos logit contributions
grand+0.002
People+0.002
ups+0.002
ind+0.002
ich+0.002
Neg logit contributions
 sip-0.002
 bite-0.002
 leap-0.002
 squeeze-0.002
 bites-0.002
Token m
Token ID285
Feature activation-6.40e-02

Pos logit contributions
grand+0.040
People+0.039
ups+0.039
Grand+0.037
ind+0.037
Neg logit contributions
 bite-0.028
 leap-0.028
 sip-0.027
 squeeze-0.024
 bites-0.023
Tokenule
Token ID2261
Feature activation-1.09e-02

Pos logit contributions
People+0.089
grand+0.086
Kat+0.085
ups+0.081
Our+0.080
Neg logit contributions
 bite-0.146
 leap-0.145
 sip-0.139
 paws-0.134
 squeeze-0.132
Token,
Token ID11
Feature activation-2.75e-02

Pos logit contributions
grand+0.022
ups+0.022
People+0.021
Grand+0.020
ind+0.020
Neg logit contributions
 sip-0.021
 leap-0.020
 bite-0.020
 squeeze-0.018
 bites-0.017
Token but
Token ID475
Feature activation-1.23e-03

Pos logit contributions
grand+0.068
ups+0.067
People+0.065
ind+0.063
Grand+0.063
Neg logit contributions
 sip-0.042
 bite-0.041
 leap-0.041
 squeeze-0.036
 bites-0.033
Token the
Token ID262
Feature activation-3.78e-02

Pos logit contributions
grand+0.003
ups+0.003
ind+0.003
People+0.003
Grand+0.002
Neg logit contributions
 bite-0.002
 sip-0.002
 leap-0.002
 squeeze-0.002
 bites-0.002
Token m
Token ID285
Feature activation-7.25e-02

Pos logit contributions
grand+0.074
ups+0.069
People+0.068
we+0.065
Kat+0.064
Neg logit contributions
 bite-0.079
 leap-0.077
 sip-0.074
 squeeze-0.070
 bites-0.067
Tokenule
Token ID2261
Feature activation-2.65e-02

Pos logit contributions
People+0.106
ups+0.101
grand+0.100
Kat+0.099
bel+0.094
Neg logit contributions
 bite-0.154
 leap-0.150
 paws-0.143
 sip-0.143
 squeeze-0.137
Token refused
Token ID6520
Feature activation-7.45e-03

Pos logit contributions
grand+0.053
People+0.053
ups+0.052
Grand+0.050
ind+0.049
Neg logit contributions
 sip-0.050
 leap-0.049
 bite-0.049
 squeeze-0.044
 bites-0.042
Token to
Token ID284
Feature activation+5.67e-03

Pos logit contributions
grand+0.016
People+0.016
Grand+0.015
ups+0.015
Kat+0.015
Neg logit contributions
 bite-0.014
 sip-0.013
 leap-0.013
 squeeze-0.012
 bites-0.012
Token move
Token ID1445
Feature activation-1.03e-03

Pos logit contributions
 sip+0.009
 leap+0.009
 bite+0.008
 breaths+0.008
 bites+0.008
Neg logit contributions
grand-0.012
ups-0.012
People-0.012
Grand-0.012
God-0.011
Token.
Token ID13
Feature activation+1.88e-02

Pos logit contributions
grand+0.002
People+0.002
ups+0.002
Grand+0.002
ich+0.002
Neg logit contributions
 bite-0.002
 sip-0.002
 leap-0.002
 squeeze-0.001
 bites-0.001
Tokenanas
Token ID15991
Feature activation-2.19e-02

Pos logit contributions
ups+0.051
People+0.051
grand+0.050
Grand+0.046
ind+0.045
Neg logit contributions
 bite-0.075
 sip-0.075
 leap-0.074
 squeeze-0.068
 bites-0.066
Token available
Token ID1695
Feature activation-1.76e-02

Pos logit contributions
grand+0.051
ups+0.051
People+0.051
Grand+0.049
ind+0.048
Neg logit contributions
 bite-0.033
 sip-0.033
 leap-0.032
 squeeze-0.028
 bites-0.026
Token for
Token ID329
Feature activation-1.16e-02

Pos logit contributions
People+0.041
grand+0.041
ups+0.040
Grand+0.040
ind+0.039
Neg logit contributions
 bite-0.027
 leap-0.026
 sip-0.025
 squeeze-0.023
 bites-0.021
Token feeding
Token ID13017
Feature activation-2.58e-02

Pos logit contributions
grand+0.022
ups+0.022
People+0.021
Grand+0.020
ind+0.020
Neg logit contributions
 bite-0.023
 sip-0.023
 leap-0.023
 squeeze-0.021
 bites-0.020
Token the
Token ID262
Feature activation-1.80e-02

Pos logit contributions
grand+0.051
ups+0.050
People+0.050
Grand+0.047
ind+0.046
Neg logit contributions
 sip-0.049
 bite-0.049
 leap-0.049
 squeeze-0.043
 bites-0.042
Token gor
Token ID30344
Feature activation-7.22e-02

Pos logit contributions
grand+0.032
ups+0.031
People+0.030
Grand+0.029
David+0.029
Neg logit contributions
 bite-0.039
 sip-0.038
 leap-0.038
 bites-0.034
 squeeze-0.034
Tokenillas
Token ID25314
Feature activation-1.71e-02

Pos logit contributions
grand+0.121
Grand+0.111
People+0.107
ups+0.107
David+0.106
Neg logit contributions
 bite-0.163
 leap-0.155
 sip-0.152
 bites-0.143
 paws-0.136
Token."
Token ID526
Feature activation+2.49e-02

Pos logit contributions
grand+0.038
People+0.038
ups+0.038
Grand+0.036
ind+0.035
Neg logit contributions
 bite-0.028
 sip-0.028
 leap-0.028
 squeeze-0.024
 bites-0.023
Token She
Token ID1375
Feature activation-2.42e-02

Pos logit contributions
 sip+0.034
 leap+0.033
 bite+0.032
 squeeze+0.027
 bites+0.026
Neg logit contributions
grand-0.064
ups-0.064
People-0.063
Grand-0.061
ind-0.060
Token said
Token ID531
Feature activation-1.49e-02

Pos logit contributions
grand+0.051
ups+0.050
People+0.049
ind+0.047
ich+0.045
Neg logit contributions
 sip-0.044
 bite-0.043
 leap-0.043
 squeeze-0.039
 bites-0.037
Token to
Token ID284
Feature activation-5.65e-04

Pos logit contributions
grand+0.032
ups+0.032
People+0.031
ind+0.030
Grand+0.029
Neg logit contributions
 sip-0.027
 bite-0.026
 leap-0.026
 squeeze-0.023
 bites-0.022
Token they
Token ID484
Feature activation-1.67e-02

Pos logit contributions
 sip+0.020
 bite+0.020
 leap+0.019
 squeeze+0.018
 bites+0.016
Neg logit contributions
grand-0.022
ups-0.022
People-0.022
ind-0.021
Grand-0.021
Token went
Token ID1816
Feature activation-9.62e-03

Pos logit contributions
ups+0.037
grand+0.037
People+0.035
we+0.035
ich+0.034
Neg logit contributions
 sip-0.029
 bite-0.029
 leap-0.028
 squeeze-0.025
 bites-0.023
Token to
Token ID284
Feature activation-1.42e-02

Pos logit contributions
grand+0.016
ups+0.016
People+0.016
Grand+0.014
ind+0.014
Neg logit contributions
 sip-0.022
 leap-0.022
 bite-0.022
 squeeze-0.020
 bites-0.019
Token see
Token ID766
Feature activation-1.71e-02

Pos logit contributions
ups+0.029
grand+0.029
People+0.028
Grand+0.026
ind+0.026
Neg logit contributions
 sip-0.027
 bite-0.027
 leap-0.027
 squeeze-0.024
 bites-0.023
Token the
Token ID262
Feature activation-1.84e-02

Pos logit contributions
grand+0.041
ups+0.040
People+0.039
we+0.037
ind+0.037
Neg logit contributions
 sip-0.026
 bite-0.026
 leap-0.025
 squeeze-0.022
 bites-0.022
Token gor
Token ID30344
Feature activation-7.19e-02

Pos logit contributions
ups+0.035
grand+0.035
People+0.033
we+0.031
Grand+0.031
Neg logit contributions
 bite-0.037
 sip-0.037
 leap-0.036
 squeeze-0.032
 bites-0.031
Tokenillas
Token ID25314
Feature activation-2.06e-02

Pos logit contributions
grand+0.135
Grand+0.117
Kat+0.117
ups+0.114
People+0.114
Neg logit contributions
 bite-0.148
 sip-0.138
 leap-0.135
 bites-0.125
 paws-0.120
Token.
Token ID13
Feature activation+1.54e-02

Pos logit contributions
grand+0.043
ups+0.042
People+0.042
Grand+0.040
ind+0.039
Neg logit contributions
 sip-0.039
 bite-0.039
 leap-0.038
 squeeze-0.034
 bites-0.032
Token Tim
Token ID5045
Feature activation+2.16e-03

Pos logit contributions
 sip+0.026
 bite+0.025
 leap+0.025
 squeeze+0.022
 paws+0.021
Neg logit contributions
ups-0.035
grand-0.035
People-0.034
ind-0.033
Grand-0.032
Tokenmy
Token ID1820
Feature activation-7.16e-03

Pos logit contributions
 sip+0.004
 bite+0.004
 leap+0.004
 squeeze+0.004
 bites+0.004
Neg logit contributions
grand-0.004
People-0.004
ups-0.004
Grand-0.004
ind-0.004
Token was
Token ID373
Feature activation-1.10e-02

Pos logit contributions
People+0.016
grand+0.016
ups+0.015
we+0.015
ind+0.014
Neg logit contributions
 bite-0.012
 sip-0.012
 leap-0.012
 squeeze-0.011
 bites-0.011
Token a
Token ID257
Feature activation+7.38e-03

Pos logit contributions
grand+0.102
ups+0.099
ind+0.096
ber+0.096
we+0.096
Neg logit contributions
 bite-0.024
 sip-0.015
 bites-0.014
 paws-0.014
 squeeze-0.013
Token m
Token ID285
Feature activation-5.53e-02

Pos logit contributions
 sip+0.014
 leap+0.014
 bite+0.014
 squeeze+0.013
 bites+0.012
Neg logit contributions
grand-0.014
ups-0.014
People-0.014
Grand-0.013
ind-0.013
Tokenule
Token ID2261
Feature activation-3.04e-03

Pos logit contributions
People+0.076
ups+0.073
grand+0.071
Grand+0.068
Kat+0.065
Neg logit contributions
 bite-0.137
 leap-0.135
 sip-0.134
 squeeze-0.122
 paws-0.121
Token.
Token ID13
Feature activation+1.76e-02

Pos logit contributions
People+0.006
ups+0.006
grand+0.006
Grand+0.006
Kat+0.006
Neg logit contributions
 bite-0.006
 sip-0.006
 leap-0.005
 squeeze-0.005
 bites-0.005
Token The
Token ID383
Feature activation-3.09e-02

Pos logit contributions
 sip+0.033
 leap+0.032
 bite+0.032
 squeeze+0.029
 bites+0.027
Neg logit contributions
ups-0.037
grand-0.037
People-0.036
Grand-0.034
ind-0.034
Token m
Token ID285
Feature activation-7.09e-02

Pos logit contributions
ups+0.071
grand+0.070
People+0.068
we+0.067
Kat+0.066
Neg logit contributions
 bite-0.052
 leap-0.047
 sip-0.046
 squeeze-0.044
 bites-0.042
Tokenule
Token ID2261
Feature activation-1.77e-02

Pos logit contributions
ups+0.103
People+0.102
Kat+0.097
grand+0.096
 rightful+0.095
Neg logit contributions
 bite-0.149
 leap-0.145
 sip-0.141
 paws-0.138
 bites-0.133
Token was
Token ID373
Feature activation-1.99e-02

Pos logit contributions
ups+0.037
People+0.036
grand+0.035
ind+0.034
Kat+0.034
Neg logit contributions
 bite-0.032
 sip-0.031
 leap-0.030
 squeeze-0.028
 bites-0.028
Token big
Token ID1263
Feature activation+2.60e-02

Pos logit contributions
ups+0.049
grand+0.048
People+0.046
we+0.046
Kat+0.044
Neg logit contributions
 bite-0.034
 sip-0.030
 leap-0.029
 squeeze-0.028
 paws-0.026
Token and
Token ID290
Feature activation-1.82e-02

Pos logit contributions
 sip+0.054
 leap+0.053
 bite+0.051
 squeeze+0.046
 shove+0.044
Neg logit contributions
ups-0.047
grand-0.046
People-0.046
ind-0.045
Grand-0.044
Token strong
Token ID1913
Feature activation+6.64e-03

Pos logit contributions
ups+0.046
grand+0.044
People+0.043
Kat+0.042
ind+0.042
Neg logit contributions
 bite-0.028
 sip-0.027
 leap-0.026
 squeeze-0.024
 paws-0.022
Token than
Token ID621
Feature activation-1.05e-02

Pos logit contributions
grand+0.031
People+0.031
ups+0.029
David+0.029
Grand+0.029
Neg logit contributions
 bite-0.026
 sip-0.026
 leap-0.026
 squeeze-0.024
 paws-0.024
Token all
Token ID477
Feature activation-2.78e-02

Pos logit contributions
grand+0.022
People+0.021
ups+0.021
Grand+0.021
ich+0.020
Neg logit contributions
 bite-0.019
 sip-0.019
 leap-0.018
 squeeze-0.016
 bites-0.016
Token of
Token ID286
Feature activation-3.12e-02

Pos logit contributions
People+0.062
Grand+0.061
grand+0.061
David+0.060
ups+0.059
Neg logit contributions
 bite-0.047
 sip-0.045
 leap-0.044
 bites-0.039
 squeeze-0.039
Token the
Token ID262
Feature activation-3.46e-02

Pos logit contributions
grand+0.066
ups+0.063
People+0.061
Grand+0.059
bel+0.058
Neg logit contributions
 bite-0.059
 sip-0.059
 leap-0.057
 squeeze-0.052
 bites-0.050
Token other
Token ID584
Feature activation-2.84e-02

Pos logit contributions
grand+0.083
People+0.076
eline+0.075
bel+0.075
ups+0.075
Neg logit contributions
 bite-0.055
 leap-0.050
 sip-0.047
 squeeze-0.046
 bites-0.044
Token m
Token ID285
Feature activation-7.06e-02

Pos logit contributions
grand+0.067
bel+0.061
People+0.060
Kat+0.060
ups+0.059
Neg logit contributions
 bite-0.045
 leap-0.043
 sip-0.041
 paws-0.040
 bites-0.040
Tokenules
Token ID5028
Feature activation-3.67e-02

Pos logit contributions
People+0.104
Kat+0.103
grand+0.103
 rightful+0.099
Jessica+0.098
Neg logit contributions
 leap-0.143
 bite-0.142
 paws-0.134
 breaths-0.132
 sip-0.132
Token.
Token ID13
Feature activation+1.01e-02

Pos logit contributions
Grand+0.093
People+0.093
grand+0.092
Kat+0.090
bel+0.089
Neg logit contributions
 bite-0.048
 leap-0.047
 sip-0.045
 squeeze-0.039
 bites-0.038
Token She
Token ID1375
Feature activation-1.75e-02

Pos logit contributions
 sip+0.018
 leap+0.017
 bite+0.017
 squeeze+0.016
 paws+0.015
Neg logit contributions
ups-0.022
grand-0.021
People-0.021
ind-0.020
ich-0.020
Token had
Token ID550
Feature activation-2.15e-02

Pos logit contributions
ups+0.034
grand+0.033
People+0.032
ich+0.030
Kat+0.030
Neg logit contributions
 bite-0.036
 sip-0.035
 leap-0.033
 squeeze-0.032
 bites-0.030
Token beautiful
Token ID4950
Feature activation+1.40e-02

Pos logit contributions
grand+0.060
ups+0.056
People+0.056
ind+0.056
cle+0.056
Neg logit contributions
 bite-0.025
 leap-0.023
 sip-0.022
 paws-0.021
 squeeze-0.020
Token m
Token ID285
Feature activation-5.07e-02

Pos logit contributions
ups+0.039
grand+0.038
People+0.038
Grand+0.037
ind+0.037
Neg logit contributions
 sip-0.029
 leap-0.028
 bite-0.027
 squeeze-0.024
 breaths-0.022
Tokenule
Token ID2261
Feature activation+1.31e-02

Pos logit contributions
People+0.069
grand+0.067
Kat+0.064
ups+0.063
Grand+0.059
Neg logit contributions
 bite-0.123
 leap-0.121
 sip-0.120
 squeeze-0.112
 paws-0.110
Token,
Token ID11
Feature activation-1.80e-02

Pos logit contributions
 sip+0.025
 leap+0.023
 bite+0.022
 breaths+0.020
 bites+0.020
Neg logit contributions
ups-0.026
grand-0.022
People-0.022
 office-0.021
ich-0.021
Token but
Token ID475
Feature activation+4.78e-03

Pos logit contributions
grand+0.041
People+0.041
ups+0.041
Grand+0.039
ind+0.038
Neg logit contributions
 sip-0.029
 leap-0.028
 bite-0.028
 squeeze-0.024
 bites-0.023
Token the
Token ID262
Feature activation-3.24e-02

Pos logit contributions
 sip+0.009
 leap+0.009
 bite+0.009
 squeeze+0.008
 bites+0.008
Neg logit contributions
grand-0.010
ups-0.010
People-0.009
ind-0.009
Grand-0.009
Token m
Token ID285
Feature activation-7.02e-02

Pos logit contributions
grand+0.075
ups+0.073
People+0.073
Grand+0.068
ind+0.068
Neg logit contributions
 bite-0.053
 leap-0.053
 sip-0.053
 squeeze-0.046
 bites-0.044
Tokenule
Token ID2261
Feature activation-1.61e-02

Pos logit contributions
People+0.101
grand+0.096
ups+0.096
Kat+0.093
Our+0.087
Neg logit contributions
 bite-0.154
 leap-0.151
 sip-0.144
 paws-0.143
 squeeze-0.139
Token would
Token ID561
Feature activation+1.06e-02

Pos logit contributions
ups+0.030
grand+0.029
People+0.029
Grand+0.027
we+0.026
Neg logit contributions
 sip-0.034
 leap-0.033
 bite-0.032
 squeeze-0.029
 breaths-0.028
Token not
Token ID407
Feature activation+2.09e-02

Pos logit contributions
 sip+0.027
 leap+0.027
 breaths+0.026
 bite+0.025
 paws+0.025
Neg logit contributions
ups-0.012
People-0.012
grand-0.012
Grand-0.011
God-0.010
Token yield
Token ID7800
Feature activation-4.30e-03

Pos logit contributions
 sip+0.043
 leap+0.040
 breaths+0.039
 paws+0.037
 bite+0.036
Neg logit contributions
People-0.036
ups-0.035
grand-0.033
Grand-0.033
God-0.032
Token.
Token ID13
Feature activation+2.29e-02

Pos logit contributions
grand+0.009
People+0.009
ups+0.009
Grand+0.008
ind+0.008
Neg logit contributions
 sip-0.008
 bite-0.008
 leap-0.008
 squeeze-0.007
 bites-0.007
Token some
Token ID617
Feature activation-1.61e-02

Pos logit contributions
grand+0.006
ups+0.006
People+0.006
Grand+0.006
ind+0.006
Neg logit contributions
 sip-0.005
 leap-0.005
 bite-0.005
 squeeze-0.005
 bites-0.004
Token bananas
Token ID35484
Feature activation-2.94e-02

Pos logit contributions
grand+0.034
ups+0.033
People+0.033
Grand+0.031
ind+0.031
Neg logit contributions
 sip-0.029
 leap-0.029
 bite-0.028
 squeeze-0.025
 bites-0.024
Token and
Token ID290
Feature activation-1.57e-02

Pos logit contributions
grand+0.061
People+0.060
ups+0.060
Grand+0.056
ind+0.056
Neg logit contributions
 sip-0.054
 bite-0.054
 leap-0.053
 squeeze-0.047
 bites-0.045
Token feed
Token ID3745
Feature activation-2.89e-02

Pos logit contributions
grand+0.038
ups+0.038
People+0.037
ind+0.036
Grand+0.036
Neg logit contributions
 bite-0.023
 sip-0.023
 leap-0.023
 squeeze-0.020
 bites-0.019
Token the
Token ID262
Feature activation-1.83e-02

Pos logit contributions
grand+0.056
ups+0.054
People+0.053
Grand+0.052
Kat+0.051
Neg logit contributions
 bite-0.056
 sip-0.055
 leap-0.053
 squeeze-0.048
 bites-0.047
Token gor
Token ID30344
Feature activation-6.99e-02

Pos logit contributions
grand+0.033
ups+0.033
People+0.031
Grand+0.030
David+0.030
Neg logit contributions
 bite-0.039
 sip-0.037
 leap-0.037
 bites-0.034
 squeeze-0.033
Tokenillas
Token ID25314
Feature activation-2.27e-02

Pos logit contributions
grand+0.113
Grand+0.106
ups+0.105
People+0.104
David+0.101
Neg logit contributions
 bite-0.160
 leap-0.150
 sip-0.150
 bites-0.140
 paws-0.132
Token?"
Token ID1701
Feature activation+8.62e-03

Pos logit contributions
grand+0.050
ups+0.049
People+0.049
Grand+0.047
David+0.046
Neg logit contributions
 bite-0.040
 sip-0.040
 leap-0.037
 squeeze-0.034
 bites-0.031
Token Tim
Token ID5045
Feature activation+1.02e-02

Pos logit contributions
 sip+0.011
 bite+0.010
 leap+0.010
 squeeze+0.009
 bites+0.008
Neg logit contributions
ups-0.023
grand-0.023
People-0.023
ind-0.022
Grand-0.022
Tokenmy
Token ID1820
Feature activation+3.03e-03

Pos logit contributions
 leap+0.025
 sip+0.025
 bite+0.024
 squeeze+0.022
 shock+0.021
Neg logit contributions
ups-0.015
grand-0.015
Grand-0.014
People-0.014
ind-0.014
Token nodded
Token ID14464
Feature activation+1.72e-02

Pos logit contributions
 sip+0.006
 leap+0.006
 bite+0.006
 squeeze+0.005
 breaths+0.005
Neg logit contributions
ups-0.006
grand-0.006
People-0.006
Grand-0.005
ind-0.005
Token was
Token ID373
Feature activation-1.81e-02

Pos logit contributions
grand+0.023
People+0.023
ups+0.023
Grand+0.022
ind+0.022
Neg logit contributions
 sip-0.024
 leap-0.023
 bite-0.023
 squeeze-0.021
 bites-0.020
Token going
Token ID1016
Feature activation-2.03e-02

Pos logit contributions
grand+0.038
People+0.037
ups+0.035
Grand+0.035
ind+0.035
Neg logit contributions
 bite-0.034
 sip-0.033
 leap-0.033
 squeeze-0.030
 bites-0.028
Token to
Token ID284
Feature activation-8.62e-04

Pos logit contributions
grand+0.039
People+0.039
Grand+0.038
ups+0.037
Kat+0.037
Neg logit contributions
 bite-0.040
 leap-0.040
 sip-0.039
 squeeze-0.037
 bites-0.035
Token happen
Token ID1645
Feature activation-2.24e-02

Pos logit contributions
grand+0.002
ups+0.002
People+0.002
Grand+0.002
ind+0.002
Neg logit contributions
 sip-0.001
 bite-0.001
 leap-0.001
 squeeze-0.001
 bites-0.001
Token on
Token ID319
Feature activation-2.63e-02

Pos logit contributions
grand+0.055
People+0.051
ich+0.050
Grand+0.050
ups+0.049
Neg logit contributions
 leap-0.032
 bite-0.032
 sip-0.032
 squeeze-0.029
 bites-0.027
Token Tues
Token ID48496
Feature activation-6.96e-02

Pos logit contributions
grand+0.059
People+0.057
ups+0.056
ich+0.055
Grand+0.055
Neg logit contributions
 sip-0.044
 bite-0.044
 leap-0.043
 squeeze-0.039
 bites-0.037
Tokendays
Token ID12545
Feature activation-3.46e-02

Pos logit contributions
ups+0.143
grand+0.134
People+0.128
Mrs+0.119
Grand+0.115
Neg logit contributions
 sip-0.128
 leap-0.122
 bite-0.121
 squeeze-0.111
 breath-0.109
Token.
Token ID13
Feature activation+7.26e-03

Pos logit contributions
grand+0.082
People+0.081
ich+0.081
ind+0.079
Grand+0.078
Neg logit contributions
 sip-0.049
 leap-0.048
 bite-0.048
 squeeze-0.043
 cushion-0.039
Token
Token ID198
Feature activation-1.44e-02

Pos logit contributions
 sip+0.012
 leap+0.012
 bite+0.011
 squeeze+0.010
 paws+0.010
Neg logit contributions
ups-0.016
grand-0.016
People-0.016
ich-0.016
ind-0.016
Token
Token ID198
Feature activation+4.55e-03

Pos logit contributions
grand+0.031
ups+0.030
ich+0.030
ind+0.029
eline+0.028
Neg logit contributions
 sip-0.025
 bite-0.024
 leap-0.023
 squeeze-0.020
 cushion-0.020
TokenToday
Token ID8888
Feature activation+1.11e-02

Pos logit contributions
 sip+0.008
 bite+0.008
 leap+0.008
 squeeze+0.007
 cushion+0.006
Neg logit contributions
ich-0.010
grand-0.010
ups-0.009
ind-0.009
eline-0.009
Token,
Token ID11
Feature activation-3.50e-02

Pos logit contributions
grand+0.003
People+0.003
ups+0.003
Grand+0.003
we+0.002
Neg logit contributions
 sip-0.002
 leap-0.002
 bite-0.002
 squeeze-0.001
 bites-0.001
Token "
Token ID366
Feature activation-1.57e-02

Pos logit contributions
ups+0.081
grand+0.079
ind+0.077
People+0.073
Kat+0.069
Neg logit contributions
 bite-0.060
 sip-0.059
 leap-0.058
 squeeze-0.054
 paws-0.048
TokenThank
Token ID10449
Feature activation-9.48e-03

Pos logit contributions
ups+0.030
grand+0.028
ind+0.026
ber+0.026
People+0.025
Neg logit contributions
 bite-0.031
 sip-0.031
 leap-0.030
 squeeze-0.029
 lick-0.026
Token you
Token ID345
Feature activation+1.75e-02

Pos logit contributions
grand+0.015
ups+0.015
People+0.015
Grand+0.014
ind+0.014
Neg logit contributions
 sip-0.022
 leap-0.021
 bite-0.021
 squeeze-0.019
 bites-0.019
Token,
Token ID11
Feature activation-1.56e-02

Pos logit contributions
 sip+0.028
 leap+0.027
 bite+0.026
 squeeze+0.023
 bites+0.022
Neg logit contributions
ups-0.041
grand-0.041
Grand-0.040
People-0.040
ind-0.038
Token m
Token ID285
Feature activation-6.94e-02

Pos logit contributions
ups+0.029
grand+0.029
People+0.028
we+0.026
ind+0.026
Neg logit contributions
 sip-0.032
 leap-0.032
 bite-0.032
 squeeze-0.029
 bites-0.027
Tokenister
Token ID1694
Feature activation+3.86e-04

Pos logit contributions
People+0.099
ups+0.096
ind+0.094
grand+0.092
Grand+0.090
Neg logit contributions
 leap-0.158
 bite-0.158
 sip-0.154
 squeeze-0.141
 shove-0.136
Token.
Token ID13
Feature activation-7.90e-03

Pos logit contributions
 sip+0.001
 bite+0.001
 leap+0.001
 squeeze+0.001
 bites+0.000
Neg logit contributions
grand-0.001
People-0.001
ups-0.001
Grand-0.001
ind-0.001
Token You
Token ID921
Feature activation+2.06e-02

Pos logit contributions
grand+0.015
ups+0.015
People+0.015
Grand+0.014
ind+0.014
Neg logit contributions
 sip-0.016
 leap-0.016
 bite-0.016
 squeeze-0.014
 bites-0.013
Token saved
Token ID7448
Feature activation-7.88e-03

Pos logit contributions
 sip+0.038
 leap+0.034
 breaths+0.034
 bite+0.033
 snowball+0.032
Neg logit contributions
People-0.040
grand-0.039
 known-0.038
 ruled-0.035
Grand-0.034
Token the
Token ID262
Feature activation-1.66e-02

Pos logit contributions
grand+0.015
People+0.015
ups+0.014
Grand+0.014
ind+0.013
Neg logit contributions
 sip-0.016
 bite-0.016
 leap-0.016
 squeeze-0.014
 bites-0.014
Token a
Token ID257
Feature activation+3.85e-03

Pos logit contributions
ups+0.013
grand+0.013
People+0.012
ind+0.012
we+0.012
Neg logit contributions
 sip-0.007
 bite-0.006
 leap-0.006
 squeeze-0.005
 bites-0.005
Token bar
Token ID2318
Feature activation-4.59e-02

Pos logit contributions
 sip+0.006
 bite+0.006
 leap+0.006
 squeeze+0.006
 bites+0.005
Neg logit contributions
ups-0.009
grand-0.009
People-0.008
we-0.008
Grand-0.008
Tokenber
Token ID527
Feature activation-2.25e-02

Pos logit contributions
ups+0.040
People+0.038
grand+0.036
David+0.034
Grand+0.031
Neg logit contributions
 bite-0.137
 sip-0.135
 leap-0.131
 bites-0.125
 squeeze-0.124
Token.
Token ID13
Feature activation+6.63e-03

Pos logit contributions
People+0.053
ups+0.052
Kat+0.049
David+0.049
grand+0.049
Neg logit contributions
 bite-0.035
 sip-0.033
 leap-0.031
 squeeze-0.030
 paws-0.029
Token The
Token ID383
Feature activation-3.74e-02

Pos logit contributions
 sip+0.013
 leap+0.013
 bite+0.013
 squeeze+0.011
 bites+0.011
Neg logit contributions
grand-0.013
People-0.013
ups-0.013
Grand-0.012
ind-0.012
Token bar
Token ID2318
Feature activation-6.94e-02

Pos logit contributions
ups+0.099
grand+0.099
People+0.096
we+0.094
David+0.092
Neg logit contributions
 bite-0.050
 sip-0.046
 leap-0.043
 squeeze-0.040
 shock-0.040
Tokenber
Token ID527
Feature activation-1.96e-02

Pos logit contributions
David+0.066
ups+0.066
People+0.063
Kelly+0.061
grand+0.058
Neg logit contributions
 bite-0.186
 sip-0.180
 bites-0.177
 leap-0.169
 squeeze-0.165
Token was
Token ID373
Feature activation-3.40e-02

Pos logit contributions
ups+0.042
People+0.041
grand+0.040
ind+0.040
Kat+0.039
Neg logit contributions
 bite-0.034
 sip-0.032
 leap-0.031
 squeeze-0.030
 bites-0.029
Token a
Token ID257
Feature activation-1.31e-02

Pos logit contributions
grand+0.087
ups+0.087
we+0.084
People+0.082
Kat+0.081
Neg logit contributions
 bite-0.052
 squeeze-0.043
 leap-0.042
 sip-0.042
 push-0.040
Token funny
Token ID8258
Feature activation+4.08e-03

Pos logit contributions
ups+0.029
grand+0.028
People+0.027
we+0.026
ind+0.025
Neg logit contributions
 bite-0.024
 sip-0.023
 leap-0.022
 squeeze-0.021
 shock-0.020
Token man
Token ID582
Feature activation-8.40e-03

Pos logit contributions
 sip+0.008
 leap+0.007
 bite+0.007
 squeeze+0.007
 bites+0.006
Neg logit contributions
grand-0.008
ups-0.008
People-0.008
Grand-0.008
ind-0.008
Token to
Token ID284
Feature activation-1.06e-02

Pos logit contributions
grand+0.027
ich+0.025
ind+0.025
Grand+0.025
People+0.025
Neg logit contributions
 bite-0.021
 sip-0.021
 leap-0.019
 squeeze-0.018
 bites-0.018
Token choose
Token ID3853
Feature activation-4.69e-02

Pos logit contributions
grand+0.025
ups+0.025
People+0.025
Grand+0.024
ind+0.024
Neg logit contributions
 bite-0.017
 sip-0.017
 leap-0.016
 squeeze-0.014
 bites-0.013
Token one
Token ID530
Feature activation-1.01e-02

Pos logit contributions
grand+0.115
People+0.105
ich+0.105
Grand+0.104
cle+0.102
Neg logit contributions
 bite-0.073
 sip-0.070
 leap-0.061
 squeeze-0.060
 bites-0.058
Token from
Token ID422
Feature activation-2.23e-02

Pos logit contributions
grand+0.021
People+0.020
ups+0.020
Grand+0.019
ind+0.019
Neg logit contributions
 sip-0.019
 bite-0.019
 leap-0.019
 squeeze-0.017
 bites-0.016
Token the
Token ID262
Feature activation-2.68e-02

Pos logit contributions
grand+0.050
ups+0.049
People+0.048
Grand+0.045
ich+0.044
Neg logit contributions
 sip-0.039
 bite-0.039
 leap-0.035
 squeeze-0.033
 bites-0.032
Token bakery
Token ID38164
Feature activation-6.92e-02

Pos logit contributions
grand+0.056
ind+0.053
Grand+0.052
People+0.052
ups+0.052
Neg logit contributions
 bite-0.051
 sip-0.047
 leap-0.044
 bites-0.041
 squeeze-0.041
Token.
Token ID13
Feature activation+1.87e-02

Pos logit contributions
People+0.184
grand+0.175
David+0.175
ind+0.170
Grand+0.170
Neg logit contributions
 bite-0.083
 sip-0.077
 leap-0.069
 bites-0.066
 squeeze-0.063
Token Every
Token ID3887
Feature activation+1.54e-02

Pos logit contributions
 leap+0.030
 bite+0.030
 sip+0.030
 breaths+0.025
 squeeze+0.025
Neg logit contributions
People-0.042
grand-0.042
ups-0.041
Grand-0.039
Kat-0.038
Token day
Token ID1110
Feature activation+7.57e-03

Pos logit contributions
 sip+0.036
 leap+0.036
 bite+0.035
 squeeze+0.033
 breaths+0.031
Neg logit contributions
grand-0.024
People-0.023
ups-0.023
Grand-0.021
ind-0.021
Token she
Token ID673
Feature activation-1.21e-02

Pos logit contributions
 sip+0.012
 leap+0.012
 bite+0.012
 squeeze+0.010
 bites+0.010
Neg logit contributions
People-0.017
ups-0.017
grand-0.017
Grand-0.016
ind-0.015
Token went
Token ID1816
Feature activation-9.10e-03

Pos logit contributions
People+0.023
grand+0.023
ups+0.022
ich+0.022
David+0.022
Neg logit contributions
 bite-0.024
 sip-0.024
 leap-0.023
 squeeze-0.022
 bites-0.021
Token a
Token ID257
Feature activation+6.49e-03

Pos logit contributions
grand+0.099
ups+0.097
ind+0.095
ber+0.095
annie+0.094
Neg logit contributions
 bite-0.020
 paws-0.012
 bites-0.011
 leap-0.011
 nap-0.010
Token m
Token ID285
Feature activation-5.55e-02

Pos logit contributions
 sip+0.013
 leap+0.013
 bite+0.013
 squeeze+0.011
 bites+0.011
Neg logit contributions
grand-0.012
ups-0.012
People-0.012
Grand-0.011
ind-0.011
Tokenule
Token ID2261
Feature activation-5.69e-03

Pos logit contributions
People+0.074
ups+0.070
grand+0.069
Grand+0.067
Kat+0.064
Neg logit contributions
 leap-0.139
 bite-0.138
 sip-0.134
 paws-0.125
 squeeze-0.124
Token.
Token ID13
Feature activation+1.38e-02

Pos logit contributions
People+0.011
ups+0.011
Grand+0.011
grand+0.011
Kat+0.010
Neg logit contributions
 bite-0.011
 sip-0.011
 leap-0.010
 squeeze-0.010
 paws-0.009
Token The
Token ID383
Feature activation-2.88e-02

Pos logit contributions
 sip+0.023
 bite+0.022
 leap+0.022
 squeeze+0.020
 paws+0.018
Neg logit contributions
ups-0.033
grand-0.031
People-0.031
we-0.030
ich-0.030
Token m
Token ID285
Feature activation-6.90e-02

Pos logit contributions
grand+0.055
ups+0.055
People+0.052
we+0.051
Kat+0.050
Neg logit contributions
 bite-0.059
 leap-0.056
 sip-0.056
 squeeze-0.052
 bites-0.049
Tokenule
Token ID2261
Feature activation-1.94e-02

Pos logit contributions
ups+0.098
People+0.098
Kat+0.096
grand+0.096
 rightful+0.092
Neg logit contributions
 bite-0.145
 leap-0.144
 sip-0.137
 paws-0.136
 squeeze-0.130
Token was
Token ID373
Feature activation-1.88e-02

Pos logit contributions
ups+0.040
People+0.039
grand+0.037
ind+0.037
Kat+0.037
Neg logit contributions
 bite-0.036
 sip-0.034
 leap-0.033
 squeeze-0.031
 bites-0.031
Token very
Token ID845
Feature activation-3.07e-02

Pos logit contributions
ups+0.048
grand+0.046
we+0.045
People+0.044
Kat+0.042
Neg logit contributions
 bite-0.030
 sip-0.026
 leap-0.025
 squeeze-0.025
 paws-0.023
Token strong
Token ID1913
Feature activation+1.13e-03

Pos logit contributions
ups+0.071
we+0.070
People+0.068
grand+0.067
Kat+0.064
Neg logit contributions
 bite-0.053
 sip-0.049
 leap-0.047
 squeeze-0.045
 bites-0.041
Token and
Token ID290
Feature activation-2.74e-02

Pos logit contributions
 sip+0.002
 leap+0.002
 bite+0.002
 squeeze+0.002
 shove+0.002
Neg logit contributions
ups-0.002
grand-0.002
People-0.002
ind-0.002
Grand-0.002
Token "
Token ID366
Feature activation-1.05e-02

Pos logit contributions
ups+0.073
grand+0.073
People+0.069
ind+0.069
Kat+0.063
Neg logit contributions
 sip-0.067
 bite-0.066
 leap-0.065
 squeeze-0.060
 paws-0.056
TokenI
Token ID40
Feature activation+3.79e-03

Pos logit contributions
ups+0.017
grand+0.016
ind+0.016
ews+0.015
eline+0.015
Neg logit contributions
 bite-0.023
 sip-0.023
 leap-0.022
 squeeze-0.021
 bites-0.020
Token am
Token ID716
Feature activation-2.12e-03

Pos logit contributions
 sip+0.007
 leap+0.007
 bite+0.007
 squeeze+0.006
 bites+0.006
Neg logit contributions
grand-0.008
ups-0.008
People-0.008
Grand-0.007
ind-0.007
Token sorry
Token ID7926
Feature activation+2.29e-02

Pos logit contributions
ups+0.006
grand+0.006
People+0.006
we+0.005
ind+0.005
Neg logit contributions
 bite-0.003
 sip-0.003
 leap-0.003
 squeeze-0.002
 shock-0.002
Token,
Token ID11
Feature activation-1.67e-02

Pos logit contributions
 sip+0.034
 leap+0.033
 bite+0.033
 squeeze+0.027
 bites+0.026
Neg logit contributions
ups-0.056
People-0.056
grand-0.056
Grand-0.053
ind-0.052
Token m
Token ID285
Feature activation-6.89e-02

Pos logit contributions
grand+0.033
People+0.033
ups+0.032
Grand+0.031
ind+0.030
Neg logit contributions
 sip-0.033
 leap-0.032
 bite-0.032
 squeeze-0.028
 bites-0.027
Tokenister
Token ID1694
Feature activation-5.59e-03

Pos logit contributions
People+0.108
ups+0.103
grand+0.100
ind+0.098
Our+0.096
Neg logit contributions
 bite-0.144
 leap-0.142
 sip-0.138
 squeeze-0.128
 shove-0.127
Token.
Token ID13
Feature activation-1.62e-02

Pos logit contributions
People+0.013
grand+0.013
ups+0.013
ind+0.012
Grand+0.012
Neg logit contributions
 bite-0.009
 sip-0.009
 leap-0.008
 squeeze-0.008
 bites-0.007
Token That
Token ID1320
Feature activation+2.21e-03

Pos logit contributions
ups+0.034
People+0.033
grand+0.032
ind+0.032
Grand+0.031
Neg logit contributions
 sip-0.029
 leap-0.029
 bite-0.028
 squeeze-0.026
 paws-0.025
Token is
Token ID318
Feature activation+2.84e-03

Pos logit contributions
 sip+0.004
 bite+0.004
 leap+0.004
 squeeze+0.004
 bites+0.003
Neg logit contributions
People-0.004
grand-0.004
ups-0.004
Grand-0.004
ind-0.004
Token very
Token ID845
Feature activation-1.49e-02

Pos logit contributions
 bite+0.005
 sip+0.005
 leap+0.005
 squeeze+0.004
 bites+0.004
Neg logit contributions
grand-0.006
ups-0.006
People-0.006
we-0.006
ind-0.006
Token than
Token ID621
Feature activation+5.93e-03

Pos logit contributions
grand+0.003
People+0.003
ups+0.003
Grand+0.003
ind+0.003
Neg logit contributions
 sip-0.003
 leap-0.003
 bite-0.003
 squeeze-0.003
 bites-0.003
Token any
Token ID597
Feature activation-1.36e-02

Pos logit contributions
 bite+0.011
 sip+0.011
 leap+0.011
 squeeze+0.010
 bites+0.009
Neg logit contributions
ups-0.012
grand-0.012
People-0.012
Grand-0.012
ind-0.011
Token other
Token ID584
Feature activation-3.06e-02

Pos logit contributions
grand+0.025
ups+0.024
People+0.024
Grand+0.023
ind+0.022
Neg logit contributions
 sip-0.028
 leap-0.028
 bite-0.028
 squeeze-0.025
 bites-0.024
Token donkey
Token ID50085
Feature activation-1.92e-02

Pos logit contributions
grand+0.058
People+0.056
ups+0.055
Grand+0.053
ich+0.051
Neg logit contributions
 bite-0.062
 sip-0.062
 leap-0.062
 squeeze-0.055
 bites-0.055
Token the
Token ID262
Feature activation-2.49e-02

Pos logit contributions
grand+0.044
People+0.044
ups+0.043
Grand+0.042
ind+0.041
Neg logit contributions
 sip-0.031
 bite-0.031
 leap-0.031
 squeeze-0.026
 bites-0.025
Token m
Token ID285
Feature activation-6.89e-02

Pos logit contributions
grand+0.052
People+0.051
ups+0.051
Grand+0.047
we+0.046
Neg logit contributions
 bite-0.046
 sip-0.046
 leap-0.046
 squeeze-0.041
 bites-0.039
Tokenule
Token ID2261
Feature activation-2.06e-02

Pos logit contributions
People+0.104
grand+0.092
bel+0.092
Grand+0.091
ups+0.090
Neg logit contributions
 bite-0.149
 leap-0.146
 sip-0.138
 bites-0.136
 paws-0.135
Token had
Token ID550
Feature activation-1.51e-02

Pos logit contributions
People+0.047
grand+0.046
ups+0.045
Grand+0.044
ind+0.044
Neg logit contributions
 bite-0.034
 sip-0.034
 leap-0.034
 squeeze-0.029
 bites-0.028
Token seen
Token ID1775
Feature activation-4.38e-03

Pos logit contributions
People+0.031
grand+0.031
ups+0.029
Grand+0.029
ind+0.029
Neg logit contributions
 bite-0.029
 sip-0.028
 leap-0.028
 squeeze-0.025
 bites-0.024
Token before
Token ID878
Feature activation-4.47e-03

Pos logit contributions
grand+0.010
People+0.010
ups+0.010
Grand+0.009
ind+0.009
Neg logit contributions
 bite-0.007
 sip-0.007
 leap-0.007
 squeeze-0.006
 bites-0.006
Token.
Token ID13
Feature activation+2.18e-02

Pos logit contributions
grand+0.011
People+0.011
ups+0.011
Grand+0.010
ind+0.010
Neg logit contributions
 bite-0.007
 sip-0.007
 leap-0.007
 squeeze-0.006
 bites-0.005
TokenSuddenly
Token ID38582
Feature activation+3.13e-02

Pos logit contributions
 sip+0.013
 leap+0.012
 bite+0.012
 squeeze+0.011
 knot+0.011
Neg logit contributions
grand-0.011
ich-0.010
ups-0.010
ind-0.010
we-0.010
Token,
Token ID11
Feature activation+1.46e-02

Pos logit contributions
 sip+0.046
 bite+0.045
 leap+0.044
 squeeze+0.038
 bites+0.037
Neg logit contributions
People-0.074
ups-0.074
grand-0.072
Grand-0.070
Kat-0.067
Token one
Token ID530
Feature activation-7.83e-03

Pos logit contributions
 sip+0.030
 leap+0.029
 bite+0.029
 squeeze+0.026
 bites+0.025
Neg logit contributions
grand-0.027
ups-0.027
People-0.027
Grand-0.025
ind-0.025
Token of
Token ID286
Feature activation-2.05e-03

Pos logit contributions
grand+0.014
ups+0.014
People+0.014
Grand+0.013
ind+0.012
Neg logit contributions
 sip-0.017
 leap-0.016
 bite-0.016
 squeeze-0.015
 bites-0.014
Token the
Token ID262
Feature activation-1.27e-02

Pos logit contributions
grand+0.004
ups+0.004
People+0.004
Grand+0.004
ind+0.003
Neg logit contributions
 sip-0.004
 leap-0.004
 bite-0.004
 squeeze-0.004
 bites-0.004
Token gor
Token ID30344
Feature activation-6.88e-02

Pos logit contributions
grand+0.024
People+0.023
ups+0.023
Grand+0.022
ind+0.021
Neg logit contributions
 sip-0.026
 bite-0.026
 leap-0.026
 squeeze-0.023
 bites-0.022
Tokenillas
Token ID25314
Feature activation-9.84e-03

Pos logit contributions
grand+0.122
Grand+0.108
ups+0.108
People+0.107
Kat+0.096
Neg logit contributions
 bite-0.149
 leap-0.147
 sip-0.144
 breaths-0.131
 bites-0.131
Token reached
Token ID4251
Feature activation+1.97e-02

Pos logit contributions
grand+0.019
ups+0.019
People+0.019
Grand+0.018
ind+0.018
Neg logit contributions
 sip-0.019
 leap-0.019
 bite-0.019
 squeeze-0.017
 bites-0.016
Token out
Token ID503
Feature activation+1.68e-02

Pos logit contributions
 sip+0.033
 leap+0.032
 bite+0.032
 squeeze+0.028
 bites+0.027
Neg logit contributions
grand-0.046
People-0.045
ups-0.044
Grand-0.042
ind-0.041
Token his
Token ID465
Feature activation+1.36e-02

Pos logit contributions
 sip+0.034
 leap+0.034
 bite+0.034
 squeeze+0.030
 bites+0.029
Neg logit contributions
grand-0.031
ups-0.031
People-0.031
Grand-0.029
ind-0.029
Token furry
Token ID40857
Feature activation+3.76e-02

Pos logit contributions
 sip+0.020
 leap+0.020
 bite+0.019
 squeeze+0.016
 bites+0.015
Neg logit contributions
grand-0.033
ups-0.033
People-0.032
Grand-0.031
ind-0.031
Token
Token ID198
Feature activation-5.40e-03

Pos logit contributions
grand+0.002
People+0.002
ups+0.002
Grand+0.002
ind+0.002
Neg logit contributions
 sip-0.002
 leap-0.002
 bite-0.002
 squeeze-0.002
 bites-0.002
Token
Token ID198
Feature activation+4.37e-03

Pos logit contributions
grand+0.011
ups+0.010
ind+0.010
ich+0.010
People+0.010
Neg logit contributions
 sip-0.011
 bite-0.010
 leap-0.010
 squeeze-0.009
 bites-0.009
Token"
Token ID1
Feature activation-8.20e-03

Pos logit contributions
 sip+0.009
 bite+0.009
 leap+0.008
 squeeze+0.008
 bites+0.008
Neg logit contributions
grand-0.009
ind-0.008
ich-0.008
ups-0.007
People-0.007
TokenSorry
Token ID14385
Feature activation+8.41e-03

Pos logit contributions
grand+0.015
ups+0.014
ind+0.014
People+0.013
ich+0.013
Neg logit contributions
 sip-0.018
 bite-0.017
 leap-0.017
 squeeze-0.016
 bites-0.015
Token,
Token ID11
Feature activation-7.64e-03

Pos logit contributions
 sip+0.011
 leap+0.011
 bite+0.010
 squeeze+0.009
 bites+0.008
Neg logit contributions
ups-0.022
grand-0.022
People-0.022
Grand-0.021
ind-0.021
Token m
Token ID285
Feature activation-6.88e-02

Pos logit contributions
People+0.014
grand+0.014
ups+0.014
Grand+0.013
ich+0.013
Neg logit contributions
 sip-0.015
 leap-0.015
 bite-0.015
 squeeze-0.013
 breaths-0.013
Tokenister
Token ID1694
Feature activation-1.22e-02

Pos logit contributions
People+0.104
grand+0.097
Kat+0.096
ind+0.096
Grand+0.096
Neg logit contributions
 leap-0.149
 bite-0.147
 sip-0.143
 squeeze-0.131
 shove-0.131
Token.
Token ID13
Feature activation-3.13e-02

Pos logit contributions
grand+0.030
ups+0.029
People+0.029
Grand+0.028
ind+0.027
Neg logit contributions
 sip-0.018
 leap-0.018
 bite-0.017
 squeeze-0.015
 bites-0.014
Token We
Token ID775
Feature activation-2.85e-03

Pos logit contributions
ups+0.067
ind+0.064
People+0.064
grand+0.063
Kat+0.062
Neg logit contributions
 sip-0.054
 bite-0.051
 leap-0.051
 squeeze-0.049
 paws-0.047
Token didn
Token ID1422
Feature activation+2.40e-02

Pos logit contributions
grand+0.005
People+0.004
ups+0.004
 known+0.004
ind+0.004
Neg logit contributions
 leap-0.006
 sip-0.006
 breaths-0.006
 bite-0.005
 cushion-0.005
Token't
Token ID470
Feature activation+8.82e-03

Pos logit contributions
 leap+0.049
 sip+0.049
 bite+0.047
 squeeze+0.043
 breaths+0.042
Neg logit contributions
grand-0.044
People-0.044
ups-0.044
Grand-0.042
ind-0.040
Token
Token ID198
Feature activation+1.21e-02

Pos logit contributions
ups+0.011
grand+0.011
ind+0.011
ich+0.011
we+0.010
Neg logit contributions
 sip-0.009
 bite-0.009
 leap-0.009
 squeeze-0.008
 Carl-0.007
TokenMaria
Token ID46827
Feature activation-3.81e-03

Pos logit contributions
 bite+0.025
 sip+0.025
 leap+0.024
 squeeze+0.023
 shove+0.021
Neg logit contributions
ich-0.021
grand-0.020
ups-0.020
ind-0.020
ber-0.019
Token:
Token ID25
Feature activation-2.51e-02

Pos logit contributions
grand+0.008
People+0.008
ups+0.008
Grand+0.007
ind+0.007
Neg logit contributions
 sip-0.007
 bite-0.007
 leap-0.007
 squeeze-0.006
 bites-0.006
Token Please
Token ID4222
Feature activation-2.63e-03

Pos logit contributions
ups+0.060
grand+0.056
ind+0.053
People+0.052
we+0.050
Neg logit contributions
 bite-0.043
 leap-0.042
 sip-0.042
 squeeze-0.039
 cushion-0.036
Token,
Token ID11
Feature activation-3.02e-02

Pos logit contributions
grand+0.006
People+0.006
ups+0.006
Grand+0.006
ind+0.006
Neg logit contributions
 sip-0.004
 leap-0.004
 bite-0.004
 bites-0.003
 squeeze-0.003
Token m
Token ID285
Feature activation-6.87e-02

Pos logit contributions
ups+0.070
ind+0.066
grand+0.066
People+0.066
we+0.064
Neg logit contributions
 sip-0.052
 bite-0.051
 leap-0.050
 squeeze-0.046
 shock-0.041
Tokenister
Token ID1694
Feature activation-3.98e-03

Pos logit contributions
People+0.107
ups+0.102
Our+0.101
grand+0.100
Grand+0.095
Neg logit contributions
 leap-0.143
 bite-0.140
 sip-0.132
 squeeze-0.125
 shove-0.124
Token,
Token ID11
Feature activation-3.22e-02

Pos logit contributions
People+0.011
grand+0.011
ups+0.011
Grand+0.010
ind+0.010
Neg logit contributions
 sip-0.005
 bite-0.005
 leap-0.005
 squeeze-0.004
 bites-0.004
Token can
Token ID460
Feature activation+9.78e-03

Pos logit contributions
ups+0.076
ind+0.074
grand+0.074
People+0.073
ich+0.071
Neg logit contributions
 bite-0.051
 sip-0.051
 leap-0.050
 squeeze-0.045
 bites-0.040
Token you
Token ID345
Feature activation+2.25e-02

Pos logit contributions
 bite+0.021
 leap+0.020
 sip+0.020
 squeeze+0.019
 bites+0.017
Neg logit contributions
ups-0.018
grand-0.017
People-0.017
ind-0.016
Grand-0.016
Token help
Token ID1037
Feature activation-4.10e-03

Pos logit contributions
 sip+0.034
 leap+0.034
 bite+0.033
 squeeze+0.028
 bites+0.027
Neg logit contributions
grand-0.054
ups-0.053
People-0.053
Grand-0.050
ind-0.050
Token
Token ID198
Feature activation-1.42e-02

Pos logit contributions
grand+0.010
ups+0.010
ind+0.009
People+0.009
ich+0.009
Neg logit contributions
 sip-0.007
 bite-0.007
 leap-0.007
 squeeze-0.006
 paws-0.006
Token
Token ID198
Feature activation-2.14e-03

Pos logit contributions
grand+0.029
ups+0.029
ind+0.028
People+0.027
ich+0.027
Neg logit contributions
 sip-0.026
 bite-0.026
 leap-0.024
 squeeze-0.023
 bites-0.021
Token"
Token ID1
Feature activation-4.73e-03

Pos logit contributions
grand+0.004
ind+0.004
ups+0.004
ich+0.004
essa+0.004
Neg logit contributions
 sip-0.004
 bite-0.004
 leap-0.004
 squeeze-0.004
 shove-0.003
TokenSorry
Token ID14385
Feature activation+1.12e-02

Pos logit contributions
grand+0.008
ups+0.008
People+0.008
ind+0.008
ich+0.007
Neg logit contributions
 sip-0.011
 bite-0.010
 leap-0.010
 squeeze-0.009
 bites-0.009
Token,
Token ID11
Feature activation-1.85e-02

Pos logit contributions
 sip+0.016
 leap+0.015
 bite+0.015
 squeeze+0.012
 bites+0.012
Neg logit contributions
ups-0.028
grand-0.028
People-0.028
Grand-0.027
ind-0.026
Token m
Token ID285
Feature activation-6.87e-02

Pos logit contributions
People+0.035
grand+0.035
ups+0.035
Grand+0.033
ich+0.032
Neg logit contributions
 sip-0.037
 leap-0.036
 bite-0.035
 squeeze-0.032
 breaths-0.031
Tokenister
Token ID1694
Feature activation-8.71e-03

Pos logit contributions
People+0.097
grand+0.096
ups+0.091
ind+0.091
Grand+0.089
Neg logit contributions
 leap-0.157
 bite-0.153
 sip-0.151
 shove-0.140
 squeeze-0.139
Token,
Token ID11
Feature activation-2.14e-02

Pos logit contributions
grand+0.021
ups+0.021
People+0.021
Grand+0.020
ind+0.020
Neg logit contributions
 sip-0.013
 leap-0.013
 bite-0.012
 squeeze-0.011
 bites-0.010
Token we
Token ID356
Feature activation-2.85e-03

Pos logit contributions
grand+0.038
People+0.037
ups+0.037
Grand+0.035
ind+0.034
Neg logit contributions
 sip-0.046
 leap-0.046
 bite-0.045
 squeeze-0.041
 bites-0.039
Token didn
Token ID1422
Feature activation+3.59e-02

Pos logit contributions
grand+0.005
ups+0.005
People+0.004
 known+0.004
eline+0.004
Neg logit contributions
 leap-0.006
 sip-0.006
 breaths-0.006
 bite-0.005
 knot-0.005
Token't
Token ID470
Feature activation+2.51e-02

Pos logit contributions
 leap+0.082
 sip+0.082
 bite+0.076
 breaths+0.073
 paws+0.072
Neg logit contributions
grand-0.056
People-0.055
ups-0.054
Grand-0.053
we-0.049
Token suddenly
Token ID6451
Feature activation+2.51e-02

Pos logit contributions
 sip+0.016
 leap+0.016
 bite+0.016
 squeeze+0.015
 paws+0.014
Neg logit contributions
grand-0.014
ups-0.014
People-0.014
ind-0.014
Grand-0.013
Token,
Token ID11
Feature activation+4.01e-03

Pos logit contributions
 sip+0.035
 bite+0.034
 leap+0.033
 squeeze+0.028
 bites+0.027
Neg logit contributions
ups-0.063
People-0.062
grand-0.062
Grand-0.059
ind-0.057
Token she
Token ID673
Feature activation-1.57e-02

Pos logit contributions
 sip+0.008
 leap+0.008
 bite+0.008
 squeeze+0.007
 bites+0.007
Neg logit contributions
grand-0.007
ups-0.007
People-0.007
Grand-0.007
ind-0.007
Token heard
Token ID2982
Feature activation+4.46e-03

Pos logit contributions
grand+0.031
People+0.030
ups+0.030
Grand+0.028
ind+0.028
Neg logit contributions
 sip-0.031
 leap-0.031
 bite-0.031
 squeeze-0.028
 bites-0.026
Token a
Token ID257
Feature activation+3.07e-02

Pos logit contributions
 sip+0.007
 leap+0.007
 bite+0.007
 squeeze+0.006
 bites+0.006
Neg logit contributions
grand-0.010
ups-0.010
People-0.010
ind-0.010
ich-0.010
Token loud
Token ID7812
Feature activation+5.26e-02

Pos logit contributions
 sip+0.053
 leap+0.052
 bite+0.051
 squeeze+0.045
 bites+0.043
Neg logit contributions
grand-0.066
People-0.066
ups-0.066
Grand-0.063
ind-0.062
Token noise
Token ID7838
Feature activation+2.80e-02

Pos logit contributions
 sip+0.085
 leap+0.082
 bite+0.077
 paws+0.071
 bites+0.070
Neg logit contributions
grand-0.104
People-0.104
Grand-0.103
ind-0.102
we-0.101
Token coming
Token ID2406
Feature activation+2.08e-02

Pos logit contributions
 sip+0.056
 leap+0.055
 bite+0.054
 squeeze+0.049
 bites+0.047
Neg logit contributions
grand-0.053
ups-0.053
People-0.053
Grand-0.050
ind-0.049
Token from
Token ID422
Feature activation+1.97e-02

Pos logit contributions
 sip+0.049
 leap+0.048
 bite+0.047
 bites+0.043
 squeeze+0.043
Neg logit contributions
ups-0.031
People-0.031
grand-0.031
Grand-0.030
ind-0.029
Token the
Token ID262
Feature activation-1.40e-02

Pos logit contributions
 sip+0.040
 leap+0.039
 bite+0.039
 squeeze+0.035
 bites+0.034
Neg logit contributions
grand-0.038
ups-0.037
People-0.037
Grand-0.035
ind-0.034
Token sky
Token ID6766
Feature activation-1.07e-02

Pos logit contributions
grand+0.030
ups+0.030
People+0.030
Grand+0.028
we+0.028
Neg logit contributions
 sip-0.025
 bite-0.025
 leap-0.025
 squeeze-0.022
 bites-0.021
Token glass
Token ID5405
Feature activation+2.21e-02

Pos logit contributions
 sip+0.037
 leap+0.037
 bite+0.035
 squeeze+0.031
 breaths+0.030
Neg logit contributions
grand-0.041
ups-0.041
People-0.040
Grand-0.039
ind-0.038
Token of
Token ID286
Feature activation+1.21e-03

Pos logit contributions
 leap+0.046
 sip+0.043
 bite+0.040
 breaths+0.039
 paws+0.038
Neg logit contributions
ups-0.039
People-0.034
we-0.032
Grand-0.032
ind-0.032
Token milk
Token ID7545
Feature activation+4.67e-03

Pos logit contributions
 leap+0.002
 sip+0.002
 bite+0.002
 squeeze+0.002
 shove+0.002
Neg logit contributions
grand-0.003
People-0.003
ups-0.002
Grand-0.002
ind-0.002
Token and
Token ID290
Feature activation-6.82e-04

Pos logit contributions
 leap+0.010
 sip+0.010
 bite+0.009
 breaths+0.009
 squeeze+0.008
Neg logit contributions
ups-0.008
People-0.007
we-0.007
Grand-0.007
ich-0.007
Token knocks
Token ID36539
Feature activation+7.23e-03

Pos logit contributions
ups+0.001
People+0.001
grand+0.001
Grand+0.001
we+0.001
Neg logit contributions
 leap-0.001
 sip-0.001
 bite-0.001
 squeeze-0.001
 paws-0.001
Token it
Token ID340
Feature activation+5.04e-02

Pos logit contributions
 sip+0.013
 leap+0.013
 bite+0.013
 squeeze+0.011
 bites+0.011
Neg logit contributions
grand-0.015
ups-0.015
People-0.015
Grand-0.014
ind-0.014
Token over
Token ID625
Feature activation+6.09e-03

Pos logit contributions
 leap+0.110
 sip+0.097
 bite+0.089
 knot+0.088
 breaths+0.087
Neg logit contributions
ups-0.089
People-0.079
we-0.078
bel-0.077
Grand-0.075
Token.
Token ID13
Feature activation+1.51e-03

Pos logit contributions
 sip+0.012
 leap+0.011
 bite+0.011
 squeeze+0.010
 bites+0.010
Neg logit contributions
grand-0.012
ups-0.012
People-0.012
Grand-0.012
ind-0.011
Token The
Token ID383
Feature activation+3.33e-03

Pos logit contributions
 sip+0.003
 leap+0.003
 bite+0.003
 squeeze+0.003
 bites+0.003
Neg logit contributions
grand-0.003
ups-0.003
People-0.003
Grand-0.003
ind-0.003
Token milk
Token ID7545
Feature activation+1.17e-02

Pos logit contributions
 leap+0.005
 sip+0.005
 bite+0.005
 squeeze+0.004
 breaths+0.004
Neg logit contributions
grand-0.008
ups-0.008
People-0.008
Grand-0.007
ich-0.007
Token spills
Token ID42123
Feature activation+2.92e-02

Pos logit contributions
 leap+0.017
 sip+0.016
 bite+0.015
 squeeze+0.013
 breaths+0.013
Neg logit contributions
ups-0.028
People-0.026
we-0.026
Grand-0.025
grand-0.025
Token wipes
Token ID46142
Feature activation+3.01e-02

Pos logit contributions
 leap+0.005
 sip+0.005
 bite+0.005
 shove+0.004
 squeeze+0.004
Neg logit contributions
ups-0.006
People-0.006
grand-0.006
ind-0.006
Grand-0.006
Token her
Token ID607
Feature activation+1.80e-02

Pos logit contributions
 leap+0.078
 sip+0.074
 bite+0.073
 squeeze+0.066
 shove+0.065
Neg logit contributions
ups-0.040
grand-0.039
People-0.037
Grand-0.037
ind-0.036
Token tears
Token ID10953
Feature activation+2.78e-02

Pos logit contributions
 leap+0.029
 sip+0.027
 bite+0.026
 squeeze+0.022
 shove+0.021
Neg logit contributions
ups-0.041
grand-0.039
People-0.039
Grand-0.038
ind-0.037
Token.
Token ID13
Feature activation-3.40e-03

Pos logit contributions
 leap+0.058
 sip+0.056
 bite+0.055
 squeeze+0.049
 bites+0.048
Neg logit contributions
ups-0.051
grand-0.049
People-0.048
Grand-0.047
ind-0.046
Token He
Token ID679
Feature activation+5.81e-03

Pos logit contributions
grand+0.007
ups+0.007
People+0.006
ind+0.006
ich+0.006
Neg logit contributions
 sip-0.007
 bite-0.007
 leap-0.007
 squeeze-0.006
 bites-0.006
Token takes
Token ID2753
Feature activation+5.03e-02

Pos logit contributions
 leap+0.010
 sip+0.010
 bite+0.009
 shove+0.008
 squeeze+0.008
Neg logit contributions
ups-0.013
grand-0.012
People-0.012
ind-0.011
Grand-0.011
Token a
Token ID257
Feature activation+2.09e-02

Pos logit contributions
 leap+0.124
 sip+0.117
 bite+0.113
 squeeze+0.102
 shove+0.102
Neg logit contributions
ups-0.075
grand-0.067
People-0.065
Grand-0.064
ind-0.059
Token band
Token ID4097
Feature activation+1.51e-02

Pos logit contributions
 leap+0.034
 breaths+0.030
 sip+0.030
 bite+0.028
 paws+0.028
Neg logit contributions
grand-0.045
ups-0.043
Grand-0.043
People-0.041
bel-0.041
Token-
Token ID12
Feature activation+7.44e-03

Pos logit contributions
 sip+0.034
 leap+0.034
 bite+0.033
 squeeze+0.030
 bites+0.030
Neg logit contributions
grand-0.025
ups-0.024
People-0.024
Grand-0.023
ind-0.022
Tokenaid
Token ID1698
Feature activation+9.97e-03

Pos logit contributions
 sip+0.013
 bite+0.013
 leap+0.012
 squeeze+0.011
 bites+0.010
Neg logit contributions
grand-0.016
People-0.016
ups-0.016
ind-0.015
ich-0.015
Token from
Token ID422
Feature activation+2.26e-02

Pos logit contributions
 sip+0.020
 leap+0.020
 bite+0.019
 squeeze+0.017
 bites+0.017
Neg logit contributions
ups-0.019
grand-0.019
People-0.018
Grand-0.018
ind-0.017
Token
Token ID198
Feature activation-1.98e-03

Pos logit contributions
grand+0.036
ups+0.036
ich+0.034
People+0.034
ind+0.033
Neg logit contributions
 sip-0.033
 bite-0.032
 leap-0.031
 squeeze-0.027
 bites-0.026
TokenShe
Token ID3347
Feature activation-7.41e-03

Pos logit contributions
grand+0.003
ups+0.003
ich+0.003
People+0.003
ind+0.003
Neg logit contributions
 sip-0.005
 bite-0.004
 leap-0.004
 squeeze-0.004
 knot-0.004
Token carefully
Token ID7773
Feature activation+6.69e-03

Pos logit contributions
grand+0.015
People+0.015
ups+0.015
Grand+0.014
ind+0.014
Neg logit contributions
 sip-0.014
 leap-0.014
 bite-0.014
 squeeze-0.012
 bites-0.012
Token scoop
Token ID23529
Feature activation+1.17e-02

Pos logit contributions
 leap+0.013
 sip+0.013
 bite+0.012
 cushion+0.011
 shock+0.011
Neg logit contributions
ups-0.013
grand-0.012
People-0.012
Grand-0.012
ind-0.011
Tokened
Token ID276
Feature activation+2.84e-02

Pos logit contributions
 leap+0.024
 sip+0.024
 bite+0.023
 squeeze+0.021
 cushion+0.020
Neg logit contributions
ups-0.022
People-0.021
we-0.020
grand-0.020
Grand-0.019
Token it
Token ID340
Feature activation+4.66e-02

Pos logit contributions
 leap+0.058
 sip+0.057
 bite+0.056
 squeeze+0.051
 shove+0.048
Neg logit contributions
ups-0.052
People-0.047
grand-0.047
ind-0.046
we-0.045
Token up
Token ID510
Feature activation+8.16e-04

Pos logit contributions
 leap+0.099
 sip+0.097
 bite+0.092
 breaths+0.087
 bang+0.087
Neg logit contributions
ups-0.078
ind-0.066
we-0.065
grand-0.065
bel-0.064
Token with
Token ID351
Feature activation+2.83e-02

Pos logit contributions
 sip+0.002
 leap+0.002
 bite+0.001
 squeeze+0.001
 bites+0.001
Neg logit contributions
grand-0.002
ups-0.002
People-0.002
Grand-0.002
ind-0.002
Token a
Token ID257
Feature activation+4.25e-02

Pos logit contributions
 leap+0.069
 sip+0.068
 bite+0.067
 breaths+0.062
 shove+0.062
Neg logit contributions
ups-0.032
People-0.031
Grand-0.030
Kat-0.028
grand-0.028
Token leaf
Token ID12835
Feature activation+2.11e-02

Pos logit contributions
 breaths+0.067
 leap+0.065
 sip+0.062
 bite+0.059
 bites+0.059
Neg logit contributions
Grand-0.081
grand-0.077
People-0.077
God-0.075
ups-0.075
Token and
Token ID290
Feature activation+1.27e-02

Pos logit contributions
 sip+0.041
 leap+0.041
 bite+0.039
 bites+0.035
 squeeze+0.035
Neg logit contributions
ups-0.040
grand-0.039
People-0.037
ich-0.036
we-0.036
Token time
Token ID640
Feature activation+1.69e-02

Pos logit contributions
 leap+0.062
 sip+0.062
 breaths+0.057
 bite+0.057
 squeeze+0.054
Neg logit contributions
People-0.051
Grand-0.051
ind-0.050
grand-0.049
ups-0.048
Token,
Token ID11
Feature activation-1.99e-02

Pos logit contributions
 sip+0.027
 leap+0.025
 bite+0.025
 squeeze+0.022
 breaths+0.022
Neg logit contributions
People-0.039
grand-0.038
ups-0.037
Grand-0.037
ich-0.036
Token there
Token ID612
Feature activation-4.03e-03

Pos logit contributions
People+0.032
grand+0.032
ups+0.032
Grand+0.030
ind+0.029
Neg logit contributions
 sip-0.045
 leap-0.045
 bite-0.044
 squeeze-0.040
 bites-0.039
Token was
Token ID373
Feature activation+1.00e-02

Pos logit contributions
grand+0.008
ups+0.008
People+0.008
ind+0.007
Grand+0.007
Neg logit contributions
 sip-0.008
 bite-0.008
 leap-0.008
 squeeze-0.007
 bites-0.006
Token a
Token ID257
Feature activation+8.30e-03

Pos logit contributions
 sip+0.017
 leap+0.017
 bite+0.017
 squeeze+0.015
 bites+0.014
Neg logit contributions
grand-0.022
People-0.022
ups-0.021
Grand-0.021
ind-0.020
Token cute
Token ID13779
Feature activation+4.67e-02

Pos logit contributions
 sip+0.013
 leap+0.013
 bite+0.013
 squeeze+0.011
 bites+0.011
Neg logit contributions
grand-0.019
ups-0.019
People-0.019
Grand-0.018
ind-0.017
Token little
Token ID1310
Feature activation+3.74e-02

Pos logit contributions
 leap+0.084
 sip+0.083
 bite+0.081
 breaths+0.071
 squeeze+0.071
Neg logit contributions
grand-0.095
ind-0.094
People-0.094
ich-0.092
Grand-0.091
Token t
Token ID256
Feature activation-2.47e-02

Pos logit contributions
 leap+0.081
 sip+0.081
 bite+0.078
 shove+0.073
 squeeze+0.071
Neg logit contributions
ind-0.060
ich-0.059
grand-0.057
People-0.056
Grand-0.055
Tokeneddy
Token ID21874
Feature activation+3.76e-02

Pos logit contributions
People+0.030
ups+0.029
grand+0.027
 known+0.026
David+0.026
Neg logit contributions
 sip-0.061
 bite-0.060
 leap-0.057
 cushion-0.056
 squeeze-0.054
Token bear
Token ID6842
Feature activation+3.19e-03

Pos logit contributions
 sip+0.033
 bite+0.032
 leap+0.031
 squeeze+0.024
 bites+0.021
Neg logit contributions
grand-0.114
ups-0.114
People-0.113
Grand-0.108
ind-0.107
Token.
Token ID13
Feature activation+1.25e-02

Pos logit contributions
 bite+0.006
 sip+0.006
 leap+0.005
 squeeze+0.005
 paws+0.005
Neg logit contributions
ups-0.007
People-0.007
grand-0.006
David-0.006
Kat-0.006
Token dish
Token ID9433
Feature activation-2.02e-02

Pos logit contributions
 sip+0.013
 leap+0.012
 bite+0.012
 squeeze+0.011
 bites+0.010
Neg logit contributions
grand-0.014
ups-0.014
People-0.014
Grand-0.013
ind-0.013
Token.
Token ID13
Feature activation+8.98e-03

Pos logit contributions
People+0.044
grand+0.043
ups+0.042
Grand+0.041
ind+0.040
Neg logit contributions
 sip-0.036
 bite-0.035
 leap-0.034
 squeeze-0.031
 bites-0.029
Token Lily
Token ID20037
Feature activation+1.68e-02

Pos logit contributions
 sip+0.014
 leap+0.013
 bite+0.013
 squeeze+0.012
 paws+0.012
Neg logit contributions
ups-0.022
ind-0.020
grand-0.020
People-0.020
ich-0.020
Token ate
Token ID15063
Feature activation+1.84e-03

Pos logit contributions
 sip+0.037
 leap+0.037
 bite+0.036
 squeeze+0.033
 bites+0.031
Neg logit contributions
grand-0.028
ups-0.028
People-0.028
Grand-0.026
ind-0.025
Token it
Token ID340
Feature activation+5.04e-02

Pos logit contributions
 leap+0.004
 sip+0.004
 bite+0.004
 squeeze+0.004
 shove+0.004
Neg logit contributions
ups-0.003
People-0.003
Grand-0.003
grand-0.003
we-0.002
Token all
Token ID477
Feature activation+2.12e-02

Pos logit contributions
 leap+0.103
 sip+0.102
 bite+0.096
 shock+0.094
 breaths+0.090
Neg logit contributions
ups-0.086
grand-0.077
People-0.076
we-0.075
ind-0.075
Token up
Token ID510
Feature activation+1.54e-03

Pos logit contributions
 leap+0.040
 sip+0.040
 bite+0.038
 squeeze+0.035
 bites+0.033
Neg logit contributions
grand-0.043
ups-0.043
People-0.042
Grand-0.040
ind-0.039
Token,
Token ID11
Feature activation-3.47e-02

Pos logit contributions
 leap+0.003
 sip+0.003
 bite+0.003
 squeeze+0.002
 bites+0.002
Neg logit contributions
ups-0.003
grand-0.003
People-0.003
Grand-0.003
ind-0.003
Token but
Token ID475
Feature activation-1.38e-03

Pos logit contributions
grand+0.087
ups+0.083
People+0.082
ind+0.076
Grand+0.075
Neg logit contributions
 bite-0.055
 sip-0.055
 leap-0.052
 squeeze-0.048
 bites-0.044
Token then
Token ID788
Feature activation+2.83e-03

Pos logit contributions
grand+0.003
People+0.003
ups+0.003
ind+0.003
Grand+0.003
Neg logit contributions
 sip-0.003
 bite-0.003
 leap-0.002
 squeeze-0.002
 paws-0.002
Token her
Token ID607
Feature activation-1.85e-02

Pos logit contributions
 sip+0.006
 leap+0.006
 bite+0.006
 squeeze+0.005
 bites+0.005
Neg logit contributions
grand-0.005
People-0.005
ups-0.005
ind-0.005
Grand-0.005
Token spoon
Token ID24556
Feature activation-3.60e-03

Pos logit contributions
 leap+0.037
 sip+0.037
 breaths+0.034
 bite+0.033
 bites+0.030
Neg logit contributions
People-0.061
ups-0.061
Grand-0.060
grand-0.059
ind-0.059
Token on
Token ID319
Feature activation+1.17e-02

Pos logit contributions
grand+0.008
People+0.008
ups+0.008
Grand+0.008
ind+0.008
Neg logit contributions
 sip-0.006
 bite-0.006
 leap-0.006
 squeeze-0.005
 bites-0.005
Token their
Token ID511
Feature activation+6.22e-03

Pos logit contributions
 sip+0.023
 bite+0.023
 leap+0.022
 squeeze+0.020
 bites+0.019
Neg logit contributions
grand-0.023
ups-0.023
People-0.023
Grand-0.021
ind-0.021
Token nose
Token ID9686
Feature activation+1.10e-02

Pos logit contributions
 sip+0.010
 leap+0.010
 bite+0.010
 squeeze+0.008
 bites+0.008
Neg logit contributions
ups-0.014
grand-0.014
People-0.014
Grand-0.013
ind-0.013
Token.
Token ID13
Feature activation-7.62e-03

Pos logit contributions
 sip+0.018
 leap+0.018
 bite+0.018
 squeeze+0.016
 bites+0.015
Neg logit contributions
grand-0.025
People-0.024
ups-0.024
Grand-0.023
ind-0.023
Token It
Token ID632
Feature activation+2.26e-02

Pos logit contributions
ups+0.018
grand+0.017
ind+0.017
People+0.017
ich+0.016
Neg logit contributions
 sip-0.012
 leap-0.011
 bite-0.011
 squeeze-0.010
 paws-0.010
Token's
Token ID338
Feature activation-7.59e-03

Pos logit contributions
 leap+0.043
 sip+0.042
 bite+0.039
 breath+0.036
 breaths+0.036
Neg logit contributions
ups-0.044
grand-0.041
People-0.041
we-0.039
ind-0.038
Token just
Token ID655
Feature activation-5.75e-03

Pos logit contributions
ups+0.014
grand+0.014
People+0.014
Grand+0.013
ind+0.013
Neg logit contributions
 sip-0.015
 leap-0.015
 bite-0.015
 squeeze-0.013
 bites-0.013
Token a
Token ID257
Feature activation+1.53e-02

Pos logit contributions
grand+0.012
ups+0.012
People+0.011
Grand+0.011
ind+0.011
Neg logit contributions
 sip-0.010
 leap-0.010
 bite-0.010
 squeeze-0.009
 bites-0.008
Token spoon
Token ID24556
Feature activation-1.06e-02

Pos logit contributions
 sip+0.024
 leap+0.024
 bite+0.022
 breaths+0.020
 squeeze+0.019
Neg logit contributions
ups-0.033
grand-0.033
People-0.033
Grand-0.033
ind-0.031
Token.
Token ID13
Feature activation-8.61e-03

Pos logit contributions
grand+0.024
People+0.024
ups+0.024
Grand+0.023
ind+0.022
Neg logit contributions
 sip-0.017
 leap-0.017
 bite-0.017
 squeeze-0.015
 bites-0.014
Token the
Token ID262
Feature activation-8.01e-03

Pos logit contributions
 sip+0.008
 bite+0.008
 leap+0.008
 squeeze+0.007
 bites+0.007
Neg logit contributions
grand-0.010
People-0.010
ups-0.010
ind-0.009
Grand-0.009
Token pie
Token ID2508
Feature activation-2.09e-02

Pos logit contributions
grand+0.017
ups+0.017
People+0.017
Grand+0.016
ind+0.016
Neg logit contributions
 sip-0.014
 bite-0.014
 leap-0.014
 squeeze-0.012
 bites-0.012
Token.
Token ID13
Feature activation-1.21e-02

Pos logit contributions
grand+0.046
People+0.045
ups+0.044
Grand+0.042
ind+0.042
Neg logit contributions
 sip-0.037
 bite-0.037
 leap-0.036
 squeeze-0.032
 bites-0.031
Token It
Token ID632
Feature activation+8.30e-03

Pos logit contributions
ups+0.030
grand+0.030
ind+0.029
People+0.028
ich+0.028
Neg logit contributions
 sip-0.019
 bite-0.017
 leap-0.017
 squeeze-0.016
 paws-0.015
Token is
Token ID318
Feature activation-1.09e-02

Pos logit contributions
 leap+0.014
 sip+0.014
 bite+0.013
 squeeze+0.012
 breaths+0.012
Neg logit contributions
ups-0.018
People-0.018
grand-0.017
Grand-0.017
ind-0.016
Token brown
Token ID7586
Feature activation+2.79e-02

Pos logit contributions
grand+0.028
ups+0.026
People+0.026
ind+0.025
we+0.024
Neg logit contributions
 bite-0.018
 sip-0.017
 leap-0.016
 squeeze-0.015
 bites-0.014
Token and
Token ID290
Feature activation-1.89e-02

Pos logit contributions
 leap+0.062
 sip+0.061
 bite+0.057
 squeeze+0.053
 shove+0.053
Neg logit contributions
ups-0.047
People-0.043
we-0.043
grand-0.043
ind-0.043
Token bub
Token ID10015
Feature activation-1.54e-02

Pos logit contributions
grand+0.041
People+0.041
ups+0.041
Grand+0.039
ind+0.038
Neg logit contributions
 sip-0.032
 leap-0.032
 bite-0.031
 squeeze-0.027
 bites-0.026
Tokenbly
Token ID36874
Feature activation+1.68e-02

Pos logit contributions
grand+0.024
People+0.023
ups+0.023
Grand+0.021
ind+0.020
Neg logit contributions
 sip-0.037
 bite-0.036
 leap-0.036
 squeeze-0.033
 bites-0.031
Token.
Token ID13
Feature activation-1.09e-02

Pos logit contributions
 leap+0.034
 sip+0.033
 bite+0.032
 squeeze+0.029
 shove+0.028
Neg logit contributions
ups-0.031
People-0.030
grand-0.030
Grand-0.028
ind-0.028
Token They
Token ID1119
Feature activation-3.12e-03

Pos logit contributions
grand+0.030
ups+0.029
ind+0.029
ich+0.028
People+0.027
Neg logit contributions
 sip-0.014
 leap-0.013
 bite-0.013
 squeeze-0.012
 paws-0.011
Token.
Token ID13
Feature activation+1.39e-02

Pos logit contributions
 sip+0.013
 leap+0.012
 bite+0.012
 squeeze+0.011
 bites+0.010
Neg logit contributions
grand-0.015
ups-0.015
People-0.015
Grand-0.014
ind-0.014
Token He
Token ID679
Feature activation+6.02e-03

Pos logit contributions
 sip+0.025
 bite+0.024
 leap+0.024
 squeeze+0.022
 bites+0.021
Neg logit contributions
grand-0.029
ups-0.028
People-0.028
ind-0.028
ich-0.027
Token told
Token ID1297
Feature activation+1.33e-02

Pos logit contributions
 sip+0.013
 leap+0.013
 bite+0.013
 squeeze+0.011
 bites+0.011
Neg logit contributions
grand-0.011
ups-0.011
People-0.010
Grand-0.010
ind-0.009
Token her
Token ID607
Feature activation-1.78e-02

Pos logit contributions
 sip+0.031
 leap+0.031
 bite+0.030
 squeeze+0.028
 paws+0.026
Neg logit contributions
grand-0.021
ups-0.021
People-0.021
Grand-0.019
ind-0.019
Token he
Token ID339
Feature activation-1.52e-02

Pos logit contributions
grand+0.031
ups+0.031
People+0.031
Grand+0.028
ind+0.028
Neg logit contributions
 sip-0.039
 leap-0.038
 bite-0.038
 squeeze-0.034
 bites-0.033
Token would
Token ID561
Feature activation+1.12e-02

Pos logit contributions
grand+0.025
ups+0.025
People+0.025
Grand+0.023
ind+0.023
Neg logit contributions
 sip-0.034
 leap-0.034
 bite-0.034
 squeeze-0.031
 bites-0.029
Token be
Token ID307
Feature activation+1.34e-02

Pos logit contributions
 sip+0.020
 leap+0.020
 bite+0.019
 bites+0.017
 breaths+0.017
Neg logit contributions
grand-0.023
People-0.022
ups-0.022
Grand-0.021
ind-0.021
Token okay
Token ID8788
Feature activation-4.29e-03

Pos logit contributions
 bite+0.023
 sip+0.023
 leap+0.023
 squeeze+0.021
 paws+0.019
Neg logit contributions
grand-0.030
People-0.028
ups-0.028
ind-0.026
Grand-0.026
Token and
Token ID290
Feature activation-2.50e-03

Pos logit contributions
grand+0.011
People+0.010
ups+0.010
ind+0.010
Grand+0.010
Neg logit contributions
 bite-0.006
 sip-0.006
 leap-0.006
 squeeze-0.006
 paws-0.005
Token thanked
Token ID26280
Feature activation-9.33e-03

Pos logit contributions
grand+0.006
People+0.006
ups+0.006
Grand+0.006
ind+0.006
Neg logit contributions
 sip-0.004
 bite-0.004
 leap-0.004
 squeeze-0.003
 bites-0.003
Token her
Token ID607
Feature activation-5.27e-03

Pos logit contributions
ups+0.021
grand+0.021
People+0.020
ich+0.020
ind+0.019
Neg logit contributions
 bite-0.016
 sip-0.016
 leap-0.015
 squeeze-0.014
 bites-0.013
Token but
Token ID475
Feature activation+1.56e-03

Pos logit contributions
ups+0.046
grand+0.046
People+0.043
ind+0.043
Grand+0.042
Neg logit contributions
 leap-0.040
 sip-0.039
 bite-0.038
 squeeze-0.034
 paws-0.033
Token finally
Token ID3443
Feature activation+1.27e-02

Pos logit contributions
 leap+0.003
 sip+0.003
 bite+0.003
 squeeze+0.003
 paws+0.003
Neg logit contributions
ups-0.003
grand-0.003
People-0.003
ind-0.003
Grand-0.003
Token,
Token ID11
Feature activation-9.14e-03

Pos logit contributions
 leap+0.018
 sip+0.018
 bite+0.017
 squeeze+0.015
 bites+0.014
Neg logit contributions
grand-0.032
ups-0.032
People-0.032
Grand-0.030
ind-0.030
Token the
Token ID262
Feature activation-1.76e-02

Pos logit contributions
grand+0.020
ups+0.020
ind+0.019
People+0.018
Grand+0.018
Neg logit contributions
 leap-0.017
 bite-0.016
 sip-0.016
 squeeze-0.014
 paws-0.014
Token nest
Token ID16343
Feature activation-2.88e-02

Pos logit contributions
grand+0.036
People+0.036
ups+0.036
Grand+0.034
ind+0.034
Neg logit contributions
 sip-0.032
 leap-0.031
 bite-0.031
 squeeze-0.027
 bites-0.026
Token was
Token ID373
Feature activation+8.09e-03

Pos logit contributions
grand+0.060
People+0.059
Grand+0.058
ups+0.057
ind+0.056
Neg logit contributions
 sip-0.053
 leap-0.053
 bite-0.052
 squeeze-0.046
 bites-0.044
Token finished
Token ID5201
Feature activation-2.66e-02

Pos logit contributions
 bite+0.015
 sip+0.015
 leap+0.015
 squeeze+0.013
 bites+0.012
Neg logit contributions
grand-0.018
ups-0.017
People-0.017
we-0.016
ind-0.016
Token!
Token ID0
Feature activation+1.11e-02

Pos logit contributions
grand+0.070
People+0.069
ups+0.066
Grand+0.064
David+0.062
Neg logit contributions
 bite-0.037
 sip-0.036
 leap-0.036
 squeeze-0.031
 bites-0.029
Token The
Token ID383
Feature activation-6.72e-05

Pos logit contributions
 sip+0.019
 leap+0.019
 bite+0.018
 squeeze+0.017
 paws+0.016
Neg logit contributions
ups-0.024
grand-0.024
ind-0.023
People-0.023
ich-0.022
Token bird
Token ID6512
Feature activation+2.27e-02

Pos logit contributions
grand+0.000
People+0.000
ups+0.000
Grand+0.000
ind+0.000
Neg logit contributions
 sip-0.000
 leap-0.000
 bite-0.000
 squeeze-0.000
 bites-0.000
Tokenie
Token ID494
Feature activation+1.19e-02

Pos logit contributions
 sip+0.046
 leap+0.045
 bite+0.045
 squeeze+0.040
 bites+0.039
Neg logit contributions
grand-0.043
ups-0.043
People-0.042
Grand-0.040
ind-0.039
Token.
Token ID13
Feature activation+2.31e-02

Pos logit contributions
 bite+0.008
 sip+0.008
 leap+0.008
 squeeze+0.007
 bites+0.007
Neg logit contributions
ups-0.011
People-0.011
grand-0.011
Grand-0.010
Kat-0.010
Token Tim
Token ID5045
Feature activation-6.35e-03

Pos logit contributions
 leap+0.042
 sip+0.041
 bite+0.039
 breaths+0.036
 knot+0.035
Neg logit contributions
People-0.048
grand-0.047
Grand-0.046
ups-0.044
David-0.042
Tokenmy
Token ID1820
Feature activation-1.03e-02

Pos logit contributions
grand+0.012
ups+0.012
People+0.012
Grand+0.011
ind+0.011
Neg logit contributions
 sip-0.013
 bite-0.013
 leap-0.013
 squeeze-0.011
 bites-0.011
Token loved
Token ID6151
Feature activation-2.92e-02

Pos logit contributions
ups+0.020
People+0.019
grand+0.017
we+0.017
ich+0.017
Neg logit contributions
 bite-0.022
 sip-0.021
 squeeze-0.019
 leap-0.019
 paws-0.018
Token to
Token ID284
Feature activation+3.33e-03

Pos logit contributions
People+0.061
ups+0.061
grand+0.060
Kat+0.059
eline+0.058
Neg logit contributions
 bite-0.055
 sip-0.049
 bites-0.048
 leap-0.047
 squeeze-0.045
Token play
Token ID711
Feature activation-2.62e-03

Pos logit contributions
 sip+0.005
 leap+0.005
 bite+0.005
 breaths+0.005
 bites+0.005
Neg logit contributions
grand-0.007
People-0.007
ups-0.007
Grand-0.007
God-0.006
Token with
Token ID351
Feature activation-1.90e-02

Pos logit contributions
People+0.004
ups+0.004
grand+0.004
Grand+0.004
we+0.004
Neg logit contributions
 leap-0.006
 sip-0.006
 bite-0.006
 squeeze-0.005
 breaths-0.005
Token his
Token ID465
Feature activation+1.63e-04

Pos logit contributions
grand+0.033
ups+0.032
People+0.032
Grand+0.030
ind+0.030
Neg logit contributions
 sip-0.042
 leap-0.041
 bite-0.041
 squeeze-0.037
 bites-0.035
Token toy
Token ID13373
Feature activation+1.81e-02

Pos logit contributions
 sip+0.000
 leap+0.000
 bite+0.000
 squeeze+0.000
 bites+0.000
Neg logit contributions
grand-0.000
ups-0.000
People-0.000
ind-0.000
Grand-0.000
Token gun
Token ID2485
Feature activation+1.25e-02

Pos logit contributions
 sip+0.036
 leap+0.035
 bite+0.035
 squeeze+0.031
 bites+0.030
Neg logit contributions
grand-0.035
ups-0.035
People-0.034
ind-0.032
Grand-0.032
Token and
Token ID290
Feature activation-4.99e-03

Pos logit contributions
 sip+0.025
 leap+0.024
 bite+0.023
 squeeze+0.021
 shock+0.020
Neg logit contributions
ups-0.025
grand-0.024
People-0.024
ind-0.023
Grand-0.022
Token it
Token ID340
Feature activation+1.78e-02

Pos logit contributions
grand+0.014
ups+0.014
People+0.013
ich+0.013
Grand+0.013
Neg logit contributions
 sip-0.012
 bite-0.012
 leap-0.012
 squeeze-0.011
 paws-0.011
Token.
Token ID13
Feature activation+1.49e-02

Pos logit contributions
 sip+0.032
 leap+0.031
 bite+0.031
 squeeze+0.028
 bites+0.026
Neg logit contributions
grand-0.038
People-0.037
ups-0.037
Grand-0.035
ind-0.035
Token Her
Token ID2332
Feature activation-2.10e-02

Pos logit contributions
 sip+0.031
 bite+0.030
 leap+0.029
 squeeze+0.028
 paws+0.026
Neg logit contributions
People-0.027
ups-0.027
grand-0.027
ind-0.026
ich-0.026
Token mom
Token ID1995
Feature activation-3.71e-03

Pos logit contributions
grand+0.033
People+0.032
ups+0.032
Grand+0.030
ind+0.029
Neg logit contributions
 sip-0.050
 leap-0.050
 bite-0.049
 squeeze-0.045
 bites-0.044
Token agreed
Token ID4987
Feature activation+1.62e-02

Pos logit contributions
ups+0.006
grand+0.006
People+0.006
Grand+0.006
we+0.005
Neg logit contributions
 leap-0.008
 sip-0.008
 bite-0.008
 breaths-0.007
 shock-0.007
Token,
Token ID11
Feature activation-1.40e-02

Pos logit contributions
 leap+0.031
 sip+0.031
 bite+0.030
 squeeze+0.027
 breaths+0.026
Neg logit contributions
ups-0.032
grand-0.031
People-0.030
Grand-0.030
we-0.028
Token and
Token ID290
Feature activation+6.00e-03

Pos logit contributions
grand+0.033
ups+0.033
People+0.032
ind+0.031
Grand+0.030
Neg logit contributions
 sip-0.022
 leap-0.022
 bite-0.021
 squeeze-0.019
 bites-0.017
Token they
Token ID484
Feature activation+1.41e-02

Pos logit contributions
 sip+0.014
 leap+0.014
 bite+0.014
 squeeze+0.012
 bites+0.012
Neg logit contributions
grand-0.010
People-0.010
ups-0.010
ind-0.009
Grand-0.009
Token found
Token ID1043
Feature activation+1.88e-02

Pos logit contributions
 sip+0.029
 leap+0.029
 bite+0.028
 bites+0.026
 shock+0.025
Neg logit contributions
grand-0.024
ups-0.023
Grand-0.023
People-0.023
eline-0.022
Token a
Token ID257
Feature activation+2.61e-02

Pos logit contributions
 sip+0.038
 leap+0.038
 bite+0.037
 squeeze+0.033
 breaths+0.031
Neg logit contributions
ups-0.036
People-0.035
grand-0.035
we-0.032
Grand-0.032
Token spot
Token ID4136
Feature activation-9.22e-03

Pos logit contributions
 sip+0.056
 leap+0.055
 bite+0.054
 breaths+0.050
 bites+0.049
Neg logit contributions
grand-0.044
People-0.044
Grand-0.042
ups-0.042
ich-0.040
Token is
Token ID318
Feature activation-5.24e-03

Pos logit contributions
 sip+0.019
 leap+0.018
 bite+0.018
 squeeze+0.016
 bites+0.015
Neg logit contributions
ups-0.017
grand-0.017
People-0.017
Grand-0.016
ind-0.016
Token always
Token ID1464
Feature activation-6.53e-03

Pos logit contributions
grand+0.009
ups+0.008
People+0.008
we+0.008
ich+0.008
Neg logit contributions
 sip-0.012
 bite-0.012
 leap-0.011
 squeeze-0.011
 bites-0.010
Token good
Token ID922
Feature activation-3.72e-03

Pos logit contributions
grand+0.013
ups+0.013
People+0.013
Grand+0.012
ind+0.012
Neg logit contributions
 sip-0.012
 bite-0.012
 leap-0.012
 squeeze-0.011
 bites-0.010
Token to
Token ID284
Feature activation+5.08e-03

Pos logit contributions
ups+0.006
grand+0.006
People+0.006
Grand+0.005
ind+0.005
Neg logit contributions
 sip-0.009
 leap-0.009
 bite-0.009
 squeeze-0.008
 bites-0.008
Token ask
Token ID1265
Feature activation-1.93e-02

Pos logit contributions
 leap+0.009
 sip+0.009
 bite+0.008
 squeeze+0.007
 bites+0.007
Neg logit contributions
grand-0.011
People-0.011
ups-0.011
Grand-0.010
ind-0.010
Token for
Token ID329
Feature activation-1.94e-02

Pos logit contributions
grand+0.047
People+0.045
ups+0.044
ich+0.044
Grand+0.043
Neg logit contributions
 sip-0.029
 bite-0.029
 leap-0.027
 squeeze-0.025
 paws-0.023
Token permission
Token ID7170
Feature activation-4.17e-02

Pos logit contributions
grand+0.056
People+0.054
ups+0.054
ind+0.053
ich+0.053
Neg logit contributions
 bite-0.021
 sip-0.021
 leap-0.020
 squeeze-0.017
 bites-0.015
Token before
Token ID878
Feature activation-2.36e-02

Pos logit contributions
grand+0.098
People+0.097
ind+0.093
ich+0.092
Kat+0.092
Neg logit contributions
 bite-0.056
 sip-0.053
 leap-0.053
 squeeze-0.050
 paws-0.046
Token doing
Token ID1804
Feature activation-2.40e-02

Pos logit contributions
grand+0.043
People+0.041
ups+0.041
ind+0.039
Grand+0.039
Neg logit contributions
 bite-0.050
 leap-0.050
 sip-0.050
 squeeze-0.045
 bites-0.044
Token something
Token ID1223
Feature activation-2.55e-02

Pos logit contributions
grand+0.053
People+0.050
Grand+0.048
ups+0.047
ind+0.047
Neg logit contributions
 bite-0.043
 sip-0.042
 leap-0.041
 squeeze-0.038
 bites-0.037
Token that
Token ID326
Feature activation-4.79e-03

Pos logit contributions
grand+0.056
People+0.056
Grand+0.055
ups+0.052
ind+0.052
Neg logit contributions
 bite-0.044
 sip-0.044
 leap-0.043
 squeeze-0.039
 bites-0.037
Token to
Token ID284
Feature activation+9.57e-03

Pos logit contributions
 sip+0.046
 bite+0.046
 leap+0.045
 squeeze+0.041
 bites+0.040
Neg logit contributions
grand-0.035
People-0.035
ups-0.034
ind-0.032
ich-0.032
Token call
Token ID869
Feature activation-6.12e-04

Pos logit contributions
 sip+0.014
 bite+0.014
 leap+0.014
 squeeze+0.012
 bites+0.011
Neg logit contributions
grand-0.023
ups-0.023
People-0.023
ind-0.022
Grand-0.022
Token her
Token ID607
Feature activation+3.72e-03

Pos logit contributions
grand+0.001
ups+0.001
People+0.001
ich+0.001
ind+0.001
Neg logit contributions
 sip-0.001
 bite-0.001
 leap-0.001
 squeeze-0.001
 bites-0.001
Token and
Token ID290
Feature activation-6.47e-03

Pos logit contributions
 bite+0.007
 sip+0.007
 leap+0.007
 squeeze+0.006
 cushion+0.006
Neg logit contributions
grand-0.007
ups-0.007
People-0.007
ind-0.007
we-0.007
Token tell
Token ID1560
Feature activation-8.63e-04

Pos logit contributions
People+0.014
ups+0.014
grand+0.014
ind+0.014
we+0.014
Neg logit contributions
 bite-0.011
 sip-0.011
 leap-0.011
 squeeze-0.010
 paws-0.009
Token her
Token ID607
Feature activation-1.40e-02

Pos logit contributions
grand+0.002
People+0.002
ups+0.002
ich+0.002
Grand+0.002
Neg logit contributions
 sip-0.002
 bite-0.002
 leap-0.001
 squeeze-0.001
 bites-0.001
Token about
Token ID546
Feature activation-1.41e-02

Pos logit contributions
ind+0.028
grand+0.027
People+0.027
ups+0.027
Grand+0.026
Neg logit contributions
 bite-0.026
 sip-0.026
 leap-0.025
 squeeze-0.023
 cushion-0.023
Token it
Token ID340
Feature activation+9.85e-03

Pos logit contributions
grand+0.029
ups+0.029
People+0.029
Grand+0.027
bel+0.026
Neg logit contributions
 bite-0.026
 sip-0.026
 leap-0.025
 squeeze-0.023
 cushion-0.022
Token?"
Token ID1701
Feature activation-3.24e-03

Pos logit contributions
 bite+0.019
 sip+0.019
 leap+0.019
 squeeze+0.017
 bites+0.016
Neg logit contributions
grand-0.019
People-0.019
ups-0.019
ind-0.018
Grand-0.018
Token
Token ID198
Feature activation+4.39e-03

Pos logit contributions
ups+0.008
ind+0.008
People+0.007
ich+0.007
grand+0.007
Neg logit contributions
 sip-0.005
 bite-0.005
 leap-0.005
 squeeze-0.004
 paws-0.004
Token
Token ID198
Feature activation+9.91e-03

Pos logit contributions
 sip+0.009
 bite+0.008
 leap+0.007
 squeeze+0.007
 bites+0.007
Neg logit contributions
ups-0.009
grand-0.009
People-0.009
ich-0.008
ind-0.008
Token She
Token ID1375
Feature activation-1.02e-02

Pos logit contributions
 sip+0.012
 leap+0.012
 bite+0.011
 squeeze+0.010
 paws+0.010
Neg logit contributions
grand-0.013
ups-0.013
People-0.013
ind-0.012
Grand-0.012
Token decided
Token ID3066
Feature activation-2.09e-02

Pos logit contributions
grand+0.018
ups+0.018
People+0.018
Grand+0.017
ind+0.016
Neg logit contributions
 sip-0.022
 leap-0.022
 bite-0.021
 squeeze-0.019
 bites-0.019
Token to
Token ID284
Feature activation-5.04e-03

Pos logit contributions
grand+0.047
ind+0.047
People+0.045
Grand+0.044
cle+0.043
Neg logit contributions
 bite-0.036
 leap-0.035
 sip-0.033
 squeeze-0.031
 bites-0.029
Token take
Token ID1011
Feature activation+2.55e-02

Pos logit contributions
grand+0.011
People+0.011
ups+0.011
Grand+0.010
ind+0.010
Neg logit contributions
 sip-0.008
 leap-0.008
 bite-0.007
 breaths-0.007
 bites-0.007
Token it
Token ID340
Feature activation+2.77e-02

Pos logit contributions
 sip+0.056
 bite+0.053
 leap+0.053
 squeeze+0.048
 cushion+0.047
Neg logit contributions
ups-0.043
People-0.042
grand-0.041
Grand-0.038
Kat-0.038
Token home
Token ID1363
Feature activation-2.57e-02

Pos logit contributions
 sip+0.062
 leap+0.059
 bite+0.059
 squeeze+0.054
 bites+0.054
Neg logit contributions
ups-0.046
grand-0.045
People-0.043
ind-0.040
Grand-0.040
Token,
Token ID11
Feature activation-1.76e-02

Pos logit contributions
People+0.065
grand+0.063
ups+0.061
ind+0.060
Grand+0.059
Neg logit contributions
 bite-0.037
 sip-0.037
 leap-0.036
 squeeze-0.031
 breath-0.029
Token so
Token ID523
Feature activation-1.08e-02

Pos logit contributions
grand+0.039
ups+0.039
People+0.039
ind+0.037
Grand+0.037
Neg logit contributions
 sip-0.030
 leap-0.030
 bite-0.030
 squeeze-0.027
 paws-0.025
Token she
Token ID673
Feature activation-1.18e-02

Pos logit contributions
grand+0.019
People+0.019
ind+0.018
ups+0.018
Grand+0.017
Neg logit contributions
 leap-0.024
 sip-0.023
 bite-0.023
 squeeze-0.021
 paws-0.020
Token could
Token ID714
Feature activation+9.22e-03

Pos logit contributions
grand+0.024
ups+0.023
People+0.023
Grand+0.022
ind+0.022
Neg logit contributions
 sip-0.022
 leap-0.022
 bite-0.022
 squeeze-0.020
 bites-0.019
Token show
Token ID905
Feature activation-1.18e-03

Pos logit contributions
 sip+0.016
 leap+0.015
 bite+0.015
 bites+0.013
 squeeze+0.013
Neg logit contributions
grand-0.020
People-0.020
ups-0.019
Grand-0.018
ind-0.018
Token and
Token ID290
Feature activation+9.15e-03

Pos logit contributions
ups+0.002
grand+0.002
People+0.002
Grand+0.002
ind+0.002
Neg logit contributions
 leap-0.002
 sip-0.002
 bite-0.002
 squeeze-0.002
 paws-0.002
Token Sam
Token ID3409
Feature activation+1.18e-02

Pos logit contributions
 leap+0.016
 sip+0.016
 bite+0.015
 squeeze+0.013
 breaths+0.013
Neg logit contributions
grand-0.020
ups-0.019
People-0.019
Grand-0.019
ind-0.018
Token went
Token ID1816
Feature activation+1.91e-03

Pos logit contributions
 leap+0.020
 sip+0.020
 bite+0.019
 breaths+0.018
 paws+0.018
Neg logit contributions
grand-0.022
ups-0.022
People-0.020
Grand-0.020
eline-0.020
Token to
Token ID284
Feature activation-1.04e-03

Pos logit contributions
 sip+0.004
 bite+0.004
 leap+0.004
 squeeze+0.003
 bites+0.003
Neg logit contributions
grand-0.004
People-0.004
ups-0.003
Grand-0.003
ich-0.003
Token their
Token ID511
Feature activation-1.05e-02

Pos logit contributions
grand+0.002
ups+0.002
People+0.002
Grand+0.002
ind+0.002
Neg logit contributions
 sip-0.002
 leap-0.002
 bite-0.002
 squeeze-0.002
 bites-0.002
Token rooms
Token ID9519
Feature activation-4.35e-02

Pos logit contributions
People+0.018
grand+0.018
ups+0.017
Grand+0.016
we+0.016
Neg logit contributions
 sip-0.024
 bite-0.023
 leap-0.022
 squeeze-0.021
 bites-0.021
Token and
Token ID290
Feature activation+7.86e-03

Pos logit contributions
People+0.114
grand+0.114
ind+0.108
ups+0.108
ich+0.107
Neg logit contributions
 bite-0.055
 sip-0.054
 leap-0.049
 squeeze-0.044
 bites-0.042
Token cried
Token ID16896
Feature activation+3.49e-02

Pos logit contributions
 sip+0.014
 bite+0.014
 leap+0.013
 squeeze+0.012
 bites+0.011
Neg logit contributions
People-0.017
grand-0.017
ups-0.017
ind-0.017
ich-0.017
Token.
Token ID13
Feature activation+4.48e-03

Pos logit contributions
 leap+0.080
 bite+0.076
 sip+0.076
 bites+0.067
 squeeze+0.067
Neg logit contributions
ups-0.055
People-0.053
Grand-0.052
grand-0.051
ind-0.049
Token They
Token ID1119
Feature activation+1.07e-02

Pos logit contributions
 sip+0.008
 bite+0.008
 leap+0.008
 squeeze+0.007
 paws+0.007
Neg logit contributions
grand-0.010
ups-0.009
People-0.009
ind-0.009
ich-0.009
Token wished
Token ID16555
Feature activation-1.50e-02

Pos logit contributions
 sip+0.022
 leap+0.022
 bite+0.021
 squeeze+0.019
 bites+0.018
Neg logit contributions
grand-0.020
ups-0.020
People-0.020
Grand-0.019
ind-0.018
Token at
Token ID379
Feature activation-3.65e-03

Pos logit contributions
 sip+0.008
 bite+0.008
 leap+0.008
 squeeze+0.007
 bites+0.007
Neg logit contributions
grand-0.011
People-0.010
ups-0.010
ich-0.010
Grand-0.010
Token him
Token ID683
Feature activation+9.39e-03

Pos logit contributions
grand+0.008
ups+0.008
People+0.008
Grand+0.007
ich+0.007
Neg logit contributions
 sip-0.007
 bite-0.006
 leap-0.006
 squeeze-0.006
 bites-0.005
Token.
Token ID13
Feature activation-8.54e-03

Pos logit contributions
 sip+0.018
 leap+0.017
 bite+0.017
 squeeze+0.015
 bites+0.015
Neg logit contributions
grand-0.019
ups-0.019
People-0.019
Grand-0.018
ind-0.017
Token She
Token ID1375
Feature activation-2.18e-02

Pos logit contributions
grand+0.020
ups+0.020
People+0.019
ind+0.019
we+0.018
Neg logit contributions
 sip-0.014
 leap-0.013
 bite-0.013
 squeeze-0.012
 bites-0.012
Token said
Token ID531
Feature activation-2.24e-02

Pos logit contributions
grand+0.048
ups+0.046
People+0.046
ind+0.044
ich+0.043
Neg logit contributions
 sip-0.038
 bite-0.038
 leap-0.037
 squeeze-0.034
 bites-0.032
Token:
Token ID25
Feature activation-4.60e-02

Pos logit contributions
grand+0.057
ups+0.056
People+0.056
ind+0.056
ich+0.054
Neg logit contributions
 sip-0.030
 leap-0.029
 bite-0.029
 squeeze-0.025
 bites-0.024
Token "
Token ID366
Feature activation-1.76e-02

Pos logit contributions
ups+0.102
grand+0.099
ind+0.093
People+0.091
eline+0.086
Neg logit contributions
 sip-0.085
 bite-0.083
 leap-0.081
 squeeze-0.075
 bites-0.067
TokenHi
Token ID17250
Feature activation+1.45e-02

Pos logit contributions
ups+0.033
grand+0.032
ind+0.031
ich+0.030
 office+0.030
Neg logit contributions
 sip-0.033
 bite-0.032
 leap-0.031
 squeeze-0.031
hered-0.027
Token,
Token ID11
Feature activation-1.57e-02

Pos logit contributions
 sip+0.020
 leap+0.019
 bite+0.019
 squeeze+0.016
 breaths+0.015
Neg logit contributions
People-0.037
ups-0.036
grand-0.036
Grand-0.035
ind-0.034
Token I
Token ID314
Feature activation-1.20e-03

Pos logit contributions
People+0.026
grand+0.025
ups+0.025
Grand+0.023
ich+0.023
Neg logit contributions
 sip-0.035
 leap-0.034
 bite-0.034
 breaths-0.031
 squeeze-0.030
Token'm
Token ID1101
Feature activation+3.07e-03

Pos logit contributions
grand+0.002
ups+0.002
People+0.002
Grand+0.002
ind+0.002
Neg logit contributions
 sip-0.002
 leap-0.002
 bite-0.002
 squeeze-0.002
 bites-0.002
Token.
Token ID13
Feature activation+2.19e-02

Pos logit contributions
People+0.063
ups+0.063
David+0.062
Grand+0.059
grand+0.059
Neg logit contributions
 bite-0.039
 sip-0.037
 leap-0.033
 squeeze-0.029
 bites-0.028
Token She
Token ID1375
Feature activation-2.50e-02

Pos logit contributions
 sip+0.045
 leap+0.045
 bite+0.044
 squeeze+0.039
 bites+0.038
Neg logit contributions
grand-0.040
People-0.040
ups-0.040
Grand-0.038
ind-0.036
Token was
Token ID373
Feature activation-1.70e-02

Pos logit contributions
ups+0.052
People+0.048
grand+0.047
ich+0.046
ind+0.045
Neg logit contributions
 bite-0.049
 sip-0.047
 leap-0.044
 squeeze-0.044
 bites-0.041
Token only
Token ID691
Feature activation+2.86e-03

Pos logit contributions
ups+0.039
grand+0.039
People+0.037
we+0.035
David+0.034
Neg logit contributions
 sip-0.029
 bite-0.029
 leap-0.028
 squeeze-0.026
 shock-0.024
Token three
Token ID1115
Feature activation-1.20e-02

Pos logit contributions
 sip+0.005
 leap+0.005
 bite+0.005
 squeeze+0.004
 bites+0.004
Neg logit contributions
grand-0.006
ups-0.006
People-0.006
Grand-0.006
ind-0.006
Token years
Token ID812
Feature activation-3.70e-02

Pos logit contributions
grand+0.015
People+0.015
ups+0.014
Grand+0.013
ich+0.013
Neg logit contributions
 sip-0.031
 bite-0.031
 leap-0.030
 bites-0.028
 paws-0.028
Token old
Token ID1468
Feature activation+2.04e-02

Pos logit contributions
People+0.080
we+0.078
grand+0.077
Our+0.075
David+0.075
Neg logit contributions
 sip-0.062
 bite-0.062
 leap-0.058
 bites-0.055
 squeeze-0.055
Token and
Token ID290
Feature activation-1.82e-02

Pos logit contributions
 sip+0.033
 bite+0.032
 leap+0.032
 squeeze+0.028
 bites+0.027
Neg logit contributions
grand-0.047
People-0.046
ups-0.046
Grand-0.044
ind-0.043
Token loved
Token ID6151
Feature activation-2.64e-02

Pos logit contributions
ups+0.031
grand+0.030
People+0.029
ind+0.028
ich+0.027
Neg logit contributions
 sip-0.041
 bite-0.040
 leap-0.040
 squeeze-0.037
 paws-0.036
Token playing
Token ID2712
Feature activation-1.12e-02

Pos logit contributions
grand+0.056
People+0.055
ups+0.054
cle+0.052
David+0.052
Neg logit contributions
 bite-0.049
 sip-0.046
 leap-0.043
 bites-0.042
 paws-0.041
Token with
Token ID351
Feature activation-2.24e-02

Pos logit contributions
People+0.018
ups+0.018
grand+0.017
we+0.017
Grand+0.017
Neg logit contributions
 leap-0.026
 sip-0.025
 bite-0.025
 squeeze-0.023
 bites-0.022
Token said
Token ID531
Feature activation-1.28e-02

Pos logit contributions
grand+0.001
ups+0.001
People+0.001
Grand+0.001
ind+0.001
Neg logit contributions
 sip-0.001
 leap-0.001
 bite-0.001
 squeeze-0.001
 bites-0.001
Token,
Token ID11
Feature activation-3.93e-02

Pos logit contributions
ups+0.033
grand+0.033
People+0.033
Grand+0.031
ind+0.030
Neg logit contributions
 sip-0.017
 bite-0.016
 leap-0.016
 squeeze-0.014
 bites-0.013
Token "
Token ID366
Feature activation-1.66e-02

Pos logit contributions
ups+0.089
grand+0.089
ind+0.083
People+0.081
ich+0.075
Neg logit contributions
 sip-0.068
 leap-0.068
 bite-0.068
 squeeze-0.061
 paws-0.055
TokenHi
Token ID17250
Feature activation+1.62e-02

Pos logit contributions
ups+0.035
grand+0.033
 office+0.032
ber+0.032
ind+0.031
Neg logit contributions
 bite-0.027
 leap-0.027
 sip-0.027
 squeeze-0.026
 lick-0.022
Token little
Token ID1310
Feature activation+7.97e-03

Pos logit contributions
 sip+0.026
 leap+0.025
 bite+0.025
 squeeze+0.022
 bites+0.021
Neg logit contributions
grand-0.038
ups-0.037
People-0.037
Grand-0.035
ind-0.035
Token re
Token ID302
Feature activation-3.85e-02

Pos logit contributions
 sip+0.017
 leap+0.016
 bite+0.016
 squeeze+0.014
 bites+0.014
Neg logit contributions
grand-0.015
People-0.014
ups-0.014
ind-0.014
Grand-0.014
Tokenind
Token ID521
Feature activation-1.78e-02

Pos logit contributions
People+0.033
grand+0.028
ups+0.027
Grand+0.023
Our+0.021
Neg logit contributions
 bite-0.111
 sip-0.110
 leap-0.107
 breath-0.103
 breaths-0.102
Tokeneer
Token ID28153
Feature activation+2.24e-02

Pos logit contributions
People+0.035
ups+0.035
grand+0.034
Grand+0.033
David+0.031
Neg logit contributions
 bite-0.034
 sip-0.034
 leap-0.033
 squeeze-0.029
 bites-0.029
Token!"
Token ID2474
Feature activation+2.40e-03

Pos logit contributions
 sip+0.035
 leap+0.034
 bite+0.033
 squeeze+0.029
 bites+0.028
Neg logit contributions
grand-0.051
ups-0.051
People-0.050
ind-0.049
ich-0.049
Token The
Token ID383
Feature activation-2.63e-02

Pos logit contributions
 sip+0.004
 leap+0.004
 bite+0.004
 squeeze+0.004
 paws+0.004
Neg logit contributions
ups-0.005
grand-0.005
People-0.005
ind-0.005
Grand-0.005
Token re
Token ID302
Feature activation-3.68e-02

Pos logit contributions
ups+0.058
grand+0.057
People+0.055
we+0.052
David+0.051
Neg logit contributions
 bite-0.046
 leap-0.044
 sip-0.044
 squeeze-0.040
 bites-0.038
Token He
Token ID679
Feature activation-1.16e-02

Pos logit contributions
 sip+0.010
 leap+0.010
 bite+0.010
 squeeze+0.009
 paws+0.008
Neg logit contributions
ups-0.013
grand-0.013
People-0.012
ind-0.012
ich-0.012
Token sat
Token ID3332
Feature activation-3.48e-02

Pos logit contributions
grand+0.025
ups+0.025
People+0.025
ind+0.023
ich+0.023
Neg logit contributions
 bite-0.021
 leap-0.020
 sip-0.020
 squeeze-0.018
 bites-0.017
Token at
Token ID379
Feature activation-5.57e-03

Pos logit contributions
grand+0.109
ups+0.099
People+0.099
God+0.092
Grand+0.092
Neg logit contributions
 bite-0.032
 sip-0.031
 leap-0.031
 breaths-0.029
 breath-0.028
Token his
Token ID465
Feature activation-6.42e-03

Pos logit contributions
ups+0.011
grand+0.011
People+0.011
Grand+0.010
ind+0.010
Neg logit contributions
 sip-0.011
 bite-0.011
 leap-0.011
 squeeze-0.009
 bites-0.009
Token desk
Token ID6915
Feature activation-1.94e-02

Pos logit contributions
grand+0.012
ups+0.012
People+0.012
Grand+0.011
ind+0.011
Neg logit contributions
 sip-0.013
 leap-0.013
 bite-0.013
 squeeze-0.011
 bites-0.011
Token,
Token ID11
Feature activation-3.89e-02

Pos logit contributions
grand+0.045
People+0.045
ups+0.045
Grand+0.042
ind+0.042
Neg logit contributions
 sip-0.031
 leap-0.030
 bite-0.030
 squeeze-0.026
 bites-0.025
Token looking
Token ID2045
Feature activation-8.98e-04

Pos logit contributions
grand+0.087
ups+0.085
People+0.084
ind+0.080
ich+0.077
Neg logit contributions
 sip-0.070
 bite-0.069
 leap-0.068
 squeeze-0.063
 bites-0.058
Token out
Token ID503
Feature activation-1.08e-02

Pos logit contributions
grand+0.002
People+0.002
ups+0.002
Grand+0.002
ind+0.002
Neg logit contributions
 sip-0.002
 leap-0.002
 bite-0.002
 squeeze-0.001
 bites-0.001
Token the
Token ID262
Feature activation-3.64e-02

Pos logit contributions
grand+0.024
ups+0.023
People+0.023
ich+0.022
Grand+0.021
Neg logit contributions
 sip-0.020
 bite-0.019
 leap-0.019
 squeeze-0.017
 bites-0.016
Token window
Token ID4324
Feature activation-3.78e-02

Pos logit contributions
grand+0.073
People+0.070
ups+0.067
we+0.067
Grand+0.063
Neg logit contributions
 sip-0.072
 bite-0.072
 leap-0.072
 squeeze-0.065
 knot-0.062
Token,
Token ID11
Feature activation-3.57e-02

Pos logit contributions
grand+0.097
People+0.095
ups+0.092
ind+0.088
Grand+0.087
Neg logit contributions
 sip-0.054
 bite-0.054
 leap-0.054
 squeeze-0.047
 paws-0.043