TOP ACTIVATIONS
MAX = 0.113

And do you know what all these changes are called ?"
to worry and wondered what all the words meant .
in her hand , what shapes were in the sky ,
over and find out what all the comm otion was about
proud that she knew what all the words meant . From
. " What shape is the moon ?"
as . " Who wants pizza ?" Lily asks
incoln ". " Who is Lincoln , Ben ?
Mom my , what are all these keys for ?"
and asked her dad what all these things were . He
, " Hey , what have you found there ?"
. The kids asked what all the people were doing and
letters . " Who do you want to mail
Wow , what a great shape show ! You are very
, " Sure , what game do you want to play
of the sky , what shape a balloon was and even
curious . " Who are you ?" the cat
John , what a wonderful picture you have made ! You
" No way ! That all igator is too deep in
happy . " Who gave us these toys ?"

MIN ACTIVATIONS
MIN = -0.128

said , tears streaming down her face .
them with tears streaming down his cheeks , and said "
her , tears streaming down his face . His
, tears kept streaming down his face .
Jane with tears streaming down her face , and Jane 's
his mouth and l icks his lips . He thinks it
away , tears streaming down her face . When her mom
. Tears started streaming down her cheeks and Daddy quickly pulled
smiling with tears streaming down her cheeks . In
felt helpless and tears filled her eyes . Suddenly
Copper said , tears filling her eyes . The old bird
's face and w agged his tail . He was happy
of the sunrise streaming into his room .
so happy he w agged his tail and l icked the
bark ed and w agged his tail . He wanted to
his tail and l icks their faces . He is not
, put his arms around her and said , " Don
his tail and l icks their faces . Tim
excited , he w agged his tail and w igg led
house . Max w agged his tail and agreed to help

INTERVAL 0.053 - 0.113
CONTAINS 1.458%

I was just curious . You are different from me ,
fix the tape . " What 's wrong ?" he asked
like a crystal . I think it 's fragile ," said
They were so colorful and shiny ! L ily
Hello , little girl . What are you doing with that

INTERVAL -0.008 - 0.053
CONTAINS 61.373%

. " But I will be the winner !"
this path and it will lead you to success ."
a time , there was a little girl named Sarah who
hesitated . He didn 't want to go outside . He
Lily , you can try . But you have to give

INTERVAL -0.068 - -0.008
CONTAINS 36.234%

<|endoftext|> In the morning , Mom my reminded little Sus ie
. But he couldn 't do it alone .
his friends . He wanted them to see how powerful he
that looked sour . She said it smelled like it was
together , including racing each other . <|endoftext|> Once upon a

INTERVAL -0.128 - -0.068
CONTAINS 0.935%

too fast . It chased her all the way home and
he gives the car to Mia . He says , "
you better ." She took Ben 's hand and followed their
alert . His Mom hugged him and said ,
Sam smiled . He followed Tom and they looked for Anna
Token And
Token ID843
Feature activation+3.16e-02

Pos logit contributions
We+0.037
 of+0.035
By+0.034
 deserve+0.033
In+0.032
Neg logit contributions
 Sab-0.059
 gn-0.057
osc-0.057
gy-0.056
 K-0.056
Token do
Token ID466
Feature activation+2.68e-02

Pos logit contributions
We+0.034
 Friends+0.031
By+0.031
 deserve+0.030
 of+0.028
Neg logit contributions
 sob-0.061
 gn-0.060
 Sab-0.058
 Majesty-0.058
unct-0.057
Token you
Token ID345
Feature activation+1.98e-02

Pos logit contributions
We+0.017
 Friends+0.014
 of+0.014
By+0.014
 deserve+0.013
Neg logit contributions
 sob-0.066
unct-0.064
lee-0.064
osc-0.064
 gag-0.063
Token know
Token ID760
Feature activation+3.65e-02

Pos logit contributions
We+0.028
 Friends+0.026
 I+0.024
By+0.024
 Have+0.024
Neg logit contributions
 sob-0.033
osc-0.032
unct-0.032
lee-0.032
gy-0.031
Token what
Token ID644
Feature activation+3.66e-02

Pos logit contributions
We+0.052
 of+0.048
 I+0.048
 Friends+0.048
By+0.047
Neg logit contributions
 sob-0.061
unct-0.059
osc-0.058
lee-0.058
 gag-0.057
Token all
Token ID477
Feature activation+1.13e-01

Pos logit contributions
By+0.054
We+0.051
packing+0.049
 deserve+0.049
 Friends+0.048
Neg logit contributions
 sob-0.057
 teeth-0.055
 gn-0.053
 yawn-0.052
 M-0.051
Token these
Token ID777
Feature activation+4.09e-02

Pos logit contributions
By+0.192
We+0.185
 Friends+0.172
 Have+0.167
In+0.166
Neg logit contributions
 breaths-0.160
 sob-0.159
lee-0.157
 gn-0.155
 Sab-0.154
Token changes
Token ID2458
Feature activation+7.60e-02

Pos logit contributions
We+0.067
By+0.064
 Friends+0.062
 Have+0.060
 I+0.060
Neg logit contributions
 sob-0.059
lee-0.056
osc-0.055
unct-0.055
 breaths-0.055
Token are
Token ID389
Feature activation+1.31e-02

Pos logit contributions
By+0.120
We+0.116
 Friends+0.102
 Have+0.101
I+0.099
Neg logit contributions
 sob-0.112
lee-0.112
gy-0.111
osc-0.107
 breaths-0.106
Token called
Token ID1444
Feature activation+1.27e-02

Pos logit contributions
We+0.019
 Friends+0.018
By+0.017
 I+0.017
 of+0.017
Neg logit contributions
 sob-0.021
unct-0.021
osc-0.020
lee-0.020
 gn-0.020
Token?"
Token ID1701
Feature activation-1.66e-02

Pos logit contributions
We+0.018
 of+0.017
 I+0.017
 Friends+0.017
 deserve+0.016
Neg logit contributions
 sob-0.021
unct-0.021
 gag-0.020
osc-0.020
lee-0.020
Token to
Token ID284
Feature activation-2.93e-02

Pos logit contributions
unct+0.027
 Majesty+0.027
enery+0.026
 gag+0.025
aj+0.025
Neg logit contributions
 of-0.013
We-0.010
 I-0.009
 Friends-0.009
 can-0.008
Token worry
Token ID5490
Feature activation-2.48e-02

Pos logit contributions
lee+0.040
aj+0.038
ail+0.037
 breaths+0.036
 Majesty+0.036
Neg logit contributions
 I-0.050
 deserve-0.049
 of-0.047
 can-0.047
We-0.046
Token and
Token ID290
Feature activation+5.16e-04

Pos logit contributions
 sob+0.042
unct+0.042
osc+0.041
 gag+0.040
gy+0.040
Neg logit contributions
We-0.034
 of-0.032
 I-0.032
 Friends-0.031
By-0.030
Token wondered
Token ID14028
Feature activation-9.49e-03

Pos logit contributions
We+0.001
 of+0.001
 I+0.001
 we+0.001
By+0.001
Neg logit contributions
osc-0.001
 gag-0.001
unct-0.001
gy-0.001
lee-0.001
Token what
Token ID644
Feature activation+2.26e-02

Pos logit contributions
 sob+0.016
osc+0.016
unct+0.016
 gag+0.016
lee+0.016
Neg logit contributions
We-0.013
 of-0.012
 I-0.012
 Friends-0.012
 we-0.011
Token all
Token ID477
Feature activation+1.12e-01

Pos logit contributions
We+0.034
By+0.034
 Have+0.033
 deserve+0.033
 Friends+0.033
Neg logit contributions
 sob-0.034
lee-0.031
 Sab-0.031
 gn-0.030
osc-0.030
Token the
Token ID262
Feature activation+5.50e-02

Pos logit contributions
By+0.202
We+0.200
 Friends+0.189
 Have+0.180
In+0.173
Neg logit contributions
lee-0.143
 Sab-0.141
 sob-0.139
 breaths-0.135
 M-0.129
Token words
Token ID2456
Feature activation+6.71e-02

Pos logit contributions
We+0.092
 of+0.086
 Friends+0.086
 I+0.086
 deserve+0.085
Neg logit contributions
 sob-0.078
unct-0.077
lee-0.076
gy-0.075
osc-0.074
Token meant
Token ID4001
Feature activation+3.01e-02

Pos logit contributions
By+0.093
 Friends+0.093
We+0.092
 Have+0.088
 deserve+0.081
Neg logit contributions
 sob-0.107
lee-0.099
 Sab-0.096
 gigg-0.094
unct-0.093
Token.
Token ID13
Feature activation-1.06e-02

Pos logit contributions
We+0.058
By+0.057
 deserve+0.055
 Friends+0.054
 Have+0.053
Neg logit contributions
 sob-0.033
lee-0.031
osc-0.029
 Sab-0.029
 gn-0.028
Token
Token ID198
Feature activation+1.06e-02

Pos logit contributions
unct+0.017
osc+0.017
 sob+0.017
gy+0.017
 gag+0.016
Neg logit contributions
We-0.016
 I-0.016
 Friends-0.015
 of-0.015
By-0.014
Token in
Token ID287
Feature activation+1.97e-02

Pos logit contributions
We+0.034
 Friends+0.031
By+0.031
 I+0.031
 deserve+0.030
Neg logit contributions
 sob-0.036
unct-0.035
osc-0.035
lee-0.034
 gn-0.034
Token her
Token ID607
Feature activation-6.08e-02

Pos logit contributions
We+0.037
 Friends+0.035
By+0.034
 of+0.034
 I+0.034
Neg logit contributions
 sob-0.024
unct-0.023
osc-0.023
lee-0.023
 gag-0.022
Token hand
Token ID1021
Feature activation-7.06e-03

Pos logit contributions
 sob+0.073
unct+0.069
osc+0.068
lee+0.067
 gag+0.067
Neg logit contributions
We-0.117
 Friends-0.110
 I-0.110
 of-0.110
By-0.108
Token,
Token ID11
Feature activation-1.82e-02

Pos logit contributions
 sob+0.011
unct+0.011
 gag+0.011
osc+0.011
lee+0.011
Neg logit contributions
We-0.011
 of-0.010
 I-0.010
 Friends-0.010
 deserve-0.009
Token what
Token ID644
Feature activation+4.74e-02

Pos logit contributions
 sob+0.029
unct+0.028
osc+0.027
lee+0.027
 gn+0.027
Neg logit contributions
We-0.027
By-0.025
 Friends-0.025
 deserve-0.025
 of-0.024
Token shapes
Token ID15268
Feature activation+1.10e-01

Pos logit contributions
By+0.077
We+0.076
 Friends+0.073
 Have+0.072
 deserve+0.072
Neg logit contributions
 sob-0.067
 Sab-0.063
 gn-0.061
 teeth-0.060
lee-0.059
Token were
Token ID547
Feature activation+2.61e-02

Pos logit contributions
By+0.181
 Have+0.167
We+0.161
 Friends+0.159
 hour+0.145
Neg logit contributions
 sob-0.153
 gigg-0.152
ered-0.152
 Sab-0.146
lee-0.146
Token in
Token ID287
Feature activation+2.31e-02

Pos logit contributions
We+0.042
By+0.042
 Friends+0.040
 Have+0.040
 deserve+0.038
Neg logit contributions
lee-0.037
 sob-0.036
 Sab-0.036
 gn-0.036
unct-0.035
Token the
Token ID262
Feature activation-7.79e-03

Pos logit contributions
We+0.038
By+0.037
 Friends+0.037
 Have+0.034
 deserve+0.034
Neg logit contributions
 sob-0.032
lee-0.031
gy-0.031
 Sab-0.030
 gn-0.030
Token sky
Token ID6766
Feature activation+3.45e-02

Pos logit contributions
 sob+0.011
unct+0.011
osc+0.010
lee+0.010
 gn+0.010
Neg logit contributions
We-0.013
 Friends-0.012
By-0.012
 I-0.012
 of-0.012
Token,
Token ID11
Feature activation-1.61e-02

Pos logit contributions
We+0.061
By+0.059
 Friends+0.057
 deserve+0.056
 Have+0.055
Neg logit contributions
 sob-0.044
lee-0.043
gy-0.042
osc-0.042
unct-0.041
Token over
Token ID625
Feature activation-2.35e-02

Pos logit contributions
We+0.015
 of+0.014
 I+0.014
 Friends+0.013
 deserve+0.013
Neg logit contributions
 sob-0.013
unct-0.012
lee-0.012
osc-0.012
 gag-0.012
Token and
Token ID290
Feature activation-1.89e-02

Pos logit contributions
 sob+0.037
unct+0.036
osc+0.036
 gag+0.035
lee+0.035
Neg logit contributions
We-0.036
 of-0.034
 I-0.033
 Friends-0.033
By-0.032
Token find
Token ID1064
Feature activation+2.57e-02

Pos logit contributions
 sob+0.027
unct+0.026
 gn+0.024
osc+0.024
lee+0.024
Neg logit contributions
We-0.032
 Friends-0.030
By-0.029
 of-0.029
 deserve-0.029
Token out
Token ID503
Feature activation+3.72e-02

Pos logit contributions
We+0.043
 of+0.041
 I+0.040
 Friends+0.040
 deserve+0.039
Neg logit contributions
 sob-0.037
unct-0.036
osc-0.035
lee-0.035
 gag-0.034
Token what
Token ID644
Feature activation+3.00e-02

Pos logit contributions
We+0.055
 Friends+0.050
 of+0.050
 I+0.050
By+0.050
Neg logit contributions
 sob-0.061
unct-0.059
osc-0.058
lee-0.058
 gag-0.057
Token all
Token ID477
Feature activation+1.08e-01

Pos logit contributions
packing+0.048
By+0.047
 Have+0.045
hest+0.045
We+0.045
Neg logit contributions
 sob-0.040
 gn-0.038
 bark-0.036
 yawn-0.036
 teeth-0.036
Token the
Token ID262
Feature activation+4.51e-02

Pos logit contributions
By+0.179
We+0.175
 Friends+0.167
packing+0.167
 Have+0.163
Neg logit contributions
lee-0.149
 breaths-0.146
 yawn-0.146
 sob-0.143
 Sab-0.141
Token comm
Token ID725
Feature activation+6.44e-02

Pos logit contributions
We+0.071
 of+0.065
 Friends+0.065
 I+0.065
By+0.064
Neg logit contributions
 sob-0.069
unct-0.067
osc-0.066
lee-0.066
 gag-0.065
Tokenotion
Token ID9650
Feature activation+7.07e-02

Pos logit contributions
We+0.110
 Friends+0.106
 deserve+0.104
By+0.103
Deal+0.103
Neg logit contributions
lee-0.094
unct-0.086
 breaths-0.084
 sob-0.082
gy-0.078
Token was
Token ID373
Feature activation+1.31e-02

Pos logit contributions
We+0.110
By+0.105
 Have+0.105
 Friends+0.104
 deserve+0.099
Neg logit contributions
 sob-0.108
lee-0.096
 yawn-0.096
 gn-0.093
gy-0.093
Token about
Token ID546
Feature activation+2.89e-02

Pos logit contributions
We+0.019
 of+0.018
 Friends+0.018
 I+0.018
 deserve+0.017
Neg logit contributions
 sob-0.021
unct-0.021
osc-0.021
 gag-0.020
lee-0.020
Token proud
Token ID6613
Feature activation+5.82e-02

Pos logit contributions
We+0.037
 of+0.034
 Friends+0.034
 I+0.034
By+0.034
Neg logit contributions
 sob-0.031
unct-0.030
osc-0.029
lee-0.029
 gag-0.029
Token that
Token ID326
Feature activation+1.63e-02

Pos logit contributions
We+0.059
 deserve+0.055
By+0.050
 Have+0.050
packing+0.049
Neg logit contributions
lee-0.118
 sob-0.118
 gn-0.114
 yawn-0.110
 gigg-0.110
Token she
Token ID673
Feature activation+2.42e-02

Pos logit contributions
We+0.025
 Friends+0.023
 deserve+0.023
By+0.023
 of+0.022
Neg logit contributions
 sob-0.026
unct-0.025
osc-0.024
lee-0.024
 gag-0.024
Token knew
Token ID2993
Feature activation+1.57e-02

Pos logit contributions
 Friends+0.027
We+0.027
 of+0.026
 Have+0.025
By+0.024
Neg logit contributions
 sob-0.046
lee-0.044
unct-0.043
osc-0.043
 gn-0.043
Token what
Token ID644
Feature activation+3.07e-02

Pos logit contributions
We+0.024
By+0.022
 Friends+0.022
 deserve+0.022
 Have+0.020
Neg logit contributions
 sob-0.025
lee-0.024
unct-0.024
 Sab-0.023
 gn-0.023
Token all
Token ID477
Feature activation+1.08e-01

Pos logit contributions
We+0.053
By+0.052
 deserve+0.050
 Friends+0.050
 Have+0.049
Neg logit contributions
 sob-0.039
 gn-0.036
 Sab-0.036
lee-0.036
 teeth-0.034
Token the
Token ID262
Feature activation+4.57e-02

Pos logit contributions
By+0.209
We+0.208
 Friends+0.197
 Have+0.192
In+0.185
Neg logit contributions
 sob-0.126
lee-0.124
 breaths-0.119
 Sab-0.118
 yawn-0.113
Token words
Token ID2456
Feature activation+7.43e-02

Pos logit contributions
We+0.074
 of+0.070
 I+0.070
 Friends+0.070
 deserve+0.070
Neg logit contributions
 sob-0.066
unct-0.066
lee-0.065
gy-0.064
 gag-0.063
Token meant
Token ID4001
Feature activation+3.69e-02

Pos logit contributions
 Friends+0.109
By+0.108
We+0.104
 Have+0.101
 kicker+0.093
Neg logit contributions
 sob-0.109
 gigg-0.100
 yawn-0.099
lee-0.099
 Sab-0.098
Token.
Token ID13
Feature activation-2.79e-03

Pos logit contributions
We+0.074
By+0.073
 deserve+0.069
 Friends+0.069
 Have+0.068
Neg logit contributions
 sob-0.038
lee-0.036
osc-0.034
 Sab-0.032
gy-0.032
Token From
Token ID3574
Feature activation+5.06e-02

Pos logit contributions
 sob+0.004
unct+0.004
osc+0.004
 gag+0.004
lee+0.004
Neg logit contributions
We-0.004
 Friends-0.004
 I-0.004
 of-0.004
By-0.004
Token.
Token ID13
Feature activation-4.67e-03

Pos logit contributions
osc+0.041
unct+0.041
 coy+0.040
icol+0.040
 Majesty+0.040
Neg logit contributions
 of-0.023
 I-0.020
 you-0.018
We-0.018
 we-0.017
Token
Token ID220
Feature activation-1.03e-02

Pos logit contributions
unct+0.007
osc+0.007
 sob+0.007
 gag+0.007
lee+0.007
Neg logit contributions
We-0.008
 I-0.007
 Friends-0.007
 of-0.007
By-0.007
Token
Token ID198
Feature activation+8.91e-03

Pos logit contributions
 sob+0.019
unct+0.018
osc+0.018
lee+0.018
 gag+0.018
Neg logit contributions
We-0.013
 Friends-0.012
 of-0.012
 I-0.012
By-0.012
Token"
Token ID1
Feature activation+1.75e-02

Pos logit contributions
We+0.011
 Friends+0.010
 of+0.010
 I+0.010
By+0.010
Neg logit contributions
 sob-0.016
unct-0.016
osc-0.016
lee-0.016
 gag-0.016
TokenWhat
Token ID2061
Feature activation+8.25e-02

Pos logit contributions
 of+0.015
 Friends+0.014
 yours+0.014
 Have+0.013
 I+0.013
Neg logit contributions
 Sab-0.034
 sob-0.034
 gn-0.033
 gag-0.032
j-0.032
Token shape
Token ID5485
Feature activation+1.07e-01

Pos logit contributions
By+0.122
We+0.116
 Have+0.106
 deserve+0.103
 Friends+0.101
Neg logit contributions
 sob-0.133
 teeth-0.128
 ears-0.118
 Sab-0.116
 yawn-0.116
Token is
Token ID318
Feature activation+3.89e-02

Pos logit contributions
By+0.154
We+0.152
 Friends+0.131
 Have+0.130
In+0.125
Neg logit contributions
 sob-0.180
 breaths-0.175
unct-0.175
lee-0.173
 teeth-0.173
Token the
Token ID262
Feature activation+4.10e-02

Pos logit contributions
By+0.062
We+0.061
 Friends+0.058
 deserve+0.056
 Have+0.055
Neg logit contributions
lee-0.056
 gn-0.055
 sob-0.054
 Sab-0.053
unct-0.053
Token moon
Token ID8824
Feature activation+4.43e-02

Pos logit contributions
We+0.066
 Friends+0.061
By+0.060
 of+0.060
 I+0.060
Neg logit contributions
 sob-0.061
unct-0.059
osc-0.058
lee-0.057
 gag-0.057
Token?"
Token ID1701
Feature activation-2.12e-02

Pos logit contributions
By+0.068
We+0.068
 Friends+0.066
 deserve+0.061
 Have+0.061
Neg logit contributions
 sob-0.066
unct-0.064
gy-0.063
lee-0.062
 gn-0.061
Token
Token ID220
Feature activation-2.52e-02

Pos logit contributions
unct+0.028
osc+0.028
gy+0.027
lee+0.027
iously+0.027
Neg logit contributions
We-0.038
 I-0.037
 of-0.036
 Friends-0.035
By-0.035
Tokenas
Token ID292
Feature activation+1.06e-02

Pos logit contributions
We+0.006
 Friends+0.005
 Have+0.004
 deserve+0.004
By+0.004
Neg logit contributions
 sob-0.017
 Sab-0.017
osc-0.017
 heartbeat-0.017
unct-0.016
Token.
Token ID13
Feature activation+2.15e-02

Pos logit contributions
We+0.015
By+0.015
 Friends+0.014
 deserve+0.014
 I+0.014
Neg logit contributions
 sob-0.017
unct-0.016
osc-0.016
lee-0.016
gy-0.016
Token
Token ID198
Feature activation+3.43e-02

Pos logit contributions
We+0.030
 deserve+0.028
By+0.028
 of+0.028
 Friends+0.027
Neg logit contributions
 sob-0.035
lee-0.033
osc-0.033
 gn-0.032
 Sab-0.032
Token
Token ID198
Feature activation+3.18e-02

Pos logit contributions
We+0.050
 Friends+0.046
 deserve+0.046
 I+0.046
 of+0.045
Neg logit contributions
 sob-0.055
lee-0.052
osc-0.052
unct-0.052
 gn-0.051
Token"
Token ID1
Feature activation+4.55e-02

Pos logit contributions
 of+0.036
 Friends+0.036
 deserve+0.036
 makers+0.035
We+0.035
Neg logit contributions
 sob-0.056
 gn-0.053
osc-0.051
unct-0.050
 Sab-0.050
TokenWho
Token ID8241
Feature activation+1.07e-01

Pos logit contributions
 of+0.063
 Friends+0.058
 yours+0.054
 deserve+0.053
 I+0.053
Neg logit contributions
 sob-0.080
 gn-0.073
osc-0.069
 Sab-0.069
 babys-0.067
Token wants
Token ID3382
Feature activation+5.93e-02

Pos logit contributions
We+0.147
By+0.145
 Friends+0.143
 Have+0.137
 importantly+0.130
Neg logit contributions
 sob-0.175
 teeth-0.166
gy-0.163
 Sab-0.161
unct-0.160
Token pizza
Token ID14256
Feature activation+3.02e-02

Pos logit contributions
We+0.095
By+0.091
 Friends+0.089
 Have+0.086
 deserve+0.085
Neg logit contributions
 sob-0.087
osc-0.080
lee-0.080
 gn-0.080
unct-0.080
Token?"
Token ID1701
Feature activation-1.63e-02

Pos logit contributions
We+0.053
By+0.053
 Have+0.051
 Friends+0.050
 importantly+0.049
Neg logit contributions
 sob-0.037
gy-0.035
 yawn-0.034
 gn-0.034
 teeth-0.034
Token Lily
Token ID20037
Feature activation-3.70e-02

Pos logit contributions
unct+0.029
osc+0.029
 breaths+0.028
iously+0.027
 gag+0.027
Neg logit contributions
 I-0.023
We-0.022
 of-0.021
 we-0.020
 you-0.020
Token asks
Token ID7893
Feature activation-4.30e-02

Pos logit contributions
enery+0.074
 Majesty+0.072
gy+0.071
unct+0.071
osc+0.069
Neg logit contributions
 I-0.041
We-0.038
 of-0.036
 can-0.033
I-0.033
Tokenincoln
Token ID11690
Feature activation+1.81e-02

Pos logit contributions
 sob+0.006
unct+0.005
osc+0.005
lee+0.005
 gn+0.005
Neg logit contributions
We-0.004
 Friends-0.003
 I-0.003
 deserve-0.003
By-0.003
Token".
Token ID1911
Feature activation-1.06e-03

Pos logit contributions
We+0.027
 of+0.026
 I+0.026
 Friends+0.025
 deserve+0.024
Neg logit contributions
 sob-0.028
unct-0.028
 gag-0.027
osc-0.027
lee-0.027
Token
Token ID198
Feature activation+1.57e-02

Pos logit contributions
unct+0.002
 sob+0.002
 gag+0.002
 breaths+0.002
osc+0.002
Neg logit contributions
We-0.002
 I-0.002
 of-0.001
 Friends-0.001
 Have-0.001
Token
Token ID198
Feature activation+2.04e-02

Pos logit contributions
We+0.019
 of+0.017
 Friends+0.017
 I+0.017
By+0.017
Neg logit contributions
 sob-0.030
unct-0.029
osc-0.029
 gag-0.028
lee-0.028
Token"
Token ID1
Feature activation+3.48e-02

Pos logit contributions
We+0.024
 Friends+0.024
 of+0.023
 I+0.022
 deserve+0.022
Neg logit contributions
 sob-0.037
 gn-0.036
osc-0.035
unct-0.035
lee-0.034
TokenWho
Token ID8241
Feature activation+1.07e-01

Pos logit contributions
 of+0.048
 Friends+0.044
 I+0.041
 we+0.040
 yours+0.040
Neg logit contributions
 sob-0.061
 gn-0.058
osc-0.056
 Sab-0.056
 gag-0.056
Token is
Token ID318
Feature activation+2.68e-02

Pos logit contributions
We+0.143
By+0.135
 Friends+0.135
 Have+0.128
 importantly+0.123
Neg logit contributions
 sob-0.183
 teeth-0.178
osc-0.174
 Sab-0.171
tails-0.169
Token Lincoln
Token ID12406
Feature activation-4.34e-03

Pos logit contributions
We+0.035
By+0.033
 Friends+0.033
 deserve+0.033
 of+0.031
Neg logit contributions
 sob-0.046
 gn-0.045
unct-0.045
 Sab-0.045
gy-0.044
Token,
Token ID11
Feature activation+2.12e-02

Pos logit contributions
 sob+0.007
lee+0.007
unct+0.007
osc+0.007
 breaths+0.007
Neg logit contributions
We-0.006
 of-0.006
 I-0.006
 Friends-0.006
 deserve-0.006
Token Ben
Token ID3932
Feature activation-4.96e-03

Pos logit contributions
We+0.034
 Friends+0.032
By+0.032
 deserve+0.031
 Have+0.031
Neg logit contributions
 sob-0.031
 gn-0.029
 Sab-0.029
unct-0.029
osc-0.028
Token?
Token ID30
Feature activation+5.02e-02

Pos logit contributions
 sob+0.009
unct+0.009
osc+0.009
lee+0.009
 gag+0.009
Neg logit contributions
We-0.006
 Friends-0.005
 I-0.005
 of-0.005
 deserve-0.005
TokenMom
Token ID29252
Feature activation-1.65e-03

Pos logit contributions
We+0.039
 I+0.034
 Friends+0.034
 of+0.034
By+0.034
Neg logit contributions
 sob-0.068
unct-0.066
osc-0.066
lee-0.065
 gag-0.065
Tokenmy
Token ID1820
Feature activation+2.01e-02

Pos logit contributions
 gag+0.003
 sob+0.003
osc+0.003
 breaths+0.003
unct+0.003
Neg logit contributions
 I-0.002
 of-0.002
We-0.002
 Friends-0.002
 deserve-0.002
Token,
Token ID11
Feature activation+3.89e-02

Pos logit contributions
We+0.036
 I+0.035
 of+0.034
 Friends+0.034
 deserve+0.034
Neg logit contributions
 sob-0.026
unct-0.025
osc-0.025
 gag-0.024
lee-0.024
Token what
Token ID644
Feature activation+7.69e-02

Pos logit contributions
We+0.067
By+0.065
 Friends+0.064
 deserve+0.060
 Have+0.058
Neg logit contributions
 sob-0.054
lee-0.049
 gn-0.049
 Sab-0.049
unct-0.049
Token are
Token ID389
Feature activation+7.44e-02

Pos logit contributions
By+0.112
We+0.102
 hour+0.099
 Have+0.095
 Friends+0.093
Neg logit contributions
 sob-0.126
 teeth-0.118
 Sab-0.116
osc-0.111
 daughter-0.111
Token all
Token ID477
Feature activation+1.06e-01

Pos logit contributions
 Friends+0.059
By+0.059
We+0.057
 deserve+0.055
 Have+0.053
Neg logit contributions
 Sab-0.164
lee-0.159
 sob-0.158
unct-0.158
 K-0.154
Token these
Token ID777
Feature activation+4.28e-02

Pos logit contributions
By+0.182
 Friends+0.178
We+0.177
Deal+0.165
 Have+0.156
Neg logit contributions
 Sab-0.146
lee-0.145
 wardrobe-0.140
 breaths-0.140
unct-0.140
Token keys
Token ID8251
Feature activation+7.55e-02

Pos logit contributions
We+0.080
By+0.079
 Friends+0.077
 Have+0.074
 deserve+0.071
Neg logit contributions
 sob-0.049
osc-0.047
unct-0.047
lee-0.046
 breaths-0.046
Token for
Token ID329
Feature activation+4.65e-02

Pos logit contributions
By+0.140
 Friends+0.129
We+0.127
Deal+0.122
In+0.117
Neg logit contributions
gy-0.089
 sob-0.088
unct-0.087
lee-0.086
 wardrobe-0.082
Token?"
Token ID1701
Feature activation-2.61e-02

Pos logit contributions
We+0.066
 Friends+0.060
By+0.060
 deserve+0.058
 Have+0.058
Neg logit contributions
 sob-0.078
unct-0.074
lee-0.074
osc-0.074
 breaths-0.072
Token
Token ID198
Feature activation-5.36e-03

Pos logit contributions
unct+0.041
 gag+0.041
osc+0.040
 breaths+0.040
 gn+0.039
Neg logit contributions
We-0.042
 of-0.039
 Friends-0.039
 I-0.039
I-0.036
Token and
Token ID290
Feature activation-1.64e-02

Pos logit contributions
 sob+0.064
osc+0.064
 gag+0.064
unct+0.063
 Majesty+0.061
Neg logit contributions
We-0.053
 of-0.050
 I-0.047
 Friends-0.047
 we-0.044
Token asked
Token ID1965
Feature activation-3.42e-02

Pos logit contributions
osc+0.027
 gag+0.026
 sob+0.025
unct+0.025
lee+0.024
Neg logit contributions
We-0.025
 I-0.023
 of-0.023
 we-0.022
By-0.022
Token her
Token ID607
Feature activation-4.65e-02

Pos logit contributions
 gag+0.059
gy+0.056
osc+0.055
unct+0.055
 sob+0.055
Neg logit contributions
 of-0.054
 I-0.049
We-0.045
 you-0.045
 can-0.044
Token dad
Token ID9955
Feature activation-4.19e-02

Pos logit contributions
osc+0.071
unct+0.070
 gag+0.067
gy+0.066
 sob+0.065
Neg logit contributions
 of-0.085
 I-0.076
We-0.073
 can-0.073
 we-0.073
Token what
Token ID644
Feature activation+3.67e-02

Pos logit contributions
 sob+0.077
unct+0.075
 gag+0.075
osc+0.074
 breaths+0.074
Neg logit contributions
 of-0.053
We-0.052
 I-0.050
 Friends-0.048
 we-0.046
Token all
Token ID477
Feature activation+1.06e-01

Pos logit contributions
We+0.054
By+0.053
 deserve+0.052
 Friends+0.051
 Have+0.050
Neg logit contributions
 sob-0.056
 gn-0.055
 Sab-0.054
lee-0.054
osc-0.051
Token these
Token ID777
Feature activation+3.51e-02

Pos logit contributions
By+0.197
We+0.194
 Friends+0.185
 Have+0.178
 deserve+0.171
Neg logit contributions
lee-0.138
 sob-0.127
 Sab-0.127
 breaths-0.125
 gn-0.122
Token things
Token ID1243
Feature activation+6.15e-02

Pos logit contributions
We+0.057
 Friends+0.053
 I+0.053
By+0.053
 of+0.052
Neg logit contributions
 sob-0.051
unct-0.049
lee-0.049
osc-0.049
 gag-0.048
Token were
Token ID547
Feature activation+7.31e-03

Pos logit contributions
By+0.091
We+0.089
 Friends+0.089
 Have+0.085
 deserve+0.080
Neg logit contributions
 sob-0.097
lee-0.094
 gigg-0.088
 yawn-0.087
unct-0.086
Token.
Token ID13
Feature activation-9.67e-03

Pos logit contributions
We+0.012
By+0.012
 Friends+0.012
 I+0.011
 deserve+0.011
Neg logit contributions
 sob-0.010
lee-0.010
unct-0.010
osc-0.009
 gn-0.009
Token He
Token ID679
Feature activation+1.25e-02

Pos logit contributions
unct+0.015
 sob+0.014
 breaths+0.014
osc+0.014
 gag+0.014
Neg logit contributions
We-0.015
 I-0.015
 Friends-0.015
 of-0.014
 Have-0.014
Token,
Token ID11
Feature activation-1.82e-02

Pos logit contributions
unct+0.033
osc+0.033
 sob+0.032
gy+0.032
 gag+0.032
Neg logit contributions
 I-0.035
We-0.034
 of-0.034
 we-0.032
 Friends-0.031
Token "
Token ID366
Feature activation+5.01e-02

Pos logit contributions
 sob+0.032
unct+0.032
osc+0.032
lee+0.031
 gag+0.031
Neg logit contributions
We-0.024
 I-0.022
 of-0.021
 Friends-0.021
By-0.021
TokenHey
Token ID10814
Feature activation+3.50e-02

Pos logit contributions
 Friends+0.057
 of+0.056
 deserve+0.053
 I+0.052
 we+0.051
Neg logit contributions
 sob-0.089
 gn-0.087
 Sab-0.086
lee-0.085
unct-0.085
Token,
Token ID11
Feature activation+5.14e-02

Pos logit contributions
 I+0.057
We+0.057
 of+0.054
 we+0.054
 you+0.051
Neg logit contributions
 sob-0.053
osc-0.053
unct-0.052
 gag-0.052
gy-0.049
Token what
Token ID644
Feature activation+7.27e-02

Pos logit contributions
We+0.070
 Friends+0.067
By+0.067
 deserve+0.061
 Have+0.060
Neg logit contributions
 sob-0.088
lee-0.083
 gn-0.083
unct-0.083
 Sab-0.081
Token have
Token ID423
Feature activation+1.06e-01

Pos logit contributions
By+0.085
We+0.082
 Friends+0.076
 hour+0.072
 Have+0.070
Neg logit contributions
 sob-0.136
 teeth-0.130
osc-0.128
 Sab-0.126
 yawn-0.124
Token you
Token ID345
Feature activation+4.04e-02

Pos logit contributions
 Friends+0.074
We+0.072
By+0.067
 deserve+0.067
 Have+0.064
Neg logit contributions
 Sab-0.237
 sob-0.234
lee-0.230
unct-0.227
 M-0.223
Token found
Token ID1043
Feature activation+6.12e-02

Pos logit contributions
We+0.057
 Friends+0.056
By+0.055
 I+0.050
 of+0.050
Neg logit contributions
 sob-0.066
unct-0.065
lee-0.064
 gn-0.062
osc-0.062
Token there
Token ID612
Feature activation+4.53e-02

Pos logit contributions
We+0.086
 Friends+0.079
By+0.079
 deserve+0.076
 Have+0.074
Neg logit contributions
 sob-0.104
unct-0.100
lee-0.100
osc-0.098
 gn-0.097
Token?"
Token ID1701
Feature activation-1.19e-02

Pos logit contributions
We+0.067
By+0.063
 Friends+0.062
 deserve+0.059
 Have+0.057
Neg logit contributions
 sob-0.071
unct-0.069
lee-0.067
gy-0.067
osc-0.067
Token
Token ID198
Feature activation+5.33e-03

Pos logit contributions
 sob+0.019
unct+0.019
osc+0.018
lee+0.018
 gag+0.018
Neg logit contributions
We-0.018
 of-0.016
 Friends-0.016
 I-0.016
By-0.016
Token.
Token ID13
Feature activation-6.23e-03

Pos logit contributions
We+0.015
 of+0.014
 Friends+0.013
 I+0.013
 deserve+0.013
Neg logit contributions
 sob-0.013
unct-0.013
osc-0.013
lee-0.013
 gag-0.013
Token The
Token ID383
Feature activation+6.87e-03

Pos logit contributions
 sob+0.009
unct+0.008
lee+0.008
osc+0.008
 gag+0.008
Neg logit contributions
We-0.010
 Friends-0.010
 of-0.010
 I-0.010
By-0.010
Token kids
Token ID3988
Feature activation+2.14e-02

Pos logit contributions
We+0.010
 of+0.009
 I+0.009
 Friends+0.009
 deserve+0.009
Neg logit contributions
 sob-0.012
unct-0.011
osc-0.011
gy-0.011
lee-0.011
Token asked
Token ID1965
Feature activation-2.18e-02

Pos logit contributions
We+0.028
 Friends+0.027
By+0.026
 I+0.026
 deserve+0.026
Neg logit contributions
 sob-0.038
unct-0.036
lee-0.036
osc-0.036
gy-0.035
Token what
Token ID644
Feature activation+3.29e-02

Pos logit contributions
 sob+0.039
unct+0.039
 gag+0.039
osc+0.037
 breaths+0.037
Neg logit contributions
 of-0.029
We-0.028
 Friends-0.027
 I-0.026
 we-0.025
Token all
Token ID477
Feature activation+1.05e-01

Pos logit contributions
By+0.045
 deserve+0.044
We+0.044
 Have+0.043
 Friends+0.042
Neg logit contributions
 sob-0.052
 Sab-0.050
 gn-0.049
 M-0.047
lee-0.047
Token the
Token ID262
Feature activation+3.94e-02

Pos logit contributions
By+0.200
We+0.184
 Friends+0.176
 Have+0.172
In+0.170
Neg logit contributions
lee-0.127
 Sab-0.123
 sob-0.117
 M-0.116
 Su-0.114
Token people
Token ID661
Feature activation+5.15e-02

Pos logit contributions
We+0.059
 of+0.055
 Friends+0.053
 deserve+0.053
 I+0.053
Neg logit contributions
 sob-0.065
unct-0.064
gy-0.063
lee-0.062
osc-0.062
Token were
Token ID547
Feature activation-2.72e-03

Pos logit contributions
By+0.079
 Friends+0.073
We+0.072
 Have+0.068
In+0.065
Neg logit contributions
 sob-0.078
lee-0.074
 yawn-0.074
 gigg-0.073
unct-0.072
Token doing
Token ID1804
Feature activation+3.67e-03

Pos logit contributions
 sob+0.004
unct+0.004
osc+0.004
 gag+0.004
lee+0.004
Neg logit contributions
We-0.004
 of-0.004
 I-0.004
 Friends-0.004
 deserve-0.004
Token and
Token ID290
Feature activation+7.66e-04

Pos logit contributions
By+0.007
We+0.007
 deserve+0.006
 Friends+0.006
 Have+0.006
Neg logit contributions
 sob-0.004
lee-0.004
unct-0.004
osc-0.004
gy-0.004
Token letters
Token ID7475
Feature activation+1.99e-02

Pos logit contributions
 sob+0.015
unct+0.015
lee+0.014
osc+0.014
 gag+0.014
Neg logit contributions
We-0.021
 of-0.020
 I-0.020
 Friends-0.020
 deserve-0.019
Token.
Token ID13
Feature activation-4.33e-03

Pos logit contributions
We+0.037
By+0.036
 Friends+0.035
 deserve+0.035
 Have+0.035
Neg logit contributions
 sob-0.022
gy-0.022
 Sab-0.021
lee-0.020
osc-0.020
Token
Token ID198
Feature activation+9.62e-03

Pos logit contributions
 sob+0.007
lee+0.007
osc+0.007
unct+0.007
 gag+0.007
Neg logit contributions
We-0.006
 deserve-0.005
 Friends-0.005
By-0.005
 of-0.005
Token
Token ID198
Feature activation+1.68e-02

Pos logit contributions
 deserve+0.015
We+0.015
 Friends+0.014
 Have+0.014
 I+0.014
Neg logit contributions
 sob-0.014
 Sab-0.012
osc-0.012
lee-0.012
gy-0.012
Token"
Token ID1
Feature activation+4.67e-02

Pos logit contributions
 Friends+0.025
 deserve+0.024
 deserves+0.024
 Have+0.023
 I+0.023
Neg logit contributions
 sob-0.022
j-0.021
pee-0.020
mary-0.020
 curiously-0.020
TokenWho
Token ID8241
Feature activation+1.05e-01

Pos logit contributions
 of+0.079
 Friends+0.078
 I+0.077
 we+0.076
 Have+0.073
Neg logit contributions
 sob-0.058
osc-0.057
 gn-0.055
 Sab-0.055
 gag-0.050
Token do
Token ID466
Feature activation+5.43e-02

Pos logit contributions
By+0.148
We+0.145
 Friends+0.137
 Have+0.136
 importantly+0.134
Neg logit contributions
 teeth-0.157
 sob-0.157
osc-0.148
 Sab-0.145
 dial-0.145
Token you
Token ID345
Feature activation+3.14e-02

Pos logit contributions
By+0.075
We+0.075
 Friends+0.067
itting+0.064
 deserve+0.064
Neg logit contributions
 sob-0.084
 gn-0.084
osc-0.081
 teeth-0.081
 Sab-0.080
Token want
Token ID765
Feature activation+6.49e-02

Pos logit contributions
We+0.048
 Friends+0.047
By+0.045
 Have+0.043
 importantly+0.042
Neg logit contributions
 sob-0.046
unct-0.046
lee-0.044
 gn-0.044
 yawn-0.043
Token to
Token ID284
Feature activation+7.61e-04

Pos logit contributions
We+0.117
By+0.114
 Friends+0.109
 Have+0.108
 deserve+0.105
Neg logit contributions
 sob-0.080
osc-0.074
lee-0.074
 Sab-0.073
 gn-0.072
Token mail
Token ID6920
Feature activation+3.73e-02

Pos logit contributions
We+0.001
 Friends+0.001
By+0.001
 Have+0.001
 of+0.001
Neg logit contributions
 sob-0.001
unct-0.001
lee-0.001
 gn-0.001
 gag-0.001
TokenWow
Token ID22017
Feature activation+5.39e-02

Pos logit contributions
 of+0.045
 I+0.042
 yours+0.042
igator+0.040
 we+0.040
Neg logit contributions
 Sab-0.072
osc-0.068
 sob-0.068
 gn-0.067
Sab-0.066
Token,
Token ID11
Feature activation+4.00e-02

Pos logit contributions
We+0.102
 Friends+0.096
 of+0.095
 I+0.095
By+0.094
Neg logit contributions
 sob-0.065
unct-0.062
osc-0.061
lee-0.060
 gag-0.059
Token what
Token ID644
Feature activation+6.31e-02

Pos logit contributions
We+0.050
 Friends+0.046
By+0.045
 deserve+0.043
In+0.043
Neg logit contributions
 sob-0.073
lee-0.070
 gn-0.070
 Sab-0.069
unct-0.068
Token a
Token ID257
Feature activation+4.73e-02

Pos logit contributions
We+0.104
By+0.104
packing+0.100
Deal+0.099
 Friends+0.099
Neg logit contributions
 sob-0.077
 teeth-0.076
 yawn-0.072
 Sab-0.071
 gn-0.070
Token great
Token ID1049
Feature activation+5.01e-02

Pos logit contributions
We+0.073
 Friends+0.068
 of+0.067
 I+0.067
By+0.066
Neg logit contributions
 sob-0.073
unct-0.070
lee-0.069
osc-0.069
 gn-0.068
Token shape
Token ID5485
Feature activation+1.04e-01

Pos logit contributions
We+0.087
By+0.080
 Friends+0.079
 Have+0.077
 deserve+0.077
Neg logit contributions
 Majesty-0.067
 gn-0.066
 sob-0.065
lee-0.065
unct-0.064
Token show
Token ID905
Feature activation+9.20e-02

Pos logit contributions
By+0.192
We+0.176
In+0.176
 Friends+0.165
 deserve+0.164
Neg logit contributions
 gn-0.124
lee-0.123
 Sab-0.120
 breaths-0.119
 sob-0.119
Token!
Token ID0
Feature activation+5.34e-02

Pos logit contributions
We+0.160
By+0.159
 Friends+0.153
In+0.150
Deal+0.148
Neg logit contributions
lee-0.106
 Sab-0.105
 sob-0.099
 gn-0.096
 Tinker-0.095
Token You
Token ID921
Feature activation+7.71e-02

Pos logit contributions
We+0.056
By+0.054
 of+0.053
 deserve+0.051
In+0.049
Neg logit contributions
 Sab-0.100
lee-0.098
 gn-0.097
 sob-0.096
osc-0.095
Token are
Token ID389
Feature activation+2.15e-02

Pos logit contributions
 Friends+0.092
We+0.089
By+0.079
 importantly+0.079
 Have+0.078
Neg logit contributions
 sob-0.139
 Sab-0.132
 gn-0.131
osc-0.131
unct-0.131
Token very
Token ID845
Feature activation-1.13e-02

Pos logit contributions
We+0.042
 Friends+0.041
By+0.040
 of+0.039
 deserve+0.039
Neg logit contributions
 sob-0.023
lee-0.022
 gn-0.022
 Sab-0.022
osc-0.021
Token,
Token ID11
Feature activation-3.19e-02

Pos logit contributions
osc+0.046
 sob+0.046
unct+0.045
lee+0.044
 gag+0.044
Neg logit contributions
We-0.057
 of-0.056
 I-0.056
 Friends-0.054
By-0.053
Token "
Token ID366
Feature activation+4.21e-02

Pos logit contributions
 sob+0.057
unct+0.055
osc+0.054
lee+0.054
 gag+0.053
Neg logit contributions
We-0.042
 Friends-0.039
 I-0.038
 of-0.038
By-0.038
TokenSure
Token ID19457
Feature activation+2.56e-02

Pos logit contributions
 of+0.055
 yours+0.053
igators+0.053
ions+0.052
 importantly+0.052
Neg logit contributions
 sob-0.063
j-0.058
 gag-0.057
 bark-0.057
 gn-0.056
Token,
Token ID11
Feature activation+3.31e-02

Pos logit contributions
 I+0.042
 of+0.041
We+0.041
 we+0.039
 Friends+0.039
Neg logit contributions
 sob-0.038
unct-0.037
osc-0.037
 gag-0.036
lee-0.036
Token what
Token ID644
Feature activation+6.16e-02

Pos logit contributions
We+0.034
By+0.029
 Friends+0.029
In+0.027
 deserve+0.027
Neg logit contributions
 sob-0.068
 M-0.066
 gn-0.065
 yawn-0.063
 daughter-0.063
Token game
Token ID983
Feature activation+1.04e-01

Pos logit contributions
We+0.091
By+0.086
packing+0.081
 Have+0.080
 course+0.080
Neg logit contributions
 sob-0.095
 teeth-0.092
 yawn-0.085
 ears-0.084
 gag-0.083
Token do
Token ID466
Feature activation+7.29e-02

Pos logit contributions
We+0.166
By+0.159
Our+0.143
itting+0.131
 Have+0.130
Neg logit contributions
 sob-0.150
 up-0.148
 bark-0.140
 cries-0.138
 eyes-0.136
Token you
Token ID345
Feature activation+1.03e-02

Pos logit contributions
We+0.085
ifying+0.085
Uh+0.085
By+0.083
itting+0.079
Neg logit contributions
 M-0.117
 teeth-0.109
 knees-0.108
 Sab-0.107
 Fro-0.106
Token want
Token ID765
Feature activation+4.48e-02

Pos logit contributions
We+0.016
By+0.015
 importantly+0.014
 Friends+0.014
 makers+0.013
Neg logit contributions
 yawn-0.015
 sob-0.015
osc-0.015
 weep-0.015
unct-0.014
Token to
Token ID284
Feature activation-2.16e-02

Pos logit contributions
We+0.079
By+0.075
 Have+0.070
 deserve+0.069
 Friends+0.069
Neg logit contributions
 sob-0.060
osc-0.055
lee-0.054
 gn-0.053
 bed-0.053
Token play
Token ID711
Feature activation+1.16e-02

Pos logit contributions
 sob+0.031
unct+0.030
lee+0.030
osc+0.029
 gn+0.029
Neg logit contributions
We-0.036
 Friends-0.033
By-0.032
 of-0.032
 deserve-0.032
Token of
Token ID286
Feature activation+4.73e-02

Pos logit contributions
We+0.072
By+0.071
 Friends+0.067
 deserve+0.064
 Have+0.064
Neg logit contributions
lee-0.081
 sob-0.078
gy-0.076
 Sab-0.075
iously-0.075
Token the
Token ID262
Feature activation+7.64e-04

Pos logit contributions
We+0.070
 of+0.067
 I+0.066
 Friends+0.066
 deserve+0.065
Neg logit contributions
 sob-0.074
unct-0.073
osc-0.071
lee-0.070
 gag-0.070
Token sky
Token ID6766
Feature activation+5.08e-02

Pos logit contributions
We+0.001
 of+0.001
 I+0.001
 Friends+0.001
 deserve+0.001
Neg logit contributions
 sob-0.001
unct-0.001
osc-0.001
lee-0.001
 gag-0.001
Token,
Token ID11
Feature activation-1.09e-02

Pos logit contributions
We+0.085
By+0.083
 Friends+0.081
 deserve+0.078
 Have+0.077
Neg logit contributions
 sob-0.069
lee-0.067
gy-0.065
osc-0.065
unct-0.064
Token what
Token ID644
Feature activation+4.07e-02

Pos logit contributions
 sob+0.017
unct+0.016
osc+0.016
lee+0.016
 gag+0.016
Neg logit contributions
We-0.017
 Friends-0.016
By-0.015
 of-0.015
 I-0.015
Token shape
Token ID5485
Feature activation+1.04e-01

Pos logit contributions
By+0.061
We+0.059
 Have+0.057
 Friends+0.056
 deserve+0.055
Neg logit contributions
 sob-0.061
 teeth-0.059
 Sab-0.058
 gn-0.057
lee-0.056
Token a
Token ID257
Feature activation+1.53e-02

Pos logit contributions
By+0.165
We+0.150
 Friends+0.146
 Have+0.144
Deal+0.133
Neg logit contributions
lee-0.161
 sob-0.156
 teeth-0.154
 Sab-0.150
 breaths-0.150
Token balloon
Token ID21190
Feature activation+6.21e-02

Pos logit contributions
We+0.025
 of+0.023
 I+0.023
 Friends+0.023
 deserve+0.023
Neg logit contributions
 sob-0.022
unct-0.022
osc-0.022
lee-0.021
 gag-0.021
Token was
Token ID373
Feature activation+4.54e-04

Pos logit contributions
By+0.103
We+0.103
 Friends+0.096
In+0.094
 deserve+0.092
Neg logit contributions
 sob-0.082
 gn-0.082
gy-0.082
lee-0.079
 Sab-0.077
Token and
Token ID290
Feature activation+1.87e-02

Pos logit contributions
We+0.001
 Friends+0.001
By+0.001
 I+0.001
 deserve+0.001
Neg logit contributions
 sob-0.001
unct-0.001
lee-0.001
osc-0.001
 gn-0.001
Token even
Token ID772
Feature activation+3.29e-02

Pos logit contributions
We+0.029
 Friends+0.028
By+0.028
 deserve+0.027
 Have+0.026
Neg logit contributions
 sob-0.028
lee-0.027
 gn-0.027
unct-0.026
 breaths-0.026
Token curious
Token ID11040
Feature activation+1.53e-02

Pos logit contributions
 sob+0.011
unct+0.011
osc+0.011
lee+0.011
 gag+0.011
Neg logit contributions
We-0.015
 I-0.014
 of-0.014
 Friends-0.014
By-0.013
Token.
Token ID13
Feature activation-3.73e-03

Pos logit contributions
We+0.024
 of+0.022
 Friends+0.022
 I+0.022
 deserve+0.021
Neg logit contributions
 sob-0.024
unct-0.023
osc-0.023
lee-0.022
 gag-0.022
Token
Token ID198
Feature activation+2.16e-02

Pos logit contributions
 sob+0.006
unct+0.006
osc+0.006
lee+0.006
 gag+0.006
Neg logit contributions
We-0.005
 Friends-0.005
 I-0.005
 of-0.005
By-0.005
Token
Token ID198
Feature activation+2.36e-02

Pos logit contributions
We+0.028
 Friends+0.026
 of+0.025
 I+0.025
 deserve+0.025
Neg logit contributions
 sob-0.039
unct-0.037
lee-0.037
osc-0.037
 gag-0.036
Token"
Token ID1
Feature activation+4.15e-02

Pos logit contributions
 Friends+0.025
 of+0.024
We+0.024
 deserve+0.023
 makers+0.023
Neg logit contributions
 sob-0.045
 gn-0.042
lee-0.041
osc-0.041
unct-0.041
TokenWho
Token ID8241
Feature activation+1.03e-01

Pos logit contributions
 of+0.058
 Friends+0.053
 deserve+0.050
 I+0.050
 Have+0.049
Neg logit contributions
 sob-0.073
 gn-0.068
osc-0.065
 Sab-0.064
unct-0.062
Token are
Token ID389
Feature activation+3.91e-02

Pos logit contributions
We+0.147
By+0.144
 Friends+0.143
 Have+0.136
 importantly+0.131
Neg logit contributions
 sob-0.162
 teeth-0.155
gy-0.152
ered-0.151
unct-0.149
Token you
Token ID345
Feature activation+7.27e-03

Pos logit contributions
We+0.026
By+0.026
 Friends+0.023
 deserve+0.023
 Have+0.021
Neg logit contributions
unct-0.089
 sob-0.089
 Sab-0.089
 gn-0.088
lee-0.087
Token?"
Token ID1701
Feature activation-1.08e-02

Pos logit contributions
We+0.010
 I+0.009
 of+0.009
 Friends+0.009
 deserve+0.009
Neg logit contributions
 sob-0.013
osc-0.013
 gag-0.012
unct-0.012
lee-0.012
Token the
Token ID262
Feature activation+1.24e-03

Pos logit contributions
unct+0.019
osc+0.019
 breaths+0.019
 gag+0.018
iously+0.018
Neg logit contributions
 I-0.015
We-0.014
 of-0.014
 we-0.013
 you-0.013
Token cat
Token ID3797
Feature activation-2.80e-02

Pos logit contributions
We+0.002
 I+0.002
 of+0.002
 Friends+0.002
By+0.002
Neg logit contributions
osc-0.002
unct-0.002
 sob-0.002
lee-0.002
 gag-0.002
TokenJohn
Token ID7554
Feature activation+2.49e-02

Pos logit contributions
 we+0.060
 of+0.059
igator+0.058
 Friends+0.056
igators+0.056
Neg logit contributions
 Sab-0.072
 Kay-0.071
 Ab-0.070
 Su-0.070
lement-0.070
Token,
Token ID11
Feature activation+3.75e-02

Pos logit contributions
We+0.047
 Friends+0.044
By+0.044
 of+0.043
 deserve+0.043
Neg logit contributions
 sob-0.030
unct-0.028
lee-0.028
osc-0.028
gy-0.027
Token what
Token ID644
Feature activation+7.58e-02

Pos logit contributions
By+0.049
We+0.048
 Friends+0.044
 makers+0.042
ite+0.040
Neg logit contributions
 sob-0.060
 M-0.059
 daughter-0.057
 yawn-0.057
 gn-0.056
Token a
Token ID257
Feature activation+5.67e-02

Pos logit contributions
By+0.115
We+0.102
 hour+0.101
 Friends+0.095
hest+0.094
Neg logit contributions
 teeth-0.111
 sob-0.111
 yawn-0.105
 daughter-0.104
 M-0.098
Token wonderful
Token ID7932
Feature activation+4.78e-02

Pos logit contributions
We+0.085
 Friends+0.078
 of+0.078
 I+0.078
By+0.077
Neg logit contributions
 sob-0.091
unct-0.088
osc-0.086
lee-0.086
 gag-0.085
Token picture
Token ID4286
Feature activation+1.03e-01

Pos logit contributions
We+0.076
 Friends+0.070
By+0.069
 I+0.068
 of+0.068
Neg logit contributions
 sob-0.072
unct-0.069
lee-0.069
osc-0.068
 gn-0.067
Token you
Token ID345
Feature activation+3.65e-02

Pos logit contributions
By+0.161
We+0.150
In+0.141
 Friends+0.138
Deal+0.137
Neg logit contributions
 up-0.141
 yawn-0.135
 sob-0.133
 cries-0.132
 off-0.128
Token have
Token ID423
Feature activation+3.96e-02

Pos logit contributions
 Friends+0.048
We+0.048
By+0.045
 importantly+0.042
 of+0.041
Neg logit contributions
lee-0.061
unct-0.060
 sob-0.060
 gn-0.057
 gigg-0.056
Token made
Token ID925
Feature activation+3.15e-02

Pos logit contributions
 Friends+0.062
We+0.061
By+0.061
 Have+0.057
igator+0.056
Neg logit contributions
lee-0.056
 sob-0.055
unct-0.051
 gn-0.050
osc-0.049
Token!
Token ID0
Feature activation+6.17e-02

Pos logit contributions
By+0.056
We+0.056
 Have+0.050
 Friends+0.050
In+0.050
Neg logit contributions
 sob-0.038
lee-0.037
unct-0.036
osc-0.036
 breaths-0.034
Token You
Token ID921
Feature activation+7.16e-02

Pos logit contributions
 makers+0.082
By+0.081
 hour+0.080
We+0.079
igator+0.076
Neg logit contributions
lee-0.090
 Sab-0.088
 M-0.087
 gn-0.085
 K-0.084
Token "
Token ID366
Feature activation+4.83e-02

Pos logit contributions
 sob+0.056
unct+0.055
osc+0.054
lee+0.054
 gag+0.053
Neg logit contributions
We-0.041
 Friends-0.037
 I-0.037
 of-0.037
By-0.036
TokenNo
Token ID2949
Feature activation+2.85e-02

Pos logit contributions
 of+0.063
 yours+0.058
 Friends+0.056
 Have+0.055
 deserve+0.055
Neg logit contributions
 sob-0.082
gy-0.075
 coy-0.075
 gn-0.074
unct-0.073
Token way
Token ID835
Feature activation+2.69e-02

Pos logit contributions
 of+0.044
 I+0.044
 we+0.043
 Friends+0.042
 you+0.041
Neg logit contributions
 gag-0.045
osc-0.045
unct-0.044
 sob-0.044
 gn-0.042
Token!
Token ID0
Feature activation+5.52e-02

Pos logit contributions
We+0.050
 Friends+0.047
By+0.047
 deserve+0.047
 of+0.046
Neg logit contributions
 sob-0.033
unct-0.032
lee-0.031
osc-0.031
gy-0.030
Token That
Token ID1320
Feature activation+4.85e-02

Pos logit contributions
We+0.054
By+0.048
 of+0.048
 deserve+0.047
 Friends+0.045
Neg logit contributions
 sob-0.114
lee-0.111
gy-0.110
unct-0.108
osc-0.108
Token all
Token ID477
Feature activation+1.03e-01

Pos logit contributions
By+0.089
 Friends+0.086
We+0.085
 Have+0.083
packing+0.082
Neg logit contributions
 sob-0.058
gy-0.055
 shoulders-0.052
 teeth-0.052
 up-0.051
Tokenigator
Token ID23823
Feature activation+5.78e-02

Pos logit contributions
By+0.144
We+0.136
 Friends+0.128
In+0.124
 Have+0.122
Neg logit contributions
 cries-0.174
 breaths-0.173
 sob-0.170
unct-0.169
 up-0.168
Token is
Token ID318
Feature activation+4.37e-03

Pos logit contributions
By+0.082
We+0.080
In+0.073
 Friends+0.072
 Have+0.069
Neg logit contributions
 sob-0.096
 breaths-0.095
gy-0.094
unct-0.092
 cries-0.090
Token too
Token ID1165
Feature activation+1.14e-02

Pos logit contributions
We+0.008
 Friends+0.008
By+0.007
 Have+0.007
 deserve+0.007
Neg logit contributions
 sob-0.006
gy-0.006
 gn-0.005
osc-0.005
lee-0.005
Token deep
Token ID2769
Feature activation+2.10e-02

Pos logit contributions
We+0.019
 Friends+0.019
By+0.018
 Have+0.018
 deserve+0.017
Neg logit contributions
 sob-0.016
unct-0.015
lee-0.015
 gag-0.015
 gn-0.014
Token in
Token ID287
Feature activation+3.67e-03

Pos logit contributions
We+0.034
 Friends+0.032
By+0.032
 deserve+0.031
 Have+0.030
Neg logit contributions
 sob-0.030
unct-0.029
 breaths-0.029
osc-0.029
gy-0.029
Token happy
Token ID3772
Feature activation+2.41e-02

Pos logit contributions
 sob+0.027
unct+0.025
lee+0.025
osc+0.025
 gag+0.024
Neg logit contributions
We-0.041
 Friends-0.039
 I-0.038
 of-0.038
By-0.038
Token.
Token ID13
Feature activation-5.76e-03

Pos logit contributions
 deserve+0.041
We+0.040
By+0.040
 Have+0.038
In+0.038
Neg logit contributions
 sob-0.032
lee-0.030
iously-0.029
 gn-0.029
gy-0.028
Token
Token ID198
Feature activation+1.64e-02

Pos logit contributions
 sob+0.009
lee+0.009
osc+0.009
 Sab+0.009
gy+0.009
Neg logit contributions
We-0.008
 deserve-0.008
By-0.008
 of-0.008
 Friends-0.007
Token
Token ID198
Feature activation+2.15e-02

Pos logit contributions
We+0.023
 deserve+0.022
 Friends+0.022
 I+0.022
By+0.021
Neg logit contributions
 sob-0.027
lee-0.025
osc-0.025
unct-0.025
 gn-0.025
Token"
Token ID1
Feature activation+4.02e-02

Pos logit contributions
 Friends+0.027
 I+0.027
 deserve+0.027
 can+0.027
 of+0.026
Neg logit contributions
 sob-0.034
 gn-0.031
 nervously-0.030
osc-0.030
 coy-0.029
TokenWho
Token ID8241
Feature activation+1.03e-01

Pos logit contributions
 of+0.054
 I+0.045
body+0.045
 Friends+0.045
 yours+0.045
Neg logit contributions
 sob-0.067
 gn-0.063
 Sab-0.061
 stro-0.060
 gigg-0.058
Token gave
Token ID2921
Feature activation+4.18e-02

Pos logit contributions
We+0.138
 Friends+0.137
By+0.135
 Have+0.131
 importantly+0.130
Neg logit contributions
 sob-0.161
 teeth-0.151
ered-0.150
 yawn-0.148
 daughter-0.145
Token us
Token ID514
Feature activation-7.49e-03

Pos logit contributions
We+0.060
 I+0.055
 of+0.055
 Friends+0.054
 deserve+0.053
Neg logit contributions
 sob-0.070
unct-0.068
osc-0.067
 gag-0.067
lee-0.067
Token these
Token ID777
Feature activation+7.03e-03

Pos logit contributions
 sob+0.012
unct+0.011
lee+0.011
osc+0.011
 gag+0.011
Neg logit contributions
We-0.011
 Friends-0.010
By-0.010
 I-0.010
 deserve-0.010
Token toys
Token ID14958
Feature activation+1.27e-02

Pos logit contributions
We+0.011
By+0.011
 Friends+0.010
 Have+0.010
 deserve+0.010
Neg logit contributions
 sob-0.010
lee-0.010
unct-0.010
osc-0.010
 gn-0.009
Token?"
Token ID1701
Feature activation-2.00e-02

Pos logit contributions
We+0.022
By+0.022
 Friends+0.021
 Have+0.020
 deserve+0.019
Neg logit contributions
 sob-0.016
gy-0.016
unct-0.015
lee-0.015
osc-0.015
Token said
Token ID531
Feature activation-5.97e-02

Pos logit contributions
enery+0.046
 Majesty+0.046
osc+0.045
unct+0.045
 breaths+0.045
Neg logit contributions
 of-0.029
 I-0.027
We-0.027
 we-0.024
 Friends-0.024
Token,
Token ID11
Feature activation-5.34e-02

Pos logit contributions
unct+0.108
 Majesty+0.107
osc+0.104
 heels+0.102
enery+0.102
Neg logit contributions
 of-0.083
 I-0.075
We-0.074
 we-0.069
 Friends-0.067
Token tears
Token ID10953
Feature activation+3.18e-02

Pos logit contributions
unct+0.075
osc+0.072
 breaths+0.070
 Sab+0.069
lee+0.069
Neg logit contributions
We-0.090
 I-0.087
 of-0.084
By-0.081
 Friends-0.080
Token streaming
Token ID11305
Feature activation-2.14e-02

Pos logit contributions
We+0.051
 of+0.050
 I+0.047
By+0.047
 Friends+0.046
Neg logit contributions
unct-0.046
 sob-0.046
osc-0.045
 Majesty-0.044
 gag-0.044
Token down
Token ID866
Feature activation-3.81e-02

Pos logit contributions
 Majesty+0.031
osc+0.031
 gn+0.030
 gag+0.029
unct+0.029
Neg logit contributions
 of-0.035
We-0.035
 I-0.033
By-0.032
I-0.031
Token her
Token ID607
Feature activation-1.28e-01

Pos logit contributions
unct+0.052
 Majesty+0.049
 sob+0.048
osc+0.048
 gag+0.048
Neg logit contributions
 of-0.068
We-0.067
 I-0.063
By-0.062
 Friends-0.061
Token face
Token ID1986
Feature activation-5.86e-02

Pos logit contributions
unct+0.171
 Sab+0.169
 Majesty+0.159
 gn+0.153
enery+0.148
Neg logit contributions
 of-0.223
 I-0.218
We-0.200
 Friends-0.200
 we-0.198
Token.
Token ID13
Feature activation-1.16e-02

Pos logit contributions
unct+0.114
 Majesty+0.114
osc+0.109
 gag+0.107
 breaths+0.107
Neg logit contributions
 of-0.076
We-0.062
 I-0.062
 we-0.057
 Friends-0.055
Token
Token ID220
Feature activation-1.55e-02

Pos logit contributions
unct+0.018
osc+0.018
 gag+0.018
 breaths+0.018
 sob+0.017
Neg logit contributions
We-0.018
 I-0.018
 Friends-0.017
 of-0.017
By-0.016
Token
Token ID198
Feature activation+7.72e-03

Pos logit contributions
 sob+0.029
unct+0.029
osc+0.029
 gag+0.028
 breaths+0.028
Neg logit contributions
We-0.019
 of-0.017
 Friends-0.017
 I-0.017
By-0.016
Token
Token ID198
Feature activation+2.01e-02

Pos logit contributions
We+0.008
By+0.007
 of+0.007
 I+0.007
 Friends+0.007
Neg logit contributions
 breaths-0.016
unct-0.016
 gag-0.016
 sob-0.016
osc-0.016
Token them
Token ID606
Feature activation-4.03e-02

Pos logit contributions
 sob+0.029
unct+0.029
osc+0.028
 gag+0.028
 gn+0.027
Neg logit contributions
We-0.029
 of-0.028
 I-0.028
 Friends-0.028
 deserve-0.027
Token with
Token ID351
Feature activation+3.09e-02

Pos logit contributions
 sob+0.064
unct+0.063
osc+0.062
lee+0.062
 gag+0.061
Neg logit contributions
We-0.060
 of-0.057
 Friends-0.055
 I-0.055
By-0.054
Token tears
Token ID10953
Feature activation+2.03e-02

Pos logit contributions
We+0.067
 Friends+0.063
 I+0.062
 deserve+0.062
By+0.061
Neg logit contributions
 sob-0.031
lee-0.027
 gag-0.027
 breaths-0.026
osc-0.026
Token streaming
Token ID11305
Feature activation-1.94e-02

Pos logit contributions
We+0.032
 Friends+0.029
 I+0.029
 of+0.029
By+0.029
Neg logit contributions
 sob-0.031
unct-0.030
osc-0.030
lee-0.029
 gag-0.029
Token down
Token ID866
Feature activation-3.34e-02

Pos logit contributions
 sob+0.025
osc+0.024
unct+0.024
 Majesty+0.024
 gn+0.024
Neg logit contributions
We-0.035
 of-0.034
 I-0.032
 Friends-0.032
By-0.031
Token his
Token ID465
Feature activation-1.28e-01

Pos logit contributions
 sob+0.048
unct+0.047
osc+0.046
lee+0.046
 gag+0.045
Neg logit contributions
We-0.055
 Friends-0.051
 of-0.051
 I-0.051
By-0.050
Token cheeks
Token ID25839
Feature activation-5.29e-02

Pos logit contributions
unct+0.141
 gn+0.135
 Sab+0.129
 gag+0.120
 sob+0.117
Neg logit contributions
We-0.240
 Friends-0.237
 of-0.237
 I-0.236
 Have-0.224
Token,
Token ID11
Feature activation-5.33e-02

Pos logit contributions
 sob+0.077
unct+0.073
lee+0.072
osc+0.072
 gag+0.071
Neg logit contributions
We-0.087
 Friends-0.081
 I-0.080
 deserve-0.080
By-0.079
Token and
Token ID290
Feature activation-1.80e-02

Pos logit contributions
unct+0.075
 sob+0.072
osc+0.072
 gag+0.072
lee+0.071
Neg logit contributions
We-0.090
 I-0.083
 of-0.082
By-0.081
 Friends-0.081
Token said
Token ID531
Feature activation-4.07e-02

Pos logit contributions
unct+0.034
osc+0.034
 gag+0.033
 sob+0.032
lee+0.032
Neg logit contributions
We-0.022
 of-0.020
 I-0.020
 we-0.019
 Friends-0.018
Token "
Token ID366
Feature activation+3.39e-02

Pos logit contributions
unct+0.068
osc+0.068
 sob+0.068
 gag+0.066
 breaths+0.065
Neg logit contributions
 I-0.056
We-0.056
 of-0.056
 Friends-0.053
By-0.052
Token her
Token ID607
Feature activation-9.38e-02

Pos logit contributions
unct+0.109
osc+0.109
lee+0.108
 sob+0.108
 gag+0.107
Neg logit contributions
 of-0.090
We-0.090
 I-0.089
 deserve-0.083
 we-0.081
Token,
Token ID11
Feature activation-3.52e-02

Pos logit contributions
unct+0.165
 gag+0.158
osc+0.155
lee+0.152
 breaths+0.152
Neg logit contributions
 of-0.137
We-0.130
 I-0.127
 we-0.120
 Friends-0.118
Token tears
Token ID10953
Feature activation+3.00e-02

Pos logit contributions
 sob+0.050
lee+0.044
unct+0.044
osc+0.044
 gn+0.043
Neg logit contributions
We-0.061
 Friends-0.059
By-0.058
 deserve-0.058
 of-0.056
Token streaming
Token ID11305
Feature activation-1.50e-02

Pos logit contributions
We+0.044
 of+0.041
 I+0.040
 Friends+0.040
By+0.039
Neg logit contributions
 sob-0.049
unct-0.048
osc-0.047
lee-0.047
 gag-0.047
Token down
Token ID866
Feature activation-3.00e-02

Pos logit contributions
 Majesty+0.022
osc+0.021
 gn+0.021
 sob+0.020
unct+0.020
Neg logit contributions
 of-0.026
We-0.025
 I-0.024
 can-0.022
 we-0.022
Token his
Token ID465
Feature activation-1.28e-01

Pos logit contributions
unct+0.049
 sob+0.047
osc+0.046
 Majesty+0.046
 gag+0.045
Neg logit contributions
 of-0.047
We-0.044
 I-0.042
By-0.041
 Friends-0.040
Token face
Token ID1986
Feature activation-4.94e-02

Pos logit contributions
unct+0.138
 gn+0.127
 Sab+0.125
 gag+0.118
 sob+0.117
Neg logit contributions
 of-0.251
 Friends-0.246
We-0.245
 I-0.244
 deserve-0.234
Token.
Token ID13
Feature activation-1.68e-02

Pos logit contributions
unct+0.091
 Majesty+0.089
 sob+0.088
osc+0.087
 gag+0.086
Neg logit contributions
 of-0.068
 I-0.058
We-0.058
 Friends-0.054
 we-0.053
Token
Token ID198
Feature activation+1.16e-03

Pos logit contributions
unct+0.029
 sob+0.028
osc+0.028
 gag+0.027
 breaths+0.027
Neg logit contributions
We-0.024
 I-0.022
 of-0.022
 Friends-0.022
By-0.021
Token
Token ID198
Feature activation+1.04e-02

Pos logit contributions
We+0.001
 of+0.001
 I+0.001
By+0.001
 Friends+0.001
Neg logit contributions
unct-0.003
 breaths-0.002
 gag-0.002
 sob-0.002
iously-0.002
TokenHis
Token ID6653
Feature activation-5.26e-02

Pos logit contributions
 Friends+0.014
We+0.013
 I+0.013
 we+0.012
 yours+0.012
Neg logit contributions
 sob-0.018
 gn-0.017
osc-0.017
gy-0.016
 gag-0.016
Token,
Token ID11
Feature activation-2.94e-02

Pos logit contributions
 gag+0.054
osc+0.054
unct+0.053
 sob+0.053
gy+0.052
Neg logit contributions
 I-0.042
 of-0.041
 we-0.039
 you-0.039
We-0.038
Token tears
Token ID10953
Feature activation+4.22e-02

Pos logit contributions
 sob+0.055
unct+0.053
osc+0.052
lee+0.052
 gag+0.052
Neg logit contributions
We-0.037
 Friends-0.033
 of-0.033
 I-0.033
By-0.032
Token kept
Token ID4030
Feature activation-1.79e-02

Pos logit contributions
We+0.066
 Friends+0.061
 of+0.061
 I+0.061
By+0.060
Neg logit contributions
 sob-0.064
unct-0.062
osc-0.061
lee-0.061
 gag-0.060
Token streaming
Token ID11305
Feature activation-2.09e-02

Pos logit contributions
 sob+0.026
unct+0.026
lee+0.026
osc+0.025
 gag+0.025
Neg logit contributions
We-0.028
 of-0.027
 I-0.026
 Friends-0.026
By-0.025
Token down
Token ID866
Feature activation-2.72e-02

Pos logit contributions
 sob+0.028
 gn+0.028
osc+0.027
 Majesty+0.027
unct+0.027
Neg logit contributions
 of-0.037
We-0.035
 I-0.035
 we-0.033
 can-0.033
Token his
Token ID465
Feature activation-1.28e-01

Pos logit contributions
unct+0.043
 sob+0.043
 gag+0.041
osc+0.041
lee+0.041
Neg logit contributions
 of-0.041
We-0.040
 I-0.037
 Friends-0.037
By-0.037
Token face
Token ID1986
Feature activation-5.01e-02

Pos logit contributions
unct+0.143
 gn+0.137
 Sab+0.127
 gag+0.125
 sob+0.122
Neg logit contributions
 of-0.241
 Friends-0.240
We-0.234
 I-0.232
 Have-0.226
Token.
Token ID13
Feature activation-1.60e-02

Pos logit contributions
unct+0.081
 sob+0.080
 Majesty+0.078
osc+0.077
 gag+0.077
Neg logit contributions
 of-0.077
 I-0.069
We-0.069
 Friends-0.067
 we-0.064
Token
Token ID220
Feature activation-7.50e-03

Pos logit contributions
 sob+0.029
unct+0.029
lee+0.028
 gag+0.027
osc+0.027
Neg logit contributions
We-0.021
 Friends-0.020
 I-0.019
 of-0.019
By-0.018
Token
Token ID198
Feature activation-4.45e-04

Pos logit contributions
 sob+0.011
lee+0.010
osc+0.010
gy+0.010
 Sab+0.010
Neg logit contributions
We-0.012
By-0.011
 deserve-0.011
 I-0.011
 Friends-0.011
Token
Token ID198
Feature activation+8.89e-03

Pos logit contributions
 sob+0.001
unct+0.001
lee+0.001
 breaths+0.001
 gag+0.001
Neg logit contributions
We-0.001
 of-0.000
 Friends-0.000
 I-0.000
By-0.000
Token Jane
Token ID12091
Feature activation-4.00e-02

Pos logit contributions
We+0.010
 of+0.010
 I+0.010
 Friends+0.009
 deserve+0.009
Neg logit contributions
 sob-0.011
unct-0.011
osc-0.011
 gag-0.011
gy-0.010
Token with
Token ID351
Feature activation+3.43e-02

Pos logit contributions
 Majesty+0.070
unct+0.067
 gag+0.065
icol+0.065
 breaths+0.064
Neg logit contributions
 of-0.059
We-0.055
 I-0.049
 we-0.047
 Friends-0.046
Token tears
Token ID10953
Feature activation+2.60e-02

Pos logit contributions
We+0.064
 Friends+0.060
By+0.059
 I+0.059
 deserve+0.059
Neg logit contributions
 sob-0.044
lee-0.040
osc-0.040
 gag-0.040
unct-0.040
Token streaming
Token ID11305
Feature activation-1.71e-02

Pos logit contributions
We+0.036
 of+0.035
 Friends+0.032
 I+0.032
By+0.032
Neg logit contributions
unct-0.044
 sob-0.043
 gag-0.042
osc-0.042
 Majesty-0.042
Token down
Token ID866
Feature activation-3.53e-02

Pos logit contributions
 Majesty+0.027
osc+0.025
 gn+0.025
unct+0.025
 gag+0.025
Neg logit contributions
 of-0.028
We-0.028
 I-0.025
 we-0.024
By-0.024
Token her
Token ID607
Feature activation-1.27e-01

Pos logit contributions
unct+0.053
 sob+0.051
osc+0.050
 gag+0.050
 Majesty+0.049
Neg logit contributions
 of-0.057
We-0.057
 I-0.053
 Friends-0.052
By-0.051
Token face
Token ID1986
Feature activation-5.16e-02

Pos logit contributions
unct+0.151
 gn+0.141
 Sab+0.140
 Majesty+0.131
 gag+0.131
Neg logit contributions
 of-0.241
We-0.230
 I-0.229
 Friends-0.227
 we-0.216
Token,
Token ID11
Feature activation-5.34e-02

Pos logit contributions
unct+0.090
 Majesty+0.087
 gag+0.086
 sob+0.085
osc+0.085
Neg logit contributions
 of-0.074
We-0.068
 I-0.065
 Friends-0.062
 we-0.061
Token and
Token ID290
Feature activation-6.45e-03

Pos logit contributions
unct+0.079
osc+0.076
 gag+0.075
 breaths+0.074
gy+0.074
Neg logit contributions
We-0.088
 of-0.081
 I-0.081
By-0.078
 Friends-0.078
Token Jane
Token ID12091
Feature activation-9.43e-03

Pos logit contributions
 sob+0.010
unct+0.010
osc+0.010
lee+0.010
 gag+0.009
Neg logit contributions
We-0.010
 Friends-0.009
 of-0.009
 I-0.009
By-0.009
Token's
Token ID338
Feature activation-4.33e-02

Pos logit contributions
unct+0.016
osc+0.016
 Majesty+0.016
 gag+0.016
 breaths+0.016
Neg logit contributions
 of-0.013
We-0.013
 I-0.011
By-0.011
I-0.011
Token his
Token ID465
Feature activation-6.44e-02

Pos logit contributions
We+0.065
 Friends+0.060
 of+0.060
 I+0.060
By+0.059
Neg logit contributions
 sob-0.049
unct-0.047
osc-0.046
lee-0.046
 gag-0.045
Token mouth
Token ID5422
Feature activation-3.78e-02

Pos logit contributions
 sob+0.059
unct+0.053
 gag+0.052
lee+0.052
osc+0.051
Neg logit contributions
We-0.143
By-0.135
 Friends-0.134
 deserve-0.134
 of-0.133
Token and
Token ID290
Feature activation-9.07e-03

Pos logit contributions
 sob+0.055
unct+0.052
osc+0.052
lee+0.052
 gag+0.051
Neg logit contributions
We-0.062
 Friends-0.058
By-0.057
 I-0.057
 deserve-0.057
Token l
Token ID300
Feature activation+4.81e-04

Pos logit contributions
 sob+0.012
unct+0.012
osc+0.011
lee+0.011
 gag+0.011
Neg logit contributions
We-0.016
 I-0.015
 Friends-0.015
 of-0.015
 deserve-0.015
Tokenicks
Token ID3378
Feature activation-1.78e-02

Pos logit contributions
We+0.001
 we+0.001
 Have+0.001
 I+0.001
 Friends+0.001
Neg logit contributions
 sob-0.001
 gag-0.001
lee-0.001
 breaths-0.001
unct-0.001
Token his
Token ID465
Feature activation-1.24e-01

Pos logit contributions
 sob+0.025
unct+0.024
osc+0.024
lee+0.023
 gag+0.023
Neg logit contributions
We-0.030
 of-0.029
 Friends-0.028
 I-0.028
By-0.028
Token lips
Token ID11914
Feature activation-5.26e-02

Pos logit contributions
unct+0.120
osc+0.111
lee+0.111
 Sab+0.109
 sob+0.108
Neg logit contributions
We-0.257
 I-0.253
 of-0.251
 Friends-0.246
 deserve-0.238
Token.
Token ID13
Feature activation-1.36e-02

Pos logit contributions
 sob+0.085
unct+0.085
lee+0.083
osc+0.082
 gn+0.081
Neg logit contributions
We-0.076
 of-0.074
 Friends-0.070
 I-0.070
By-0.068
Token He
Token ID679
Feature activation+1.03e-02

Pos logit contributions
unct+0.027
 sob+0.026
osc+0.026
 breaths+0.025
lee+0.025
Neg logit contributions
We-0.016
 I-0.015
 Friends-0.015
 of-0.014
 Have-0.014
Token thinks
Token ID6834
Feature activation+2.86e-02

Pos logit contributions
We+0.015
 Friends+0.014
 of+0.014
 I+0.014
By+0.014
Neg logit contributions
 sob-0.017
unct-0.016
osc-0.016
lee-0.016
 gag-0.015
Token it
Token ID340
Feature activation+2.90e-02

Pos logit contributions
We+0.037
 Friends+0.034
By+0.033
 deserve+0.033
 I+0.032
Neg logit contributions
 sob-0.052
unct-0.050
lee-0.050
osc-0.049
 gag-0.049
Token away
Token ID1497
Feature activation-4.48e-02

Pos logit contributions
 gag+0.079
unct+0.078
 Majesty+0.078
 sob+0.073
lee+0.073
Neg logit contributions
 of-0.065
We-0.056
 I-0.052
By-0.050
 you-0.050
Token,
Token ID11
Feature activation-3.70e-02

Pos logit contributions
unct+0.086
 Majesty+0.086
osc+0.086
 gag+0.086
lee+0.083
Neg logit contributions
 of-0.058
We-0.048
 I-0.047
 we-0.047
By-0.044
Token tears
Token ID10953
Feature activation+4.53e-02

Pos logit contributions
 sob+0.043
unct+0.042
osc+0.041
lee+0.041
 gag+0.040
Neg logit contributions
We-0.071
 of-0.067
 Friends-0.067
 I-0.067
By-0.066
Token streaming
Token ID11305
Feature activation-1.16e-02

Pos logit contributions
We+0.060
 of+0.057
By+0.054
 I+0.054
 Friends+0.053
Neg logit contributions
 sob-0.080
unct-0.079
osc-0.077
 gag-0.076
lee-0.076
Token down
Token ID866
Feature activation-2.95e-02

Pos logit contributions
 Majesty+0.019
 gn+0.018
osc+0.018
unct+0.017
 gag+0.017
Neg logit contributions
 of-0.019
We-0.017
 I-0.016
 we-0.015
 can-0.015
Token her
Token ID607
Feature activation-1.23e-01

Pos logit contributions
unct+0.046
 Majesty+0.044
 sob+0.043
 gag+0.042
gy+0.042
Neg logit contributions
 of-0.052
We-0.046
 I-0.043
By-0.043
In-0.042
Token face
Token ID1986
Feature activation-5.00e-02

Pos logit contributions
unct+0.163
 gn+0.157
 Sab+0.152
 grizz+0.140
 Majesty+0.140
Neg logit contributions
 of-0.212
 Friends-0.195
We-0.194
 I-0.193
 hour-0.191
Token.
Token ID13
Feature activation-2.12e-02

Pos logit contributions
unct+0.095
 Majesty+0.093
 sob+0.091
osc+0.090
 gag+0.090
Neg logit contributions
 of-0.068
We-0.057
 I-0.056
 Friends-0.052
 we-0.051
Token When
Token ID1649
Feature activation-3.11e-03

Pos logit contributions
osc+0.038
unct+0.038
 gag+0.037
 sob+0.037
 gn+0.037
Neg logit contributions
 I-0.030
We-0.027
 Friends-0.027
 Have-0.025
I-0.025
Token her
Token ID607
Feature activation-4.60e-02

Pos logit contributions
osc+0.008
 Majesty+0.007
aj+0.007
 gag+0.007
gy+0.007
Neg logit contributions
We-0.003
 of-0.003
 I-0.003
By-0.003
 we-0.002
Token mom
Token ID1995
Feature activation-2.63e-02

Pos logit contributions
osc+0.055
unct+0.053
enery+0.051
 gn+0.050
 gag+0.050
Neg logit contributions
We-0.086
 of-0.084
By-0.080
 Friends-0.080
 we-0.079
Token.
Token ID13
Feature activation-1.18e-02

Pos logit contributions
unct+0.039
 Majesty+0.038
osc+0.038
lee+0.037
 gn+0.037
Neg logit contributions
 of-0.024
We-0.024
 I-0.021
 we-0.020
 Friends-0.020
Token Tears
Token ID40328
Feature activation-1.21e-03

Pos logit contributions
unct+0.022
osc+0.021
 breaths+0.020
gy+0.020
 sob+0.020
Neg logit contributions
 I-0.016
We-0.015
 of-0.014
I-0.013
 we-0.013
Token started
Token ID2067
Feature activation-3.62e-02

Pos logit contributions
 sob+0.002
lee+0.002
unct+0.002
osc+0.002
 gag+0.002
Neg logit contributions
We-0.002
 Friends-0.002
 I-0.002
 deserve-0.002
 Have-0.002
Token streaming
Token ID11305
Feature activation-1.75e-02

Pos logit contributions
 Majesty+0.067
unct+0.063
osc+0.062
 gag+0.059
enery+0.059
Neg logit contributions
 of-0.054
We-0.044
 I-0.042
 we-0.041
 they-0.041
Token down
Token ID866
Feature activation-2.76e-02

Pos logit contributions
 Majesty+0.024
osc+0.024
 gn+0.024
 sob+0.023
unct+0.023
Neg logit contributions
 of-0.030
We-0.030
 I-0.028
By-0.027
 we-0.027
Token her
Token ID607
Feature activation-1.23e-01

Pos logit contributions
unct+0.041
 sob+0.041
osc+0.040
 Majesty+0.039
 gag+0.039
Neg logit contributions
 of-0.045
We-0.044
 I-0.041
By-0.040
 Friends-0.040
Token cheeks
Token ID25839
Feature activation-4.97e-02

Pos logit contributions
unct+0.143
 gn+0.143
 Sab+0.134
 Majesty+0.132
gy+0.127
Neg logit contributions
 of-0.229
 I-0.218
We-0.209
 Friends-0.206
 we-0.201
Token and
Token ID290
Feature activation-1.68e-02

Pos logit contributions
 sob+0.077
unct+0.076
osc+0.075
lee+0.074
 gag+0.073
Neg logit contributions
We-0.075
 of-0.072
 I-0.069
 Friends-0.068
By-0.068
Token Daddy
Token ID26926
Feature activation-3.10e-02

Pos logit contributions
unct+0.022
osc+0.022
lee+0.021
gy+0.021
 Majesty+0.021
Neg logit contributions
We-0.029
 I-0.028
 of-0.028
 we-0.026
By-0.026
Token quickly
Token ID2952
Feature activation-2.04e-02

Pos logit contributions
 Majesty+0.052
unct+0.051
 gag+0.049
osc+0.048
enery+0.048
Neg logit contributions
 of-0.045
We-0.045
 I-0.040
By-0.039
 we-0.038
Token pulled
Token ID5954
Feature activation-8.88e-03

Pos logit contributions
unct+0.032
 gag+0.031
 Majesty+0.031
osc+0.031
gy+0.030
Neg logit contributions
 of-0.029
We-0.029
 deserve-0.027
By-0.027
 I-0.026
Token smiling
Token ID16755
Feature activation-3.59e-03

Pos logit contributions
unct+0.018
osc+0.017
 gag+0.017
 sob+0.016
 breaths+0.016
Neg logit contributions
 of-0.018
We-0.018
 I-0.017
 deserve-0.017
By-0.016
Token with
Token ID351
Feature activation+3.24e-02

Pos logit contributions
unct+0.006
 sob+0.006
osc+0.006
 gag+0.006
 gn+0.006
Neg logit contributions
We-0.005
 of-0.005
 I-0.005
 Friends-0.005
By-0.005
Token tears
Token ID10953
Feature activation+3.32e-02

Pos logit contributions
We+0.065
 Friends+0.061
By+0.060
 I+0.060
 deserve+0.059
Neg logit contributions
 sob-0.035
lee-0.034
osc-0.032
unct-0.032
 breaths-0.031
Token streaming
Token ID11305
Feature activation-1.22e-02

Pos logit contributions
We+0.049
 of+0.046
 Friends+0.045
 I+0.045
By+0.045
Neg logit contributions
 sob-0.053
unct-0.052
osc-0.051
lee-0.050
 gag-0.050
Token down
Token ID866
Feature activation-2.62e-02

Pos logit contributions
osc+0.016
 sob+0.016
 gn+0.016
 gag+0.016
 Majesty+0.016
Neg logit contributions
 of-0.021
We-0.021
 I-0.020
 we-0.019
 Friends-0.019
Token her
Token ID607
Feature activation-1.22e-01

Pos logit contributions
 sob+0.037
unct+0.036
osc+0.035
 gag+0.035
lee+0.034
Neg logit contributions
We-0.044
 of-0.042
 I-0.041
 Friends-0.041
By-0.040
Token cheeks
Token ID25839
Feature activation-4.74e-02

Pos logit contributions
unct+0.128
 gn+0.121
 Sab+0.119
 gag+0.119
 sob+0.111
Neg logit contributions
 of-0.238
We-0.235
 Friends-0.233
 I-0.233
 we-0.222
Token.
Token ID13
Feature activation-9.24e-03

Pos logit contributions
 sob+0.071
unct+0.069
osc+0.068
 gag+0.067
lee+0.067
Neg logit contributions
We-0.075
 of-0.070
 I-0.069
 Friends-0.068
By-0.068
Token
Token ID198
Feature activation+1.44e-02

Pos logit contributions
unct+0.016
 sob+0.015
osc+0.015
 gag+0.015
 breaths+0.015
Neg logit contributions
We-0.013
 I-0.012
 Friends-0.012
 of-0.012
By-0.012
Token
Token ID198
Feature activation+2.15e-02

Pos logit contributions
We+0.018
 Friends+0.017
 of+0.017
 I+0.016
By+0.016
Neg logit contributions
 sob-0.026
unct-0.026
osc-0.025
 gag-0.025
lee-0.025
TokenIn
Token ID818
Feature activation+1.04e-02

Pos logit contributions
 Friends+0.025
We+0.025
 I+0.025
 deserve+0.025
 of+0.024
Neg logit contributions
 sob-0.038
lee-0.036
 gn-0.036
osc-0.035
unct-0.035
Token felt
Token ID2936
Feature activation-4.68e-03

Pos logit contributions
unct+0.025
osc+0.024
 gag+0.024
 sob+0.024
 breaths+0.023
Neg logit contributions
We-0.021
 of-0.020
 I-0.019
By-0.019
 deserve-0.018
Token helpless
Token ID21144
Feature activation+4.89e-03

Pos logit contributions
 sob+0.005
unct+0.005
osc+0.005
lee+0.005
gy+0.005
Neg logit contributions
We-0.009
 of-0.008
 I-0.008
 Friends-0.008
 deserve-0.008
Token and
Token ID290
Feature activation-1.30e-02

Pos logit contributions
We+0.007
 of+0.007
 I+0.007
 Friends+0.007
 deserve+0.006
Neg logit contributions
 sob-0.008
unct-0.008
osc-0.008
lee-0.008
 gag-0.007
Token tears
Token ID10953
Feature activation+4.47e-02

Pos logit contributions
osc+0.021
unct+0.020
 sob+0.019
gy+0.019
 Sab+0.019
Neg logit contributions
We-0.020
 I-0.019
 of-0.019
By-0.018
 deserve-0.018
Token filled
Token ID5901
Feature activation+7.47e-03

Pos logit contributions
We+0.069
 of+0.065
 I+0.063
By+0.063
 Friends+0.063
Neg logit contributions
 sob-0.068
unct-0.067
osc-0.066
lee-0.065
 gag-0.064
Token her
Token ID607
Feature activation-1.20e-01

Pos logit contributions
We+0.011
 of+0.011
 you+0.010
You+0.010
 I+0.010
Neg logit contributions
unct-0.012
lee-0.012
 Majesty-0.012
aj-0.012
 gag-0.012
Token eyes
Token ID2951
Feature activation-5.00e-02

Pos logit contributions
 Sab+0.137
unct+0.134
 gn+0.123
lee+0.122
osc+0.122
Neg logit contributions
 of-0.226
 I-0.216
We-0.216
 Friends-0.208
 deserve-0.206
Token.
Token ID13
Feature activation-5.68e-03

Pos logit contributions
 sob+0.080
unct+0.080
lee+0.079
osc+0.079
 Majesty+0.079
Neg logit contributions
 of-0.072
We-0.072
By-0.066
 I-0.066
 deserve-0.065
Token
Token ID198
Feature activation+1.13e-02

Pos logit contributions
unct+0.011
osc+0.011
 gag+0.011
 breaths+0.010
 sob+0.010
Neg logit contributions
We-0.007
 I-0.007
 Friends-0.006
I-0.006
 Have-0.006
Token
Token ID198
Feature activation+2.33e-02

Pos logit contributions
We+0.012
 of+0.011
By+0.011
 I+0.010
In+0.010
Neg logit contributions
unct-0.023
 breaths-0.023
 gag-0.023
osc-0.023
 sob-0.023
TokenSuddenly
Token ID38582
Feature activation+4.34e-03

Pos logit contributions
We+0.030
 Friends+0.028
 of+0.027
 I+0.027
 deserve+0.027
Neg logit contributions
 sob-0.042
unct-0.041
osc-0.040
lee-0.040
 gag-0.039
Token Copper
Token ID27157
Feature activation-1.89e-02

Pos logit contributions
lee+0.014
 sob+0.014
unct+0.013
 gag+0.013
osc+0.013
Neg logit contributions
We-0.015
 I-0.014
 Friends-0.014
 of-0.014
By-0.014
Token said
Token ID531
Feature activation-4.84e-02

Pos logit contributions
 gag+0.037
 elbows+0.035
 breaths+0.035
unct+0.035
 mischief+0.035
Neg logit contributions
 I-0.020
 of-0.019
We-0.019
 can-0.019
 deserve-0.018
Token,
Token ID11
Feature activation-4.47e-02

Pos logit contributions
unct+0.097
lee+0.096
 Majesty+0.096
aj+0.095
 heels+0.093
Neg logit contributions
 of-0.061
 I-0.054
 we-0.048
 they-0.044
We-0.044
Token tears
Token ID10953
Feature activation+4.08e-02

Pos logit contributions
unct+0.081
osc+0.079
lee+0.077
 gag+0.077
 breaths+0.076
Neg logit contributions
We-0.056
 I-0.056
 of-0.052
By-0.050
 we-0.049
Token filling
Token ID12591
Feature activation-1.15e-03

Pos logit contributions
We+0.059
 Friends+0.054
 I+0.053
 of+0.053
By+0.053
Neg logit contributions
 sob-0.068
unct-0.065
osc-0.064
lee-0.064
 gag-0.063
Token her
Token ID607
Feature activation-1.19e-01

Pos logit contributions
 Majesty+0.002
aj+0.002
lee+0.002
 gag+0.002
unct+0.002
Neg logit contributions
 of-0.002
 you-0.001
You-0.001
 I-0.001
We-0.001
Token eyes
Token ID2951
Feature activation-5.65e-02

Pos logit contributions
unct+0.152
 Sab+0.151
lee+0.150
 gn+0.145
 coy+0.142
Neg logit contributions
 of-0.209
 I-0.197
We-0.191
 deserve-0.189
 Have-0.183
Token.
Token ID13
Feature activation-7.65e-03

Pos logit contributions
 sob+0.088
lee+0.087
unct+0.087
osc+0.087
 Majesty+0.085
Neg logit contributions
 of-0.085
We-0.084
 I-0.079
By-0.078
 deserve-0.076
Token The
Token ID383
Feature activation-1.10e-03

Pos logit contributions
 gag+0.014
unct+0.014
 breaths+0.014
 sob+0.013
lee+0.013
Neg logit contributions
We-0.010
 I-0.010
 Friends-0.010
 Have-0.009
 of-0.009
Token old
Token ID1468
Feature activation+4.13e-03

Pos logit contributions
osc+0.002
 sob+0.002
unct+0.002
lee+0.002
 gag+0.002
Neg logit contributions
We-0.002
 of-0.002
 I-0.002
 Friends-0.002
By-0.002
Token bird
Token ID6512
Feature activation+4.29e-03

Pos logit contributions
 of+0.005
 I+0.005
We+0.005
 we+0.005
 Friends+0.005
Neg logit contributions
osc-0.007
 gag-0.007
unct-0.007
lee-0.007
 breaths-0.007
Token's
Token ID338
Feature activation-8.83e-02

Pos logit contributions
 Majesty+0.138
 Sab+0.137
unct+0.135
lee+0.134
enery+0.131
Neg logit contributions
 of-0.090
We-0.080
 I-0.075
 Friends-0.073
 deserve-0.072
Token face
Token ID1986
Feature activation-3.40e-02

Pos logit contributions
unct+0.096
 Sab+0.094
lee+0.092
 Majesty+0.085
 gn+0.084
Neg logit contributions
 of-0.173
We-0.172
 I-0.166
 Friends-0.166
By-0.160
Token and
Token ID290
Feature activation-1.12e-02

Pos logit contributions
unct+0.068
 Majesty+0.066
 Sab+0.064
lee+0.064
osc+0.063
Neg logit contributions
 of-0.042
We-0.039
By-0.035
 I-0.035
 can-0.033
Token w
Token ID266
Feature activation-2.64e-02

Pos logit contributions
unct+0.018
osc+0.018
lee+0.018
 sob+0.017
 gag+0.017
Neg logit contributions
We-0.016
 I-0.015
 of-0.015
 we-0.014
 Friends-0.014
Tokenagged
Token ID14655
Feature activation-1.53e-02

Pos logit contributions
 sob+0.045
unct+0.045
osc+0.044
 gn+0.043
 gag+0.043
Neg logit contributions
We-0.036
 Friends-0.033
 of-0.033
By-0.032
 deserve-0.032
Token his
Token ID465
Feature activation-1.18e-01

Pos logit contributions
unct+0.032
 Majesty+0.029
lee+0.028
 babys+0.028
osc+0.027
Neg logit contributions
 of-0.018
 Friends-0.014
 than-0.014
 I-0.014
We-0.014
Token tail
Token ID7894
Feature activation-4.80e-02

Pos logit contributions
unct+0.168
 Sab+0.160
lee+0.154
 gn+0.151
osc+0.151
Neg logit contributions
We-0.195
 of-0.187
 I-0.186
 Friends-0.185
By-0.184
Token.
Token ID13
Feature activation-2.36e-02

Pos logit contributions
unct+0.081
lee+0.078
 sob+0.078
osc+0.077
 Sab+0.076
Neg logit contributions
We-0.067
 of-0.067
 Friends-0.063
By-0.061
 I-0.061
Token He
Token ID679
Feature activation+5.69e-03

Pos logit contributions
 sob+0.041
unct+0.039
lee+0.039
osc+0.039
 gag+0.039
Neg logit contributions
We-0.032
 Friends-0.029
 of-0.029
 deserve-0.028
 I-0.028
Token was
Token ID373
Feature activation-1.60e-02

Pos logit contributions
We+0.009
 of+0.008
 I+0.008
By+0.008
 deserve+0.008
Neg logit contributions
unct-0.009
osc-0.009
 breaths-0.009
lee-0.009
 gag-0.009
Token happy
Token ID3772
Feature activation+1.34e-02

Pos logit contributions
unct+0.030
 breaths+0.028
osc+0.028
iously+0.028
enery+0.028
Neg logit contributions
 of-0.022
We-0.021
 I-0.020
By-0.019
 deserve-0.019
Token of
Token ID286
Feature activation+1.01e-02

Pos logit contributions
We+0.037
By+0.034
 Friends+0.034
 deserve+0.034
 Have+0.033
Neg logit contributions
 sob-0.031
lee-0.030
 breaths-0.029
osc-0.029
unct-0.029
Token the
Token ID262
Feature activation-1.17e-02

Pos logit contributions
We+0.016
 of+0.015
 I+0.015
 Friends+0.015
By+0.014
Neg logit contributions
 sob-0.015
unct-0.015
osc-0.014
lee-0.014
 gag-0.014
Token sunrise
Token ID40048
Feature activation+7.59e-03

Pos logit contributions
 sob+0.018
unct+0.017
lee+0.017
osc+0.017
 gag+0.017
Neg logit contributions
We-0.019
 of-0.017
 I-0.017
 Friends-0.017
By-0.017
Token streaming
Token ID11305
Feature activation-1.51e-02

Pos logit contributions
We+0.013
 deserve+0.013
By+0.013
 Friends+0.012
 Have+0.012
Neg logit contributions
 sob-0.010
osc-0.010
gy-0.009
unct-0.009
 gn-0.009
Token into
Token ID656
Feature activation-4.22e-02

Pos logit contributions
 sob+0.020
osc+0.020
 gn+0.019
 gag+0.019
unct+0.018
Neg logit contributions
 of-0.027
We-0.027
 I-0.026
By-0.025
 we-0.025
Token his
Token ID465
Feature activation-1.18e-01

Pos logit contributions
 sob+0.069
unct+0.069
osc+0.067
gy+0.067
 gag+0.065
Neg logit contributions
We-0.061
 of-0.061
 I-0.057
By-0.054
 Friends-0.054
Token room
Token ID2119
Feature activation-4.23e-02

Pos logit contributions
unct+0.139
 gn+0.130
 sob+0.126
 gag+0.121
iously+0.121
Neg logit contributions
We-0.225
 of-0.220
 Friends-0.219
 I-0.215
By-0.208
Token.
Token ID13
Feature activation-5.46e-03

Pos logit contributions
 sob+0.068
unct+0.068
osc+0.065
lee+0.065
 gag+0.064
Neg logit contributions
We-0.061
 of-0.060
 I-0.057
 Friends-0.057
By-0.055
Token
Token ID220
Feature activation-7.48e-03

Pos logit contributions
 sob+0.009
osc+0.008
lee+0.008
unct+0.008
gy+0.008
Neg logit contributions
We-0.008
By-0.007
 deserve-0.007
 of-0.007
 Friends-0.007
Token
Token ID198
Feature activation+3.62e-03

Pos logit contributions
 sob+0.010
lee+0.010
gy+0.009
 Sab+0.009
osc+0.009
Neg logit contributions
We-0.012
By-0.012
 deserve-0.012
 Have-0.011
 Friends-0.011
Token
Token ID198
Feature activation+1.25e-02

Pos logit contributions
We+0.005
 Friends+0.004
 I+0.004
 deserve+0.004
By+0.004
Neg logit contributions
 sob-0.006
osc-0.006
unct-0.006
lee-0.006
gy-0.006
Token so
Token ID523
Feature activation-2.58e-02

Pos logit contributions
unct+0.032
 sob+0.030
osc+0.030
 gag+0.030
lee+0.029
Neg logit contributions
We-0.036
 of-0.036
 I-0.034
By-0.033
 deserve-0.033
Token happy
Token ID3772
Feature activation+2.48e-02

Pos logit contributions
 sob+0.031
unct+0.029
lee+0.029
osc+0.028
 gag+0.028
Neg logit contributions
We-0.050
 Friends-0.047
 I-0.046
 of-0.046
By-0.046
Token he
Token ID339
Feature activation+9.99e-04

Pos logit contributions
We+0.042
 deserve+0.041
By+0.040
In+0.039
 Have+0.039
Neg logit contributions
 sob-0.033
lee-0.032
iously-0.031
 gn-0.031
gy-0.030
Token w
Token ID266
Feature activation-1.15e-02

Pos logit contributions
We+0.001
 Friends+0.001
 of+0.001
By+0.001
 Have+0.001
Neg logit contributions
 sob-0.002
lee-0.002
unct-0.001
 gn-0.001
osc-0.001
Tokenagged
Token ID14655
Feature activation-4.26e-03

Pos logit contributions
lee+0.016
 sob+0.016
unct+0.016
 breaths+0.015
osc+0.015
Neg logit contributions
We-0.020
 I-0.019
 we-0.018
 deserve-0.018
 Have-0.018
Token his
Token ID465
Feature activation-1.18e-01

Pos logit contributions
unct+0.007
 sob+0.007
 gag+0.007
lee+0.006
osc+0.006
Neg logit contributions
 of-0.006
We-0.006
 I-0.006
 Friends-0.006
By-0.006
Token tail
Token ID7894
Feature activation-4.41e-02

Pos logit contributions
 sob+0.138
unct+0.136
osc+0.130
lee+0.129
 gag+0.129
Neg logit contributions
We-0.224
 Friends-0.212
 of-0.211
 I-0.211
By-0.209
Token and
Token ID290
Feature activation-2.26e-02

Pos logit contributions
 sob+0.067
unct+0.065
osc+0.063
lee+0.063
 gag+0.062
Neg logit contributions
We-0.069
 Friends-0.064
 of-0.064
 I-0.064
By-0.063
Token l
Token ID300
Feature activation-8.46e-03

Pos logit contributions
unct+0.030
osc+0.029
 sob+0.029
 gag+0.028
lee+0.028
Neg logit contributions
We-0.040
 I-0.039
 Friends-0.037
 of-0.037
By-0.037
Tokenicked
Token ID9484
Feature activation-1.85e-02

Pos logit contributions
 sob+0.012
lee+0.012
unct+0.012
osc+0.011
 gn+0.011
Neg logit contributions
We-0.014
 I-0.013
 deserve-0.013
 we-0.013
 Friends-0.013
Token the
Token ID262
Feature activation-4.65e-02

Pos logit contributions
 sob+0.026
unct+0.025
osc+0.024
lee+0.024
 gag+0.024
Neg logit contributions
We-0.032
 of-0.030
 Friends-0.030
 I-0.029
By-0.029
Token bark
Token ID21405
Feature activation-2.47e-02

Pos logit contributions
unct+0.054
 gag+0.050
 Sab+0.049
 breaths+0.049
osc+0.049
Neg logit contributions
We-0.055
 I-0.053
 deserve-0.051
 Friends-0.051
 of-0.051
Tokened
Token ID276
Feature activation-5.03e-02

Pos logit contributions
unct+0.041
 Majesty+0.038
 Sab+0.038
 babys+0.038
 gag+0.037
Neg logit contributions
 of-0.039
We-0.038
By-0.036
 Friends-0.034
 I-0.034
Token and
Token ID290
Feature activation-2.10e-02

Pos logit contributions
unct+0.105
tails+0.100
 gag+0.099
osc+0.098
 Sab+0.097
Neg logit contributions
 of-0.065
 I-0.049
We-0.048
 we-0.046
 Friends-0.045
Token w
Token ID266
Feature activation-2.65e-02

Pos logit contributions
unct+0.032
osc+0.031
lee+0.030
 gag+0.030
 sob+0.030
Neg logit contributions
We-0.033
 I-0.032
 we-0.031
By-0.030
 of-0.029
Tokenagged
Token ID14655
Feature activation-1.04e-02

Pos logit contributions
 sob+0.041
lee+0.039
unct+0.039
osc+0.038
 breaths+0.038
Neg logit contributions
We-0.043
 I-0.040
 Friends-0.039
 Have-0.039
 deserve-0.038
Token his
Token ID465
Feature activation-1.18e-01

Pos logit contributions
unct+0.022
 gag+0.020
 Majesty+0.020
 babys+0.019
osc+0.019
Neg logit contributions
 of-0.012
 they-0.010
 than-0.010
 I-0.009
 Friends-0.009
Token tail
Token ID7894
Feature activation-4.57e-02

Pos logit contributions
unct+0.148
 sob+0.144
osc+0.138
 gag+0.137
 Sab+0.137
Neg logit contributions
We-0.213
 Friends-0.202
 of-0.201
 I-0.200
By-0.199
Token.
Token ID13
Feature activation-1.50e-02

Pos logit contributions
unct+0.080
lee+0.076
 sob+0.076
osc+0.075
 gag+0.074
Neg logit contributions
 of-0.064
We-0.061
 Friends-0.057
By-0.057
 I-0.056
Token He
Token ID679
Feature activation+7.42e-03

Pos logit contributions
 sob+0.027
unct+0.027
osc+0.026
lee+0.026
 gag+0.026
Neg logit contributions
We-0.020
 Friends-0.018
 I-0.018
 of-0.018
By-0.018
Token wanted
Token ID2227
Feature activation+1.73e-02

Pos logit contributions
We+0.010
By+0.009
 of+0.009
 I+0.009
 deserve+0.009
Neg logit contributions
unct-0.013
 gag-0.012
osc-0.012
 breaths-0.012
 sob-0.012
Token to
Token ID284
Feature activation-2.78e-02

Pos logit contributions
We+0.023
 of+0.022
 I+0.022
 Friends+0.021
 deserve+0.021
Neg logit contributions
unct-0.031
 sob-0.030
lee-0.030
 gag-0.029
osc-0.029
Token his
Token ID465
Feature activation-1.04e-01

Pos logit contributions
We+0.036
 of+0.036
 I+0.034
 Friends+0.034
By+0.034
Neg logit contributions
unct-0.037
lee-0.035
 Majesty-0.035
osc-0.035
 gag-0.035
Token tail
Token ID7894
Feature activation-2.17e-02

Pos logit contributions
unct+0.134
 sob+0.128
lee+0.127
 Sab+0.124
 gag+0.124
Neg logit contributions
We-0.186
 Friends-0.178
 of-0.175
 I-0.175
By-0.175
Token and
Token ID290
Feature activation-5.73e-03

Pos logit contributions
 sob+0.033
unct+0.031
osc+0.031
lee+0.031
 gag+0.030
Neg logit contributions
We-0.035
 Friends-0.032
 I-0.032
 of-0.032
By-0.032
Token l
Token ID300
Feature activation-2.79e-03

Pos logit contributions
 sob+0.006
unct+0.006
osc+0.006
lee+0.006
 gag+0.006
Neg logit contributions
We-0.012
 Friends-0.011
 I-0.011
 of-0.011
By-0.011
Tokenicks
Token ID3378
Feature activation-7.96e-03

Pos logit contributions
 sob+0.004
 breaths+0.004
lee+0.004
osc+0.004
 gag+0.004
Neg logit contributions
We-0.005
 I-0.004
 deserve-0.004
 we-0.004
 Have-0.004
Token their
Token ID511
Feature activation-1.17e-01

Pos logit contributions
 sob+0.012
lee+0.011
osc+0.011
unct+0.011
 gag+0.011
Neg logit contributions
We-0.013
 deserve-0.012
 Have-0.012
 Friends-0.012
By-0.011
Token faces
Token ID6698
Feature activation-2.08e-02

Pos logit contributions
unct+0.101
 Sab+0.094
 gn+0.093
 sob+0.091
iously+0.089
Neg logit contributions
We-0.261
 Friends-0.250
 of-0.249
By-0.242
 I-0.239
Token.
Token ID13
Feature activation-1.02e-02

Pos logit contributions
 sob+0.029
lee+0.027
unct+0.027
osc+0.026
 gag+0.026
Neg logit contributions
We-0.036
 Friends-0.033
 deserve-0.033
 I-0.033
 Have-0.033
Token He
Token ID679
Feature activation+9.90e-03

Pos logit contributions
 sob+0.017
lee+0.017
unct+0.017
osc+0.016
 gag+0.016
Neg logit contributions
We-0.014
 Friends-0.013
 of-0.013
 deserve-0.013
 I-0.013
Token is
Token ID318
Feature activation-1.14e-02

Pos logit contributions
We+0.016
 Friends+0.015
 of+0.015
 I+0.015
By+0.015
Neg logit contributions
 sob-0.014
unct-0.014
osc-0.014
lee-0.013
 gag-0.013
Token not
Token ID407
Feature activation+1.17e-02

Pos logit contributions
unct+0.019
osc+0.018
 sob+0.018
lee+0.018
 breaths+0.018
Neg logit contributions
We-0.017
 of-0.016
 I-0.015
By-0.015
 Friends-0.015
Token,
Token ID11
Feature activation-4.69e-02

Pos logit contributions
 sob+0.022
unct+0.021
lee+0.021
osc+0.021
 gag+0.021
Neg logit contributions
We-0.023
 Friends-0.022
 I-0.021
 deserve-0.021
By-0.021
Token put
Token ID1234
Feature activation+6.19e-03

Pos logit contributions
unct+0.070
osc+0.068
 gag+0.067
 Majesty+0.066
lee+0.066
Neg logit contributions
We-0.074
 I-0.070
 of-0.069
By-0.066
 we-0.065
Token his
Token ID465
Feature activation-7.21e-02

Pos logit contributions
We+0.011
 of+0.011
 I+0.011
 Friends+0.011
 deserve+0.010
Neg logit contributions
 sob-0.008
unct-0.008
lee-0.007
osc-0.007
 gag-0.007
Token arms
Token ID5101
Feature activation-2.24e-02

Pos logit contributions
unct+0.092
lee+0.091
osc+0.090
 Sab+0.090
 sob+0.085
Neg logit contributions
 I-0.123
 of-0.122
We-0.120
 Have-0.112
 can-0.112
Token around
Token ID1088
Feature activation-1.36e-02

Pos logit contributions
 sob+0.033
unct+0.032
osc+0.031
lee+0.031
 gag+0.031
Neg logit contributions
We-0.036
 Friends-0.034
 of-0.033
 I-0.033
By-0.033
Token her
Token ID607
Feature activation-1.17e-01

Pos logit contributions
unct+0.022
 Majesty+0.021
 sob+0.021
 gag+0.021
osc+0.020
Neg logit contributions
 of-0.020
We-0.020
 I-0.019
 Friends-0.019
 we-0.018
Token and
Token ID290
Feature activation-2.93e-02

Pos logit contributions
unct+0.203
 gag+0.191
 Sab+0.189
 Majesty+0.181
lee+0.180
Neg logit contributions
 of-0.157
 I-0.156
 can-0.142
We-0.141
 we-0.140
Token said
Token ID531
Feature activation-4.44e-02

Pos logit contributions
osc+0.046
unct+0.046
lee+0.044
 gag+0.044
 Majesty+0.043
Neg logit contributions
We-0.042
 I-0.041
 we-0.039
 of-0.038
By-0.037
Token,
Token ID11
Feature activation-4.06e-02

Pos logit contributions
unct+0.065
osc+0.065
 sob+0.064
 gag+0.063
 breaths+0.062
Neg logit contributions
 of-0.070
 I-0.070
We-0.069
 Friends-0.065
By-0.065
Token "
Token ID366
Feature activation+3.95e-02

Pos logit contributions
unct+0.077
 sob+0.077
osc+0.076
 gag+0.075
lee+0.075
Neg logit contributions
We-0.048
 I-0.044
By-0.042
 Friends-0.042
 of-0.042
TokenDon
Token ID3987
Feature activation+2.95e-02

Pos logit contributions
 Friends+0.039
 of+0.038
 we+0.036
 I+0.035
 makers+0.033
Neg logit contributions
 sob-0.080
 gn-0.074
unct-0.074
osc-0.074
 Sab-0.073
Token his
Token ID465
Feature activation-1.00e-01

Pos logit contributions
We+0.047
 of+0.046
 I+0.044
 Friends+0.044
By+0.043
Neg logit contributions
unct-0.049
lee-0.047
 sob-0.046
osc-0.046
 gag-0.046
Token tail
Token ID7894
Feature activation-1.67e-02

Pos logit contributions
unct+0.127
 sob+0.121
lee+0.120
 gag+0.117
osc+0.117
Neg logit contributions
We-0.183
 Friends-0.174
 I-0.171
 of-0.171
By-0.171
Token and
Token ID290
Feature activation-8.45e-03

Pos logit contributions
 sob+0.025
unct+0.023
osc+0.023
lee+0.023
 gag+0.023
Neg logit contributions
We-0.027
 Friends-0.025
 I-0.025
By-0.025
 of-0.025
Token l
Token ID300
Feature activation-3.62e-03

Pos logit contributions
 sob+0.010
unct+0.010
osc+0.010
lee+0.010
 gag+0.010
Neg logit contributions
We-0.016
 Friends-0.015
 of-0.015
 I-0.015
By-0.015
Tokenicks
Token ID3378
Feature activation-1.15e-02

Pos logit contributions
 sob+0.005
lee+0.005
 breaths+0.005
 gag+0.005
osc+0.004
Neg logit contributions
We-0.006
 I-0.006
 deserve-0.006
 we-0.006
In-0.006
Token their
Token ID511
Feature activation-1.16e-01

Pos logit contributions
 sob+0.018
unct+0.017
osc+0.017
lee+0.017
 gag+0.017
Neg logit contributions
We-0.018
 Friends-0.017
 of-0.017
 I-0.016
By-0.016
Token faces
Token ID6698
Feature activation-1.60e-02

Pos logit contributions
unct+0.095
 Sab+0.090
 gn+0.085
 sob+0.085
osc+0.084
Neg logit contributions
We-0.267
 Friends-0.256
 of-0.252
 I-0.249
By-0.247
Token.
Token ID13
Feature activation-1.96e-02

Pos logit contributions
 sob+0.020
 gag+0.018
lee+0.018
osc+0.018
unct+0.018
Neg logit contributions
We-0.030
 deserve-0.028
 Have-0.027
 Friends-0.027
By-0.027
Token
Token ID198
Feature activation+1.41e-02

Pos logit contributions
 sob+0.033
lee+0.030
osc+0.030
 gn+0.030
 Sab+0.030
Neg logit contributions
We-0.027
 deserve-0.026
 of-0.025
By-0.025
 Friends-0.024
Token
Token ID198
Feature activation+1.91e-02

Pos logit contributions
We+0.019
 Friends+0.017
 I+0.017
 of+0.017
 deserve+0.017
Neg logit contributions
 sob-0.025
unct-0.024
lee-0.024
osc-0.024
 gag-0.023
TokenTim
Token ID14967
Feature activation-1.58e-02

Pos logit contributions
 deserve+0.027
 yours+0.025
 Friends+0.025
 wins+0.025
 losers+0.025
Neg logit contributions
 sob-0.029
 gn-0.025
heads-0.024
tails-0.024
pee-0.024
Token excited
Token ID6568
Feature activation+1.74e-02

Pos logit contributions
 sob+0.026
unct+0.025
lee+0.024
osc+0.024
 gag+0.024
Neg logit contributions
We-0.046
 Friends-0.043
 I-0.042
 of-0.042
By-0.042
Token,
Token ID11
Feature activation-2.15e-02

Pos logit contributions
We+0.027
By+0.025
 deserve+0.025
 Friends+0.024
 Have+0.024
Neg logit contributions
 sob-0.027
lee-0.027
unct-0.026
iously-0.026
osc-0.025
Token he
Token ID339
Feature activation+5.40e-03

Pos logit contributions
 sob+0.040
lee+0.038
unct+0.038
 gn+0.037
osc+0.037
Neg logit contributions
We-0.027
 Friends-0.024
 deserve-0.024
By-0.024
 of-0.023
Token w
Token ID266
Feature activation-1.22e-02

Pos logit contributions
We+0.009
 Friends+0.008
 of+0.008
 I+0.008
By+0.008
Neg logit contributions
 sob-0.008
unct-0.007
lee-0.007
osc-0.007
 gag-0.007
Tokenagged
Token ID14655
Feature activation-2.90e-03

Pos logit contributions
 sob+0.018
lee+0.018
unct+0.018
osc+0.017
 breaths+0.017
Neg logit contributions
We-0.020
 I-0.019
 Friends-0.018
 deserve-0.018
 Have-0.018
Token his
Token ID465
Feature activation-1.16e-01

Pos logit contributions
unct+0.005
 gag+0.005
 sob+0.005
osc+0.005
 Majesty+0.005
Neg logit contributions
 of-0.004
We-0.004
 I-0.004
By-0.004
 Friends-0.004
Token tail
Token ID7894
Feature activation-4.35e-02

Pos logit contributions
 sob+0.135
unct+0.132
osc+0.127
lee+0.126
 gag+0.125
Neg logit contributions
We-0.223
 Friends-0.211
 of-0.210
 I-0.209
By-0.208
Token and
Token ID290
Feature activation-2.37e-02

Pos logit contributions
 sob+0.067
unct+0.066
osc+0.065
lee+0.064
 gag+0.064
Neg logit contributions
We-0.066
 of-0.063
 Friends-0.062
 I-0.061
By-0.061
Token w
Token ID266
Feature activation-3.21e-02

Pos logit contributions
unct+0.031
osc+0.030
 sob+0.029
 gag+0.029
lee+0.028
Neg logit contributions
We-0.042
 I-0.041
 of-0.039
 we-0.039
 Friends-0.039
Tokenigg
Token ID6950
Feature activation-2.68e-03

Pos logit contributions
 sob+0.051
lee+0.049
unct+0.049
osc+0.048
 breaths+0.047
Neg logit contributions
We-0.050
 I-0.047
 Friends-0.045
 deserve-0.045
 of-0.045
Tokenled
Token ID992
Feature activation-1.09e-02

Pos logit contributions
 sob+0.004
 gag+0.004
osc+0.004
unct+0.004
 gn+0.004
Neg logit contributions
We-0.004
 of-0.004
 Friends-0.004
 deserve-0.004
 I-0.004
Token house
Token ID2156
Feature activation-2.22e-02

Pos logit contributions
 sob+0.031
unct+0.031
lee+0.031
osc+0.030
 gag+0.029
Neg logit contributions
We-0.042
 of-0.039
 I-0.039
 Friends-0.038
 deserve-0.038
Token.
Token ID13
Feature activation-5.84e-03

Pos logit contributions
 sob+0.031
unct+0.029
osc+0.029
lee+0.029
gy+0.028
Neg logit contributions
We-0.038
 Friends-0.035
By-0.035
 deserve-0.034
 I-0.034
Token Max
Token ID5436
Feature activation-1.60e-02

Pos logit contributions
unct+0.010
 sob+0.010
osc+0.009
 gag+0.009
 breaths+0.009
Neg logit contributions
We-0.008
 I-0.008
 Friends-0.008
 of-0.008
By-0.008
Token w
Token ID266
Feature activation-1.75e-02

Pos logit contributions
unct+0.028
 gag+0.026
lee+0.026
osc+0.026
 breaths+0.026
Neg logit contributions
We-0.022
 I-0.020
 of-0.020
 deserve-0.019
By-0.019
Tokenagged
Token ID14655
Feature activation-1.16e-02

Pos logit contributions
 sob+0.025
lee+0.025
osc+0.024
unct+0.024
 breaths+0.024
Neg logit contributions
We-0.031
 I-0.029
 Have-0.027
 we-0.027
By-0.027
Token his
Token ID465
Feature activation-1.16e-01

Pos logit contributions
unct+0.023
 Majesty+0.022
 gag+0.021
lee+0.021
 Sab+0.020
Neg logit contributions
 of-0.015
In-0.012
By-0.012
 I-0.012
We-0.012
Token tail
Token ID7894
Feature activation-4.11e-02

Pos logit contributions
unct+0.158
 Sab+0.148
 sob+0.146
lee+0.146
 gag+0.144
Neg logit contributions
We-0.198
 Friends-0.189
 I-0.189
 of-0.189
By-0.187
Token and
Token ID290
Feature activation-2.26e-02

Pos logit contributions
unct+0.069
 sob+0.066
lee+0.066
 gag+0.066
osc+0.066
Neg logit contributions
 of-0.058
We-0.057
By-0.054
 Friends-0.054
 I-0.053
Token agreed
Token ID4987
Feature activation-1.57e-02

Pos logit contributions
unct+0.027
osc+0.025
lee+0.024
 gag+0.024
 sob+0.023
Neg logit contributions
We-0.043
 I-0.042
 of-0.040
 we-0.040
 Friends-0.040
Token to
Token ID284
Feature activation-2.43e-02

Pos logit contributions
 sob+0.026
unct+0.026
osc+0.026
lee+0.025
 gag+0.025
Neg logit contributions
We-0.022
 of-0.021
 I-0.020
 Friends-0.020
By-0.020
Token help
Token ID1037
Feature activation-1.19e-02

Pos logit contributions
 sob+0.043
unct+0.042
osc+0.042
lee+0.041
 gag+0.041
Neg logit contributions
We-0.032
 Friends-0.029
 I-0.029
 of-0.029
 deserve-0.029
Token I
Token ID314
Feature activation+3.84e-02

Pos logit contributions
We+0.015
By+0.014
 of+0.013
In+0.012
hest+0.012
Neg logit contributions
 sob-0.051
 gn-0.049
 teeth-0.048
osc-0.048
 gag-0.048
Token was
Token ID373
Feature activation+2.99e-03

Pos logit contributions
We+0.049
 I+0.045
 Friends+0.045
 of+0.044
By+0.044
Neg logit contributions
 sob-0.070
osc-0.065
 gag-0.065
unct-0.065
gy-0.064
Token just
Token ID655
Feature activation+2.88e-02

Pos logit contributions
We+0.005
By+0.005
 Friends+0.004
 deserve+0.004
 of+0.004
Neg logit contributions
 sob-0.005
unct-0.004
lee-0.004
osc-0.004
 gag-0.004
Token curious
Token ID11040
Feature activation+2.77e-02

Pos logit contributions
We+0.049
 I+0.046
 of+0.045
 Friends+0.045
 deserve+0.044
Neg logit contributions
 sob-0.040
unct-0.039
osc-0.039
lee-0.038
 gag-0.038
Token.
Token ID13
Feature activation+1.74e-02

Pos logit contributions
We+0.052
By+0.050
In+0.048
 deserve+0.048
 Friends+0.048
Neg logit contributions
 sob-0.034
lee-0.031
 breaths-0.030
osc-0.030
 teeth-0.029
Token You
Token ID921
Feature activation+6.87e-02

Pos logit contributions
We+0.013
 of+0.011
By+0.011
 makers+0.010
 deserve+0.010
Neg logit contributions
 sob-0.040
 gn-0.038
osc-0.037
 Sab-0.037
 teeth-0.037
Token are
Token ID389
Feature activation+1.26e-02

Pos logit contributions
 hour+0.078
 Have+0.074
 Friends+0.074
We+0.073
By+0.071
Neg logit contributions
 sob-0.134
 gigg-0.119
 teeth-0.119
gy-0.119
 weep-0.116
Token different
Token ID1180
Feature activation+5.70e-02

Pos logit contributions
We+0.023
By+0.022
 Friends+0.022
ite+0.022
t+0.021
Neg logit contributions
 sob-0.017
 gn-0.014
 ears-0.014
 teeth-0.014
 whim-0.014
Token from
Token ID422
Feature activation+1.24e-02

Pos logit contributions
We+0.106
By+0.104
 Have+0.097
 deserve+0.097
In+0.097
Neg logit contributions
 sob-0.073
lee-0.066
osc-0.065
 breaths-0.065
 teeth-0.065
Token me
Token ID502
Feature activation-2.21e-02

Pos logit contributions
We+0.016
 I+0.015
 of+0.014
 Friends+0.014
 deserve+0.014
Neg logit contributions
 sob-0.023
osc-0.022
unct-0.022
 gag-0.022
lee-0.021
Token,
Token ID11
Feature activation-3.43e-03

Pos logit contributions
 sob+0.033
unct+0.031
osc+0.031
lee+0.031
 gag+0.030
Neg logit contributions
We-0.036
 Friends-0.033
By-0.033
 I-0.032
 of-0.032
Token fix
Token ID4259
Feature activation+5.02e-03

Pos logit contributions
 sob+0.035
 gn+0.033
gy+0.032
unct+0.031
osc+0.031
Neg logit contributions
We-0.046
 Friends-0.046
 Have-0.044
By-0.043
 importantly-0.041
Token the
Token ID262
Feature activation-1.03e-02

Pos logit contributions
 I+0.005
 of+0.005
We+0.005
 you+0.005
 we+0.005
Neg logit contributions
unct-0.010
 sob-0.010
 gag-0.010
lee-0.010
 Majesty-0.010
Token tape
Token ID9154
Feature activation-1.34e-02

Pos logit contributions
 sob+0.017
 gn+0.016
osc+0.016
 bed+0.016
gy+0.016
Neg logit contributions
We-0.014
 Friends-0.014
By-0.013
 deserve-0.013
 of-0.013
Token.
Token ID13
Feature activation-1.50e-02

Pos logit contributions
 sob+0.017
gy+0.017
osc+0.016
unct+0.016
 gn+0.016
Neg logit contributions
We-0.024
By-0.023
 Friends-0.023
 Have-0.022
 deserve-0.022
Token "
Token ID366
Feature activation+3.31e-02

Pos logit contributions
 sob+0.024
unct+0.024
osc+0.023
lee+0.023
 gag+0.023
Neg logit contributions
We-0.022
 Friends-0.020
 I-0.020
 of-0.020
By-0.020
TokenWhat
Token ID2061
Feature activation+7.87e-02

Pos logit contributions
 of+0.045
 Friends+0.044
 we+0.042
 I+0.041
 importantly+0.039
Neg logit contributions
 sob-0.056
osc-0.052
 gn-0.048
 Sab-0.047
 gag-0.047
Token's
Token ID338
Feature activation+3.85e-02

Pos logit contributions
By+0.119
 Have+0.112
We+0.110
 Friends+0.105
 hour+0.104
Neg logit contributions
 sob-0.124
 teeth-0.117
 yawn-0.102
 daughter-0.099
 gag-0.099
Token wrong
Token ID2642
Feature activation+2.17e-02

Pos logit contributions
We+0.073
 Friends+0.072
 of+0.070
By+0.070
 Have+0.068
Neg logit contributions
 sob-0.045
unct-0.042
 breaths-0.041
 up-0.040
lee-0.037
Token?"
Token ID1701
Feature activation-2.44e-02

Pos logit contributions
We+0.032
 I+0.031
 of+0.031
 Friends+0.030
 deserve+0.029
Neg logit contributions
unct-0.035
 sob-0.034
osc-0.034
lee-0.034
 gag-0.033
Token he
Token ID339
Feature activation-1.56e-02

Pos logit contributions
 sob+0.042
unct+0.041
osc+0.041
lee+0.041
 gag+0.040
Neg logit contributions
We-0.033
 I-0.031
 of-0.030
 Friends-0.030
By-0.030
Token asked
Token ID1965
Feature activation-4.48e-02

Pos logit contributions
 Majesty+0.032
enery+0.032
icol+0.032
unct+0.032
osc+0.032
Neg logit contributions
 I-0.016
We-0.015
 of-0.015
 we-0.014
 you-0.014
Token like
Token ID588
Feature activation+4.96e-02

Pos logit contributions
 of+0.008
 I+0.008
We+0.008
 can+0.007
 deserve+0.007
Neg logit contributions
osc-0.017
unct-0.017
 sob-0.017
gy-0.017
 gag-0.017
Token a
Token ID257
Feature activation+1.23e-02

Pos logit contributions
We+0.091
 Friends+0.086
By+0.085
 deserve+0.083
In+0.080
Neg logit contributions
 sob-0.060
 gn-0.059
lee-0.058
 Sab-0.056
 breaths-0.056
Token crystal
Token ID15121
Feature activation+3.76e-02

Pos logit contributions
We+0.018
 I+0.018
 of+0.017
By+0.016
 Friends+0.016
Neg logit contributions
osc-0.020
 sob-0.020
lee-0.019
unct-0.019
 gag-0.019
Token.
Token ID13
Feature activation+2.43e-02

Pos logit contributions
We+0.070
By+0.067
 Friends+0.066
 Have+0.065
 deserve+0.063
Neg logit contributions
 sob-0.047
 gn-0.043
unct-0.043
gy-0.043
lee-0.042
Token I
Token ID314
Feature activation+4.56e-02

Pos logit contributions
We+0.026
 I+0.023
 Friends+0.023
 of+0.023
By+0.022
Neg logit contributions
 sob-0.050
unct-0.049
osc-0.048
lee-0.048
 gag-0.047
Token think
Token ID892
Feature activation+5.40e-02

Pos logit contributions
We+0.065
 Friends+0.060
 I+0.059
By+0.059
 of+0.059
Neg logit contributions
 sob-0.076
unct-0.073
osc-0.072
lee-0.071
 gag-0.071
Token it
Token ID340
Feature activation+6.04e-02

Pos logit contributions
We+0.078
By+0.071
 Friends+0.071
 deserve+0.069
 Have+0.067
Neg logit contributions
 sob-0.089
lee-0.087
unct-0.086
 Sab-0.084
 gn-0.084
Token's
Token ID338
Feature activation+1.95e-02

Pos logit contributions
We+0.074
 I+0.067
 deserve+0.066
 of+0.066
 Friends+0.065
Neg logit contributions
 sob-0.113
unct-0.112
lee-0.111
osc-0.109
 gag-0.109
Token fragile
Token ID21049
Feature activation+5.35e-02

Pos logit contributions
We+0.031
 I+0.029
 of+0.029
 deserve+0.028
 Friends+0.028
Neg logit contributions
 sob-0.029
unct-0.029
osc-0.028
lee-0.028
 gag-0.028
Token,"
Token ID553
Feature activation-1.29e-02

Pos logit contributions
We+0.084
 of+0.081
 I+0.081
 deserve+0.079
 Friends+0.078
Neg logit contributions
 sob-0.079
unct-0.078
osc-0.077
lee-0.076
 gag-0.075
Token said
Token ID531
Feature activation-4.70e-02

Pos logit contributions
 gag+0.026
osc+0.026
unct+0.026
 breaths+0.025
 Majesty+0.025
Neg logit contributions
We-0.015
 I-0.014
 Friends-0.013
 of-0.013
 you-0.013
Token They
Token ID1119
Feature activation+2.80e-02

Pos logit contributions
 sob+0.004
unct+0.004
osc+0.004
lee+0.003
 gag+0.003
Neg logit contributions
We-0.004
 Friends-0.004
 of-0.004
 I-0.004
By-0.004
Token were
Token ID547
Feature activation-1.11e-02

Pos logit contributions
 Friends+0.046
We+0.045
 Have+0.044
By+0.043
 I+0.042
Neg logit contributions
 sob-0.041
unct-0.038
lee-0.038
gy-0.037
 Sab-0.037
Token so
Token ID523
Feature activation-1.39e-02

Pos logit contributions
 sob+0.016
unct+0.016
osc+0.015
 gag+0.015
lee+0.015
Neg logit contributions
We-0.018
 of-0.018
 I-0.017
By-0.017
 deserve-0.017
Token colorful
Token ID20239
Feature activation+6.27e-02

Pos logit contributions
 sob+0.017
unct+0.016
osc+0.016
lee+0.016
 gag+0.016
Neg logit contributions
We-0.026
 Friends-0.025
 of-0.024
 I-0.024
By-0.024
Token and
Token ID290
Feature activation+1.55e-02

Pos logit contributions
We+0.110
By+0.106
 deserve+0.105
 Friends+0.103
 Have+0.102
Neg logit contributions
 gn-0.079
 sob-0.079
gy-0.076
lee-0.076
osc-0.076
Token shiny
Token ID22441
Feature activation+5.56e-02

Pos logit contributions
We+0.025
 Friends+0.024
By+0.024
 of+0.023
 deserve+0.023
Neg logit contributions
 sob-0.022
lee-0.021
unct-0.021
 gn-0.021
osc-0.020
Token!
Token ID0
Feature activation+3.60e-03

Pos logit contributions
We+0.092
 Friends+0.086
By+0.085
 deserve+0.084
 I+0.084
Neg logit contributions
 sob-0.080
unct-0.076
lee-0.075
osc-0.075
 breaths-0.074
Token
Token ID198
Feature activation+3.80e-03

Pos logit contributions
We+0.006
 Friends+0.005
By+0.005
 deserve+0.005
 of+0.005
Neg logit contributions
 sob-0.005
lee-0.005
unct-0.005
osc-0.005
 gag-0.005
Token
Token ID198
Feature activation+1.04e-02

Pos logit contributions
We+0.005
 Friends+0.005
 deserve+0.005
 I+0.005
 Have+0.005
Neg logit contributions
 sob-0.006
lee-0.006
osc-0.006
unct-0.006
 Sab-0.006
TokenL
Token ID43
Feature activation+5.85e-03

Pos logit contributions
 Friends+0.015
 Have+0.014
 I+0.014
We+0.014
 deserve+0.014
Neg logit contributions
 sob-0.016
lee-0.014
 gn-0.014
osc-0.014
 nervously-0.013
Tokenily
Token ID813
Feature activation-4.08e-03

Pos logit contributions
We+0.008
 Friends+0.007
By+0.007
 Have+0.007
 I+0.007
Neg logit contributions
 sob-0.011
lee-0.010
unct-0.010
osc-0.010
 gn-0.010
TokenHello
Token ID15496
Feature activation+1.14e-02

Pos logit contributions
 of+0.054
 Friends+0.054
We+0.052
 Have+0.052
 deserve+0.052
Neg logit contributions
 sob-0.059
 gn-0.053
 Sab-0.052
unct-0.052
osc-0.052
Token,
Token ID11
Feature activation+3.75e-02

Pos logit contributions
 I+0.019
We+0.019
 of+0.018
 we+0.018
 Friends+0.017
Neg logit contributions
unct-0.017
 gag-0.017
osc-0.017
 sob-0.017
lee-0.016
Token little
Token ID1310
Feature activation+2.62e-02

Pos logit contributions
We+0.067
By+0.063
 Friends+0.062
 deserve+0.061
In+0.060
Neg logit contributions
 sob-0.051
 gn-0.045
 Sab-0.044
lee-0.042
 babys-0.042
Token girl
Token ID2576
Feature activation+9.36e-03

Pos logit contributions
We+0.042
 Friends+0.039
 deserve+0.039
By+0.039
 of+0.039
Neg logit contributions
 sob-0.039
unct-0.036
osc-0.036
lee-0.036
 gn-0.036
Token.
Token ID13
Feature activation+4.07e-02

Pos logit contributions
We+0.015
 of+0.014
 I+0.014
 Friends+0.013
 we+0.013
Neg logit contributions
unct-0.014
 sob-0.014
 gag-0.014
osc-0.014
lee-0.014
Token What
Token ID1867
Feature activation+8.25e-02

Pos logit contributions
We+0.046
By+0.042
 of+0.042
 deserve+0.042
 Friends+0.041
Neg logit contributions
 sob-0.078
unct-0.075
osc-0.075
 gn-0.074
lee-0.074
Token are
Token ID389
Feature activation+6.72e-02

Pos logit contributions
We+0.093
By+0.090
 hour+0.083
 Have+0.081
packing+0.081
Neg logit contributions
 sob-0.157
 teeth-0.147
 gn-0.141
 Sab-0.140
 yawn-0.140
Token you
Token ID345
Feature activation+3.33e-02

Pos logit contributions
 deserve+0.051
We+0.047
By+0.047
 Friends+0.044
 Have+0.041
Neg logit contributions
 Sab-0.148
unct-0.145
 sob-0.143
 Tinker-0.139
 K-0.139
Token doing
Token ID1804
Feature activation+4.44e-02

Pos logit contributions
 I+0.033
 you+0.031
We+0.031
 of+0.028
 we+0.027
Neg logit contributions
 gag-0.074
 breaths-0.072
osc-0.072
iously-0.072
lee-0.070
Token with
Token ID351
Feature activation+6.17e-02

Pos logit contributions
We+0.060
By+0.057
 deserve+0.056
 Friends+0.055
 Have+0.054
Neg logit contributions
 sob-0.077
unct-0.072
 gn-0.071
osc-0.071
gy-0.070
Token that
Token ID326
Feature activation+1.09e-02

Pos logit contributions
We+0.077
 I+0.072
 of+0.071
 Friends+0.070
 deserve+0.068
Neg logit contributions
 sob-0.114
unct-0.112
osc-0.109
 gag-0.109
lee-0.109
Token.
Token ID13
Feature activation-1.19e-02

Pos logit contributions
unct+0.059
 sob+0.058
 Majesty+0.058
lee+0.057
osc+0.057
Neg logit contributions
We-0.049
 of-0.047
 Friends-0.045
 I-0.045
 we-0.043
Token "
Token ID366
Feature activation+1.98e-02

Pos logit contributions
 sob+0.020
lee+0.019
unct+0.019
osc+0.019
 gag+0.019
Neg logit contributions
We-0.017
 deserve-0.015
By-0.015
 of-0.015
 Friends-0.015
TokenBut
Token ID1537
Feature activation+4.55e-02

Pos logit contributions
 of+0.025
 importantly+0.024
 Friends+0.022
 I+0.021
igator+0.021
Neg logit contributions
osc-0.033
 sob-0.031
j-0.030
unct-0.030
pee-0.030
Token I
Token ID314
Feature activation+4.36e-02

Pos logit contributions
We+0.057
By+0.056
 deserve+0.056
 Friends+0.053
 of+0.051
Neg logit contributions
 sob-0.079
 knees-0.078
 Sab-0.075
unct-0.074
 gn-0.073
Token will
Token ID481
Feature activation-1.77e-02

Pos logit contributions
We+0.063
 Friends+0.060
 importantly+0.058
By+0.055
ions+0.054
Neg logit contributions
 sob-0.067
 yawn-0.066
 gn-0.064
gy-0.064
 gigg-0.064
Token be
Token ID307
Feature activation-7.28e-03

Pos logit contributions
 sob+0.026
 gn+0.024
unct+0.024
gy+0.023
osc+0.023
Neg logit contributions
We-0.029
 Friends-0.027
By-0.026
 of-0.025
 importantly-0.025
Token the
Token ID262
Feature activation+2.43e-02

Pos logit contributions
 sob+0.010
unct+0.009
lee+0.009
osc+0.009
 gn+0.009
Neg logit contributions
We-0.013
 Friends-0.012
By-0.012
 of-0.012
 I-0.012
Token winner
Token ID8464
Feature activation+5.37e-02

Pos logit contributions
We+0.032
 Friends+0.031
 of+0.029
 deserve+0.028
By+0.028
Neg logit contributions
 gn-0.041
 sob-0.040
unct-0.039
 Majesty-0.039
 heels-0.038
Token!"
Token ID2474
Feature activation-9.59e-03

Pos logit contributions
We+0.105
In+0.103
By+0.103
 Friends+0.098
I+0.096
Neg logit contributions
iously-0.051
 Sab-0.046
 knees-0.045
 Majesty-0.045
 breaths-0.044
Token
Token ID198
Feature activation+1.47e-02

Pos logit contributions
 sob+0.018
unct+0.018
osc+0.017
lee+0.017
 gag+0.017
Neg logit contributions
We-0.012
 Friends-0.011
 of-0.011
 I-0.011
By-0.010
Token
Token ID198
Feature activation+2.34e-02

Pos logit contributions
We+0.018
 of+0.016
 Friends+0.016
 I+0.016
By+0.015
Neg logit contributions
 sob-0.028
unct-0.028
osc-0.027
 gag-0.027
lee-0.027
Token this
Token ID428
Feature activation-2.39e-02

Pos logit contributions
 of+0.013
 I+0.013
We+0.013
 you+0.013
 we+0.012
Neg logit contributions
 gag-0.017
 sob-0.017
unct-0.016
osc-0.015
lee-0.015
Token path
Token ID3108
Feature activation+2.55e-02

Pos logit contributions
 sob+0.037
 gag+0.037
osc+0.036
unct+0.036
lee+0.035
Neg logit contributions
 of-0.038
 I-0.037
 we-0.035
We-0.034
 deserve-0.034
Token and
Token ID290
Feature activation+1.23e-02

Pos logit contributions
We+0.045
By+0.043
 Friends+0.042
 deserve+0.041
 importantly+0.039
Neg logit contributions
gy-0.031
lee-0.030
iously-0.030
unct-0.030
 sob-0.030
Token it
Token ID340
Feature activation+3.86e-02

Pos logit contributions
We+0.018
 Friends+0.017
 of+0.016
By+0.016
 deserve+0.016
Neg logit contributions
 sob-0.019
unct-0.019
 gn-0.019
gy-0.018
 Sab-0.018
Token will
Token ID481
Feature activation+2.54e-03

Pos logit contributions
We+0.052
 Friends+0.049
By+0.048
 of+0.046
 Have+0.045
Neg logit contributions
 sob-0.065
unct-0.064
gy-0.063
 gn-0.062
 Sab-0.062
Token lead
Token ID1085
Feature activation+1.93e-02

Pos logit contributions
We+0.004
 Friends+0.004
 importantly+0.004
 makers+0.004
By+0.004
Neg logit contributions
 gn-0.003
unct-0.003
 sob-0.003
 Sab-0.003
gy-0.003
Token you
Token ID345
Feature activation-3.40e-02

Pos logit contributions
We+0.020
 Friends+0.018
By+0.017
 of+0.017
 I+0.017
Neg logit contributions
 sob-0.039
unct-0.038
osc-0.038
lee-0.038
 gag-0.037
Token to
Token ID284
Feature activation-3.25e-02

Pos logit contributions
 sob+0.048
unct+0.047
osc+0.046
lee+0.045
gy+0.045
Neg logit contributions
We-0.056
 Friends-0.053
By-0.051
 I-0.050
 of-0.050
Token success
Token ID1943
Feature activation+2.89e-02

Pos logit contributions
 sob+0.051
unct+0.049
osc+0.048
lee+0.048
 gag+0.048
Neg logit contributions
We-0.050
 I-0.047
 of-0.047
 Friends-0.046
 deserve-0.046
Token."
Token ID526
Feature activation-1.60e-02

Pos logit contributions
We+0.048
By+0.045
 Friends+0.044
 deserve+0.043
 Have+0.043
Neg logit contributions
 sob-0.039
lee-0.038
osc-0.038
unct-0.038
 breaths-0.037
Token
Token ID198
Feature activation+1.06e-02

Pos logit contributions
 sob+0.026
unct+0.025
osc+0.024
lee+0.024
 gag+0.024
Neg logit contributions
We-0.024
 Friends-0.022
 I-0.022
 of-0.022
By-0.022
Token a
Token ID257
Feature activation+9.80e-03

Pos logit contributions
We+0.066
 of+0.061
 I+0.061
By+0.060
 Friends+0.059
Neg logit contributions
unct-0.062
 sob-0.062
lee-0.061
osc-0.060
 breaths-0.059
Token time
Token ID640
Feature activation+2.64e-03

Pos logit contributions
We+0.013
 I+0.012
 Friends+0.012
 deserve+0.012
 Have+0.012
Neg logit contributions
 sob-0.017
osc-0.017
 yawn-0.016
 gag-0.016
gy-0.015
Token,
Token ID11
Feature activation-1.13e-02

Pos logit contributions
We+0.005
 Friends+0.005
 deserve+0.005
By+0.005
 I+0.005
Neg logit contributions
 sob-0.003
unct-0.003
lee-0.003
osc-0.003
 gn-0.003
Token there
Token ID612
Feature activation+3.89e-02

Pos logit contributions
 sob+0.019
gy+0.018
 gn+0.017
 Sab+0.016
osc+0.016
Neg logit contributions
 Friends-0.016
We-0.016
 deserve-0.016
By-0.015
 Have-0.014
Token was
Token ID373
Feature activation-1.24e-02

Pos logit contributions
We+0.057
 Friends+0.052
 of+0.052
 I+0.052
By+0.051
Neg logit contributions
 sob-0.064
unct-0.062
osc-0.061
lee-0.061
 gag-0.060
Token a
Token ID257
Feature activation+3.27e-03

Pos logit contributions
unct+0.021
 sob+0.020
lee+0.020
osc+0.020
 breaths+0.020
Neg logit contributions
We-0.019
 of-0.018
 I-0.017
By-0.017
 Friends-0.016
Token little
Token ID1310
Feature activation+1.66e-02

Pos logit contributions
We+0.005
 I+0.005
 of+0.005
 Friends+0.005
By+0.005
Neg logit contributions
unct-0.005
osc-0.005
 sob-0.005
 breaths-0.005
lee-0.005
Token girl
Token ID2576
Feature activation+1.58e-02

Pos logit contributions
We+0.026
 Friends+0.024
 of+0.024
 I+0.024
By+0.024
Neg logit contributions
 sob-0.025
unct-0.024
osc-0.024
lee-0.024
 gag-0.023
Token named
Token ID3706
Feature activation-7.39e-03

Pos logit contributions
We+0.027
 Friends+0.025
 I+0.024
 of+0.024
By+0.024
Neg logit contributions
 sob-0.023
unct-0.022
osc-0.021
lee-0.021
 gag-0.021
Token Sarah
Token ID10490
Feature activation+6.87e-03

Pos logit contributions
unct+0.014
 breaths+0.013
 gag+0.013
iously+0.013
osc+0.013
Neg logit contributions
 I-0.011
We-0.010
 of-0.010
 we-0.009
 you-0.009
Token who
Token ID508
Feature activation+4.18e-02

Pos logit contributions
We+0.013
By+0.012
 Friends+0.012
 Have+0.012
 deserve+0.012
Neg logit contributions
 sob-0.009
gy-0.008
lee-0.008
osc-0.008
 gn-0.008
Token hesitated
Token ID41723
Feature activation-1.94e-03

Pos logit contributions
unct+0.036
 sob+0.036
 gag+0.035
lee+0.035
osc+0.035
Neg logit contributions
We-0.037
 of-0.034
 I-0.034
By-0.034
 Friends-0.033
Token.
Token ID13
Feature activation-1.57e-02

Pos logit contributions
 sob+0.003
osc+0.003
unct+0.003
gy+0.003
lee+0.003
Neg logit contributions
We-0.003
 of-0.003
 I-0.003
 Friends-0.003
By-0.002
Token He
Token ID679
Feature activation+1.33e-02

Pos logit contributions
 sob+0.029
unct+0.029
osc+0.028
lee+0.028
 gag+0.028
Neg logit contributions
We-0.020
 Friends-0.018
 I-0.018
 of-0.018
By-0.017
Token didn
Token ID1422
Feature activation-2.35e-02

Pos logit contributions
We+0.017
 of+0.015
By+0.015
 I+0.015
 Friends+0.015
Neg logit contributions
 sob-0.024
unct-0.024
osc-0.023
lee-0.023
 gag-0.023
Token't
Token ID470
Feature activation-7.26e-03

Pos logit contributions
 sob+0.048
unct+0.047
osc+0.046
lee+0.046
 gag+0.046
Neg logit contributions
We-0.025
 Friends-0.022
 deserve-0.021
By-0.021
 I-0.021
Token want
Token ID765
Feature activation+3.45e-02

Pos logit contributions
 sob+0.014
unct+0.014
lee+0.014
osc+0.014
 gag+0.014
Neg logit contributions
We-0.008
 of-0.007
 I-0.007
 deserve-0.007
 Friends-0.007
Token to
Token ID284
Feature activation-3.63e-02

Pos logit contributions
We+0.049
 of+0.047
 I+0.046
 Friends+0.045
 deserve+0.044
Neg logit contributions
 sob-0.058
unct-0.057
osc-0.056
lee-0.055
 gag-0.055
Token go
Token ID467
Feature activation-4.59e-02

Pos logit contributions
lee+0.069
 sob+0.067
osc+0.067
unct+0.065
 breaths+0.065
Neg logit contributions
 I-0.047
 of-0.045
 deserve-0.043
We-0.043
 we-0.042
Token outside
Token ID2354
Feature activation-2.25e-02

Pos logit contributions
 sob+0.085
 gag+0.084
lee+0.084
unct+0.083
osc+0.083
Neg logit contributions
 of-0.065
We-0.062
 I-0.060
 we-0.056
 you-0.053
Token.
Token ID13
Feature activation-1.38e-02

Pos logit contributions
unct+0.036
 sob+0.036
lee+0.035
osc+0.034
 gag+0.034
Neg logit contributions
 of-0.034
We-0.034
 I-0.032
 Friends-0.031
 we-0.030
Token He
Token ID679
Feature activation+1.64e-02

Pos logit contributions
 sob+0.026
unct+0.025
osc+0.025
lee+0.025
 gag+0.024
Neg logit contributions
We-0.017
 Friends-0.016
 I-0.016
 of-0.016
By-0.015
Token Lily
Token ID20037
Feature activation+1.48e-02

Pos logit contributions
We+0.062
By+0.057
 Friends+0.056
 Have+0.055
 deserve+0.054
Neg logit contributions
 sob-0.059
 gn-0.050
lee-0.048
 Sab-0.048
 footsteps-0.047
Token,
Token ID11
Feature activation+4.18e-02

Pos logit contributions
We+0.031
By+0.029
 Friends+0.029
 deserve+0.028
 Have+0.028
Neg logit contributions
 sob-0.015
osc-0.014
lee-0.014
gy-0.014
unct-0.014
Token you
Token ID345
Feature activation+5.31e-02

Pos logit contributions
We+0.049
 Friends+0.045
By+0.044
 deserve+0.044
In+0.041
Neg logit contributions
 sob-0.082
 gn-0.075
unct-0.074
 knees-0.074
 Sab-0.073
Token can
Token ID460
Feature activation-1.69e-02

Pos logit contributions
We+0.038
 Friends+0.032
By+0.026
 of+0.024
You+0.024
Neg logit contributions
 sob-0.125
lee-0.121
unct-0.119
 gn-0.118
 Majesty-0.118
Token try
Token ID1949
Feature activation+2.68e-04

Pos logit contributions
 sob+0.025
unct+0.025
osc+0.024
gy+0.024
 gag+0.024
Neg logit contributions
We-0.026
By-0.024
 Friends-0.024
 deserve-0.024
 of-0.023
Token.
Token ID13
Feature activation+3.96e-02

Pos logit contributions
 I+0.000
 we+0.000
 you+0.000
We+0.000
 of+0.000
Neg logit contributions
 sob-0.001
 gag-0.001
unct-0.001
osc-0.001
lee-0.001
Token But
Token ID887
Feature activation+5.55e-02

Pos logit contributions
We+0.057
 of+0.053
 deserve+0.052
By+0.052
 Friends+0.049
Neg logit contributions
 sob-0.063
osc-0.062
 Sab-0.059
 gn-0.059
unct-0.058
Token you
Token ID345
Feature activation+5.48e-02

Pos logit contributions
We+0.094
 Friends+0.087
 deserve+0.086
By+0.085
 of+0.082
Neg logit contributions
 sob-0.078
unct-0.075
 knees-0.073
 gn-0.071
 Sab-0.070
Token have
Token ID423
Feature activation+5.12e-02

Pos logit contributions
We+0.067
 Friends+0.064
 importantly+0.056
By+0.054
 Have+0.053
Neg logit contributions
 sob-0.102
 yawn-0.099
gy-0.097
 gn-0.097
 gag-0.095
Token to
Token ID284
Feature activation+4.98e-03

Pos logit contributions
We+0.091
 Friends+0.087
By+0.086
 deserve+0.082
 Have+0.081
Neg logit contributions
 sob-0.068
 breaths-0.062
osc-0.061
lee-0.061
 teeth-0.060
Token give
Token ID1577
Feature activation+1.32e-02

Pos logit contributions
We+0.007
 I+0.006
 of+0.006
 Friends+0.006
 deserve+0.006
Neg logit contributions
 sob-0.009
unct-0.008
lee-0.008
osc-0.008
 gag-0.008
Token<|endoftext|>
Token ID50256
Feature activation+4.84e-03

Pos logit contributions
We+0.012
By+0.012
 deserve+0.012
 makers+0.011
 of+0.011
Neg logit contributions
 sob-0.007
lee-0.006
gy-0.006
 Sab-0.006
 nervously-0.006
TokenIn
Token ID818
Feature activation-4.26e-04

Pos logit contributions
We+0.007
 Friends+0.006
 of+0.006
 I+0.006
By+0.006
Neg logit contributions
 sob-0.008
unct-0.008
osc-0.008
lee-0.008
 gag-0.008
Token the
Token ID262
Feature activation-8.92e-03

Pos logit contributions
 sob+0.000
 gn+0.000
gy+0.000
 breaths+0.000
 Sab+0.000
Neg logit contributions
We-0.001
 deserve-0.001
 Friends-0.001
By-0.001
 makers-0.001
Token morning
Token ID3329
Feature activation-5.18e-02

Pos logit contributions
 sob+0.014
unct+0.013
lee+0.013
osc+0.013
 gag+0.013
Neg logit contributions
We-0.014
 Friends-0.013
 of-0.013
 I-0.013
By-0.013
Token,
Token ID11
Feature activation-2.38e-02

Pos logit contributions
 sob+0.063
unct+0.060
lee+0.059
 gn+0.057
osc+0.057
Neg logit contributions
We-0.097
 Friends-0.092
 deserve-0.091
By-0.089
 Have-0.088
Token Mom
Token ID11254
Feature activation-1.81e-02

Pos logit contributions
 sob+0.038
gy+0.038
unct+0.036
 gn+0.034
lee+0.034
Neg logit contributions
 Friends-0.034
We-0.033
 deserve-0.033
By-0.031
 Have-0.030
Tokenmy
Token ID1820
Feature activation-2.90e-03

Pos logit contributions
lee+0.020
 sob+0.020
gy+0.020
unct+0.019
 gn+0.019
Neg logit contributions
We-0.034
By-0.034
 Friends-0.034
 deserve-0.032
 Have-0.032
Token reminded
Token ID14516
Feature activation+2.44e-02

Pos logit contributions
 sob+0.004
lee+0.004
gy+0.004
unct+0.004
 gn+0.004
Neg logit contributions
We-0.005
 Friends-0.005
By-0.005
 deserve-0.005
 Have-0.004
Token little
Token ID1310
Feature activation+2.50e-02

Pos logit contributions
We+0.043
 of+0.043
 I+0.041
 Friends+0.040
By+0.040
Neg logit contributions
 sob-0.032
 gn-0.031
unct-0.031
osc-0.031
 Majesty-0.031
Token Sus
Token ID8932
Feature activation+8.88e-04

Pos logit contributions
We+0.041
 I+0.038
 of+0.038
 Friends+0.038
By+0.038
Neg logit contributions
 sob-0.036
unct-0.035
osc-0.034
 gag-0.034
lee-0.034
Tokenie
Token ID494
Feature activation-7.26e-03

Pos logit contributions
We+0.002
 of+0.002
 I+0.002
 Friends+0.002
 deserve+0.002
Neg logit contributions
 sob-0.001
 gag-0.001
unct-0.001
osc-0.001
 gn-0.001
Token.
Token ID13
Feature activation-1.49e-02

Pos logit contributions
 sob+0.075
unct+0.075
lee+0.075
osc+0.074
 Majesty+0.074
Neg logit contributions
 of-0.055
 I-0.051
We-0.050
 we-0.046
 deserve-0.046
Token But
Token ID887
Feature activation+8.94e-03

Pos logit contributions
unct+0.029
 breaths+0.028
osc+0.028
 gag+0.028
 sob+0.027
Neg logit contributions
We-0.018
 I-0.018
 of-0.017
 Friends-0.017
 we-0.016
Token he
Token ID339
Feature activation+1.44e-02

Pos logit contributions
 I+0.011
 of+0.010
We+0.010
 we+0.010
By+0.009
Neg logit contributions
osc-0.017
lee-0.017
 gag-0.017
unct-0.017
 sob-0.017
Token couldn
Token ID3521
Feature activation-1.31e-02

Pos logit contributions
We+0.018
 of+0.017
 I+0.017
By+0.016
I+0.016
Neg logit contributions
osc-0.026
 breaths-0.026
unct-0.026
 Majesty-0.026
 gag-0.025
Token't
Token ID470
Feature activation-1.22e-02

Pos logit contributions
 sob+0.021
gy+0.021
unct+0.020
 gn+0.020
osc+0.020
Neg logit contributions
We-0.018
 Friends-0.017
 deserve-0.017
By-0.017
 importantly-0.016
Token do
Token ID466
Feature activation-7.99e-03

Pos logit contributions
 sob+0.020
lee+0.020
unct+0.020
 Majesty+0.020
osc+0.020
Neg logit contributions
We-0.017
 of-0.017
 I-0.016
 deserve-0.016
By-0.016
Token it
Token ID340
Feature activation-1.86e-02

Pos logit contributions
 sob+0.017
osc+0.016
lee+0.016
unct+0.016
 gag+0.016
Neg logit contributions
We-0.008
 I-0.008
 of-0.008
 we-0.007
 you-0.007
Token alone
Token ID3436
Feature activation+5.71e-03

Pos logit contributions
 sob+0.032
lee+0.031
osc+0.031
 Majesty+0.031
 gag+0.031
Neg logit contributions
 of-0.027
We-0.025
 I-0.025
 we-0.024
 deserve-0.023
Token.
Token ID13
Feature activation-3.73e-03

Pos logit contributions
We+0.009
 of+0.008
 I+0.008
 Friends+0.008
 deserve+0.008
Neg logit contributions
 sob-0.009
unct-0.009
lee-0.009
 gag-0.009
osc-0.009
Token
Token ID198
Feature activation+2.20e-03

Pos logit contributions
unct+0.007
 breaths+0.007
osc+0.007
 gag+0.007
 sob+0.007
Neg logit contributions
We-0.004
 I-0.004
 Friends-0.004
 of-0.004
I-0.004
Token
Token ID198
Feature activation+9.08e-03

Pos logit contributions
We+0.003
 of+0.002
 Friends+0.002
 I+0.002
By+0.002
Neg logit contributions
 sob-0.004
unct-0.004
osc-0.004
lee-0.004
 gag-0.004
Token his
Token ID465
Feature activation-4.53e-02

Pos logit contributions
 of+0.036
We+0.035
 I+0.034
 Friends+0.034
 deserve+0.032
Neg logit contributions
 sob-0.034
unct-0.033
 gag-0.033
osc-0.032
gy-0.032
Token friends
Token ID2460
Feature activation-1.83e-02

Pos logit contributions
 sob+0.070
osc+0.068
unct+0.068
 gag+0.066
lee+0.064
Neg logit contributions
We-0.068
 of-0.068
 Friends-0.066
 I-0.065
 we-0.064
Token.
Token ID13
Feature activation+1.06e-03

Pos logit contributions
 sob+0.028
unct+0.027
osc+0.027
lee+0.027
 gag+0.026
Neg logit contributions
We-0.028
 of-0.026
 Friends-0.026
 I-0.026
By-0.026
Token He
Token ID679
Feature activation+1.24e-02

Pos logit contributions
We+0.002
 deserve+0.002
By+0.002
 of+0.002
 makers+0.002
Neg logit contributions
lee-0.001
 sob-0.001
gy-0.001
 Sab-0.001
osc-0.001
Token wanted
Token ID2227
Feature activation+2.66e-02

Pos logit contributions
We+0.019
 Friends+0.018
 I+0.017
 of+0.017
By+0.017
Neg logit contributions
 sob-0.019
unct-0.019
lee-0.019
osc-0.018
 gn-0.018
Token them
Token ID606
Feature activation-1.22e-02

Pos logit contributions
We+0.039
 of+0.036
 I+0.036
 Friends+0.036
 deserve+0.035
Neg logit contributions
 sob-0.044
unct-0.043
osc-0.042
 gag-0.042
lee-0.041
Token to
Token ID284
Feature activation-1.50e-02

Pos logit contributions
 sob+0.018
unct+0.018
osc+0.017
lee+0.017
 gag+0.017
Neg logit contributions
We-0.020
 Friends-0.018
 of-0.018
 I-0.018
By-0.018
Token see
Token ID766
Feature activation+2.38e-02

Pos logit contributions
 sob+0.022
 gn+0.021
unct+0.021
lee+0.021
osc+0.021
Neg logit contributions
We-0.025
 Friends-0.022
By-0.022
 of-0.021
 deserve-0.021
Token how
Token ID703
Feature activation-2.88e-03

Pos logit contributions
We+0.038
By+0.036
 Friends+0.035
 deserve+0.035
 I+0.034
Neg logit contributions
 sob-0.035
lee-0.033
unct-0.033
osc-0.033
 gn-0.032
Token powerful
Token ID3665
Feature activation+5.45e-02

Pos logit contributions
lee+0.004
 sob+0.004
unct+0.004
 gn+0.004
 gag+0.004
Neg logit contributions
We-0.004
 Friends-0.004
By-0.004
 deserve-0.004
 Have-0.004
Token he
Token ID339
Feature activation+1.94e-02

Pos logit contributions
We+0.077
By+0.076
 deserve+0.072
In+0.070
 Friends+0.070
Neg logit contributions
 sob-0.083
 gn-0.082
 breaths-0.081
lee-0.081
 Sab-0.080
Token that
Token ID326
Feature activation+1.97e-02

Pos logit contributions
We+0.041
 Friends+0.038
By+0.038
 deserve+0.038
 Have+0.037
Neg logit contributions
 sob-0.037
lee-0.035
 gn-0.035
osc-0.035
unct-0.035
Token looked
Token ID3114
Feature activation+1.77e-02

Pos logit contributions
We+0.030
 Friends+0.029
By+0.029
 deserve+0.028
 Have+0.028
Neg logit contributions
 sob-0.030
 Sab-0.028
 gn-0.028
lee-0.028
unct-0.028
Token sour
Token ID11348
Feature activation+1.89e-02

Pos logit contributions
We+0.022
 of+0.021
 I+0.020
 deserve+0.019
 we+0.018
Neg logit contributions
unct-0.033
osc-0.033
 sob-0.033
gy-0.032
 gag-0.031
Token.
Token ID13
Feature activation-1.41e-02

Pos logit contributions
We+0.032
 Friends+0.030
By+0.030
 deserve+0.029
 I+0.029
Neg logit contributions
 sob-0.027
lee-0.025
unct-0.025
osc-0.025
 gag-0.024
Token She
Token ID1375
Feature activation+1.17e-02

Pos logit contributions
unct+0.025
osc+0.023
gy+0.023
 sob+0.022
 breaths+0.022
Neg logit contributions
 I-0.021
We-0.021
 of-0.020
 Friends-0.019
I-0.018
Token said
Token ID531
Feature activation-2.31e-02

Pos logit contributions
We+0.020
 Friends+0.019
By+0.019
 Have+0.019
 deserve+0.019
Neg logit contributions
 sob-0.017
lee-0.015
unct-0.015
osc-0.015
 gn-0.015
Token it
Token ID340
Feature activation+2.98e-02

Pos logit contributions
osc+0.043
unct+0.043
gy+0.042
 sob+0.042
 gag+0.041
Neg logit contributions
 I-0.029
We-0.029
 of-0.028
 we-0.026
 Friends-0.025
Token smelled
Token ID37609
Feature activation+1.73e-02

Pos logit contributions
We+0.038
 I+0.035
 of+0.035
 Friends+0.033
 deserve+0.033
Neg logit contributions
unct-0.055
 sob-0.053
lee-0.053
osc-0.052
 breaths-0.052
Token like
Token ID588
Feature activation+4.38e-02

Pos logit contributions
We+0.028
 Friends+0.026
By+0.026
 I+0.026
 of+0.025
Neg logit contributions
 sob-0.025
unct-0.024
lee-0.024
osc-0.024
 gag-0.024
Token it
Token ID340
Feature activation+3.56e-02

Pos logit contributions
We+0.074
 Friends+0.071
By+0.069
 deserve+0.069
 Have+0.067
Neg logit contributions
 sob-0.060
lee-0.057
 gn-0.056
unct-0.055
 gag-0.055
Token was
Token ID373
Feature activation-1.70e-03

Pos logit contributions
We+0.055
By+0.053
 Friends+0.052
 Have+0.049
 deserve+0.049
Neg logit contributions
 sob-0.056
lee-0.051
unct-0.051
osc-0.051
 gag-0.051
Token together
Token ID1978
Feature activation+1.86e-02

Pos logit contributions
 deserve+0.050
We+0.050
By+0.050
I+0.048
In+0.047
Neg logit contributions
gy-0.036
osc-0.036
 breaths-0.034
iously-0.034
lee-0.034
Token,
Token ID11
Feature activation-1.50e-02

Pos logit contributions
We+0.039
 deserve+0.037
By+0.037
 I+0.036
body+0.035
Neg logit contributions
 sob-0.016
osc-0.015
gy-0.015
lee-0.015
iously-0.015
Token including
Token ID1390
Feature activation+3.25e-02

Pos logit contributions
 sob+0.017
osc+0.017
lee+0.016
 gigg+0.016
 ears+0.016
Neg logit contributions
We-0.027
 deserve-0.026
By-0.025
 Friends-0.025
 I-0.025
Token racing
Token ID11717
Feature activation+2.99e-02

Pos logit contributions
We+0.056
By+0.053
 deserve+0.052
 Friends+0.052
 Have+0.050
Neg logit contributions
 sob-0.041
lee-0.039
osc-0.038
 gn-0.038
unct-0.038
Token each
Token ID1123
Feature activation+1.40e-02

Pos logit contributions
We+0.050
By+0.048
 deserve+0.047
 I+0.045
 Friends+0.044
Neg logit contributions
lee-0.038
gy-0.038
 sob-0.038
osc-0.038
 heels-0.037
Token other
Token ID584
Feature activation-2.98e-02

Pos logit contributions
We+0.029
By+0.028
 Friends+0.027
 I+0.027
packing+0.027
Neg logit contributions
 heels-0.011
 home-0.011
 ears-0.010
osc-0.010
 daughter-0.010
Token.
Token ID13
Feature activation+2.33e-03

Pos logit contributions
 sob+0.038
lee+0.037
osc+0.037
gy+0.037
 breaths+0.036
Neg logit contributions
We-0.052
By-0.049
 deserve-0.047
 Friends-0.047
 I-0.046
Token<|endoftext|>
Token ID50256
Feature activation+9.24e-03

Pos logit contributions
We+0.004
By+0.003
body+0.003
 deserve+0.003
packing+0.003
Neg logit contributions
lee-0.003
 sob-0.003
osc-0.003
 teeth-0.003
gy-0.003
TokenOnce
Token ID7454
Feature activation-5.04e-02

Pos logit contributions
We+0.014
 deserve+0.013
 Friends+0.013
By+0.013
 I+0.013
Neg logit contributions
 sob-0.014
osc-0.013
lee-0.013
unct-0.013
 gn-0.013
Token upon
Token ID2402
Feature activation+4.14e-02

Pos logit contributions
lee+0.060
 sob+0.058
unct+0.055
 Sab+0.054
ail+0.052
Neg logit contributions
We-0.097
 deserve-0.094
By-0.091
 Friends-0.090
 Have-0.088
Token a
Token ID257
Feature activation+8.30e-03

Pos logit contributions
We+0.068
 of+0.063
 I+0.063
 Friends+0.062
By+0.062
Neg logit contributions
 sob-0.060
unct-0.059
lee-0.058
osc-0.057
 breaths-0.056
Token too
Token ID1165
Feature activation-6.43e-03

Pos logit contributions
 sob+0.017
unct+0.016
osc+0.016
lee+0.016
 gag+0.016
Neg logit contributions
We-0.019
 of-0.018
 I-0.018
 Friends-0.018
By-0.017
Token fast
Token ID3049
Feature activation+1.87e-02

Pos logit contributions
 sob+0.009
unct+0.009
lee+0.008
osc+0.008
 gag+0.008
Neg logit contributions
We-0.011
 Friends-0.010
 I-0.010
 of-0.010
By-0.010
Token.
Token ID13
Feature activation+6.32e-03

Pos logit contributions
We+0.037
By+0.035
 deserve+0.034
 Friends+0.034
In+0.034
Neg logit contributions
 sob-0.022
 breaths-0.020
iously-0.019
osc-0.019
gy-0.019
Token It
Token ID632
Feature activation+4.39e-02

Pos logit contributions
We+0.010
 I+0.009
 Friends+0.009
 of+0.009
By+0.009
Neg logit contributions
 sob-0.010
unct-0.010
osc-0.010
lee-0.010
 gag-0.010
Token chased
Token ID26172
Feature activation-2.90e-03

Pos logit contributions
We+0.071
 Friends+0.067
 I+0.065
 of+0.065
 deserve+0.065
Neg logit contributions
 sob-0.066
unct-0.062
osc-0.061
lee-0.061
 gn-0.060
Token her
Token ID607
Feature activation-8.62e-02

Pos logit contributions
 gag+0.006
 sob+0.006
 Majesty+0.005
unct+0.005
 gn+0.005
Neg logit contributions
We-0.004
 of-0.003
 I-0.003
 you-0.003
By-0.003
Token all
Token ID477
Feature activation+1.93e-03

Pos logit contributions
unct+0.139
lee+0.138
 sob+0.138
 gag+0.135
osc+0.134
Neg logit contributions
We-0.122
 of-0.122
 I-0.119
 Friends-0.116
 we-0.111
Token the
Token ID262
Feature activation-2.22e-02

Pos logit contributions
We+0.003
 Friends+0.003
By+0.003
 deserve+0.003
 Have+0.003
Neg logit contributions
 breaths-0.003
 sob-0.003
lee-0.003
osc-0.003
unct-0.003
Token way
Token ID835
Feature activation-2.56e-02

Pos logit contributions
 sob+0.027
osc+0.025
unct+0.025
 breaths+0.024
 gn+0.024
Neg logit contributions
We-0.043
 Friends-0.040
By-0.040
 I-0.039
 of-0.039
Token home
Token ID1363
Feature activation-2.58e-02

Pos logit contributions
 sob+0.031
unct+0.030
lee+0.030
gy+0.029
osc+0.029
Neg logit contributions
We-0.047
 Friends-0.045
 deserve-0.044
By-0.044
 Have-0.043
Token and
Token ID290
Feature activation-1.31e-03

Pos logit contributions
 sob+0.039
unct+0.039
 gag+0.038
osc+0.038
lee+0.037
Neg logit contributions
We-0.040
 of-0.039
 I-0.038
 Friends-0.036
 deserve-0.036
Token he
Token ID339
Feature activation+5.57e-03

Pos logit contributions
 sob+0.001
unct+0.001
 gag+0.001
lee+0.001
 gn+0.001
Neg logit contributions
We-0.001
 Friends-0.001
 deserve-0.001
By-0.001
 Have-0.001
Token gives
Token ID3607
Feature activation+1.59e-02

Pos logit contributions
We+0.008
By+0.008
 Friends+0.008
 of+0.008
 deserve+0.008
Neg logit contributions
 sob-0.009
unct-0.008
lee-0.008
osc-0.008
 gn-0.008
Token the
Token ID262
Feature activation-3.19e-03

Pos logit contributions
 of+0.024
 I+0.023
We+0.022
 you+0.022
I+0.021
Neg logit contributions
osc-0.028
 sob-0.027
 gag-0.026
unct-0.026
iously-0.026
Token car
Token ID1097
Feature activation-2.02e-02

Pos logit contributions
 sob+0.004
 gn+0.003
 babys+0.003
osc+0.003
 gag+0.003
Neg logit contributions
We-0.006
 Friends-0.006
By-0.006
 deserve-0.006
 I-0.006
Token to
Token ID284
Feature activation-3.10e-02

Pos logit contributions
 sob+0.028
unct+0.026
osc+0.026
lee+0.026
 gag+0.025
Neg logit contributions
We-0.035
 of-0.033
 Friends-0.033
 I-0.033
By-0.032
Token Mia
Token ID32189
Feature activation-7.61e-02

Pos logit contributions
unct+0.045
osc+0.045
 sob+0.045
lee+0.042
gy+0.042
Neg logit contributions
We-0.051
 I-0.050
 of-0.050
 Friends-0.047
By-0.046
Token.
Token ID13
Feature activation-2.04e-02

Pos logit contributions
 sob+0.120
unct+0.119
lee+0.115
osc+0.114
 gag+0.114
Neg logit contributions
We-0.114
 of-0.108
 I-0.107
 Friends-0.106
 deserve-0.103
Token He
Token ID679
Feature activation+1.43e-02

Pos logit contributions
unct+0.034
osc+0.033
 sob+0.033
gy+0.032
lee+0.031
Neg logit contributions
We-0.030
 I-0.029
 of-0.028
 Friends-0.028
By-0.027
Token says
Token ID1139
Feature activation-1.20e-02

Pos logit contributions
We+0.025
By+0.024
 Friends+0.023
 deserve+0.023
 Have+0.022
Neg logit contributions
 sob-0.020
 gn-0.019
lee-0.018
unct-0.018
osc-0.018
Token,
Token ID11
Feature activation-2.12e-02

Pos logit contributions
 sob+0.017
unct+0.017
osc+0.017
lee+0.017
gy+0.017
Neg logit contributions
 I-0.020
 of-0.020
We-0.019
 we-0.019
 Friends-0.019
Token "
Token ID366
Feature activation+4.72e-02

Pos logit contributions
 sob+0.036
unct+0.034
osc+0.034
lee+0.034
 gn+0.033
Neg logit contributions
We-0.030
 Friends-0.027
 deserve-0.027
 of-0.027
By-0.026
Token you
Token ID345
Feature activation-4.47e-02

Pos logit contributions
 you+0.012
We+0.011
 I+0.010
 of+0.010
 we+0.009
Neg logit contributions
osc-0.067
gy-0.066
enery-0.065
lee-0.064
 gn-0.064
Token better
Token ID1365
Feature activation-2.19e-02

Pos logit contributions
lee+0.066
unct+0.062
iously+0.062
 Sab+0.061
osc+0.059
Neg logit contributions
 of-0.077
We-0.075
 deserve-0.072
 I-0.071
 we-0.070
Token."
Token ID526
Feature activation-2.89e-02

Pos logit contributions
 sob+0.040
lee+0.040
unct+0.040
osc+0.039
 gag+0.038
Neg logit contributions
We-0.029
 of-0.026
 I-0.026
 Friends-0.026
 deserve-0.025
Token She
Token ID1375
Feature activation+4.83e-03

Pos logit contributions
unct+0.056
 sob+0.055
osc+0.055
 gag+0.054
lee+0.054
Neg logit contributions
We-0.035
 Friends-0.032
 I-0.031
 of-0.031
By-0.029
Token took
Token ID1718
Feature activation-5.57e-03

Pos logit contributions
We+0.007
 of+0.006
 Friends+0.006
 I+0.006
By+0.006
Neg logit contributions
 sob-0.008
unct-0.008
osc-0.008
lee-0.008
 gag-0.008
Token Ben
Token ID3932
Feature activation-9.82e-02

Pos logit contributions
 sob+0.009
unct+0.009
lee+0.009
osc+0.009
 gag+0.009
Neg logit contributions
We-0.008
 of-0.008
 I-0.008
 Friends-0.007
By-0.007
Token's
Token ID338
Feature activation-7.72e-02

Pos logit contributions
unct+0.166
lee+0.166
 Majesty+0.162
 sob+0.159
 gag+0.159
Neg logit contributions
 of-0.138
We-0.137
 deserve-0.122
 Friends-0.122
 I-0.122
Token hand
Token ID1021
Feature activation-4.75e-02

Pos logit contributions
unct+0.104
lee+0.103
 sob+0.100
osc+0.097
 Sab+0.096
Neg logit contributions
We-0.135
 of-0.129
 I-0.129
 Friends-0.125
 deserve-0.122
Token and
Token ID290
Feature activation-2.08e-02

Pos logit contributions
 Majesty+0.111
unct+0.103
lee+0.097
 gn+0.095
 gag+0.095
Neg logit contributions
 of-0.058
We-0.049
 can-0.040
 Friends-0.039
 I-0.038
Token followed
Token ID3940
Feature activation-2.02e-02

Pos logit contributions
 sob+0.031
unct+0.029
osc+0.029
lee+0.029
 gag+0.028
Neg logit contributions
We-0.034
 Friends-0.031
 of-0.031
By-0.031
 I-0.031
Token their
Token ID511
Feature activation-7.74e-02

Pos logit contributions
 Majesty+0.037
unct+0.036
 sob+0.036
osc+0.035
 gn+0.035
Neg logit contributions
 of-0.026
 I-0.025
We-0.025
 you-0.025
 we-0.024
Token alert
Token ID7995
Feature activation+9.05e-03

Pos logit contributions
 sob+0.017
lee+0.015
unct+0.015
 gag+0.015
 breaths+0.015
Neg logit contributions
We-0.029
 Friends-0.028
 Have-0.027
 I-0.026
 deserve-0.026
Token.
Token ID13
Feature activation-1.62e-02

Pos logit contributions
We+0.014
 of+0.013
 I+0.013
 Friends+0.013
By+0.013
Neg logit contributions
 sob-0.014
unct-0.014
osc-0.013
lee-0.013
 gag-0.013
Token His
Token ID2399
Feature activation-3.67e-02

Pos logit contributions
 sob+0.032
unct+0.031
osc+0.031
 gag+0.030
 breaths+0.030
Neg logit contributions
We-0.019
 I-0.018
 Friends-0.017
 of-0.017
 Have-0.017
Token Mom
Token ID11254
Feature activation-1.25e-02

Pos logit contributions
osc+0.043
unct+0.043
 sob+0.042
 Sab+0.041
gy+0.040
Neg logit contributions
We-0.068
 of-0.066
 I-0.065
 Friends-0.065
By-0.063
Token hugged
Token ID41130
Feature activation+6.87e-03

Pos logit contributions
 sob+0.019
unct+0.019
osc+0.018
 gag+0.018
lee+0.018
Neg logit contributions
We-0.020
 of-0.018
 I-0.018
 Friends-0.018
By-0.018
Token him
Token ID683
Feature activation-7.72e-02

Pos logit contributions
We+0.011
 of+0.011
 I+0.010
 Friends+0.010
By+0.010
Neg logit contributions
 sob-0.010
unct-0.010
 gag-0.010
osc-0.010
lee-0.010
Token and
Token ID290
Feature activation-2.04e-03

Pos logit contributions
unct+0.150
 Sab+0.146
 Majesty+0.145
lee+0.143
 gag+0.143
Neg logit contributions
 of-0.095
 I-0.082
We-0.079
 Friends-0.078
 we-0.073
Token said
Token ID531
Feature activation-2.17e-02

Pos logit contributions
 sob+0.003
unct+0.003
lee+0.003
 gn+0.003
osc+0.003
Neg logit contributions
We-0.003
 Friends-0.003
By-0.003
 deserve-0.003
 Have-0.003
Token,
Token ID11
Feature activation-1.90e-02

Pos logit contributions
 sob+0.029
unct+0.028
osc+0.028
lee+0.027
 gag+0.027
Neg logit contributions
We-0.038
 I-0.036
 of-0.036
 Friends-0.035
By-0.035
Token
Token ID6184
Feature activation+9.71e-04

Pos logit contributions
 sob+0.033
unct+0.031
lee+0.031
osc+0.031
 gag+0.030
Neg logit contributions
We-0.026
 Friends-0.024
 of-0.024
 deserve-0.024
By-0.024
Token
Token ID95
Feature activation+9.60e-04

Pos logit contributions
We+0.001
 of+0.001
 Friends+0.001
 deserve+0.001
 I+0.001
Neg logit contributions
 sob-0.001
unct-0.001
osc-0.001
 gn-0.001
lee-0.001
TokenSam
Token ID16305
Feature activation-1.59e-02

Pos logit contributions
 yours+0.020
 I+0.019
 deserve+0.018
 Friends+0.018
 Have+0.018
Neg logit contributions
 sob-0.030
 gn-0.029
 wounds-0.027
osc-0.026
 whim-0.026
Token smiled
Token ID13541
Feature activation+1.33e-05

Pos logit contributions
unct+0.026
 gag+0.025
 sob+0.025
lee+0.024
osc+0.024
Neg logit contributions
We-0.024
 of-0.022
 I-0.021
By-0.021
 Friends-0.021
Token.
Token ID13
Feature activation-1.35e-02

Pos logit contributions
We+0.000
 of+0.000
 I+0.000
 Friends+0.000
By+0.000
Neg logit contributions
unct-0.000
osc-0.000
 sob-0.000
 gag-0.000
lee-0.000
Token He
Token ID679
Feature activation+1.57e-02

Pos logit contributions
 sob+0.022
lee+0.021
 gn+0.020
osc+0.020
 Sab+0.020
Neg logit contributions
We-0.020
 deserve-0.018
By-0.018
 Friends-0.018
 of-0.017
Token followed
Token ID3940
Feature activation-1.89e-02

Pos logit contributions
We+0.024
 Friends+0.022
 of+0.022
 I+0.022
By+0.022
Neg logit contributions
 sob-0.025
unct-0.024
osc-0.024
lee-0.024
 gag-0.023
Token Tom
Token ID4186
Feature activation-7.48e-02

Pos logit contributions
 sob+0.037
 gag+0.037
unct+0.037
 Majesty+0.036
osc+0.035
Neg logit contributions
 of-0.023
We-0.022
 I-0.021
 you-0.020
 we-0.020
Token and
Token ID290
Feature activation-1.95e-02

Pos logit contributions
unct+0.134
 gag+0.133
 sob+0.130
lee+0.129
 Majesty+0.127
Neg logit contributions
 of-0.102
We-0.093
 I-0.089
 Friends-0.088
 we-0.085
Token they
Token ID484
Feature activation-9.19e-03

Pos logit contributions
unct+0.030
 gag+0.029
osc+0.029
 sob+0.029
lee+0.029
Neg logit contributions
We-0.030
 I-0.029
 of-0.028
By-0.027
 we-0.027
Token looked
Token ID3114
Feature activation-2.05e-03

Pos logit contributions
unct+0.017
iously+0.016
 Majesty+0.016
 gag+0.016
 heartbeat+0.016
Neg logit contributions
We-0.012
By-0.011
 of-0.011
 can-0.010
In-0.010
Token for
Token ID329
Feature activation+1.73e-02

Pos logit contributions
 sob+0.004
unct+0.004
osc+0.004
 gag+0.004
lee+0.004
Neg logit contributions
We-0.002
 of-0.002
 Friends-0.002
 I-0.002
By-0.002
Token Anna
Token ID11735
Feature activation-4.52e-02

Pos logit contributions
 of+0.025
We+0.025
 I+0.024
 we+0.023
 Friends+0.022
Neg logit contributions
unct-0.030
gy-0.029
osc-0.028
 sob-0.028
 gag-0.028