TOP ACTIVATIONS
MAX = 5.971

all he wanted for his 3 rd birthday was to try
new baby brother for the 3 year old to meet .
The dad told the 3 year old son , "
Dad showed mom and the 3 year old a big tree
was always there teaching the 3 - year - old girl
toys and games for his 3 year old daughter to enjoy
't like all of the unnecessary extravag ance .
my said , " S oph ie , let 's go
The princess also showed the 3 year old how to draw
! The parents told their 3 - year - old child
. She told the little 3 year old boy to come
and said , " S oph ie , no matter how
mum said , " R alph , I need you to
, so he said " Stan , let 's stretch your
my said , " S oph ia , don 't be
The m ama bear , pap a bear and baby bear
, they proudly showed their 3 year old the thing they
. She explained to her 3 year old daughter what they
was sad , because her pap a never had time to
instinct can often save us unnecessary danger . <|endoftext|> Once upon

MIN ACTIVATIONS
MIN = -4.517

Mom asks . " Are you okay ?" S
!" " Are you okay ?" Ben asked .
was so relieved â he knew that if
!" " Are you OK ?" Mia says .
was so happy â she had so much
And even though he was still lonely sometimes , he knew
mom said , " Are you ready to take care of
Are you ready ? â M
Are you ok ? â He
and said , " Are you okay ?" Lily said ,
Sue says . " Are you fine ?" "
the rabbit . " Are you hungry ? I have some
and said , " Are you okay , Tim my ?"
and asked , " Are you glad to have pizza for
was too late â the ice melted and
and said , " Are you ready to open the box
ara says , " Are you OK , bird ? That
she continued to make more dolls , Jane learned how to
. " Are you ready ?" Tom asked .
. " Are you okay , Sara ?" Ben

INTERVAL 3.349 - 5.971
CONTAINS 0.059%

. The moral of the story is to always listen to
, " Yes , it sure is ! I 'm so
for your things and for others . That is what grand
from all the other tasks Piper had seen before .
. The moral of the story is to be kind and

INTERVAL 0.727 - 3.349
CONTAINS 18.898%

on , Fl uffy 's secret box stayed hidden under the
. The moral of the story is that taking
, Ben and his friends went to the park . Ben
But he knew there would be other days full of fun
, " OK , Mom . I will do it .

INTERVAL -1.895 - 0.727
CONTAINS 78.816%

myself then ! â The cat smiled
to the lake and put it in the water . He
, they wanted to go to the park . Tim was
said proudly . Her mom and dad came and
Mom hears the noise and comes to the room

INTERVAL -4.517 - -1.895
CONTAINS 2.227%

went to the park to play . While
asked . Mom smiled . " No ,
Every day Mia would blink at the radio and smile .
wants to build a big castle and Mia wants to build
Mom my was angry . She grabbed Lily and
Token all
Token ID477
Feature activation+1.16e+00

Pos logit contributions
 tunes+0.036
 films+0.034
 tune+0.033
 melodies+0.032
 songs+0.032
Neg logit contributions
m-0.019
-0.016
ll-0.015
 -0.015
ossible-0.014
Token he
Token ID339
Feature activation+1.26e+00

Pos logit contributions
 tunes+0.062
 films+0.058
rated+0.057
ening+0.056
 melodies+0.055
Neg logit contributions
-0.035
m-0.034
,-0.033
ll-0.032
 -0.031
Token wanted
Token ID2227
Feature activation+7.63e-01

Pos logit contributions
 tunes+0.060
 films+0.055
 tune+0.052
phony+0.052
 melodies+0.051
Neg logit contributions
 -0.043
-0.041
ossible-0.040
m-0.040
,-0.040
Token for
Token ID329
Feature activation+2.58e-01

Pos logit contributions
 tunes+0.053
 films+0.051
rated+0.051
rily+0.050
phony+0.050
Neg logit contributions
,-0.014
-0.013
 -0.013
 she-0.011
 are-0.010
Token his
Token ID465
Feature activation+2.09e+00

Pos logit contributions
 tunes+0.016
rated+0.016
rily+0.016
 films+0.016
phony+0.015
Neg logit contributions
,-0.006
 -0.006
-0.006
 she-0.005
 are-0.005
Token 3
Token ID513
Feature activation+5.97e+00

Pos logit contributions
 tunes+0.101
 films+0.098
 tune+0.088
 melodies+0.087
 songs+0.083
Neg logit contributions
 -0.079
-0.076
m-0.075
,-0.072
 she-0.072
Tokenrd
Token ID4372
Feature activation+4.36e-01

Pos logit contributions
 tunes+0.273
 films+0.257
rated+0.242
phony+0.240
 melodies+0.240
Neg logit contributions
 -0.244
-0.244
,-0.241
 she-0.228
 are-0.222
Token birthday
Token ID10955
Feature activation+1.30e+00

Pos logit contributions
 tunes+0.019
 films+0.018
phony+0.017
rated+0.017
rily+0.017
Neg logit contributions
,-0.019
 -0.019
-0.019
 she-0.017
 are-0.017
Token was
Token ID373
Feature activation-7.03e-02

Pos logit contributions
 tunes+0.082
 films+0.079
phony+0.075
 tune+0.075
 melodies+0.075
Neg logit contributions
 -0.030
-0.030
,-0.027
 are-0.025
 she-0.025
Token to
Token ID284
Feature activation-3.92e-01

Pos logit contributions
+0.002
 +0.002
,+0.002
 are+0.002
 she+0.002
Neg logit contributions
 tunes-0.004
 films-0.004
phony-0.004
rated-0.004
rily-0.004
Token try
Token ID1949
Feature activation+6.17e-01

Pos logit contributions
+0.012
 +0.012
,+0.011
m+0.011
 she+0.011
Neg logit contributions
 tunes-0.022
 films-0.021
phony-0.020
 melodies-0.020
 tune-0.020
Token new
Token ID649
Feature activation-6.28e-01

Pos logit contributions
 tunes+0.008
 films+0.008
 tune+0.007
phony+0.007
 melody+0.007
Neg logit contributions
 -0.006
-0.006
 are-0.006
,-0.006
m-0.006
Token baby
Token ID5156
Feature activation-1.11e-01

Pos logit contributions
 +0.025
+0.024
m+0.023
 are+0.023
 she+0.021
Neg logit contributions
 tunes-0.030
 films-0.028
 tune-0.028
 melody-0.026
 melodies-0.025
Token brother
Token ID3956
Feature activation-8.51e-02

Pos logit contributions
 +0.004
ossible+0.003
 are+0.003
+0.003
 forgive+0.003
Neg logit contributions
 tunes-0.006
 films-0.006
 tune-0.006
rily-0.005
en-0.005
Token for
Token ID329
Feature activation+8.25e-01

Pos logit contributions
 +0.002
 are+0.002
+0.002
m+0.002
 she+0.002
Neg logit contributions
 tunes-0.005
 films-0.005
en-0.005
 tune-0.005
phony-0.005
Token the
Token ID262
Feature activation+1.86e+00

Pos logit contributions
 tunes+0.053
 films+0.051
phony+0.050
rated+0.049
rily+0.049
Neg logit contributions
,-0.018
 -0.018
-0.018
 she-0.016
 are-0.015
Token 3
Token ID513
Feature activation+5.97e+00

Pos logit contributions
 tunes+0.087
 films+0.083
 tune+0.078
 melodies+0.075
rily+0.072
Neg logit contributions
 -0.077
-0.073
,-0.072
 are-0.069
ossible-0.069
Token year
Token ID614
Feature activation+1.57e+00

Pos logit contributions
 tunes+0.275
 films+0.259
phony+0.246
rated+0.245
 melodies+0.243
Neg logit contributions
 -0.242
,-0.241
-0.241
 she-0.224
 are-0.220
Token old
Token ID1468
Feature activation+1.24e+00

Pos logit contributions
 tunes+0.058
 films+0.056
ening+0.052
 melodies+0.050
rily+0.050
Neg logit contributions
 -0.077
-0.074
 are-0.073
m-0.071
,-0.070
Token to
Token ID284
Feature activation+1.01e+00

Pos logit contributions
 tunes+0.067
 films+0.063
 tune+0.059
 melodies+0.058
ening+0.056
Neg logit contributions
 -0.043
-0.042
 are-0.038
m-0.036
,-0.036
Token meet
Token ID1826
Feature activation+7.60e-01

Pos logit contributions
 tunes+0.057
 films+0.054
 melodies+0.052
 tune+0.052
phony+0.051
Neg logit contributions
 -0.029
-0.029
m-0.028
,-0.027
 she-0.026
Token.
Token ID13
Feature activation-1.11e-01

Pos logit contributions
 tunes+0.046
 films+0.045
 melodies+0.042
 tune+0.041
rily+0.041
Neg logit contributions
-0.018
 -0.018
 are-0.017
m-0.017
,-0.016
Token
Token ID198
Feature activation+8.42e-02

Pos logit contributions
,+0.003
 are+0.003
 she+0.002
 help+0.002
 pr+0.002
Neg logit contributions
 tunes-0.008
 films-0.007
rily-0.007
 chords-0.007
 tune-0.007
TokenThe
Token ID464
Feature activation-9.39e-01

Pos logit contributions
 tunes+0.004
 tune+0.003
 chords+0.003
phony+0.003
 films+0.003
Neg logit contributions
,-0.004
 are-0.004
m-0.003
 -0.003
 help-0.003
Token dad
Token ID9955
Feature activation+1.01e+00

Pos logit contributions
 +0.035
ossible+0.034
+0.033
 are+0.033
,+0.032
Neg logit contributions
 tunes-0.047
 films-0.043
 tune-0.043
 melodies-0.041
 melody-0.040
Token told
Token ID1297
Feature activation+7.69e-01

Pos logit contributions
 tunes+0.052
en+0.050
 films+0.050
ening+0.049
rily+0.048
Neg logit contributions
 -0.031
ossible-0.026
 are-0.026
 forgive-0.024
,-0.024
Token the
Token ID262
Feature activation+1.74e+00

Pos logit contributions
 tunes+0.044
 films+0.043
 melodies+0.040
rily+0.040
 tune+0.040
Neg logit contributions
-0.022
 -0.022
,-0.021
m-0.020
 she-0.020
Token 3
Token ID513
Feature activation+5.92e+00

Pos logit contributions
 tunes+0.078
 films+0.073
 tune+0.070
 melodies+0.067
rated+0.065
Neg logit contributions
 -0.072
ossible-0.070
-0.069
,-0.069
 are-0.068
Token year
Token ID614
Feature activation+1.26e+00

Pos logit contributions
 tunes+0.266
 films+0.249
rated+0.236
 melodies+0.233
 tune+0.233
Neg logit contributions
 -0.245
-0.245
,-0.242
 she-0.230
 are-0.226
Token old
Token ID1468
Feature activation+9.56e-01

Pos logit contributions
 tunes+0.046
 films+0.042
ening+0.041
lights+0.040
rily+0.040
Neg logit contributions
 -0.063
 are-0.060
-0.060
m-0.058
ossible-0.058
Token son
Token ID3367
Feature activation+2.46e+00

Pos logit contributions
 tunes+0.051
 films+0.047
 tune+0.045
 melodies+0.044
ening+0.044
Neg logit contributions
 -0.030
 are-0.030
-0.030
m-0.029
ossible-0.029
Token,
Token ID11
Feature activation+3.12e-01

Pos logit contributions
 tunes+0.117
ening+0.111
 films+0.110
en+0.110
rily+0.105
Neg logit contributions
ossible-0.073
m-0.073
 -0.068
Once-0.061
ll-0.061
Token "
Token ID366
Feature activation+1.26e+00

Pos logit contributions
 tunes+0.016
 films+0.015
 tune+0.015
 melodies+0.014
ening+0.014
Neg logit contributions
ossible-0.009
m-0.009
 -0.009
 are-0.009
-0.009
Token Dad
Token ID17415
Feature activation+8.66e-01

Pos logit contributions
 tunes+0.008
rily+0.008
 melodies+0.008
rated+0.008
izing+0.008
Neg logit contributions
 -0.004
,-0.003
-0.003
 are-0.003
-0.003
Token showed
Token ID3751
Feature activation+9.71e-01

Pos logit contributions
 tunes+0.045
 films+0.044
ening+0.043
en+0.043
rated+0.042
Neg logit contributions
 -0.022
-0.021
ossible-0.020
 forgive-0.020
 are-0.019
Token mom
Token ID1995
Feature activation+7.81e-01

Pos logit contributions
 tunes+0.063
 films+0.060
phony+0.059
rated+0.058
 melodies+0.058
Neg logit contributions
,-0.023
 -0.022
-0.022
 she-0.019
 are-0.019
Token and
Token ID290
Feature activation+1.15e+00

Pos logit contributions
 films+0.043
 tunes+0.043
en+0.041
rily+0.040
ening+0.039
Neg logit contributions
-0.022
 -0.021
 she-0.019
 forgive-0.018
,-0.018
Token the
Token ID262
Feature activation+1.63e+00

Pos logit contributions
 tunes+0.056
 films+0.054
rily+0.050
 melodies+0.049
 tune+0.049
Neg logit contributions
-0.042
 -0.039
,-0.037
m-0.036
 are-0.035
Token 3
Token ID513
Feature activation+5.70e+00

Pos logit contributions
 tunes+0.080
 films+0.077
 tune+0.071
 melodies+0.070
phony+0.069
Neg logit contributions
-0.061
,-0.060
 -0.059
 she-0.056
m-0.053
Token year
Token ID614
Feature activation+1.35e+00

Pos logit contributions
 tunes+0.252
 films+0.237
rated+0.223
 melodies+0.220
 tune+0.220
Neg logit contributions
-0.241
 -0.237
,-0.234
 she-0.223
 are-0.218
Token old
Token ID1468
Feature activation+9.05e-01

Pos logit contributions
 tunes+0.053
 films+0.050
rated+0.046
 melodies+0.046
ening+0.046
Neg logit contributions
 -0.063
-0.063
,-0.061
 she-0.059
 are-0.059
Token a
Token ID257
Feature activation+6.32e-01

Pos logit contributions
 tunes+0.045
 films+0.041
 melodies+0.039
 tune+0.039
 stations+0.038
Neg logit contributions
-0.034
 -0.032
ossible-0.030
 are-0.029
m-0.029
Token big
Token ID1263
Feature activation-1.35e-02

Pos logit contributions
 tunes+0.029
 films+0.027
 tune+0.027
phony+0.025
 melodies+0.025
Neg logit contributions
-0.025
,-0.024
 -0.023
 she-0.023
 are-0.023
Token tree
Token ID5509
Feature activation-9.30e-01

Pos logit contributions
+0.000
 +0.000
 she+0.000
m+0.000
ll+0.000
Neg logit contributions
 tunes-0.001
 films-0.001
 tune-0.001
ening-0.001
 melodies-0.001
Token was
Token ID373
Feature activation-5.85e-02

Pos logit contributions
 tunes+0.018
 tune+0.016
 melodies+0.016
en+0.016
ening+0.015
Neg logit contributions
 -0.010
ossible-0.009
-0.008
 are-0.008
m-0.008
Token always
Token ID1464
Feature activation-2.47e-01

Pos logit contributions
 +0.002
+0.002
,+0.002
 are+0.002
m+0.002
Neg logit contributions
 tunes-0.003
 films-0.003
ening-0.003
rily-0.003
 tune-0.003
Token there
Token ID612
Feature activation-1.04e+00

Pos logit contributions
 +0.007
+0.007
,+0.006
 are+0.006
ossible+0.006
Neg logit contributions
 tunes-0.014
 films-0.013
rily-0.013
ening-0.013
 tune-0.013
Token teaching
Token ID7743
Feature activation+1.91e+00

Pos logit contributions
 +0.027
ossible+0.027
+0.025
m+0.024
 are+0.021
Neg logit contributions
 tunes-0.058
ening-0.053
 tune-0.053
 films-0.052
 melodies-0.052
Token the
Token ID262
Feature activation+2.33e+00

Pos logit contributions
 tunes+0.121
 films+0.115
 melodies+0.112
en+0.110
ening+0.110
Neg logit contributions
 -0.042
-0.042
,-0.039
ossible-0.038
m-0.037
Token 3
Token ID513
Feature activation+5.70e+00

Pos logit contributions
 tunes+0.115
 films+0.107
 melodies+0.102
rated+0.102
 tune+0.102
Neg logit contributions
 -0.088
,-0.086
-0.086
 she-0.080
ossible-0.079
Token-
Token ID12
Feature activation+8.69e-01

Pos logit contributions
 tunes+0.258
 films+0.242
rated+0.229
 melodies+0.227
 tune+0.226
Neg logit contributions
 -0.234
-0.234
,-0.231
 she-0.219
 are-0.215
Tokenyear
Token ID1941
Feature activation+2.01e+00

Pos logit contributions
 tunes+0.035
 melodies+0.031
 chords+0.030
 tune+0.029
en+0.029
Neg logit contributions
 -0.039
,-0.038
ossible-0.037
 are-0.036
m-0.035
Token-
Token ID12
Feature activation+7.97e-01

Pos logit contributions
 tunes+0.088
 films+0.081
rily+0.078
ening+0.078
 melodies+0.078
Neg logit contributions
 -0.088
-0.087
,-0.081
 are-0.081
m-0.081
Tokenold
Token ID727
Feature activation+1.72e+00

Pos logit contributions
 tunes+0.033
 melodies+0.029
 chords+0.028
 tune+0.027
 songs+0.027
Neg logit contributions
 -0.036
,-0.034
ossible-0.033
 are-0.033
m-0.033
Token girl
Token ID2576
Feature activation+2.58e+00

Pos logit contributions
 tunes+0.098
 films+0.090
 melodies+0.089
 tune+0.088
en+0.087
Neg logit contributions
 -0.048
ossible-0.046
-0.045
m-0.044
 are-0.042
Token toys
Token ID14958
Feature activation-1.66e+00

Pos logit contributions
,+0.008
 +0.008
+0.007
 she+0.007
 are+0.007
Neg logit contributions
 tunes-0.010
 films-0.010
phony-0.010
rated-0.010
rily-0.010
Token and
Token ID290
Feature activation+2.12e+00

Pos logit contributions
 +0.045
+0.043
ossible+0.038
m+0.035
 forgive+0.034
Neg logit contributions
 tunes-0.097
 films-0.093
ening-0.087
 stations-0.086
en-0.085
Token games
Token ID1830
Feature activation-8.55e-02

Pos logit contributions
 tunes+0.095
 films+0.089
phony+0.086
rated+0.086
rily+0.085
Neg logit contributions
,-0.089
 -0.088
-0.088
 she-0.083
 are-0.081
Token for
Token ID329
Feature activation+1.24e+00

Pos logit contributions
+0.002
 +0.002
m+0.002
ossible+0.002
 are+0.001
Neg logit contributions
 tunes-0.005
 films-0.005
lights-0.005
 stations-0.005
 tune-0.005
Token his
Token ID465
Feature activation+2.59e+00

Pos logit contributions
rated+0.089
rily+0.084
 tunes+0.084
ening+0.083
phony+0.083
Neg logit contributions
,-0.026
 -0.023
 she-0.020
 are-0.018
 he-0.017
Token 3
Token ID513
Feature activation+5.67e+00

Pos logit contributions
 tunes+0.145
 films+0.140
 tune+0.129
 melodies+0.129
 songs+0.124
Neg logit contributions
 -0.085
-0.082
m-0.075
,-0.073
 are-0.071
Token year
Token ID614
Feature activation+1.75e+00

Pos logit contributions
 tunes+0.268
 films+0.253
phony+0.248
rily+0.244
rated+0.242
Neg logit contributions
,-0.225
 -0.220
-0.217
 she-0.203
 are-0.199
Token old
Token ID1468
Feature activation+8.97e-01

Pos logit contributions
 tunes+0.069
 films+0.065
rated+0.060
phony+0.060
 melodies+0.060
Neg logit contributions
 -0.082
-0.081
,-0.077
 are-0.077
m-0.076
Token daughter
Token ID4957
Feature activation+2.16e+00

Pos logit contributions
 tunes+0.048
 films+0.045
 melodies+0.042
 tune+0.041
 stations+0.040
Neg logit contributions
-0.032
 -0.031
m-0.028
 she-0.027
,-0.027
Token to
Token ID284
Feature activation+7.21e-01

Pos logit contributions
 tunes+0.121
 films+0.117
en+0.108
 stations+0.104
 songs+0.104
Neg logit contributions
 -0.049
m-0.049
-0.046
ossible-0.043
 forgive-0.042
Token enjoy
Token ID2883
Feature activation-3.87e-01

Pos logit contributions
 tunes+0.039
 films+0.038
 melodies+0.035
 tune+0.035
phony+0.034
Neg logit contributions
m-0.023
-0.022
 -0.021
,-0.020
 she-0.019
Token't
Token ID470
Feature activation-7.56e-01

Pos logit contributions
 +0.019
,+0.018
ossible+0.016
m+0.016
 are+0.015
Neg logit contributions
 tunes-0.052
 films-0.052
rily-0.049
phony-0.049
en-0.048
Token like
Token ID588
Feature activation-2.64e+00

Pos logit contributions
 +0.020
,+0.020
+0.020
 she+0.018
 are+0.017
Neg logit contributions
 tunes-0.046
 films-0.044
phony-0.042
rated-0.042
 melodies-0.042
Token all
Token ID477
Feature activation-5.17e-02

Pos logit contributions
,+0.075
 +0.074
+0.074
 she+0.068
 are+0.065
Neg logit contributions
 tunes-0.154
 films-0.147
rated-0.143
phony-0.143
rily-0.141
Token of
Token ID286
Feature activation+3.72e-01

Pos logit contributions
ossible+0.001
 +0.001
+0.001
m+0.001
 forgive+0.001
Neg logit contributions
 tunes-0.003
 films-0.003
ening-0.003
en-0.003
 melodies-0.003
Token the
Token ID262
Feature activation+5.63e-01

Pos logit contributions
 tunes+0.022
 films+0.021
rated+0.021
phony+0.021
rily+0.021
Neg logit contributions
,-0.010
 -0.010
-0.010
 she-0.009
 are-0.008
Token unnecessary
Token ID13114
Feature activation+5.67e+00

Pos logit contributions
 tunes+0.026
 films+0.024
 melodies+0.023
 tune+0.022
ening+0.022
Neg logit contributions
-0.024
 -0.023
ossible-0.022
,-0.022
 she-0.021
Token extravag
Token ID36827
Feature activation+2.64e+00

Pos logit contributions
 tunes+0.276
 films+0.259
 melodies+0.243
ening+0.242
phony+0.241
Neg logit contributions
 -0.215
-0.212
,-0.204
ossible-0.197
 she-0.193
Tokenance
Token ID590
Feature activation+1.86e+00

Pos logit contributions
 tunes+0.082
rated+0.074
phony+0.074
en+0.074
 films+0.073
Neg logit contributions
 -0.145
-0.142
,-0.136
 she-0.132
m-0.127
Token.
Token ID13
Feature activation-2.76e-01

Pos logit contributions
 tunes+0.127
 films+0.119
ening+0.118
 tune+0.117
rily+0.116
Neg logit contributions
 -0.035
-0.034
ossible-0.027
m-0.026
,-0.025
Token
Token ID198
Feature activation+2.46e-01

Pos logit contributions
+0.005
 are+0.005
m+0.004
,+0.004
 sc+0.004
Neg logit contributions
 tunes-0.017
 films-0.017
phony-0.016
rily-0.016
 habitat-0.016
Token
Token ID198
Feature activation+4.50e-01

Pos logit contributions
 tunes+0.017
 films+0.017
phony+0.016
rily+0.016
 melodies+0.016
Neg logit contributions
-0.003
,-0.003
 are-0.003
 -0.003
 she-0.002
Tokenmy
Token ID1820
Feature activation+9.51e-01

Pos logit contributions
 +0.009
+0.008
,+0.007
 are+0.007
 forgive+0.007
Neg logit contributions
 films-0.023
 tunes-0.023
en-0.023
rated-0.022
ening-0.022
Token said
Token ID531
Feature activation+5.67e-01

Pos logit contributions
 tunes+0.055
 films+0.054
rated+0.052
phony+0.052
en+0.051
Neg logit contributions
 -0.024
-0.023
,-0.022
 she-0.021
 are-0.020
Token,
Token ID11
Feature activation+1.10e+00

Pos logit contributions
 films+0.039
 tunes+0.039
rily+0.037
 tune+0.036
phony+0.036
Neg logit contributions
-0.006
 -0.006
m-0.005
my-0.005
ossible-0.005
Token "
Token ID366
Feature activation+1.91e+00

Pos logit contributions
 films+0.054
 tunes+0.053
rily+0.050
 tune+0.049
phony+0.049
Neg logit contributions
-0.034
 are-0.034
ossible-0.033
m-0.031
 -0.030
TokenS
Token ID50
Feature activation+1.53e+00

Pos logit contributions
 films+0.107
 tunes+0.106
lights+0.103
rily+0.100
ening+0.100
Neg logit contributions
-0.063
 -0.062
m-0.058
,-0.057
 she-0.057
Tokenoph
Token ID2522
Feature activation+5.66e+00

Pos logit contributions
 tunes+0.054
 films+0.050
phony+0.045
rily+0.045
 tune+0.045
Neg logit contributions
,-0.082
-0.080
 -0.080
 she-0.077
m-0.075
Tokenie
Token ID494
Feature activation+3.84e+00

Pos logit contributions
 tunes+0.219
 films+0.205
rated+0.196
rily+0.194
 melodies+0.192
Neg logit contributions
 -0.268
-0.266
,-0.259
 she-0.247
 are-0.243
Token,
Token ID11
Feature activation+1.10e+00

Pos logit contributions
 tunes+0.258
ening+0.251
rated+0.248
en+0.247
 films+0.245
Neg logit contributions
 -0.052
-0.047
m-0.044
 forgive-0.038
ossible-0.037
Token let
Token ID1309
Feature activation-1.49e-01

Pos logit contributions
 tunes+0.061
 films+0.059
 tune+0.056
phony+0.056
 melodies+0.055
Neg logit contributions
 -0.032
-0.030
,-0.030
ossible-0.028
m-0.028
Token's
Token ID338
Feature activation-2.58e-01

Pos logit contributions
 +0.002
,+0.002
+0.002
 she+0.002
 are+0.002
Neg logit contributions
 tunes-0.011
 films-0.010
phony-0.010
rated-0.010
 melodies-0.010
Token go
Token ID467
Feature activation-1.41e-01

Pos logit contributions
 +0.007
,+0.007
+0.007
m+0.007
 she+0.007
Neg logit contributions
 tunes-0.015
 films-0.014
phony-0.013
 melodies-0.013
 tune-0.013
TokenThe
Token ID464
Feature activation-1.25e+00

Pos logit contributions
,+0.001
 +0.001
 are+0.001
m+0.001
+0.001
Neg logit contributions
 tunes-0.001
 films-0.001
 melodies-0.001
 tune-0.001
phony-0.001
Token princess
Token ID21752
Feature activation+4.40e-01

Pos logit contributions
 +0.065
+0.064
,+0.064
 are+0.061
 she+0.060
Neg logit contributions
 tunes-0.043
 films-0.041
 melodies-0.037
 tune-0.036
phony-0.036
Token also
Token ID635
Feature activation-4.87e-01

Pos logit contributions
 tunes+0.025
 films+0.023
en+0.023
 melodies+0.022
 tune+0.022
Neg logit contributions
 -0.013
-0.011
m-0.011
 are-0.011
ossible-0.011
Token showed
Token ID3751
Feature activation+7.26e-01

Pos logit contributions
 +0.015
+0.014
,+0.014
m+0.013
 she+0.013
Neg logit contributions
 tunes-0.027
 films-0.025
phony-0.024
 melodies-0.024
rated-0.024
Token the
Token ID262
Feature activation+1.36e+00

Pos logit contributions
 tunes+0.048
 films+0.046
 melodies+0.044
phony+0.044
rily+0.044
Neg logit contributions
-0.015
 -0.015
,-0.015
 she-0.013
 are-0.013
Token 3
Token ID513
Feature activation+5.65e+00

Pos logit contributions
 tunes+0.052
 films+0.050
 melodies+0.044
 tune+0.044
phony+0.044
Neg logit contributions
-0.066
 -0.065
,-0.063
 she-0.061
ossible-0.060
Token year
Token ID614
Feature activation+8.98e-01

Pos logit contributions
 tunes+0.270
 films+0.255
phony+0.249
rated+0.246
rily+0.243
Neg logit contributions
,-0.220
 -0.218
-0.216
 she-0.199
 are-0.195
Token old
Token ID1468
Feature activation+6.75e-01

Pos logit contributions
 tunes+0.033
 films+0.030
lights+0.028
rily+0.028
ening+0.028
Neg logit contributions
 -0.045
-0.044
 are-0.043
m-0.042
,-0.041
Token how
Token ID703
Feature activation+4.32e-01

Pos logit contributions
 tunes+0.030
 films+0.028
 melodies+0.026
 tune+0.025
 songs+0.024
Neg logit contributions
-0.030
 -0.030
m-0.028
ossible-0.027
 are-0.027
Token to
Token ID284
Feature activation-3.46e-01

Pos logit contributions
 tunes+0.022
 films+0.021
 melodies+0.021
 songs+0.020
 galaxies+0.020
Neg logit contributions
m-0.013
-0.012
 -0.012
ossible-0.012
,-0.011
Token draw
Token ID3197
Feature activation-3.48e-01

Pos logit contributions
m+0.012
+0.011
 +0.011
ossible+0.010
 she+0.010
Neg logit contributions
 tunes-0.018
en-0.016
 films-0.016
 melodies-0.016
 tune-0.016
Token!
Token ID0
Feature activation+3.09e-01

Pos logit contributions
 tunes+0.073
ening+0.069
rily+0.067
rated+0.067
 tune+0.067
Neg logit contributions
-0.025
 -0.024
ossible-0.024
m-0.021
 forgive-0.021
Token The
Token ID383
Feature activation-9.22e-01

Pos logit contributions
 tunes+0.017
 films+0.016
phony+0.016
rily+0.015
rated+0.015
Neg logit contributions
,-0.009
-0.009
m-0.008
 are-0.008
 -0.008
Token parents
Token ID3397
Feature activation+1.32e+00

Pos logit contributions
,+0.029
 +0.029
+0.029
 she+0.026
 are+0.025
Neg logit contributions
 tunes-0.051
 films-0.049
phony-0.047
rated-0.047
rily-0.047
Token told
Token ID1297
Feature activation+9.65e-01

Pos logit contributions
 tunes+0.068
en+0.066
ening+0.065
 films+0.064
rated+0.062
Neg logit contributions
ossible-0.037
 -0.036
-0.034
 she-0.033
,-0.030
Token their
Token ID511
Feature activation+1.52e+00

Pos logit contributions
 tunes+0.058
 films+0.055
phony+0.053
rated+0.053
rily+0.053
Neg logit contributions
,-0.027
 -0.026
-0.026
 she-0.024
 are-0.023
Token 3
Token ID513
Feature activation+5.64e+00

Pos logit contributions
 tunes+0.077
 films+0.073
rated+0.069
phony+0.069
 melodies+0.069
Neg logit contributions
 -0.055
-0.055
,-0.054
 she-0.052
 are-0.050
Token-
Token ID12
Feature activation+7.10e-01

Pos logit contributions
 tunes+0.252
 films+0.237
rated+0.224
 tune+0.221
 melodies+0.220
Neg logit contributions
-0.236
 -0.233
,-0.226
 she-0.220
 are-0.216
Tokenyear
Token ID1941
Feature activation+2.16e+00

Pos logit contributions
 tunes+0.020
 chords+0.017
 films+0.016
 melodies+0.016
rant+0.016
Neg logit contributions
,-0.040
 -0.039
 are-0.038
-0.037
 still-0.036
Token-
Token ID12
Feature activation+8.17e-01

Pos logit contributions
 tunes+0.099
 films+0.094
rated+0.088
rily+0.088
 melodies+0.088
Neg logit contributions
 -0.088
-0.088
,-0.085
 she-0.081
 are-0.081
Tokenold
Token ID727
Feature activation+1.40e+00

Pos logit contributions
 tunes+0.029
 chords+0.026
 melodies+0.025
 films+0.025
ands+0.025
Neg logit contributions
 -0.038
,-0.038
 are-0.037
-0.036
m-0.036
Token child
Token ID1200
Feature activation+2.75e+00

Pos logit contributions
 tunes+0.074
 films+0.071
en+0.067
 tune+0.067
ening+0.067
Neg logit contributions
-0.043
 -0.042
m-0.041
 are-0.041
ossible-0.039
Token.
Token ID13
Feature activation-3.32e-01

Pos logit contributions
 tunes+0.106
ening+0.097
 films+0.096
 stations+0.094
en+0.094
Neg logit contributions
 -0.048
ossible-0.045
-0.041
m-0.040
 forgive-0.036
Token She
Token ID1375
Feature activation-4.69e-02

Pos logit contributions
 are+0.008
,+0.008
 forgive+0.008
+0.008
m+0.008
Neg logit contributions
 tunes-0.018
 films-0.018
rily-0.017
phony-0.017
ishment-0.017
Token told
Token ID1297
Feature activation+8.36e-01

Pos logit contributions
 +0.001
,+0.001
 are+0.001
+0.001
m+0.001
Neg logit contributions
 tunes-0.003
rated-0.002
 films-0.002
 melodies-0.002
en-0.002
Token the
Token ID262
Feature activation+1.54e+00

Pos logit contributions
 tunes+0.049
 films+0.048
 melodies+0.045
 tune+0.045
rily+0.045
Neg logit contributions
-0.023
 -0.023
,-0.022
m-0.020
 are-0.020
Token little
Token ID1310
Feature activation+3.79e+00

Pos logit contributions
 tunes+0.075
 films+0.070
 tune+0.066
 melodies+0.065
phony+0.062
Neg logit contributions
 -0.060
,-0.058
-0.057
ossible-0.056
 are-0.054
Token 3
Token ID513
Feature activation+5.59e+00

Pos logit contributions
 tunes+0.170
 films+0.159
 tune+0.151
 melodies+0.146
ening+0.146
Neg logit contributions
 -0.161
-0.153
 are-0.149
,-0.141
ossible-0.141
Token year
Token ID614
Feature activation+1.14e+00

Pos logit contributions
 tunes+0.251
 films+0.236
rated+0.223
 melodies+0.221
phony+0.221
Neg logit contributions
 -0.232
-0.231
,-0.228
 she-0.215
 are-0.213
Token old
Token ID1468
Feature activation+8.42e-01

Pos logit contributions
 tunes+0.042
 films+0.039
ening+0.037
 melodies+0.036
lights+0.036
Neg logit contributions
 -0.057
-0.055
 are-0.054
m-0.053
,-0.052
Token boy
Token ID2933
Feature activation+2.26e+00

Pos logit contributions
 tunes+0.044
 films+0.041
 melodies+0.039
 tune+0.039
 chords+0.038
Neg logit contributions
 -0.030
-0.028
m-0.027
 are-0.026
ossible-0.025
Token to
Token ID284
Feature activation+9.67e-01

Pos logit contributions
 tunes+0.109
 films+0.107
en+0.104
ening+0.103
rated+0.097
Neg logit contributions
m-0.066
 -0.065
ossible-0.060
Once-0.056
 are-0.053
Token come
Token ID1282
Feature activation-4.79e-02

Pos logit contributions
 tunes+0.051
 films+0.046
 tune+0.046
 melodies+0.046
en+0.045
Neg logit contributions
m-0.031
 -0.030
-0.029
ossible-0.028
,-0.027
Token and
Token ID290
Feature activation+1.78e-01

Pos logit contributions
ossible+0.064
 +0.063
+0.056
 forgive+0.054
m+0.050
Neg logit contributions
 tunes-0.113
 films-0.108
ening-0.104
izing-0.102
 stations-0.102
Token said
Token ID531
Feature activation+4.40e-01

Pos logit contributions
 tunes+0.010
 films+0.009
 melodies+0.009
 tune+0.009
rily+0.009
Neg logit contributions
-0.004
ossible-0.004
 are-0.004
 -0.004
m-0.004
Token,
Token ID11
Feature activation+5.52e-01

Pos logit contributions
 films+0.027
 tunes+0.027
rily+0.026
ening+0.025
izing+0.025
Neg logit contributions
ossible-0.008
my-0.007
m-0.006
-0.006
 -0.006
Token "
Token ID366
Feature activation+1.67e+00

Pos logit contributions
 tunes+0.026
 films+0.025
rily+0.025
 tune+0.024
ening+0.024
Neg logit contributions
ossible-0.019
 are-0.018
-0.017
m-0.016
 forgive-0.016
TokenS
Token ID50
Feature activation+1.40e+00

Pos logit contributions
 tunes+0.089
 films+0.089
lights+0.084
phony+0.083
rily+0.081
Neg logit contributions
-0.059
 -0.059
,-0.055
 she-0.055
m-0.055
Tokenoph
Token ID2522
Feature activation+5.55e+00

Pos logit contributions
 tunes+0.048
 films+0.044
phony+0.040
rily+0.040
 tune+0.040
Neg logit contributions
,-0.076
 -0.075
-0.075
 she-0.072
m-0.070
Tokenie
Token ID494
Feature activation+3.77e+00

Pos logit contributions
 tunes+0.197
 films+0.184
rated+0.180
rily+0.178
en+0.177
Neg logit contributions
 -0.278
-0.273
,-0.259
 she-0.253
 are-0.251
Token,
Token ID11
Feature activation+8.35e-01

Pos logit contributions
ening+0.212
en+0.211
uders+0.198
izing+0.198
 tunes+0.198
Neg logit contributions
ossible-0.075
m-0.075
 -0.071
She-0.064
 forgive-0.064
Token no
Token ID645
Feature activation+8.18e-02

Pos logit contributions
 tunes+0.041
 films+0.039
 tune+0.037
ening+0.037
 melodies+0.037
Neg logit contributions
 -0.028
ossible-0.026
m-0.025
,-0.025
-0.025
Token matter
Token ID2300
Feature activation+2.37e+00

Pos logit contributions
 tunes+0.005
 films+0.005
phony+0.005
rated+0.005
rily+0.005
Neg logit contributions
,-0.002
 -0.002
-0.002
 she-0.001
 are-0.001
Token how
Token ID703
Feature activation-8.95e-01

Pos logit contributions
 tunes+0.125
 films+0.119
phony+0.114
rated+0.114
 melodies+0.113
Neg logit contributions
,-0.080
 -0.080
-0.079
 she-0.073
 are-0.071
Token mum
Token ID25682
Feature activation+1.07e+00

Pos logit contributions
 tunes+0.057
 films+0.054
 tune+0.052
 melodies+0.052
phony+0.051
Neg logit contributions
 -0.024
-0.023
 she-0.021
,-0.021
m-0.020
Token said
Token ID531
Feature activation+6.96e-01

Pos logit contributions
en+0.051
 tunes+0.051
 films+0.050
rated+0.049
ening+0.049
Neg logit contributions
 -0.031
ossible-0.030
 are-0.029
 forgive-0.027
 am-0.026
Token,
Token ID11
Feature activation+1.22e+00

Pos logit contributions
 films+0.042
 tunes+0.040
rily+0.039
ening+0.038
 tune+0.037
Neg logit contributions
ossible-0.013
m-0.012
my-0.012
ll-0.012
 -0.010
Token "
Token ID366
Feature activation+1.68e+00

Pos logit contributions
 films+0.055
 tunes+0.054
rily+0.053
ening+0.051
 tune+0.050
Neg logit contributions
ossible-0.042
 are-0.041
m-0.038
 forgive-0.037
ll-0.036
TokenR
Token ID49
Feature activation+4.26e+00

Pos logit contributions
 tunes+0.081
 films+0.077
phony+0.074
rated+0.073
rily+0.072
Neg logit contributions
-0.065
 -0.065
,-0.064
 she-0.060
m-0.058
Tokenalph
Token ID17307
Feature activation+5.48e+00

Pos logit contributions
 tunes+0.152
 films+0.144
rated+0.132
phony+0.132
rily+0.131
Neg logit contributions
,-0.226
-0.219
 -0.211
m-0.201
 she-0.200
Token,
Token ID11
Feature activation+1.10e+00

Pos logit contributions
ening+0.360
en+0.360
rated+0.354
 films+0.350
 tunes+0.349
Neg logit contributions
 -0.078
-0.071
 she-0.060
 forgive-0.057
m-0.054
Token I
Token ID314
Feature activation-6.86e-01

Pos logit contributions
 tunes+0.058
 films+0.057
rily+0.052
 tune+0.052
rated+0.052
Neg logit contributions
 -0.034
,-0.032
-0.031
ossible-0.031
m-0.030
Token need
Token ID761
Feature activation+1.77e+00

Pos logit contributions
 +0.017
 she+0.017
,+0.014
ossible+0.013
m+0.013
Neg logit contributions
rated-0.036
en-0.035
 tunes-0.034
phony-0.033
 films-0.033
Token you
Token ID345
Feature activation-3.73e-01

Pos logit contributions
 tunes+0.111
 films+0.107
phony+0.102
 melodies+0.101
 tune+0.101
Neg logit contributions
 -0.041
-0.040
,-0.040
 she-0.035
m-0.034
Token to
Token ID284
Feature activation+1.15e+00

Pos logit contributions
 +0.010
+0.009
m+0.008
 she+0.008
,+0.008
Neg logit contributions
 tunes-0.021
 films-0.020
rated-0.019
 melodies-0.019
 tune-0.019
Token,
Token ID11
Feature activation-3.63e-01

Pos logit contributions
 tunes+0.062
 films+0.060
 melodies+0.057
rated+0.057
phony+0.057
Neg logit contributions
 -0.014
-0.013
,-0.013
 she-0.011
 are-0.010
Token so
Token ID523
Feature activation+1.02e+00

Pos logit contributions
 +0.011
+0.011
,+0.011
m+0.010
 are+0.010
Neg logit contributions
 tunes-0.020
 films-0.018
 tune-0.018
 melodies-0.017
rated-0.017
Token he
Token ID339
Feature activation+2.44e-01

Pos logit contributions
 tunes+0.073
 films+0.070
rated+0.068
 melodies+0.068
 tune+0.067
Neg logit contributions
 -0.015
-0.014
,-0.014
m-0.012
 are-0.011
Token said
Token ID531
Feature activation+5.75e-02

Pos logit contributions
 tunes+0.014
 films+0.013
rily+0.013
rated+0.013
phony+0.012
Neg logit contributions
 -0.007
-0.007
,-0.007
 are-0.006
m-0.006
Token "
Token ID366
Feature activation+1.28e+00

Pos logit contributions
 tunes+0.004
 films+0.003
rily+0.003
 tune+0.003
 melodies+0.003
Neg logit contributions
m-0.001
ossible-0.001
-0.001
my-0.001
 -0.001
TokenStan
Token ID32140
Feature activation+5.41e+00

Pos logit contributions
 tunes+0.055
 films+0.054
phony+0.051
rily+0.050
rated+0.049
Neg logit contributions
-0.055
 -0.055
,-0.054
 she-0.051
m-0.050
Token,
Token ID11
Feature activation+8.36e-01

Pos logit contributions
rated+0.241
 tunes+0.237
rily+0.228
ening+0.227
en+0.223
Neg logit contributions
 -0.206
 she-0.195
-0.195
m-0.182
 are-0.181
Token let
Token ID1309
Feature activation-1.35e-01

Pos logit contributions
 tunes+0.043
 films+0.040
 tune+0.038
 melodies+0.038
rated+0.037
Neg logit contributions
 -0.028
ossible-0.026
m-0.026
,-0.025
-0.025
Token's
Token ID338
Feature activation-1.44e-01

Pos logit contributions
 +0.003
,+0.003
+0.003
 she+0.003
m+0.002
Neg logit contributions
 tunes-0.009
 films-0.008
rated-0.008
phony-0.008
 melodies-0.008
Token stretch
Token ID7539
Feature activation+1.51e+00

Pos logit contributions
 +0.004
m+0.004
,+0.004
+0.004
 she+0.004
Neg logit contributions
 tunes-0.008
 films-0.007
 tune-0.007
phony-0.007
 melodies-0.007
Token your
Token ID534
Feature activation-4.14e-01

Pos logit contributions
 tunes+0.086
 films+0.082
rily+0.079
rated+0.079
 tune+0.078
Neg logit contributions
 -0.042
-0.042
,-0.041
 she-0.039
m-0.037
Tokenmy
Token ID1820
Feature activation+5.40e-01

Pos logit contributions
 am+0.052
 are+0.051
 +0.050
+0.047
 forgive+0.046
Neg logit contributions
en-0.115
 tunes-0.109
ening-0.108
rated-0.106
ples-0.101
Token said
Token ID531
Feature activation-5.98e-02

Pos logit contributions
 tunes+0.026
en+0.025
ening+0.024
 tune+0.024
 films+0.024
Neg logit contributions
ossible-0.017
 are-0.017
 -0.016
 am-0.015
 forgive-0.014
Token,
Token ID11
Feature activation+3.43e-01

Pos logit contributions
ossible+0.001
 +0.001
+0.001
my+0.001
m+0.001
Neg logit contributions
 tunes-0.004
 films-0.004
rily-0.004
ening-0.004
 tune-0.004
Token "
Token ID366
Feature activation+1.43e+00

Pos logit contributions
 tunes+0.018
 films+0.018
rily+0.017
 tune+0.017
ening+0.016
Neg logit contributions
-0.010
ossible-0.010
 are-0.010
m-0.009
 -0.009
TokenS
Token ID50
Feature activation+1.22e+00

Pos logit contributions
 tunes+0.077
 films+0.075
phony+0.071
rily+0.071
rated+0.070
Neg logit contributions
-0.048
 -0.047
,-0.045
 she-0.043
m-0.042
Tokenoph
Token ID2522
Feature activation+5.41e+00

Pos logit contributions
 tunes+0.042
 films+0.038
phony+0.036
rated+0.036
 melodies+0.035
Neg logit contributions
 -0.064
,-0.064
-0.064
 she-0.060
 are-0.059
Tokenia
Token ID544
Feature activation+8.30e-01

Pos logit contributions
 tunes+0.203
en+0.195
rily+0.192
 films+0.191
rated+0.191
Neg logit contributions
 -0.256
-0.247
,-0.226
 she-0.226
 are-0.222
Token,
Token ID11
Feature activation+4.81e-01

Pos logit contributions
ening+0.053
en+0.052
rated+0.052
 tunes+0.051
 films+0.050
Neg logit contributions
 -0.014
ossible-0.013
m-0.010
 forgive-0.010
-0.009
Token don
Token ID836
Feature activation-6.39e-01

Pos logit contributions
 tunes+0.026
 films+0.025
 tune+0.023
 melodies+0.023
rated+0.023
Neg logit contributions
 -0.015
ossible-0.013
-0.013
,-0.013
m-0.013
Token't
Token ID470
Feature activation-3.63e-01

Pos logit contributions
 +0.011
,+0.011
+0.011
 she+0.009
 are+0.009
Neg logit contributions
 tunes-0.043
 films-0.042
phony-0.040
rated-0.040
rily-0.040
Token be
Token ID307
Feature activation+3.32e-01

Pos logit contributions
 +0.011
ossible+0.010
,+0.010
m+0.010
+0.010
Neg logit contributions
 tunes-0.020
 tune-0.018
rated-0.018
 melodies-0.018
 films-0.018
Token The
Token ID383
Feature activation-5.15e-01

Pos logit contributions
 tunes+0.018
 films+0.017
phony+0.016
rily+0.016
rated+0.016
Neg logit contributions
-0.011
,-0.011
 -0.010
 she-0.010
m-0.010
Token m
Token ID285
Feature activation+4.88e-01

Pos logit contributions
+0.017
 +0.016
,+0.016
 she+0.015
 are+0.014
Neg logit contributions
 tunes-0.028
 films-0.028
 melodies-0.025
 tune-0.025
rated-0.025
Tokenama
Token ID1689
Feature activation+1.34e+00

Pos logit contributions
 films+0.022
 tunes+0.022
rily+0.021
lights+0.021
phony+0.020
Neg logit contributions
-0.020
 -0.020
,-0.019
 she-0.018
 forgive-0.018
Token bear
Token ID6842
Feature activation+9.39e-01

Pos logit contributions
 films+0.076
 tunes+0.075
en+0.071
ening+0.069
rily+0.067
Neg logit contributions
 -0.036
-0.031
 forgive-0.031
ossible-0.030
 she-0.029
Token,
Token ID11
Feature activation+1.12e+00

Pos logit contributions
 tunes+0.050
 films+0.049
en+0.045
ening+0.043
 melodies+0.043
Neg logit contributions
 -0.026
-0.023
 forgive-0.023
ossible-0.022
m-0.022
Token pap
Token ID20461
Feature activation+5.38e+00

Pos logit contributions
 films+0.057
 tunes+0.053
 ger+0.047
 tune+0.047
rily+0.047
Neg logit contributions
-0.037
 -0.033
m-0.033
,-0.031
 forgive-0.030
Tokena
Token ID64
Feature activation+1.59e+00

Pos logit contributions
 tunes+0.219
rily+0.211
 films+0.202
rated+0.197
le+0.193
Neg logit contributions
 -0.243
-0.242
,-0.220
 she-0.218
 are-0.204
Token bear
Token ID6842
Feature activation+9.20e-01

Pos logit contributions
 tunes+0.110
 films+0.106
en+0.101
 tune+0.098
 melodies+0.098
Neg logit contributions
 -0.023
-0.019
ossible-0.016
 forgive-0.016
m-0.016
Token and
Token ID290
Feature activation+1.37e+00

Pos logit contributions
 tunes+0.058
 films+0.057
en+0.055
uders+0.053
ening+0.052
Neg logit contributions
-0.015
 -0.014
ossible-0.011
 forgive-0.011
m-0.010
Token baby
Token ID5156
Feature activation+1.03e+00

Pos logit contributions
 tunes+0.077
 films+0.076
 melodies+0.069
 tune+0.069
rily+0.068
Neg logit contributions
-0.041
 -0.040
,-0.038
m-0.037
 are-0.034
Token bear
Token ID6842
Feature activation+7.55e-01

Pos logit contributions
 tunes+0.041
 films+0.040
 melodies+0.035
en+0.035
ening+0.033
Neg logit contributions
-0.048
 -0.047
ossible-0.043
 she-0.042
 forgive-0.041
Token,
Token ID11
Feature activation-2.23e-01

Pos logit contributions
 +0.002
+0.002
ossible+0.002
m+0.001
 are+0.001
Neg logit contributions
 tunes-0.028
 films-0.027
ening-0.027
en-0.026
lights-0.026
Token they
Token ID484
Feature activation-1.44e-01

Pos logit contributions
+0.005
 +0.005
,+0.005
m+0.004
 are+0.004
Neg logit contributions
 tunes-0.014
 films-0.014
phony-0.013
 melodies-0.013
rily-0.013
Token proudly
Token ID23017
Feature activation+1.77e+00

Pos logit contributions
 +0.005
+0.004
,+0.004
 she+0.004
m+0.004
Neg logit contributions
 tunes-0.007
en-0.007
rily-0.007
 films-0.007
rated-0.007
Token showed
Token ID3751
Feature activation+6.54e-01

Pos logit contributions
 tunes+0.088
 films+0.082
phony+0.079
ening+0.078
en+0.078
Neg logit contributions
 -0.064
-0.063
,-0.061
 she-0.057
m-0.056
Token their
Token ID511
Feature activation+1.36e+00

Pos logit contributions
 tunes+0.039
 films+0.037
phony+0.036
rated+0.036
 melodies+0.035
Neg logit contributions
 -0.018
,-0.018
-0.018
 she-0.016
 are-0.015
Token 3
Token ID513
Feature activation+5.36e+00

Pos logit contributions
 tunes+0.066
 films+0.061
 melodies+0.057
 tune+0.057
ening+0.055
Neg logit contributions
 -0.054
-0.053
,-0.051
 she-0.050
m-0.049
Token year
Token ID614
Feature activation+9.40e-01

Pos logit contributions
 tunes+0.255
 films+0.241
phony+0.231
rated+0.229
rily+0.227
Neg logit contributions
,-0.209
 -0.209
-0.207
 she-0.192
 are-0.188
Token old
Token ID1468
Feature activation+4.54e-01

Pos logit contributions
 tunes+0.035
 films+0.033
ening+0.031
rily+0.031
 melodies+0.031
Neg logit contributions
 -0.046
-0.045
 are-0.043
,-0.043
 she-0.042
Token the
Token ID262
Feature activation+6.19e-01

Pos logit contributions
 tunes+0.022
 films+0.021
 melodies+0.019
 tune+0.018
ening+0.018
Neg logit contributions
-0.019
 -0.019
m-0.017
 are-0.016
ossible-0.016
Token thing
Token ID1517
Feature activation+9.31e-01

Pos logit contributions
 tunes+0.028
 films+0.025
 tune+0.024
 melodies+0.024
ening+0.023
Neg logit contributions
 -0.027
-0.026
,-0.026
 she-0.025
ossible-0.024
Token they
Token ID484
Feature activation+6.04e-01

Pos logit contributions
 tunes+0.051
lights+0.047
ening+0.047
 films+0.046
rily+0.046
Neg logit contributions
ossible-0.028
 -0.027
m-0.023
 are-0.022
-0.022
Token.
Token ID13
Feature activation-2.71e-01

Pos logit contributions
 tunes+0.027
 films+0.026
rily+0.025
 tune+0.025
phony+0.025
Neg logit contributions
 -0.007
-0.007
,-0.006
 are-0.006
m-0.006
Token She
Token ID1375
Feature activation+4.49e-02

Pos logit contributions
 forgive+0.007
 are+0.007
m+0.007
,+0.007
+0.007
Neg logit contributions
 tunes-0.014
 films-0.014
phony-0.014
rily-0.013
ishment-0.013
Token explained
Token ID4893
Feature activation+6.13e-01

Pos logit contributions
 tunes+0.002
 films+0.002
rated+0.002
rily+0.002
phony+0.002
Neg logit contributions
 -0.001
,-0.001
-0.001
m-0.001
 she-0.001
Token to
Token ID284
Feature activation+1.61e+00

Pos logit contributions
 tunes+0.034
 films+0.034
rily+0.032
 melodies+0.031
ening+0.031
Neg logit contributions
 -0.017
-0.017
m-0.017
ossible-0.015
 are-0.015
Token her
Token ID607
Feature activation+5.48e-01

Pos logit contributions
 tunes+0.101
 films+0.098
rily+0.093
 melodies+0.093
en+0.092
Neg logit contributions
-0.035
,-0.035
 -0.035
m-0.033
 she-0.030
Token 3
Token ID513
Feature activation+5.34e+00

Pos logit contributions
 films+0.026
 tunes+0.026
rily+0.024
 tune+0.023
 melodies+0.023
Neg logit contributions
 -0.019
m-0.019
-0.018
 are-0.018
 she-0.017
Token year
Token ID614
Feature activation+1.15e+00

Pos logit contributions
 tunes+0.236
 films+0.223
rated+0.208
 tune+0.206
 melodies+0.206
Neg logit contributions
-0.225
 -0.223
,-0.214
 she-0.211
 are-0.210
Token old
Token ID1468
Feature activation+3.79e-01

Pos logit contributions
 tunes+0.042
 films+0.040
ening+0.037
 melodies+0.036
rily+0.036
Neg logit contributions
 -0.057
 are-0.055
-0.055
m-0.053
,-0.052
Token daughter
Token ID4957
Feature activation+1.98e+00

Pos logit contributions
 tunes+0.018
 films+0.017
 melodies+0.016
 tune+0.016
en+0.015
Neg logit contributions
 -0.015
-0.014
m-0.013
 are-0.013
 forgive-0.012
Token what
Token ID644
Feature activation+5.12e-02

Pos logit contributions
 films+0.124
 tunes+0.121
en+0.120
ening+0.118
rily+0.117
Neg logit contributions
ossible-0.028
m-0.027
ll-0.024
 -0.024
 are-0.022
Token they
Token ID484
Feature activation-5.23e-01

Pos logit contributions
 tunes+0.003
 films+0.003
 melodies+0.003
 topic+0.003
 songs+0.003
Neg logit contributions
m-0.001
ossible-0.001
 -0.001
ll-0.001
 forgive-0.001
Token was
Token ID373
Feature activation+3.99e-01

Pos logit contributions
 tunes+0.022
 films+0.021
phony+0.020
rated+0.020
 melodies+0.020
Neg logit contributions
 -0.010
,-0.010
-0.010
 she-0.009
 are-0.008
Token sad
Token ID6507
Feature activation+8.35e-02

Pos logit contributions
 tunes+0.025
 films+0.025
phony+0.024
 tune+0.023
 melodies+0.023
Neg logit contributions
 -0.009
-0.009
,-0.009
 she-0.007
 are-0.007
Token,
Token ID11
Feature activation-7.08e-01

Pos logit contributions
 tunes+0.005
 films+0.004
phony+0.004
rated+0.004
rily+0.004
Neg logit contributions
,-0.003
 -0.003
-0.003
 she-0.002
 are-0.002
Token because
Token ID780
Feature activation-2.24e-01

Pos logit contributions
+0.021
 +0.021
,+0.021
 are+0.019
m+0.018
Neg logit contributions
 tunes-0.040
 films-0.038
 tune-0.036
phony-0.036
 melodies-0.036
Token her
Token ID607
Feature activation+8.50e-01

Pos logit contributions
 +0.003
+0.003
ossible+0.003
,+0.003
m+0.003
Neg logit contributions
 tunes-0.015
 films-0.015
lights-0.014
 melodies-0.014
ening-0.014
Token pap
Token ID20461
Feature activation+5.34e+00

Pos logit contributions
 tunes+0.049
 films+0.047
 tune+0.045
 melodies+0.044
 ger+0.043
Neg logit contributions
-0.026
 -0.024
,-0.022
m-0.021
 she-0.020
Tokena
Token ID64
Feature activation+1.60e+00

Pos logit contributions
 tunes+0.239
 tune+0.210
 melodies+0.208
rily+0.205
rated+0.202
Neg logit contributions
 -0.228
-0.220
,-0.219
 she-0.200
 are-0.197
Token never
Token ID1239
Feature activation+5.24e-02

Pos logit contributions
 tunes+0.093
 films+0.085
en+0.084
ening+0.083
 tune+0.082
Neg logit contributions
 -0.041
ossible-0.036
-0.034
m-0.033
 she-0.032
Token had
Token ID550
Feature activation-2.14e-01

Pos logit contributions
 tunes+0.003
 films+0.003
rily+0.003
phony+0.003
 melodies+0.003
Neg logit contributions
,-0.001
-0.001
 -0.001
 she-0.001
 are-0.001
Token time
Token ID640
Feature activation+1.72e+00

Pos logit contributions
,+0.005
 +0.005
+0.005
 she+0.005
 are+0.004
Neg logit contributions
 tunes-0.014
 films-0.013
rated-0.013
phony-0.013
rily-0.012
Token to
Token ID284
Feature activation+5.66e-02

Pos logit contributions
 tunes+0.094
 films+0.089
 tune+0.087
vet+0.085
ening+0.084
Neg logit contributions
-0.047
 -0.046
 are-0.041
,-0.041
ossible-0.040
Token instinct
Token ID13311
Feature activation+1.74e+00

Pos logit contributions
 tunes+0.006
 films+0.006
 melodies+0.005
 tune+0.005
phony+0.005
Neg logit contributions
 -0.004
-0.004
,-0.004
 she-0.004
m-0.004
Token can
Token ID460
Feature activation-4.82e-02

Pos logit contributions
 tunes+0.109
 tune+0.099
ening+0.099
 films+0.099
 melodies+0.098
Neg logit contributions
 -0.040
-0.037
ossible-0.036
m-0.033
,-0.031
Token often
Token ID1690
Feature activation+1.18e-01

Pos logit contributions
 +0.001
ossible+0.001
m+0.001
,+0.001
 she+0.001
Neg logit contributions
 tunes-0.003
en-0.002
 tune-0.002
 melodies-0.002
 films-0.002
Token save
Token ID3613
Feature activation+1.35e+00

Pos logit contributions
 tunes+0.007
 films+0.006
 tune+0.006
 melodies+0.006
rated+0.006
Neg logit contributions
 -0.003
-0.003
m-0.003
ossible-0.003
,-0.003
Token us
Token ID514
Feature activation+5.63e-02

Pos logit contributions
 tunes+0.076
 films+0.072
phony+0.070
rated+0.069
 melodies+0.069
Neg logit contributions
,-0.041
 -0.041
-0.041
 she-0.037
 are-0.036
Token unnecessary
Token ID13114
Feature activation+5.33e+00

Pos logit contributions
 tunes+0.003
 films+0.003
rated+0.003
 tune+0.003
 melodies+0.003
Neg logit contributions
 -0.002
m-0.001
ossible-0.001
 she-0.001
-0.001
Token danger
Token ID3514
Feature activation+1.77e+00

Pos logit contributions
 tunes+0.237
 films+0.221
 melodies+0.208
phony+0.206
rated+0.206
Neg logit contributions
 -0.225
-0.219
,-0.215
m-0.207
 she-0.207
Token.
Token ID13
Feature activation-4.00e-01

Pos logit contributions
 tunes+0.118
 films+0.112
rated+0.109
ening+0.108
 melodies+0.107
Neg logit contributions
 -0.035
-0.032
,-0.031
ossible-0.029
 she-0.028
Token<|endoftext|>
Token ID50256
Feature activation+1.23e+00

Pos logit contributions
,+0.007
 are+0.006
+0.006
m+0.006
 she+0.005
Neg logit contributions
 tunes-0.026
 films-0.026
rily-0.025
phony-0.025
en-0.024
TokenOnce
Token ID7454
Feature activation-3.94e-01

Pos logit contributions
 tunes+0.071
 films+0.070
phony+0.068
en+0.066
rily+0.066
Neg logit contributions
-0.027
,-0.026
 she-0.026
 forgive-0.024
ossible-0.024
Token upon
Token ID2402
Feature activation-1.68e-03

Pos logit contributions
+0.012
 help+0.009
 forgive+0.009
 pr+0.009
 are+0.009
Neg logit contributions
lights-0.019
rily-0.019
ening-0.018
rated-0.018
 films-0.018
Token Mom
Token ID11254
Feature activation-3.18e+00

Pos logit contributions
 are+0.024
m+0.023
ll+0.021
+0.021
 forgive+0.020
Neg logit contributions
 tunes-0.037
 films-0.037
 tune-0.035
rily-0.035
 melodies-0.033
Token asks
Token ID7893
Feature activation-1.40e-01

Pos logit contributions
 are+0.105
+0.099
 +0.099
ossible+0.097
 forgive+0.096
Neg logit contributions
en-0.149
ening-0.149
 tunes-0.147
 films-0.144
rated-0.143
Token.
Token ID13
Feature activation-4.97e-01

Pos logit contributions
ossible+0.003
+0.003
 are+0.003
 +0.003
 am+0.003
Neg logit contributions
rily-0.008
 tunes-0.008
 films-0.008
ening-0.008
en-0.007
Token "
Token ID366
Feature activation+4.26e-01

Pos logit contributions
 are+0.012
m+0.011
ll+0.010
ossible+0.010
+0.010
Neg logit contributions
 films-0.028
 tunes-0.027
rily-0.026
phony-0.026
 tune-0.026
TokenAre
Token ID8491
Feature activation+8.61e-01

Pos logit contributions
 tunes+0.018
 films+0.016
 tune+0.016
phony+0.015
 melodies+0.015
Neg logit contributions
,-0.020
 -0.018
 are-0.017
-0.017
ossible-0.016
Token you
Token ID345
Feature activation-4.52e+00

Pos logit contributions
 films+0.035
 tunes+0.034
 ger+0.032
uders+0.032
lights+0.030
Neg logit contributions
 -0.030
ossible-0.030
,-0.028
m-0.028
 forgive-0.027
Token okay
Token ID8788
Feature activation-7.73e-01

Pos logit contributions
 +0.130
+0.122
,+0.119
 she+0.113
ossible+0.112
Neg logit contributions
 tunes-0.250
 films-0.244
rated-0.240
rily-0.237
phony-0.236
Token?"
Token ID1701
Feature activation-1.62e-01

Pos logit contributions
 +0.012
+0.011
ossible+0.010
 are+0.009
 she+0.008
Neg logit contributions
 films-0.053
 tunes-0.053
en-0.050
rily-0.050
ening-0.050
Token
Token ID198
Feature activation-3.09e-01

Pos logit contributions
 are+0.004
m+0.004
+0.003
ll+0.003
 am+0.003
Neg logit contributions
 films-0.009
 tunes-0.009
rily-0.008
 tune-0.008
phony-0.008
Token
Token ID198
Feature activation+1.37e-01

Pos logit contributions
 are+0.006
+0.005
m+0.005
,+0.005
're+0.004
Neg logit contributions
 tunes-0.019
 films-0.019
phony-0.018
 tune-0.017
lights-0.017
TokenS
Token ID50
Feature activation-1.06e+00

Pos logit contributions
 tunes+0.006
 films+0.005
 tune+0.005
phony+0.005
 melodies+0.005
Neg logit contributions
 are-0.005
're-0.005
,-0.005
m-0.005
 -0.005
Token!"
Token ID2474
Feature activation-1.67e-01

Pos logit contributions
,+0.004
+0.004
 +0.004
 are+0.003
 she+0.003
Neg logit contributions
 tunes-0.009
 films-0.009
phony-0.009
rily-0.009
 melodies-0.009
Token
Token ID198
Feature activation-6.49e-02

Pos logit contributions
+0.003
m+0.002
 are+0.002
,+0.002
ossible+0.002
Neg logit contributions
 tunes-0.011
 films-0.011
phony-0.010
rily-0.010
 tune-0.010
Token
Token ID198
Feature activation+2.27e-01

Pos logit contributions
+0.001
,+0.001
 are+0.001
 +0.001
m+0.001
Neg logit contributions
 tunes-0.005
 films-0.004
phony-0.004
rily-0.004
 melodies-0.004
Token"
Token ID1
Feature activation-9.60e-01

Pos logit contributions
 tunes+0.012
 films+0.011
phony+0.010
 tune+0.010
 melodies+0.010
Neg logit contributions
 -0.008
,-0.008
 are-0.008
m-0.007
-0.007
TokenAre
Token ID8491
Feature activation+5.63e-01

Pos logit contributions
 +0.035
+0.035
,+0.034
 she+0.032
 are+0.031
Neg logit contributions
 tunes-0.049
 films-0.046
phony-0.044
rated-0.044
rily-0.044
Token you
Token ID345
Feature activation-4.43e+00

Pos logit contributions
 tunes+0.026
 films+0.025
 ger+0.024
izing+0.023
 melodies+0.023
Neg logit contributions
 -0.018
-0.017
,-0.017
m-0.017
ossible-0.016
Token okay
Token ID8788
Feature activation-8.24e-01

Pos logit contributions
 +0.090
,+0.089
+0.089
 she+0.077
 are+0.072
Neg logit contributions
 tunes-0.294
 films-0.283
phony-0.273
rated-0.273
 melodies-0.271
Token?"
Token ID1701
Feature activation+5.93e-01

Pos logit contributions
 +0.018
+0.017
,+0.017
 she+0.015
 are+0.014
Neg logit contributions
 tunes-0.054
 films-0.052
phony-0.050
rated-0.050
 melodies-0.050
Token Ben
Token ID3932
Feature activation-8.07e-01

Pos logit contributions
 tunes+0.030
 films+0.030
 tune+0.027
ishment+0.027
phony+0.027
Neg logit contributions
m-0.019
-0.018
 are-0.017
ll-0.016
ossible-0.016
Token asked
Token ID1965
Feature activation-8.12e-02

Pos logit contributions
,+0.023
 +0.022
+0.022
 she+0.020
 are+0.018
Neg logit contributions
 tunes-0.048
 films-0.046
phony-0.045
rated-0.045
 melodies-0.044
Token.
Token ID13
Feature activation-1.01e-01

Pos logit contributions
+0.002
ossible+0.002
 are+0.002
 +0.002
m+0.001
Neg logit contributions
 tunes-0.005
 films-0.005
rily-0.005
en-0.004
 tune-0.004
Token was
Token ID373
Feature activation-7.49e-01

Pos logit contributions
 tunes+0.044
en+0.040
 tune+0.040
 melodies+0.040
 films+0.039
Neg logit contributions
 -0.020
m-0.019
ossible-0.018
 are-0.018
-0.017
Token so
Token ID523
Feature activation-7.58e-01

Pos logit contributions
 are+0.022
ossible+0.022
 +0.021
+0.021
,+0.021
Neg logit contributions
 tunes-0.038
 films-0.036
 tune-0.036
rily-0.036
ening-0.035
Token relieved
Token ID22598
Feature activation-7.49e-01

Pos logit contributions
ossible+0.022
 +0.022
+0.021
,+0.021
 are+0.020
Neg logit contributions
 tunes-0.036
rily-0.034
 tune-0.034
rated-0.034
 films-0.034
Tokenâ
Token ID22940
Feature activation-6.89e-01

Pos logit contributions
 +0.019
+0.018
ossible+0.018
 are+0.017
m+0.017
Neg logit contributions
 tunes-0.043
en-0.040
 tune-0.039
ening-0.039
rily-0.039
Token
Token ID26391
Feature activation+1.80e+00

Pos logit contributions
 +0.032
,+0.032
+0.032
 she+0.030
 are+0.030
Neg logit contributions
 tunes-0.027
 films-0.025
rated-0.025
phony-0.024
 melodies-0.024
Token
Token ID447
Feature activation-4.35e+00

Pos logit contributions
 tunes+0.096
phony+0.093
 films+0.089
uders+0.085
 tune+0.085
Neg logit contributions
 forgive-0.047
-0.046
m-0.045
,-0.044
 are-0.043
Token
Token ID251
Feature activation+5.24e-01

Pos logit contributions
+0.181
,+0.179
 she+0.173
m+0.173
 forgive+0.172
Neg logit contributions
 tunes-0.167
phony-0.158
rated-0.149
rily-0.149
 films-0.148
Tokenhe
Token ID258
Feature activation-2.52e+00

Pos logit contributions
 tunes+0.033
 films+0.031
 tune+0.030
 melodies+0.030
rily+0.030
Neg logit contributions
-0.010
,-0.009
 are-0.009
m-0.009
 -0.009
Token knew
Token ID2993
Feature activation-1.69e+00

Pos logit contributions
+0.051
 +0.051
,+0.047
 are+0.045
ossible+0.044
Neg logit contributions
 tunes-0.164
 films-0.155
 tune-0.151
rily-0.151
 melodies-0.150
Token that
Token ID326
Feature activation-1.68e+00

Pos logit contributions
ossible+0.044
 +0.041
 are+0.041
+0.040
m+0.038
Neg logit contributions
 tunes-0.097
rily-0.089
 films-0.088
 tune-0.088
ening-0.088
Token if
Token ID611
Feature activation-7.58e-01

Pos logit contributions
ossible+0.033
 +0.030
m+0.029
 are+0.029
+0.028
Neg logit contributions
 tunes-0.105
 films-0.098
 tune-0.096
 melodies-0.095
ening-0.094
Token!"
Token ID2474
Feature activation+3.32e-01

Pos logit contributions
,+0.004
+0.003
 +0.003
 are+0.003
 she+0.003
Neg logit contributions
 tunes-0.007
 films-0.007
phony-0.007
rily-0.007
 melodies-0.007
Token
Token ID198
Feature activation+2.83e-01

Pos logit contributions
 tunes+0.022
 films+0.022
lights+0.021
phony+0.021
rily+0.020
Neg logit contributions
-0.005
 are-0.005
m-0.005
,-0.004
ossible-0.004
Token
Token ID198
Feature activation+5.50e-01

Pos logit contributions
 tunes+0.020
 films+0.019
phony+0.019
lights+0.018
rily+0.018
Neg logit contributions
-0.004
,-0.003
 -0.003
 are-0.003
 she-0.003
Token"
Token ID1
Feature activation-1.03e+00

Pos logit contributions
 tunes+0.028
phony+0.026
 films+0.026
 tune+0.025
 stations+0.025
Neg logit contributions
 are-0.017
,-0.016
 -0.015
m-0.015
 she-0.015
TokenAre
Token ID8491
Feature activation+6.37e-01

Pos logit contributions
 +0.037
,+0.037
+0.037
 she+0.034
 are+0.033
Neg logit contributions
 tunes-0.053
 films-0.050
phony-0.048
rated-0.048
 melodies-0.047
Token you
Token ID345
Feature activation-4.34e+00

Pos logit contributions
 tunes+0.029
 films+0.028
izing+0.027
 ger+0.026
lights+0.026
Neg logit contributions
 -0.020
ossible-0.019
,-0.019
-0.019
m-0.019
Token OK
Token ID7477
Feature activation-1.13e+00

Pos logit contributions
 +0.098
+0.095
,+0.094
 she+0.084
 are+0.077
Neg logit contributions
 tunes-0.275
 films-0.265
rated-0.258
phony-0.256
 tune-0.253
Token?"
Token ID1701
Feature activation+9.45e-01

Pos logit contributions
,+0.035
 +0.034
+0.034
 she+0.031
 are+0.030
Neg logit contributions
 tunes-0.064
 films-0.061
phony-0.059
rated-0.058
 melodies-0.058
Token Mia
Token ID32189
Feature activation-1.94e+00

Pos logit contributions
 films+0.046
 tunes+0.045
phony+0.041
 tune+0.041
lights+0.041
Neg logit contributions
m-0.031
-0.031
 are-0.030
,-0.028
ll-0.028
Token says
Token ID1139
Feature activation-1.90e+00

Pos logit contributions
 +0.080
,+0.078
+0.078
 she+0.075
 are+0.075
Neg logit contributions
 tunes-0.080
 films-0.079
en-0.073
ening-0.073
 melodies-0.071
Token.
Token ID13
Feature activation+4.78e-01

Pos logit contributions
+0.042
ossible+0.041
 +0.039
 are+0.038
m+0.033
Neg logit contributions
 tunes-0.114
rily-0.111
 films-0.109
ening-0.107
 tune-0.107
Token was
Token ID373
Feature activation-8.23e-01

Pos logit contributions
 tunes+0.073
en+0.066
 tune+0.066
 melodies+0.065
ening+0.064
Neg logit contributions
ossible-0.030
 -0.029
 are-0.029
 forgive-0.029
m-0.028
Token so
Token ID523
Feature activation-1.05e+00

Pos logit contributions
 +0.025
+0.024
 are+0.024
,+0.024
ossible+0.024
Neg logit contributions
 tunes-0.041
 films-0.040
 tune-0.039
rily-0.038
phony-0.038
Token happy
Token ID3772
Feature activation-2.14e+00

Pos logit contributions
 +0.029
+0.029
,+0.029
 are+0.027
ossible+0.027
Neg logit contributions
 tunes-0.052
rily-0.050
 tune-0.050
rated-0.049
 films-0.049
Tokenâ
Token ID22940
Feature activation-6.51e-01

Pos logit contributions
 +0.062
+0.060
ossible+0.058
 are+0.056
 forgive+0.051
Neg logit contributions
 tunes-0.122
 tune-0.111
 films-0.109
en-0.108
 melodies-0.108
Token
Token ID26391
Feature activation+1.48e+00

Pos logit contributions
+0.030
,+0.030
 +0.030
 she+0.028
 are+0.027
Neg logit contributions
 tunes-0.027
 films-0.025
phony-0.024
rily-0.023
 melodies-0.023
Token
Token ID447
Feature activation-4.33e+00

Pos logit contributions
 tunes+0.077
phony+0.074
 films+0.071
 melodies+0.068
lights+0.068
Neg logit contributions
-0.043
,-0.043
 -0.042
 forgive-0.041
m-0.041
Token
Token ID251
Feature activation-1.84e-01

Pos logit contributions
+0.131
,+0.123
 +0.120
 she+0.116
 are+0.112
Neg logit contributions
 tunes-0.236
phony-0.221
 films-0.218
rated-0.215
 melodies-0.213
Tokenshe
Token ID7091
Feature activation-1.81e+00

Pos logit contributions
+0.003
m+0.003
,+0.003
 are+0.003
 +0.003
Neg logit contributions
 tunes-0.012
 films-0.011
 melodies-0.011
 tune-0.011
phony-0.010
Token had
Token ID550
Feature activation-1.58e+00

Pos logit contributions
 +0.052
m+0.050
,+0.049
 she+0.049
+0.048
Neg logit contributions
 tunes-0.096
phony-0.088
en-0.087
 tune-0.087
rily-0.086
Token so
Token ID523
Feature activation+3.83e-01

Pos logit contributions
,+0.045
 +0.044
+0.043
m+0.042
 are+0.041
Neg logit contributions
 tunes-0.087
 films-0.081
phony-0.080
 melodies-0.079
 tune-0.079
Token much
Token ID881
Feature activation-8.03e-01

Pos logit contributions
 tunes+0.019
 films+0.018
 melodies+0.017
rated+0.016
phony+0.016
Neg logit contributions
 -0.012
,-0.012
-0.011
m-0.011
my-0.010
Token And
Token ID843
Feature activation-1.37e+00

Pos logit contributions
m+0.016
+0.015
,+0.015
 are+0.015
 she+0.014
Neg logit contributions
 tunes-0.025
 films-0.024
rily-0.022
phony-0.022
 tune-0.022
Token even
Token ID772
Feature activation-4.71e-02

Pos logit contributions
 +0.035
m+0.034
+0.034
ossible+0.033
 are+0.032
Neg logit contributions
 tunes-0.080
 films-0.074
 tune-0.073
 melodies-0.072
rily-0.072
Token though
Token ID996
Feature activation-3.82e-01

Pos logit contributions
,+0.002
 she+0.002
+0.002
 +0.002
 he+0.002
Neg logit contributions
 tunes-0.002
 films-0.002
phony-0.002
lights-0.002
uders-0.002
Token he
Token ID339
Feature activation-1.16e+00

Pos logit contributions
+0.006
 +0.006
ossible+0.006
m+0.006
 are+0.005
Neg logit contributions
 tunes-0.026
 films-0.025
ening-0.024
rated-0.024
 melodies-0.023
Token was
Token ID373
Feature activation-2.43e+00

Pos logit contributions
 +0.027
,+0.024
+0.024
 she+0.023
ossible+0.022
Neg logit contributions
 tunes-0.072
 films-0.067
en-0.067
rated-0.067
ening-0.067
Token still
Token ID991
Feature activation-4.32e+00

Pos logit contributions
 +0.077
+0.074
ossible+0.072
 are+0.071
,+0.070
Neg logit contributions
 tunes-0.128
 films-0.123
ening-0.119
 tune-0.118
rily-0.116
Token lonely
Token ID21757
Feature activation-2.08e+00

Pos logit contributions
 +0.140
+0.132
 are+0.126
ossible+0.125
m+0.123
Neg logit contributions
 tunes-0.227
 films-0.221
ening-0.212
 tune-0.211
 melodies-0.205
Token sometimes
Token ID3360
Feature activation-2.51e+00

Pos logit contributions
 +0.011
+0.011
,+0.006
ossible+0.004
 she+0.003
Neg logit contributions
 tunes-0.169
 films-0.163
phony-0.158
ening-0.157
 tune-0.157
Token,
Token ID11
Feature activation-1.20e+00

Pos logit contributions
 +0.011
+0.009
m+0.004
ossible+0.004
,+0.003
Neg logit contributions
 tunes-0.205
 films-0.198
ening-0.192
 tune-0.191
 melodies-0.190
Token he
Token ID339
Feature activation+1.24e-01

Pos logit contributions
+0.018
 +0.018
,+0.017
ossible+0.016
m+0.016
Neg logit contributions
 tunes-0.083
 films-0.080
 tune-0.077
 melodies-0.076
rated-0.076
Token knew
Token ID2993
Feature activation-3.51e-01

Pos logit contributions
 tunes+0.007
en+0.006
 melodies+0.006
rated+0.006
ening+0.006
Neg logit contributions
 -0.004
ossible-0.003
,-0.003
-0.003
m-0.003
Token mom
Token ID1995
Feature activation-2.01e+00

Pos logit contributions
 are+0.019
+0.019
m+0.018
 +0.018
ossible+0.017
Neg logit contributions
 tunes-0.039
 films-0.037
rily-0.036
 tune-0.036
 melodies-0.035
Token said
Token ID531
Feature activation-4.20e-01

Pos logit contributions
 +0.056
 are+0.055
ossible+0.055
'm+0.054
+0.053
Neg logit contributions
en-0.101
 tunes-0.097
le-0.094
ening-0.093
les-0.093
Token,
Token ID11
Feature activation-8.97e-02

Pos logit contributions
ossible+0.006
 are+0.006
+0.005
m+0.005
my+0.005
Neg logit contributions
rily-0.028
 tunes-0.027
 tune-0.026
ening-0.026
 films-0.026
Token "
Token ID366
Feature activation+1.04e+00

Pos logit contributions
 are+0.003
ossible+0.003
+0.003
m+0.002
 am+0.002
Neg logit contributions
 tunes-0.005
 tune-0.005
rily-0.004
 films-0.004
phony-0.004
TokenAre
Token ID8491
Feature activation+7.89e-01

Pos logit contributions
 tunes+0.052
 films+0.047
 tune+0.047
 melodies+0.047
phony+0.046
Neg logit contributions
,-0.040
 -0.038
-0.038
 are-0.036
 she-0.034
Token you
Token ID345
Feature activation-4.31e+00

Pos logit contributions
 tunes+0.035
 films+0.035
izing+0.032
 melodies+0.032
ishment+0.031
Neg logit contributions
 -0.026
ossible-0.026
,-0.025
m-0.024
-0.024
Token ready
Token ID3492
Feature activation-5.50e-01

Pos logit contributions
 +0.129
+0.118
,+0.117
ossible+0.114
 she+0.111
Neg logit contributions
 tunes-0.221
 films-0.218
rated-0.217
rily-0.212
ening-0.208
Token to
Token ID284
Feature activation-1.11e+00

Pos logit contributions
 +0.010
+0.010
,+0.008
m+0.008
 she+0.008
Neg logit contributions
 tunes-0.036
 films-0.035
rated-0.034
phony-0.034
rily-0.034
Token take
Token ID1011
Feature activation+1.78e-01

Pos logit contributions
m+0.036
 +0.036
ossible+0.032
 she+0.032
+0.032
Neg logit contributions
 tunes-0.055
en-0.053
 films-0.052
 tune-0.051
 melodies-0.050
Token care
Token ID1337
Feature activation+6.06e-01

Pos logit contributions
 tunes+0.011
rated+0.011
 films+0.011
phony+0.011
rily+0.010
Neg logit contributions
,-0.005
 -0.005
-0.004
 she-0.004
 are-0.004
Token of
Token ID286
Feature activation-5.15e-01

Pos logit contributions
 tunes+0.036
 films+0.034
rily+0.033
en+0.033
lights+0.033
Neg logit contributions
 -0.017
-0.015
,-0.014
 she-0.013
ossible-0.013
Token
Token ID95
Feature activation+3.66e-01

Pos logit contributions
 +0.004
,+0.004
+0.004
 she+0.003
 are+0.003
Neg logit contributions
 tunes-0.004
 films-0.004
phony-0.004
rated-0.004
 melodies-0.004
Token
Token ID26391
Feature activation+2.08e+00

Pos logit contributions
 tunes+0.015
 films+0.014
phony+0.014
rated+0.013
 melodies+0.013
Neg logit contributions
,-0.016
 -0.016
-0.016
 she-0.016
 are-0.015
Token
Token ID129
Feature activation+1.65e+00

Pos logit contributions
phony+0.101
 tunes+0.094
 films+0.089
uders+0.089
plets+0.085
Neg logit contributions
 forgive-0.066
m-0.062
-0.061
 are-0.059
 she-0.059
Token
Token ID241
Feature activation-3.45e-01

Pos logit contributions
 tunes+0.063
 films+0.059
lights+0.055
rily+0.055
rated+0.055
Neg logit contributions
-0.082
 -0.081
,-0.080
 she-0.073
 are-0.071
TokenAre
Token ID8491
Feature activation-1.98e-01

Pos logit contributions
,+0.012
 +0.011
+0.011
 are+0.010
 she+0.010
Neg logit contributions
 tunes-0.018
 films-0.017
phony-0.016
 tune-0.016
rated-0.016
Token you
Token ID345
Feature activation-4.29e+00

Pos logit contributions
 +0.006
,+0.006
ossible+0.006
 forgive+0.005
+0.005
Neg logit contributions
 tunes-0.010
 films-0.010
 melodies-0.009
vet-0.009
en-0.009
Token ready
Token ID3492
Feature activation-6.04e-02

Pos logit contributions
 +0.119
,+0.112
+0.111
 she+0.111
ossible+0.109
Neg logit contributions
 tunes-0.230
 films-0.224
rated-0.219
ening-0.214
phony-0.214
Token?
Token ID30
Feature activation-4.06e-01

Pos logit contributions
 +0.002
+0.002
,+0.001
 she+0.001
 are+0.001
Neg logit contributions
 tunes-0.003
 films-0.003
phony-0.003
rated-0.003
rily-0.003
Tokenâ
Token ID22940
Feature activation-1.92e-01

Pos logit contributions
,+0.013
 are+0.012
m+0.012
 +0.011
ossible+0.011
Neg logit contributions
 tunes-0.020
 films-0.020
phony-0.019
rily-0.019
vet-0.018
Token
Token ID26391
Feature activation+1.74e+00

Pos logit contributions
,+0.009
 +0.009
+0.009
 she+0.009
 are+0.009
Neg logit contributions
 tunes-0.007
rated-0.007
 films-0.006
phony-0.006
 melodies-0.006
Token M
Token ID337
Feature activation+1.98e-01

Pos logit contributions
phony+0.098
 tunes+0.094
 films+0.092
uders+0.088
rated+0.087
Neg logit contributions
 forgive-0.040
m-0.038
 are-0.038
 help-0.036
 she-0.036
Token
Token ID95
Feature activation+6.05e-01

Pos logit contributions
 +0.001
,+0.001
+0.001
 she+0.001
 are+0.001
Neg logit contributions
 tunes-0.001
 films-0.000
phony-0.000
 melodies-0.000
rily-0.000
Token
Token ID26391
Feature activation+2.09e+00

Pos logit contributions
 tunes+0.025
 films+0.024
rated+0.022
rily+0.022
 melodies+0.022
Neg logit contributions
 -0.028
-0.028
,-0.027
 she-0.025
 are-0.025
Token
Token ID129
Feature activation+1.97e+00

Pos logit contributions
phony+0.126
 tunes+0.126
 films+0.119
uders+0.117
 tune+0.116
Neg logit contributions
-0.041
m-0.039
,-0.039
 are-0.039
 she-0.038
Token
Token ID241
Feature activation-4.72e-01

Pos logit contributions
 tunes+0.076
 films+0.071
rated+0.067
phony+0.066
rily+0.065
Neg logit contributions
 -0.096
,-0.095
-0.095
 she-0.088
 are-0.086
TokenAre
Token ID8491
Feature activation+1.44e-01

Pos logit contributions
,+0.010
 +0.009
+0.009
 are+0.009
ossible+0.008
Neg logit contributions
 tunes-0.030
 films-0.027
phony-0.027
 tune-0.027
 melodies-0.027
Token you
Token ID345
Feature activation-4.27e+00

Pos logit contributions
 tunes+0.007
 films+0.007
vet+0.006
 melodies+0.006
ening+0.006
Neg logit contributions
ossible-0.004
 -0.004
,-0.004
m-0.004
 forgive-0.004
Token ok
Token ID12876
Feature activation-2.98e-01

Pos logit contributions
 +0.115
,+0.108
+0.107
ossible+0.107
 she+0.103
Neg logit contributions
 tunes-0.235
 films-0.228
rated-0.223
ening-0.220
phony-0.219
Token?
Token ID30
Feature activation-3.42e-01

Pos logit contributions
 +0.007
+0.007
,+0.006
 are+0.006
 she+0.006
Neg logit contributions
 tunes-0.018
 films-0.017
phony-0.017
rated-0.017
rily-0.017
Tokenâ
Token ID22940
Feature activation-2.30e-02

Pos logit contributions
,+0.010
 are+0.009
m+0.009
ossible+0.009
 she+0.008
Neg logit contributions
 tunes-0.018
 films-0.018
phony-0.017
rily-0.017
 tune-0.016
Token
Token ID26391
Feature activation+1.79e+00

Pos logit contributions
,+0.001
 +0.001
+0.001
 she+0.001
 are+0.001
Neg logit contributions
 tunes-0.001
rated-0.001
phony-0.001
 films-0.001
ening-0.001
Token He
Token ID679
Feature activation+2.06e-01

Pos logit contributions
 tunes+0.109
phony+0.108
 films+0.105
rated+0.100
uders+0.100
Neg logit contributions
-0.035
,-0.034
m-0.034
 are-0.033
 she-0.032
Token and
Token ID290
Feature activation-1.59e+00

Pos logit contributions
+0.023
 +0.022
 are+0.021
ossible+0.021
,+0.019
Neg logit contributions
 tunes-0.040
 films-0.038
ening-0.037
 tune-0.037
en-0.036
Token said
Token ID531
Feature activation-6.44e-01

Pos logit contributions
+0.049
ossible+0.047
m+0.046
 are+0.045
 +0.045
Neg logit contributions
 tunes-0.081
en-0.075
phony-0.074
 tune-0.074
rated-0.073
Token,
Token ID11
Feature activation-1.29e-01

Pos logit contributions
ossible+0.014
my+0.013
m+0.013
ll+0.011
+0.011
Neg logit contributions
 tunes-0.037
rily-0.036
 tune-0.035
ening-0.035
ened-0.033
Token "
Token ID366
Feature activation+1.46e+00

Pos logit contributions
ossible+0.005
 are+0.005
+0.004
m+0.004
 am+0.004
Neg logit contributions
 tunes-0.006
 tune-0.006
 films-0.005
rily-0.005
ening-0.005
TokenAre
Token ID8491
Feature activation+4.49e-01

Pos logit contributions
 tunes+0.070
 tune+0.064
rated+0.063
 melodies+0.061
phony+0.060
Neg logit contributions
,-0.061
 -0.056
 are-0.055
-0.054
ossible-0.051
Token you
Token ID345
Feature activation-4.26e+00

Pos logit contributions
 tunes+0.018
 films+0.018
izing+0.016
uders+0.016
 melodies+0.016
Neg logit contributions
 -0.015
ossible-0.015
m-0.015
,-0.014
-0.014
Token okay
Token ID8788
Feature activation-8.30e-01

Pos logit contributions
 +0.104
+0.099
,+0.092
 she+0.088
ossible+0.084
Neg logit contributions
 tunes-0.255
 films-0.247
rated-0.244
phony-0.240
 tune-0.239
Token?"
Token ID1701
Feature activation-2.25e-01

Pos logit contributions
 +0.014
+0.014
,+0.013
 she+0.012
 are+0.011
Neg logit contributions
 tunes-0.057
 films-0.056
phony-0.054
rated-0.054
rily-0.053
Token Lily
Token ID20037
Feature activation-1.87e+00

Pos logit contributions
 are+0.006
+0.006
m+0.006
 am+0.005
,+0.005
Neg logit contributions
 tunes-0.013
 tune-0.012
 films-0.012
 melodies-0.011
phony-0.011
Token said
Token ID531
Feature activation-1.09e+00

Pos logit contributions
 are+0.044
+0.041
ossible+0.040
 +0.039
m+0.038
Neg logit contributions
 tunes-0.110
 tune-0.102
en-0.101
 melodies-0.100
rated-0.099
Token,
Token ID11
Feature activation-5.91e-01

Pos logit contributions
ossible+0.017
+0.016
 are+0.012
m+0.011
 am+0.010
Neg logit contributions
 tunes-0.074
rily-0.073
 tune-0.072
ening-0.070
 films-0.068
Token Sue
Token ID26089
Feature activation-1.24e+00

Pos logit contributions
 films+0.025
 tunes+0.024
 songs+0.024
 melodies+0.022
lights+0.022
Neg logit contributions
ll-0.024
-0.024
m-0.024
'-0.023
'm-0.023
Token says
Token ID1139
Feature activation-1.81e+00

Pos logit contributions
m+0.047
'm+0.046
 +0.046
 she+0.045
on+0.042
Neg logit contributions
 films-0.050
en-0.047
 tunes-0.046
oline-0.044
les-0.043
Token.
Token ID13
Feature activation+3.41e-01

Pos logit contributions
+0.048
ossible+0.043
'm+0.042
 +0.042
m+0.040
Neg logit contributions
rily-0.098
 tunes-0.096
 films-0.096
 tune-0.090
lights-0.089
Token "
Token ID366
Feature activation+6.96e-01

Pos logit contributions
 tunes+0.018
 films+0.018
phony+0.016
rily+0.016
lights+0.016
Neg logit contributions
-0.011
 are-0.009
m-0.009
ossible-0.009
 -0.009
TokenAre
Token ID8491
Feature activation+7.80e-01

Pos logit contributions
 tunes+0.039
 films+0.038
rated+0.037
ening+0.036
rily+0.036
Neg logit contributions
-0.021
 -0.021
,-0.021
 she-0.020
 are-0.018
Token you
Token ID345
Feature activation-4.26e+00

Pos logit contributions
 tunes+0.035
 films+0.035
izing+0.032
 melodies+0.031
lights+0.030
Neg logit contributions
 -0.025
,-0.025
-0.025
ossible-0.025
m-0.024
Token fine
Token ID3734
Feature activation-2.27e+00

Pos logit contributions
 +0.120
+0.114
 she+0.107
,+0.106
ossible+0.099
Neg logit contributions
 tunes-0.232
 films-0.232
rated-0.225
phony-0.221
vet-0.216
Token?"
Token ID1701
Feature activation+8.39e-01

Pos logit contributions
 +0.081
+0.080
,+0.076
 she+0.072
 are+0.071
Neg logit contributions
 tunes-0.116
 films-0.112
phony-0.105
rily-0.105
rated-0.105
Token
Token ID198
Feature activation-1.99e-01

Pos logit contributions
 films+0.046
 tunes+0.045
phony+0.042
lights+0.040
 melodies+0.040
Neg logit contributions
m-0.019
 are-0.018
-0.018
ossible-0.016
'm-0.015
Token
Token ID198
Feature activation+1.65e-01

Pos logit contributions
+0.004
,+0.003
 are+0.003
m+0.003
 she+0.003
Neg logit contributions
 tunes-0.013
 films-0.012
lights-0.012
phony-0.012
rily-0.012
Token"
Token ID1
Feature activation-1.32e+00

Pos logit contributions
 tunes+0.007
izing+0.006
 films+0.006
 tune+0.006
 stations+0.006
Neg logit contributions
 -0.007
 are-0.006
,-0.006
-0.006
m-0.006
Token the
Token ID262
Feature activation-4.88e-01

Pos logit contributions
m+0.018
 are+0.018
+0.017
ll+0.017
ossible+0.017
Neg logit contributions
 tunes-0.036
rily-0.035
 tune-0.035
 films-0.035
en-0.032
Token rabbit
Token ID22746
Feature activation+1.15e+00

Pos logit contributions
 +0.018
+0.018
,+0.018
m+0.017
ossible+0.017
Neg logit contributions
 tunes-0.024
 films-0.023
 tune-0.021
 melodies-0.020
phony-0.020
Token.
Token ID13
Feature activation-5.66e-01

Pos logit contributions
 tunes+0.060
rily+0.059
en+0.058
 tune+0.058
ening+0.056
Neg logit contributions
 are-0.033
ossible-0.029
 am-0.029
m-0.029
-0.026
Token "
Token ID366
Feature activation+4.45e-01

Pos logit contributions
 are+0.013
m+0.011
+0.010
,+0.010
ossible+0.009
Neg logit contributions
 tunes-0.033
phony-0.033
 films-0.033
rily-0.032
 tune-0.031
TokenAre
Token ID8491
Feature activation+9.25e-01

Pos logit contributions
 tunes+0.021
 films+0.019
 tune+0.019
phony+0.018
 melodies+0.018
Neg logit contributions
,-0.019
 -0.017
-0.017
 are-0.016
 she-0.016
Token you
Token ID345
Feature activation-4.26e+00

Pos logit contributions
 tunes+0.037
 films+0.037
uders+0.033
 melodies+0.033
 ger+0.032
Neg logit contributions
 -0.032
m-0.032
ossible-0.032
,-0.031
-0.029
Token hungry
Token ID14720
Feature activation-9.98e-01

Pos logit contributions
 +0.133
+0.120
,+0.119
ossible+0.118
 she+0.114
Neg logit contributions
 tunes-0.222
 films-0.217
rated-0.212
ening-0.209
rily-0.208
Token?
Token ID30
Feature activation-3.32e-01

Pos logit contributions
,+0.032
+0.029
 +0.029
 she+0.027
 are+0.026
Neg logit contributions
 tunes-0.058
 films-0.055
phony-0.053
 melodies-0.053
rily-0.052
Token I
Token ID314
Feature activation-1.58e+00

Pos logit contributions
,+0.013
 +0.013
+0.013
m+0.012
 she+0.012
Neg logit contributions
 tunes-0.014
 films-0.014
phony-0.013
rily-0.013
 tune-0.012
Token have
Token ID423
Feature activation+4.11e-01

Pos logit contributions
 +0.043
,+0.039
 she+0.039
+0.038
ossible+0.037
Neg logit contributions
rated-0.084
 tunes-0.082
en-0.077
 films-0.077
ening-0.077
Token some
Token ID617
Feature activation+1.44e+00

Pos logit contributions
 tunes+0.024
 films+0.023
phony+0.022
rily+0.022
rated+0.022
Neg logit contributions
 -0.011
-0.011
,-0.011
 she-0.009
 are-0.009
Token and
Token ID290
Feature activation-1.53e+00

Pos logit contributions
 +0.034
+0.034
ossible+0.031
 are+0.029
,+0.027
Neg logit contributions
 tunes-0.081
 films-0.077
ening-0.077
 tune-0.075
en-0.075
Token said
Token ID531
Feature activation-2.30e-01

Pos logit contributions
+0.049
ossible+0.049
m+0.048
 +0.045
,+0.044
Neg logit contributions
 tunes-0.073
rated-0.070
ening-0.068
 films-0.068
 tune-0.067
Token,
Token ID11
Feature activation-6.39e-02

Pos logit contributions
my+0.005
ossible+0.005
m+0.005
ma+0.004
+0.004
Neg logit contributions
 tunes-0.012
rily-0.012
ening-0.012
der-0.012
 tune-0.012
Token "
Token ID366
Feature activation+1.36e+00

Pos logit contributions
ossible+0.002
 are+0.002
m+0.002
+0.002
ma+0.002
Neg logit contributions
 tunes-0.003
 tune-0.003
 films-0.003
rily-0.003
ening-0.002
TokenAre
Token ID8491
Feature activation+4.68e-01

Pos logit contributions
 tunes+0.068
 films+0.063
rated+0.061
 tune+0.061
phony+0.061
Neg logit contributions
,-0.051
 -0.050
-0.049
 are-0.045
 she-0.045
Token you
Token ID345
Feature activation-4.25e+00

Pos logit contributions
 tunes+0.019
 films+0.019
izing+0.018
 melodies+0.017
 ger+0.017
Neg logit contributions
 -0.016
ossible-0.015
,-0.015
m-0.015
-0.015
Token okay
Token ID8788
Feature activation-7.51e-01

Pos logit contributions
 +0.105
+0.096
,+0.091
ossible+0.086
 she+0.086
Neg logit contributions
 tunes-0.255
rated-0.244
 films-0.244
phony-0.240
 tune-0.238
Token,
Token ID11
Feature activation-5.92e-01

Pos logit contributions
 +0.012
+0.011
 she+0.009
 are+0.009
,+0.009
Neg logit contributions
 tunes-0.052
 films-0.051
phony-0.049
rated-0.049
 tune-0.049
Token Tim
Token ID5045
Feature activation+1.85e-02

Pos logit contributions
 +0.018
ossible+0.017
m+0.017
+0.016
,+0.016
Neg logit contributions
 tunes-0.029
 films-0.029
 tune-0.028
 melodies-0.027
rily-0.026
Tokenmy
Token ID1820
Feature activation+9.06e-01

Pos logit contributions
 tunes+0.001
rated+0.001
 films+0.001
en+0.001
ening+0.001
Neg logit contributions
 -0.000
-0.000
 she-0.000
,-0.000
 are-0.000
Token?"
Token ID1701
Feature activation-5.76e-02

Pos logit contributions
 tunes+0.050
 films+0.048
rated+0.046
phony+0.046
 melodies+0.046
Neg logit contributions
 -0.028
-0.027
,-0.027
 she-0.025
 are-0.024
Token and
Token ID290
Feature activation-2.70e-01

Pos logit contributions
 are+0.030
+0.030
 +0.029
 am+0.028
ossible+0.028
Neg logit contributions
 tunes-0.066
rily-0.065
ening-0.062
 tune-0.062
 films-0.062
Token asked
Token ID1965
Feature activation+6.87e-01

Pos logit contributions
+0.007
ossible+0.006
 +0.006
 are+0.006
,+0.006
Neg logit contributions
 tunes-0.015
rily-0.014
 films-0.014
phony-0.014
 tune-0.014
Token,
Token ID11
Feature activation-8.84e-01

Pos logit contributions
 tunes+0.036
en+0.035
 films+0.035
ening+0.034
rily+0.034
Neg logit contributions
-0.018
ossible-0.017
m-0.016
 -0.016
ll-0.015
Token "
Token ID366
Feature activation+3.36e-01

Pos logit contributions
 are+0.032
ossible+0.031
 am+0.028
+0.027
m+0.025
Neg logit contributions
rily-0.040
 tunes-0.038
 tune-0.038
 films-0.037
ening-0.037
TokenAre
Token ID8491
Feature activation+4.14e-01

Pos logit contributions
 tunes+0.013
 tune+0.011
 films+0.011
phony+0.011
 melodies+0.011
Neg logit contributions
,-0.017
 -0.016
-0.016
 are-0.016
 she-0.015
Token you
Token ID345
Feature activation-4.24e+00

Pos logit contributions
 tunes+0.018
 films+0.018
izing+0.017
 melodies+0.017
en+0.017
Neg logit contributions
 -0.013
ossible-0.013
,-0.013
m-0.013
-0.012
Token glad
Token ID9675
Feature activation-1.73e+00

Pos logit contributions
 +0.126
+0.120
,+0.118
 she+0.116
ossible+0.113
Neg logit contributions
 films-0.222
 tunes-0.221
rated-0.212
phony-0.210
rily-0.210
Token to
Token ID284
Feature activation-1.77e+00

Pos logit contributions
 +0.052
+0.050
ossible+0.045
m+0.045
,+0.043
Neg logit contributions
 tunes-0.096
 films-0.094
 melodies-0.087
rily-0.086
en-0.085
Token have
Token ID423
Feature activation-1.47e+00

Pos logit contributions
+0.049
 +0.049
 she+0.049
m+0.047
,+0.042
Neg logit contributions
 films-0.092
 tunes-0.092
en-0.089
phony-0.086
rant-0.086
Token pizza
Token ID14256
Feature activation-3.02e-01

Pos logit contributions
,+0.040
 +0.038
+0.037
 she+0.034
 are+0.032
Neg logit contributions
 tunes-0.091
rated-0.087
 films-0.087
phony-0.085
rily-0.085
Token for
Token ID329
Feature activation+4.53e-01

Pos logit contributions
 +0.009
+0.008
,+0.008
 she+0.008
 are+0.007
Neg logit contributions
 tunes-0.017
 films-0.017
rily-0.016
rated-0.016
 tune-0.016
Token was
Token ID373
Feature activation-4.04e-01

Pos logit contributions
 tunes+0.041
rated+0.040
ening+0.038
en+0.038
rily+0.037
Neg logit contributions
 -0.019
ossible-0.018
,-0.017
 are-0.016
 she-0.015
Token too
Token ID1165
Feature activation-2.39e+00

Pos logit contributions
 +0.011
,+0.010
+0.010
 are+0.010
ossible+0.010
Neg logit contributions
 tunes-0.023
rated-0.022
 films-0.022
ening-0.022
rily-0.021
Token late
Token ID2739
Feature activation+6.14e-01

Pos logit contributions
 +0.116
,+0.115
+0.115
 she+0.108
 are+0.108
Neg logit contributions
 tunes-0.089
 films-0.083
rated-0.081
phony-0.078
ening-0.077
Tokenâ
Token ID22940
Feature activation-2.87e-01

Pos logit contributions
 tunes+0.037
ening+0.036
rily+0.035
en+0.034
rated+0.034
Neg logit contributions
 -0.013
-0.013
ossible-0.013
m-0.012
 are-0.012
Token
Token ID26391
Feature activation+1.50e+00

Pos logit contributions
+0.013
 +0.013
,+0.013
 she+0.012
 are+0.012
Neg logit contributions
 tunes-0.012
 films-0.011
phony-0.010
rily-0.010
rated-0.010
Token
Token ID447
Feature activation-4.24e+00

Pos logit contributions
 tunes+0.059
phony+0.058
uders+0.053
 tune+0.052
 films+0.052
Neg logit contributions
-0.061
,-0.060
m-0.060
 -0.059
 are-0.059
Token
Token ID251
Feature activation-4.21e-01

Pos logit contributions
 are+0.156
,+0.154
+0.153
 she+0.146
 forgive+0.144
Neg logit contributions
phony-0.169
izing-0.164
 tunes-0.162
rily-0.160
ishment-0.155
Token the
Token ID262
Feature activation-7.53e-01

Pos logit contributions
 are+0.006
+0.006
m+0.006
my+0.005
ossible+0.005
Neg logit contributions
 tunes-0.028
 tune-0.026
 films-0.025
rily-0.025
 melodies-0.025
Token ice
Token ID4771
Feature activation-5.96e-01

Pos logit contributions
 +0.029
,+0.028
+0.028
 are+0.027
ossible+0.026
Neg logit contributions
 tunes-0.037
 films-0.034
 tune-0.033
 melodies-0.032
phony-0.032
Token melted
Token ID24178
Feature activation-4.50e-01

Pos logit contributions
 +0.025
+0.025
ossible+0.023
 are+0.021
,+0.020
Neg logit contributions
 tunes-0.027
lights-0.025
rated-0.025
 tune-0.025
ening-0.024
Token and
Token ID290
Feature activation-6.23e-01

Pos logit contributions
,+0.012
 +0.012
+0.012
 she+0.010
 are+0.010
Neg logit contributions
 tunes-0.027
 films-0.026
phony-0.025
rated-0.025
 melodies-0.025
Token and
Token ID290
Feature activation-1.84e-01

Pos logit contributions
+0.043
 +0.040
ossible+0.037
 are+0.037
 forgive+0.035
Neg logit contributions
 tunes-0.076
 films-0.072
 tune-0.070
ening-0.068
en-0.068
Token said
Token ID531
Feature activation-5.47e-03

Pos logit contributions
+0.005
 +0.005
,+0.005
ossible+0.005
 are+0.005
Neg logit contributions
 tunes-0.010
 films-0.010
rated-0.009
phony-0.009
rily-0.009
Token,
Token ID11
Feature activation+4.99e-01

Pos logit contributions
ossible+0.000
my+0.000
m+0.000
ll+0.000
+0.000
Neg logit contributions
 tunes-0.000
 films-0.000
rily-0.000
ening-0.000
 tune-0.000
Token "
Token ID366
Feature activation+1.64e+00

Pos logit contributions
 tunes+0.022
 films+0.021
 tune+0.021
rily+0.020
phony+0.020
Neg logit contributions
ossible-0.018
 are-0.018
m-0.016
-0.016
 am-0.016
TokenAre
Token ID8491
Feature activation+4.18e-01

Pos logit contributions
 tunes+0.075
 tune+0.068
 melodies+0.066
rated+0.065
phony+0.065
Neg logit contributions
,-0.073
 -0.065
 are-0.063
-0.061
ossible-0.059
Token you
Token ID345
Feature activation-4.24e+00

Pos logit contributions
 films+0.017
 tunes+0.017
izing+0.016
uders+0.016
ishment+0.016
Neg logit contributions
 -0.014
ossible-0.014
,-0.013
m-0.013
 forgive-0.013
Token ready
Token ID3492
Feature activation-3.49e-01

Pos logit contributions
 +0.110
+0.104
,+0.100
 she+0.095
ossible+0.089
Neg logit contributions
 tunes-0.249
 films-0.244
rated-0.237
phony-0.235
 tune-0.230
Token to
Token ID284
Feature activation-6.38e-01

Pos logit contributions
 +0.008
+0.008
,+0.008
 she+0.007
 are+0.007
Neg logit contributions
 tunes-0.022
 films-0.021
phony-0.020
rated-0.020
rily-0.020
Token open
Token ID1280
Feature activation-1.43e+00

Pos logit contributions
 +0.020
m+0.020
+0.018
 she+0.018
ossible+0.018
Neg logit contributions
 tunes-0.032
 films-0.031
en-0.031
phony-0.030
 melodies-0.029
Token the
Token ID262
Feature activation+2.07e-01

Pos logit contributions
,+0.036
 +0.033
+0.033
 she+0.029
 are+0.027
Neg logit contributions
 tunes-0.092
rated-0.087
 films-0.086
phony-0.086
 melodies-0.085
Token box
Token ID3091
Feature activation-1.38e+00

Pos logit contributions
 tunes+0.010
 films+0.010
ening+0.009
phony+0.009
rated+0.009
Neg logit contributions
 -0.008
-0.008
,-0.007
 she-0.007
m-0.007
Tokenara
Token ID3301
Feature activation-5.33e-01

Pos logit contributions
 +0.030
+0.029
 she+0.027
,+0.027
 are+0.025
Neg logit contributions
 tunes-0.015
 films-0.015
phony-0.015
en-0.014
 melodies-0.014
Token says
Token ID1139
Feature activation-1.03e+00

Pos logit contributions
 +0.017
+0.017
,+0.016
 she+0.016
m+0.015
Neg logit contributions
 tunes-0.028
 films-0.027
phony-0.026
en-0.026
 melodies-0.026
Token,
Token ID11
Feature activation-1.31e-01

Pos logit contributions
+0.017
 +0.014
my+0.014
m+0.013
ossible+0.011
Neg logit contributions
 films-0.064
 tunes-0.064
 songs-0.061
 melodies-0.060
ening-0.059
Token "
Token ID366
Feature activation+1.79e+00

Pos logit contributions
+0.004
ossible+0.004
 are+0.004
m+0.004
my+0.004
Neg logit contributions
 tunes-0.006
 films-0.006
phony-0.006
 tune-0.006
ening-0.006
TokenAre
Token ID8491
Feature activation+5.91e-01

Pos logit contributions
 films+0.099
 tunes+0.098
rily+0.097
rated+0.095
ening+0.094
Neg logit contributions
-0.057
 -0.057
,-0.054
 she-0.052
 are-0.049
Token you
Token ID345
Feature activation-4.24e+00

Pos logit contributions
 tunes+0.028
 films+0.027
 melodies+0.025
izing+0.025
phony+0.025
Neg logit contributions
 -0.018
-0.018
ossible-0.017
,-0.017
m-0.017
Token OK
Token ID7477
Feature activation-1.11e+00

Pos logit contributions
 +0.096
+0.094
,+0.087
 she+0.083
ossible+0.075
Neg logit contributions
 tunes-0.265
 films-0.256
phony-0.248
rated-0.247
 melodies-0.246
Token,
Token ID11
Feature activation-6.29e-01

Pos logit contributions
 +0.023
,+0.023
+0.023
 she+0.020
 are+0.019
Neg logit contributions
 tunes-0.073
 films-0.070
phony-0.067
rated-0.067
 melodies-0.067
Token bird
Token ID6512
Feature activation-1.56e-01

Pos logit contributions
 +0.021
+0.021
m+0.019
ossible+0.019
,+0.018
Neg logit contributions
 films-0.031
 tunes-0.030
phony-0.028
 tune-0.028
 melodies-0.028
Token?
Token ID30
Feature activation+8.60e-01

Pos logit contributions
 +0.006
+0.006
 she+0.005
,+0.005
 are+0.005
Neg logit contributions
 tunes-0.008
 films-0.007
rated-0.007
phony-0.007
 tune-0.007
Token That
Token ID1320
Feature activation-1.56e+00

Pos logit contributions
 tunes+0.038
 films+0.036
phony+0.035
 melodies+0.033
 tune+0.032
Neg logit contributions
 -0.034
,-0.033
m-0.032
-0.032
 she-0.031
Token she
Token ID673
Feature activation-2.10e+00

Pos logit contributions
 tunes+0.045
 films+0.045
lights+0.043
 melodies+0.042
 tune+0.042
Neg logit contributions
 -0.023
-0.023
ossible-0.022
 are-0.022
 forgive-0.021
Token continued
Token ID3767
Feature activation-3.77e-01

Pos logit contributions
,+0.039
 +0.039
+0.039
 she+0.033
 are+0.031
Neg logit contributions
 tunes-0.143
 films-0.138
rated-0.134
phony-0.133
rily-0.133
Token to
Token ID284
Feature activation-5.27e-01

Pos logit contributions
+0.011
 +0.010
 are+0.010
m+0.010
 forgive+0.010
Neg logit contributions
 films-0.022
 tunes-0.021
ening-0.020
 tune-0.019
en-0.019
Token make
Token ID787
Feature activation-5.53e-01

Pos logit contributions
m+0.020
 +0.018
+0.017
ll+0.017
ossible+0.017
Neg logit contributions
 tunes-0.025
en-0.024
 melodies-0.023
 films-0.023
 tune-0.022
Token more
Token ID517
Feature activation-1.12e+00

Pos logit contributions
,+0.014
 +0.014
+0.013
 she+0.012
 are+0.011
Neg logit contributions
 tunes-0.034
 films-0.033
rated-0.032
phony-0.032
rily-0.031
Token dolls
Token ID36062
Feature activation-4.24e+00

Pos logit contributions
 +0.047
+0.046
,+0.045
ossible+0.043
m+0.042
Neg logit contributions
 tunes-0.052
 films-0.049
 melodies-0.046
ening-0.045
 tune-0.045
Token,
Token ID11
Feature activation+7.71e-02

Pos logit contributions
ossible+0.092
 +0.092
+0.076
m+0.071
 forgive+0.065
Neg logit contributions
 tunes-0.266
 films-0.250
en-0.245
 melodies-0.244
 tune-0.243
Token Jane
Token ID12091
Feature activation+2.22e-01

Pos logit contributions
 tunes+0.005
 films+0.005
 melodies+0.004
 tune+0.004
rily+0.004
Neg logit contributions
-0.002
ossible-0.002
 -0.002
m-0.002
,-0.001
Token learned
Token ID4499
Feature activation+3.42e-01

Pos logit contributions
 tunes+0.012
en+0.012
ening+0.011
 tune+0.011
 films+0.011
Neg logit contributions
 -0.006
ossible-0.006
-0.006
 are-0.006
 forgive-0.005
Token how
Token ID703
Feature activation+1.00e+00

Pos logit contributions
 tunes+0.019
 films+0.018
 melodies+0.017
 tune+0.016
rily+0.016
Neg logit contributions
 -0.010
-0.010
m-0.010
,-0.009
ossible-0.009
Token to
Token ID284
Feature activation+3.40e-01

Pos logit contributions
 tunes+0.059
 films+0.055
 melodies+0.054
 tune+0.053
phony+0.052
Neg logit contributions
 -0.027
-0.025
,-0.025
 are-0.024
ossible-0.024
Token.
Token ID13
Feature activation+3.72e-01

Pos logit contributions
+0.027
ossible+0.027
 are+0.027
 +0.025
m+0.024
Neg logit contributions
 tunes-0.078
rily-0.076
 tune-0.074
 films-0.073
ening-0.073
Token
Token ID198
Feature activation+4.86e-01

Pos logit contributions
 tunes+0.021
 films+0.021
rily+0.020
phony+0.020
 tune+0.020
Neg logit contributions
-0.008
 are-0.008
m-0.008
ossible-0.008
,-0.007
Token
Token ID198
Feature activation+6.29e-01

Pos logit contributions
 tunes+0.033
 films+0.032
phony+0.031
rily+0.030
 melodies+0.030
Neg logit contributions
-0.007
,-0.006
 are-0.006
m-0.006
 -0.006
Token"
Token ID1
Feature activation-5.00e-01

Pos logit contributions
 tunes+0.036
 films+0.033
phony+0.033
 tune+0.032
 chords+0.032
Neg logit contributions
 -0.017
,-0.017
 are-0.016
m-0.016
 were-0.014
TokenAre
Token ID8491
Feature activation+7.74e-01

Pos logit contributions
+0.016
 +0.016
,+0.015
 she+0.015
 are+0.014
Neg logit contributions
 tunes-0.027
 films-0.027
rated-0.026
ening-0.026
rily-0.026
Token you
Token ID345
Feature activation-4.23e+00

Pos logit contributions
 films+0.034
 tunes+0.034
izing+0.030
 melodies+0.030
uders+0.030
Neg logit contributions
 -0.025
m-0.025
ossible-0.024
 forgive-0.024
-0.023
Token ready
Token ID3492
Feature activation-2.00e-01

Pos logit contributions
 +0.118
+0.111
,+0.109
 she+0.103
ossible+0.102
Neg logit contributions
 tunes-0.240
 films-0.233
phony-0.227
rated-0.224
ening-0.222
Token?"
Token ID1701
Feature activation+8.71e-01

Pos logit contributions
 +0.004
+0.004
,+0.004
 she+0.003
 are+0.003
Neg logit contributions
 tunes-0.013
 films-0.013
phony-0.012
rated-0.012
rily-0.012
Token Tom
Token ID4186
Feature activation-1.75e+00

Pos logit contributions
 films+0.038
 tunes+0.038
phony+0.034
 tune+0.034
ishment+0.033
Neg logit contributions
m-0.033
ll-0.032
 are-0.031
-0.030
 forgive-0.028
Token asked
Token ID1965
Feature activation-3.65e-03

Pos logit contributions
,+0.049
 +0.049
+0.049
 she+0.044
 are+0.042
Neg logit contributions
 tunes-0.102
 films-0.098
phony-0.094
rated-0.094
rily-0.093
Token.
Token ID13
Feature activation-9.44e-02

Pos logit contributions
+0.000
ossible+0.000
 are+0.000
 +0.000
ll+0.000
Neg logit contributions
 tunes-0.000
 films-0.000
rily-0.000
en-0.000
lights-0.000
Token.
Token ID13
Feature activation+1.10e-01

Pos logit contributions
 tunes+0.020
 films+0.019
rily+0.019
en+0.019
ening+0.018
Neg logit contributions
 -0.007
-0.007
 she-0.007
m-0.006
'm-0.006
Token
Token ID198
Feature activation+3.78e-01

Pos logit contributions
 tunes+0.007
 films+0.007
phony+0.007
rily+0.007
 tune+0.007
Neg logit contributions
-0.002
,-0.002
 -0.001
m-0.001
 are-0.001
Token
Token ID198
Feature activation+6.79e-01

Pos logit contributions
 tunes+0.025
 films+0.025
phony+0.024
rily+0.024
 melodies+0.023
Neg logit contributions
-0.005
,-0.005
m-0.004
 -0.004
 are-0.004
Token"
Token ID1
Feature activation+1.44e-01

Pos logit contributions
 tunes+0.029
 films+0.028
ands+0.027
phony+0.027
izing+0.027
Neg logit contributions
 -0.024
,-0.023
 are-0.023
're-0.021
m-0.021
TokenAre
Token ID8491
Feature activation+8.42e-01

Pos logit contributions
 tunes+0.007
 films+0.006
phony+0.006
rated+0.006
 melodies+0.006
Neg logit contributions
,-0.006
 -0.006
-0.006
 she-0.005
 are-0.005
Token you
Token ID345
Feature activation-4.22e+00

Pos logit contributions
 films+0.035
 tunes+0.035
 ger+0.033
izing+0.032
 melodies+0.032
Neg logit contributions
 -0.028
-0.027
ossible-0.026
 forgive-0.026
,-0.025
Token okay
Token ID8788
Feature activation-4.70e-01

Pos logit contributions
 +0.091
+0.089
,+0.083
 she+0.078
ossible+0.070
Neg logit contributions
 tunes-0.269
 films-0.262
phony-0.254
rated-0.253
 tune-0.249
Token,
Token ID11
Feature activation-6.11e-01

Pos logit contributions
 +0.005
+0.005
,+0.004
 she+0.003
 are+0.003
Neg logit contributions
 tunes-0.036
 films-0.035
phony-0.034
rated-0.033
rily-0.033
Token Sara
Token ID24799
Feature activation+9.67e-01

Pos logit contributions
 +0.023
+0.022
,+0.021
ossible+0.020
 she+0.019
Neg logit contributions
 films-0.029
 tunes-0.029
rily-0.026
phony-0.026
 tune-0.026
Token?"
Token ID1701
Feature activation+7.19e-01

Pos logit contributions
 tunes+0.044
 films+0.043
rated+0.042
ening+0.040
rily+0.040
Neg logit contributions
 -0.037
-0.036
 are-0.033
 she-0.033
,-0.033
Token Ben
Token ID3932
Feature activation-8.34e-01

Pos logit contributions
 tunes+0.033
 films+0.032
rily+0.030
 tune+0.029
phony+0.029
Neg logit contributions
-0.026
 -0.024
m-0.024
 are-0.024
,-0.024
Token.
Token ID13
Feature activation+3.68e-01

Pos logit contributions
 tunes+0.006
 films+0.006
 tune+0.006
rily+0.006
ishment+0.006
Neg logit contributions
-0.002
 -0.002
ossible-0.002
 are-0.002
 she-0.002
Token The
Token ID383
Feature activation-6.64e-01

Pos logit contributions
 tunes+0.019
 films+0.018
rated+0.017
phony+0.017
 melodies+0.017
Neg logit contributions
-0.011
,-0.011
m-0.011
 are-0.010
 -0.010
Token moral
Token ID6573
Feature activation+2.46e+00

Pos logit contributions
+0.024
 +0.024
,+0.022
 she+0.022
 She+0.019
Neg logit contributions
 tunes-0.032
 films-0.032
 ger-0.029
 tune-0.029
 melodies-0.028
Token of
Token ID286
Feature activation+2.51e+00

Pos logit contributions
 tunes+0.163
 films+0.156
phony+0.151
rated+0.150
 melodies+0.150
Neg logit contributions
 -0.051
,-0.050
-0.050
 she-0.043
 are-0.041
Token the
Token ID262
Feature activation+2.69e+00

Pos logit contributions
 tunes+0.165
 films+0.159
phony+0.154
rated+0.153
 melodies+0.152
Neg logit contributions
 -0.052
,-0.052
-0.051
 she-0.044
 are-0.042
Token story
Token ID1621
Feature activation+3.35e+00

Pos logit contributions
 films+0.139
 tunes+0.134
 melodies+0.122
 tune+0.120
 songs+0.117
Neg logit contributions
 -0.085
-0.084
,-0.083
 she-0.081
 are-0.077
Token is
Token ID318
Feature activation+7.65e-01

Pos logit contributions
 tunes+0.182
 films+0.176
rated+0.172
ening+0.165
en+0.165
Neg logit contributions
 -0.093
-0.089
 she-0.082
m-0.079
,-0.073
Token to
Token ID284
Feature activation+1.15e+00

Pos logit contributions
 tunes+0.041
 films+0.040
phony+0.037
 melodies+0.037
 tune+0.037
Neg logit contributions
 -0.023
-0.022
m-0.019
 are-0.019
 she-0.018
Token always
Token ID1464
Feature activation+1.47e+00

Pos logit contributions
 tunes+0.061
 melodies+0.057
 tune+0.056
en+0.055
 films+0.053
Neg logit contributions
m-0.037
-0.035
 -0.034
 she-0.032
,-0.030
Token listen
Token ID6004
Feature activation+1.40e+00

Pos logit contributions
 tunes+0.085
 films+0.079
 tune+0.078
 melodies+0.077
phony+0.076
Neg logit contributions
 -0.041
-0.040
m-0.037
 she-0.036
,-0.036
Token to
Token ID284
Feature activation+2.19e-01

Pos logit contributions
 tunes+0.095
 films+0.091
phony+0.089
rated+0.088
rily+0.088
Neg logit contributions
,-0.027
-0.026
 -0.026
 she-0.023
 are-0.021
Token,
Token ID11
Feature activation-2.03e-01

Pos logit contributions
ossible+0.003
m+0.003
my+0.003
+0.002
 +0.002
Neg logit contributions
rily-0.014
 tunes-0.013
ening-0.012
 films-0.012
en-0.012
Token "
Token ID366
Feature activation+8.94e-01

Pos logit contributions
 are+0.006
+0.006
ossible+0.006
m+0.006
 am+0.005
Neg logit contributions
 tunes-0.010
 tune-0.010
rily-0.010
 films-0.010
phony-0.009
TokenYes
Token ID5297
Feature activation-1.78e+00

Pos logit contributions
 tunes+0.048
 films+0.045
phony+0.044
 melodies+0.043
rated+0.043
Neg logit contributions
,-0.030
 -0.029
-0.029
 she-0.027
 are-0.026
Token,
Token ID11
Feature activation+2.80e-01

Pos logit contributions
 +0.021
ossible+0.021
m+0.021
+0.020
 am+0.020
Neg logit contributions
rily-0.118
 tunes-0.116
ening-0.116
en-0.116
izing-0.115
Token it
Token ID340
Feature activation+1.19e+00

Pos logit contributions
 tunes+0.015
 films+0.014
 melodies+0.013
 tune+0.013
rily+0.013
Neg logit contributions
 -0.009
m-0.009
-0.008
,-0.008
ossible-0.008
Token sure
Token ID1654
Feature activation+3.82e+00

Pos logit contributions
 tunes+0.066
rated+0.063
en+0.063
 films+0.061
ening+0.061
Neg logit contributions
ossible-0.033
 -0.030
 she-0.028
m-0.027
 forgive-0.027
Token is
Token ID318
Feature activation-8.03e-01

Pos logit contributions
 tunes+0.224
 films+0.209
 melodies+0.207
rated+0.204
ening+0.204
Neg logit contributions
 -0.103
-0.096
m-0.084
 she-0.083
,-0.081
Token!
Token ID0
Feature activation+4.50e-01

Pos logit contributions
 +0.020
+0.019
ossible+0.019
m+0.019
 she+0.018
Neg logit contributions
 tunes-0.047
rated-0.045
 films-0.045
ening-0.044
rily-0.044
Token I
Token ID314
Feature activation-1.40e+00

Pos logit contributions
 tunes+0.020
 films+0.019
phony+0.019
 melodies+0.018
rated+0.018
Neg logit contributions
,-0.018
 -0.017
m-0.017
 she-0.016
 are-0.016
Token'm
Token ID1101
Feature activation+2.07e-01

Pos logit contributions
 she+0.034
 +0.031
,+0.028
 he+0.026
m+0.025
Neg logit contributions
rated-0.074
 tunes-0.074
en-0.073
phony-0.072
 tune-0.068
Token so
Token ID523
Feature activation-5.56e-01

Pos logit contributions
 films+0.010
rily+0.010
 tunes+0.010
phony+0.010
vet+0.010
Neg logit contributions
ossible-0.006
 -0.006
 are-0.006
-0.006
,-0.006
Token for
Token ID329
Feature activation+6.16e-01

Pos logit contributions
 tunes+0.125
 films+0.120
ening+0.114
 melodies+0.113
rily+0.113
Neg logit contributions
 -0.050
-0.049
,-0.044
 she-0.042
m-0.041
Token your
Token ID534
Feature activation+1.14e+00

Pos logit contributions
rated+0.035
rily+0.034
 tunes+0.034
lights+0.034
phony+0.033
Neg logit contributions
,-0.020
 -0.018
-0.017
 are-0.017
 she-0.017
Token things
Token ID1243
Feature activation+8.66e-01

Pos logit contributions
 tunes+0.058
 films+0.055
 tune+0.051
 melodies+0.051
 songs+0.048
Neg logit contributions
-0.044
 -0.044
 she-0.040
,-0.040
m-0.039
Token and
Token ID290
Feature activation+1.75e+00

Pos logit contributions
 tunes+0.053
 films+0.050
rily+0.048
lights+0.048
ening+0.047
Neg logit contributions
 -0.021
ossible-0.018
-0.017
m-0.016
'm-0.015
Token for
Token ID329
Feature activation+1.09e+00

Pos logit contributions
 tunes+0.097
 films+0.091
 melodies+0.087
 tune+0.086
rated+0.084
Neg logit contributions
 -0.055
-0.054
,-0.053
m-0.050
 she-0.047
Token others
Token ID1854
Feature activation+3.71e+00

Pos logit contributions
rated+0.060
 tunes+0.060
rily+0.059
lights+0.058
phony+0.058
Neg logit contributions
,-0.035
 -0.033
-0.032
 she-0.030
 are-0.030
Token.
Token ID13
Feature activation+5.32e-01

Pos logit contributions
 tunes+0.237
 films+0.225
 melodies+0.213
ening+0.210
 tune+0.210
Neg logit contributions
 -0.083
-0.072
 she-0.065
ossible-0.064
m-0.064
Token That
Token ID1320
Feature activation-1.27e+00

Pos logit contributions
 tunes+0.029
 films+0.028
phony+0.027
rated+0.027
 melodies+0.027
Neg logit contributions
,-0.016
 -0.016
-0.016
 she-0.015
 are-0.014
Token is
Token ID318
Feature activation+2.05e-01

Pos logit contributions
 +0.035
ossible+0.031
 she+0.031
+0.030
,+0.030
Neg logit contributions
 tunes-0.074
 films-0.072
 melodies-0.067
rated-0.066
en-0.065
Token what
Token ID644
Feature activation-6.04e-02

Pos logit contributions
 tunes+0.012
 films+0.012
rated+0.011
 melodies+0.011
phony+0.011
Neg logit contributions
 -0.006
,-0.005
-0.005
 she-0.005
m-0.005
Token grand
Token ID4490
Feature activation+1.43e+00

Pos logit contributions
 +0.002
+0.002
,+0.002
m+0.002
ossible+0.002
Neg logit contributions
 tunes-0.003
 films-0.003
 melodies-0.003
 chords-0.003
 tune-0.003
Token from
Token ID422
Feature activation-3.85e-01

Pos logit contributions
,+0.034
+0.032
 +0.032
 she+0.029
 are+0.027
Neg logit contributions
 tunes-0.093
 films-0.090
phony-0.087
rated-0.086
 tune-0.085
Token all
Token ID477
Feature activation+1.93e-01

Pos logit contributions
,+0.009
 +0.009
+0.009
 she+0.008
 are+0.007
Neg logit contributions
 tunes-0.024
 films-0.023
phony-0.023
rated-0.022
 melodies-0.022
Token the
Token ID262
Feature activation+5.07e-01

Pos logit contributions
 tunes+0.011
 films+0.011
ening+0.011
rated+0.010
en+0.010
Neg logit contributions
 -0.005
-0.005
ossible-0.005
,-0.005
 forgive-0.005
Token other
Token ID584
Feature activation+2.46e-01

Pos logit contributions
 tunes+0.024
 films+0.022
rily+0.022
phony+0.022
rated+0.021
Neg logit contributions
,-0.020
-0.020
 -0.020
 she-0.019
 are-0.018
Token tasks
Token ID8861
Feature activation+1.79e+00

Pos logit contributions
 tunes+0.012
 films+0.011
rily+0.011
phony+0.011
rated+0.011
Neg logit contributions
,-0.010
-0.010
 -0.010
 she-0.009
 are-0.009
Token Piper
Token ID33503
Feature activation+4.14e+00

Pos logit contributions
 tunes+0.110
 films+0.105
ening+0.104
rily+0.102
lights+0.102
Neg logit contributions
ossible-0.035
 -0.035
m-0.031
 forgive-0.029
-0.028
Token had
Token ID550
Feature activation+5.35e-01

Pos logit contributions
 tunes+0.244
 films+0.226
rated+0.223
ening+0.221
phony+0.220
Neg logit contributions
 -0.102
ossible-0.091
m-0.090
,-0.088
 forgive-0.087
Token seen
Token ID1775
Feature activation-4.23e-01

Pos logit contributions
 tunes+0.028
 films+0.026
rated+0.026
vet+0.026
phony+0.026
Neg logit contributions
 -0.017
-0.016
,-0.016
ossible-0.015
m-0.014
Token before
Token ID878
Feature activation-7.21e-01

Pos logit contributions
 +0.010
+0.010
ossible+0.009
,+0.009
 she+0.008
Neg logit contributions
 tunes-0.026
 films-0.025
ening-0.024
 melodies-0.024
rated-0.024
Token.
Token ID13
Feature activation-3.89e-01

Pos logit contributions
+0.013
 +0.013
ossible+0.012
 are+0.011
m+0.011
Neg logit contributions
 tunes-0.047
 films-0.046
ening-0.044
phony-0.044
 melodies-0.043
Token
Token ID220
Feature activation+1.33e-01

Pos logit contributions
,+0.006
+0.006
 +0.005
 are+0.004
 she+0.004
Neg logit contributions
 tunes-0.028
 films-0.027
phony-0.026
rated-0.026
 melodies-0.026
Token.
Token ID13
Feature activation-3.73e-01

Pos logit contributions
 tunes+0.003
 films+0.003
phony+0.003
 tune+0.003
rily+0.003
Neg logit contributions
-0.001
 -0.001
,-0.001
 are-0.001
 she-0.001
Token The
Token ID383
Feature activation-1.07e+00

Pos logit contributions
,+0.009
+0.009
 +0.009
 she+0.008
 are+0.008
Neg logit contributions
 tunes-0.023
 films-0.022
phony-0.022
rated-0.021
 melodies-0.021
Token moral
Token ID6573
Feature activation+2.35e+00

Pos logit contributions
 +0.033
,+0.033
+0.033
 she+0.030
 are+0.029
Neg logit contributions
 tunes-0.059
 films-0.057
phony-0.054
rated-0.054
 melodies-0.054
Token of
Token ID286
Feature activation+2.48e+00

Pos logit contributions
 tunes+0.167
rily+0.163
 films+0.161
rated+0.160
phony+0.159
Neg logit contributions
,-0.041
-0.040
 -0.037
 she-0.033
 are-0.031
Token the
Token ID262
Feature activation+2.69e+00

Pos logit contributions
 tunes+0.168
 films+0.160
phony+0.158
rated+0.158
rily+0.157
Neg logit contributions
,-0.049
 -0.048
-0.048
 she-0.040
 are-0.039
Token story
Token ID1621
Feature activation+3.40e+00

Pos logit contributions
 tunes+0.163
 films+0.158
 melodies+0.149
 tune+0.148
phony+0.148
Neg logit contributions
 -0.068
,-0.067
-0.067
 she-0.061
 are-0.057
Token is
Token ID318
Feature activation+7.54e-01

Pos logit contributions
 tunes+0.201
 films+0.192
rated+0.187
 melodies+0.182
ening+0.182
Neg logit contributions
 -0.087
-0.083
 she-0.076
,-0.074
m-0.074
Token to
Token ID284
Feature activation+1.35e+00

Pos logit contributions
 tunes+0.046
 films+0.044
phony+0.042
 melodies+0.042
 tune+0.042
Neg logit contributions
 -0.019
-0.018
,-0.016
 are-0.016
m-0.015
Token be
Token ID307
Feature activation+1.23e+00

Pos logit contributions
 tunes+0.071
 melodies+0.066
 tune+0.064
 films+0.063
en+0.062
Neg logit contributions
m-0.042
-0.042
 -0.042
 she-0.039
,-0.038
Token kind
Token ID1611
Feature activation-5.42e-01

Pos logit contributions
 tunes+0.071
 films+0.067
 tune+0.064
 melodies+0.064
phony+0.063
Neg logit contributions
 -0.034
-0.034
,-0.032
 she-0.030
 are-0.030
Token and
Token ID290
Feature activation+8.13e-01

Pos logit contributions
 +0.012
+0.011
,+0.011
 she+0.010
 are+0.009
Neg logit contributions
 tunes-0.036
 films-0.034
phony-0.033
 melodies-0.033
rated-0.033
Token on
Token ID319
Feature activation+3.71e-02

Pos logit contributions
 tunes+0.052
 films+0.050
ening+0.049
rated+0.048
 melodies+0.047
Neg logit contributions
 -0.021
-0.020
 forgive-0.019
,-0.019
 she-0.018
Token,
Token ID11
Feature activation-6.82e-02

Pos logit contributions
 films+0.003
 tunes+0.003
ening+0.002
en+0.002
rily+0.002
Neg logit contributions
 forgive-0.000
ossible-0.000
-0.000
 -0.000
m-0.000
Token Fl
Token ID1610
Feature activation+6.98e-01

Pos logit contributions
ossible+0.003
+0.002
 +0.002
 forgive+0.002
m+0.002
Neg logit contributions
 tunes-0.003
 films-0.003
 melodies-0.003
 tune-0.003
rily-0.003
Tokenuffy
Token ID15352
Feature activation+4.51e-01

Pos logit contributions
 tunes+0.041
 films+0.039
phony+0.038
rated+0.037
rily+0.037
Neg logit contributions
 -0.020
-0.020
,-0.019
 she-0.018
 are-0.017
Token's
Token ID338
Feature activation+6.21e-01

Pos logit contributions
 tunes+0.027
 films+0.025
en+0.024
 melodies+0.024
ening+0.023
Neg logit contributions
 -0.012
ossible-0.011
-0.011
 she-0.010
m-0.010
Token secret
Token ID3200
Feature activation+1.09e+00

Pos logit contributions
 tunes+0.031
 films+0.030
 melodies+0.027
 tune+0.027
 songs+0.027
Neg logit contributions
 -0.024
-0.024
m-0.023
 she-0.022
,-0.021
Token box
Token ID3091
Feature activation-8.95e-01

Pos logit contributions
 tunes+0.067
 films+0.061
lights+0.059
 melodies+0.059
 tune+0.059
Neg logit contributions
 -0.029
-0.028
ossible-0.025
m-0.025
 are-0.021
Token stayed
Token ID9658
Feature activation+5.03e-01

Pos logit contributions
 +0.024
ossible+0.022
+0.021
m+0.020
 forgive+0.018
Neg logit contributions
 tunes-0.053
lights-0.049
en-0.048
ening-0.047
 films-0.047
Token hidden
Token ID7104
Feature activation+3.88e-02

Pos logit contributions
 tunes+0.029
 films+0.028
rily+0.027
phony+0.026
 tune+0.026
Neg logit contributions
 -0.015
-0.014
,-0.013
m-0.013
 are-0.013
Token under
Token ID739
Feature activation-2.28e-01

Pos logit contributions
 tunes+0.002
 films+0.002
phony+0.002
rated+0.002
 melodies+0.002
Neg logit contributions
 -0.001
,-0.001
-0.001
 she-0.001
 are-0.001
Token the
Token ID262
Feature activation+9.91e-01

Pos logit contributions
+0.005
 +0.005
,+0.005
m+0.004
 she+0.004
Neg logit contributions
 tunes-0.015
 films-0.014
phony-0.013
 melodies-0.013
rily-0.013
Token.
Token ID13
Feature activation-6.59e-01

Pos logit contributions
 tunes+0.081
 films+0.077
ening+0.076
rated+0.075
 tune+0.075
Neg logit contributions
 -0.025
-0.024
ossible-0.021
,-0.019
 are-0.019
Token
Token ID198
Feature activation-5.66e-02

Pos logit contributions
 are+0.015
,+0.014
m+0.013
 help+0.013
+0.013
Neg logit contributions
 tunes-0.038
 films-0.037
phony-0.036
rily-0.036
ishment-0.035
Token
Token ID198
Feature activation+1.54e-01

Pos logit contributions
 are+0.001
,+0.001
m+0.001
+0.001
 she+0.001
Neg logit contributions
 tunes-0.003
 films-0.003
 chords-0.003
 tune-0.003
phony-0.003
TokenThe
Token ID464
Feature activation-1.08e+00

Pos logit contributions
 tunes+0.006
 chords+0.006
 tune+0.006
phony+0.005
 films+0.005
Neg logit contributions
 are-0.007
,-0.007
m-0.006
 -0.006
're-0.006
Token moral
Token ID6573
Feature activation+1.87e+00

Pos logit contributions
 +0.041
,+0.037
 are+0.036
+0.036
ossible+0.034
Neg logit contributions
 tunes-0.055
 films-0.053
 tune-0.050
 melodies-0.049
 melody-0.047
Token of
Token ID286
Feature activation+2.54e+00

Pos logit contributions
 tunes+0.120
 films+0.115
 melodies+0.110
phony+0.109
rated+0.109
Neg logit contributions
 -0.042
,-0.040
-0.039
 she-0.035
 are-0.034
Token the
Token ID262
Feature activation+2.59e+00

Pos logit contributions
 tunes+0.165
 films+0.159
phony+0.152
rated+0.152
 melodies+0.152
Neg logit contributions
 -0.054
-0.054
,-0.054
 she-0.047
 are-0.045
Token story
Token ID1621
Feature activation+2.97e+00

Pos logit contributions
 films+0.140
 tunes+0.140
 melodies+0.129
 tune+0.125
 melody+0.123
Neg logit contributions
 -0.079
,-0.075
 are-0.072
-0.071
ossible-0.070
Token is
Token ID318
Feature activation+3.78e-01

Pos logit contributions
 tunes+0.140
lights+0.136
rated+0.135
 films+0.135
en+0.133
Neg logit contributions
 -0.094
-0.085
m-0.084
 pr-0.082
ossible-0.080
Token that
Token ID326
Feature activation+3.00e-02

Pos logit contributions
 tunes+0.020
 films+0.020
en+0.019
phony+0.019
 melodies+0.019
Neg logit contributions
 -0.011
-0.010
m-0.010
 are-0.009
ossible-0.009
Token taking
Token ID2263
Feature activation+1.88e+00

Pos logit contributions
 tunes+0.002
 films+0.002
 melodies+0.001
 tune+0.001
ening+0.001
Neg logit contributions
 -0.001
-0.001
,-0.001
ossible-0.001
m-0.001
Token,
Token ID11
Feature activation-2.25e-02

Pos logit contributions
 tunes+0.010
 films+0.010
ening+0.009
rily+0.009
phony+0.009
Neg logit contributions
 forgive-0.002
m-0.002
my-0.002
 are-0.002
 -0.002
Token Ben
Token ID3932
Feature activation+1.02e+00

Pos logit contributions
m+0.001
ossible+0.001
+0.001
 are+0.001
 +0.001
Neg logit contributions
 tunes-0.001
phony-0.001
 films-0.001
 tune-0.001
rily-0.001
Token and
Token ID290
Feature activation+1.52e-01

Pos logit contributions
 tunes+0.060
 films+0.058
rated+0.057
en+0.056
ening+0.056
Neg logit contributions
 -0.025
-0.022
m-0.022
,-0.022
 she-0.021
Token his
Token ID465
Feature activation+8.37e-01

Pos logit contributions
 tunes+0.009
 films+0.009
rated+0.009
 melodies+0.009
 tune+0.009
Neg logit contributions
-0.004
,-0.004
 -0.004
m-0.003
 are-0.003
Token friends
Token ID2460
Feature activation+8.21e-03

Pos logit contributions
 tunes+0.048
 films+0.045
 tune+0.044
 melodies+0.043
 songs+0.041
Neg logit contributions
 -0.024
-0.024
,-0.022
 she-0.021
 are-0.021
Token went
Token ID1816
Feature activation+8.33e-01

Pos logit contributions
en+0.000
 tunes+0.000
 films+0.000
rated+0.000
phony+0.000
Neg logit contributions
 -0.000
 she-0.000
m-0.000
-0.000
ossible-0.000
Token to
Token ID284
Feature activation+4.81e-01

Pos logit contributions
 films+0.045
 tunes+0.043
phony+0.040
en+0.039
 tune+0.039
Neg logit contributions
-0.025
 -0.024
m-0.023
 are-0.023
 forgive-0.021
Token the
Token ID262
Feature activation-2.23e-01

Pos logit contributions
 tunes+0.031
 films+0.030
phony+0.028
 melodies+0.028
 tune+0.028
Neg logit contributions
-0.011
,-0.010
 -0.010
 she-0.009
m-0.009
Token park
Token ID3952
Feature activation-8.70e-01

Pos logit contributions
 +0.007
,+0.007
+0.007
 she+0.006
 are+0.006
Neg logit contributions
 tunes-0.012
 films-0.012
phony-0.011
 melodies-0.011
 tune-0.011
Token.
Token ID13
Feature activation-1.47e-01

Pos logit contributions
+0.023
 are+0.020
 +0.019
m+0.018
ossible+0.017
Neg logit contributions
 tunes-0.050
 films-0.047
 tune-0.046
phony-0.046
en-0.044
Token Ben
Token ID3932
Feature activation+1.38e+00

Pos logit contributions
 +0.004
,+0.004
+0.003
+0.003
 are+0.003
Neg logit contributions
 tunes-0.011
izing-0.010
 melodies-0.010
 tune-0.010
rated-0.010
Token But
Token ID887
Feature activation-1.27e+00

Pos logit contributions
m+0.015
,+0.015
 are+0.014
+0.013
ossible+0.013
Neg logit contributions
 tunes-0.043
 films-0.043
rily-0.041
phony-0.040
rated-0.040
Token he
Token ID339
Feature activation-1.81e-01

Pos logit contributions
ossible+0.037
 +0.037
+0.037
m+0.037
 are+0.036
Neg logit contributions
 tunes-0.067
 films-0.063
rily-0.062
ening-0.062
rated-0.062
Token knew
Token ID2993
Feature activation-5.70e-01

Pos logit contributions
 +0.005
,+0.004
ossible+0.004
 are+0.004
+0.004
Neg logit contributions
 tunes-0.010
rated-0.010
 tune-0.009
en-0.009
 melodies-0.009
Token there
Token ID612
Feature activation-4.14e-01

Pos logit contributions
 +0.012
ossible+0.012
m+0.011
+0.011
 are+0.011
Neg logit contributions
 tunes-0.034
 films-0.033
rily-0.032
lights-0.032
ening-0.031
Token would
Token ID561
Feature activation+1.57e+00

Pos logit contributions
ossible+0.010
 +0.009
m+0.008
+0.008
,+0.008
Neg logit contributions
 tunes-0.025
 films-0.023
rated-0.022
 melodies-0.022
ening-0.022
Token be
Token ID307
Feature activation+1.14e+00

Pos logit contributions
 tunes+0.091
 films+0.083
 tune+0.083
 melodies+0.082
en+0.080
Neg logit contributions
 -0.042
ossible-0.039
m-0.038
,-0.037
-0.037
Token other
Token ID584
Feature activation-1.14e+00

Pos logit contributions
 tunes+0.065
 films+0.062
 melodies+0.059
rated+0.059
phony+0.058
Neg logit contributions
 -0.034
-0.033
,-0.032
 she-0.029
m-0.029
Token days
Token ID1528
Feature activation+3.94e-01

Pos logit contributions
 +0.045
+0.043
,+0.043
 she+0.040
m+0.039
Neg logit contributions
 tunes-0.055
 films-0.051
 melodies-0.048
rated-0.048
 tune-0.048
Token full
Token ID1336
Feature activation+2.36e-01

Pos logit contributions
 tunes+0.023
 films+0.021
ening+0.021
phony+0.021
 tune+0.021
Neg logit contributions
 -0.011
-0.010
,-0.010
ossible-0.009
 are-0.009
Token of
Token ID286
Feature activation-5.56e-01

Pos logit contributions
 tunes+0.014
 films+0.013
 tune+0.013
ening+0.013
 melodies+0.013
Neg logit contributions
 -0.006
-0.005
ossible-0.005
m-0.005
 are-0.005
Token fun
Token ID1257
Feature activation-3.68e-01

Pos logit contributions
 +0.023
+0.023
,+0.023
 she+0.021
m+0.021
Neg logit contributions
 tunes-0.025
 films-0.024
 melodies-0.022
ening-0.022
phony-0.022
Token,
Token ID11
Feature activation+4.14e-01

Pos logit contributions
 tunes+0.025
 films+0.024
rily+0.023
 songs+0.023
lights+0.023
Neg logit contributions
-0.008
my-0.006
m-0.006
ossible-0.006
 -0.006
Token "
Token ID366
Feature activation+1.89e+00

Pos logit contributions
 tunes+0.021
 films+0.020
 tune+0.020
rily+0.019
phony+0.019
Neg logit contributions
-0.013
ossible-0.012
 are-0.012
m-0.011
,-0.011
TokenOK
Token ID11380
Feature activation+7.81e-03

Pos logit contributions
 tunes+0.086
 films+0.077
 tune+0.076
 melodies+0.075
rated+0.075
Neg logit contributions
,-0.079
 -0.078
-0.077
 are-0.072
 she-0.072
Token,
Token ID11
Feature activation+9.16e-01

Pos logit contributions
 tunes+0.001
 films+0.001
rated+0.001
rily+0.001
phony+0.001
Neg logit contributions
-0.000
 -0.000
m-0.000
ossible-0.000
 she-0.000
Token Mom
Token ID11254
Feature activation-1.41e-01

Pos logit contributions
 tunes+0.048
 films+0.046
 tune+0.044
 melodies+0.043
rily+0.043
Neg logit contributions
 -0.025
-0.024
m-0.023
ossible-0.023
,-0.023
Token.
Token ID13
Feature activation+1.05e+00

Pos logit contributions
+0.003
 +0.003
ossible+0.003
 she+0.003
 are+0.003
Neg logit contributions
 tunes-0.008
en-0.008
rated-0.008
ening-0.007
 films-0.007
Token I
Token ID314
Feature activation-8.26e-01

Pos logit contributions
 tunes+0.042
 films+0.039
phony+0.037
rated+0.037
ksh+0.037
Neg logit contributions
m-0.042
,-0.041
 -0.040
-0.040
 sc-0.039
Token will
Token ID481
Feature activation+2.35e-01

Pos logit contributions
 she+0.024
 +0.022
on+0.022
 he+0.019
,+0.019
Neg logit contributions
rated-0.043
 tunes-0.043
 tune-0.040
phony-0.038
 melodies-0.038
Token do
Token ID466
Feature activation+3.64e-01

Pos logit contributions
 tunes+0.013
 tune+0.012
 films+0.012
 melodies+0.012
phony+0.011
Neg logit contributions
m-0.007
 -0.007
-0.007
ossible-0.006
 she-0.006
Token it
Token ID340
Feature activation+1.24e+00

Pos logit contributions
 tunes+0.021
 films+0.020
rated+0.020
phony+0.020
rily+0.019
Neg logit contributions
,-0.010
 -0.010
-0.010
 she-0.009
 are-0.009
Token.
Token ID13
Feature activation+5.36e-01

Pos logit contributions
 tunes+0.074
 films+0.070
rated+0.068
 melodies+0.068
rily+0.068
Neg logit contributions
 -0.032
-0.032
,-0.029
ossible-0.027
 she-0.027
Token myself
Token ID3589
Feature activation-1.06e+00

Pos logit contributions
 +0.057
+0.057
,+0.056
 she+0.053
m+0.052
Neg logit contributions
 tunes-0.065
 films-0.061
phony-0.058
 melodies-0.057
ening-0.057
Token then
Token ID788
Feature activation+1.13e+00

Pos logit contributions
 +0.025
+0.022
,+0.022
 she+0.020
m+0.020
Neg logit contributions
 tunes-0.067
 films-0.064
rated-0.062
rily-0.062
phony-0.061
Token!
Token ID0
Feature activation+8.89e-03

Pos logit contributions
 tunes+0.068
 films+0.063
rated+0.062
 melodies+0.061
 tune+0.061
Neg logit contributions
 -0.029
-0.026
,-0.026
m-0.026
ossible-0.025
Tokenâ
Token ID22940
Feature activation+1.97e-01

Pos logit contributions
 tunes+0.000
 films+0.000
phony+0.000
rily+0.000
 tune+0.000
Neg logit contributions
,-0.000
 are-0.000
m-0.000
 she-0.000
 pr-0.000
Token
Token ID26391
Feature activation+2.19e+00

Pos logit contributions
 tunes+0.007
rated+0.006
phony+0.006
 melodies+0.006
ening+0.006
Neg logit contributions
,-0.010
 -0.010
 she-0.010
-0.010
 are-0.009
Token
Token ID220
Feature activation-3.31e-02

Pos logit contributions
phony+0.106
 tunes+0.104
 films+0.097
uders+0.094
rated+0.092
Neg logit contributions
m-0.067
 forgive-0.065
,-0.063
 she-0.062
 help-0.062
Token
Token ID198
Feature activation-2.05e-01

Pos logit contributions
,+0.001
 are+0.000
 she+0.000
m+0.000
 +0.000
Neg logit contributions
 tunes-0.002
 films-0.002
phony-0.002
rily-0.002
vet-0.002
Token
Token ID198
Feature activation-1.46e-02

Pos logit contributions
,+0.005
 are+0.004
 she+0.004
 pr+0.004
m+0.004
Neg logit contributions
 tunes-0.012
 films-0.011
phony-0.011
 chords-0.011
 tune-0.011
TokenThe
Token ID464
Feature activation-1.17e+00

Pos logit contributions
,+0.001
 are+0.001
 +0.001
 help+0.000
 she+0.000
Neg logit contributions
 tunes-0.001
 chords-0.001
phony-0.001
 tune-0.001
 films-0.001
Token cat
Token ID3797
Feature activation-5.54e-01

Pos logit contributions
 +0.043
,+0.040
 are+0.039
+0.039
ossible+0.039
Neg logit contributions
 tunes-0.059
 films-0.055
 tune-0.053
 melodies-0.052
rated-0.051
Token smiled
Token ID13541
Feature activation-6.90e-01

Pos logit contributions
 +0.016
ossible+0.015
m+0.015
 are+0.014
 forgive+0.013
Neg logit contributions
 tunes-0.030
en-0.027
rated-0.027
 films-0.027
 tune-0.027
Token to
Token ID284
Feature activation+7.82e-01

Pos logit contributions
 tunes+0.034
 films+0.032
 tune+0.031
en+0.031
rily+0.031
Neg logit contributions
 -0.024
m-0.022
-0.022
 are-0.021
ossible-0.021
Token the
Token ID262
Feature activation-4.33e-01

Pos logit contributions
 tunes+0.054
 films+0.051
rated+0.051
phony+0.050
ening+0.050
Neg logit contributions
,-0.015
 -0.015
-0.014
 she-0.013
 are-0.011
Token lake
Token ID13546
Feature activation-1.51e-02

Pos logit contributions
 +0.020
,+0.020
+0.020
 she+0.019
 are+0.018
Neg logit contributions
 tunes-0.018
 films-0.017
phony-0.016
 tune-0.015
 melodies-0.015
Token and
Token ID290
Feature activation+7.64e-01

Pos logit contributions
+0.000
 +0.000
m+0.000
ossible+0.000
 are+0.000
Neg logit contributions
 tunes-0.001
rily-0.001
 films-0.001
 tune-0.001
en-0.001
Token put
Token ID1234
Feature activation+3.68e-01

Pos logit contributions
 tunes+0.039
 films+0.036
 tune+0.035
phony+0.035
rated+0.035
Neg logit contributions
-0.026
 -0.025
m-0.025
,-0.025
ossible-0.024
Token it
Token ID340
Feature activation+5.93e-01

Pos logit contributions
 tunes+0.018
rily+0.017
 films+0.017
ening+0.017
en+0.016
Neg logit contributions
-0.013
 -0.012
,-0.012
m-0.012
ossible-0.012
Token in
Token ID287
Feature activation+6.87e-01

Pos logit contributions
 tunes+0.030
rily+0.029
rated+0.028
lights+0.028
 films+0.028
Neg logit contributions
-0.020
 -0.020
 are-0.017
ossible-0.017
 she-0.017
Token the
Token ID262
Feature activation-2.25e-01

Pos logit contributions
 tunes+0.038
 films+0.036
rily+0.035
 tune+0.035
en+0.034
Neg logit contributions
-0.019
m-0.019
 are-0.018
 -0.017
 she-0.017
Token water
Token ID1660
Feature activation-5.32e-01

Pos logit contributions
,+0.009
+0.009
 +0.009
 she+0.009
m+0.009
Neg logit contributions
 tunes-0.010
 films-0.009
 tune-0.009
rated-0.009
phony-0.009
Token.
Token ID13
Feature activation-1.80e-02

Pos logit contributions
+0.012
 +0.011
 she+0.010
ossible+0.010
 are+0.010
Neg logit contributions
 tunes-0.033
rily-0.032
 films-0.031
ening-0.031
en-0.030
Token He
Token ID679
Feature activation+4.39e-01

Pos logit contributions
,+0.001
+0.001
m+0.001
 are+0.001
 +0.001
Neg logit contributions
 tunes-0.001
 films-0.001
phony-0.001
rily-0.001
 tune-0.001
Token,
Token ID11
Feature activation+7.27e-01

Pos logit contributions
 films+0.086
 tunes+0.086
phony+0.083
ening+0.082
rated+0.081
Neg logit contributions
-0.013
 -0.010
 forgive-0.006
m-0.006
 she-0.005
Token they
Token ID484
Feature activation+3.68e-01

Pos logit contributions
 tunes+0.045
 films+0.044
phony+0.041
 melodies+0.041
lights+0.041
Neg logit contributions
-0.018
 -0.016
m-0.015
,-0.014
ossible-0.014
Token wanted
Token ID2227
Feature activation+3.19e-01

Pos logit contributions
 tunes+0.023
 films+0.022
rated+0.021
phony+0.021
 melodies+0.021
Neg logit contributions
 -0.009
-0.009
,-0.008
 she-0.008
m-0.007
Token to
Token ID284
Feature activation+2.63e-01

Pos logit contributions
 tunes+0.022
 films+0.021
rated+0.021
phony+0.021
rily+0.021
Neg logit contributions
,-0.006
 -0.006
-0.006
 she-0.005
 are-0.004
Token go
Token ID467
Feature activation+4.46e-02

Pos logit contributions
 tunes+0.015
 films+0.014
phony+0.013
 melodies+0.013
 tune+0.013
Neg logit contributions
 -0.007
m-0.007
,-0.007
-0.007
 she-0.007
Token to
Token ID284
Feature activation-1.12e-01

Pos logit contributions
 tunes+0.003
 films+0.003
phony+0.003
rated+0.003
 melodies+0.003
Neg logit contributions
 -0.001
-0.001
,-0.001
 she-0.001
 are-0.001
Token the
Token ID262
Feature activation-4.36e-01

Pos logit contributions
,+0.002
 +0.002
+0.002
 she+0.002
 are+0.002
Neg logit contributions
 tunes-0.008
rated-0.008
rily-0.008
ening-0.007
lights-0.007
Token park
Token ID3952
Feature activation-1.09e+00

Pos logit contributions
 +0.015
,+0.015
+0.015
 she+0.014
 are+0.013
Neg logit contributions
 tunes-0.023
 films-0.022
phony-0.021
 melodies-0.020
 tune-0.020
Token.
Token ID13
Feature activation+5.32e-01

Pos logit contributions
+0.023
 +0.023
,+0.019
 are+0.019
 she+0.018
Neg logit contributions
 tunes-0.070
 films-0.067
phony-0.064
 tune-0.064
rated-0.063
Token Tim
Token ID5045
Feature activation-5.57e-01

Pos logit contributions
 tunes+0.030
 films+0.028
phony+0.027
rated+0.027
 melodies+0.027
Neg logit contributions
-0.016
,-0.016
 -0.016
 she-0.015
 are-0.014
Token was
Token ID373
Feature activation-9.48e-02

Pos logit contributions
 +0.012
+0.012
,+0.011
 she+0.011
 are+0.011
Neg logit contributions
 tunes-0.034
 films-0.032
rated-0.032
ening-0.031
en-0.031
Token said
Token ID531
Feature activation-4.72e-01

Pos logit contributions
,+0.015
 +0.015
+0.015
 she+0.013
 are+0.012
Neg logit contributions
 tunes-0.064
 films-0.062
phony-0.060
rated-0.060
 melodies-0.060
Token proudly
Token ID23017
Feature activation+1.42e+00

Pos logit contributions
ossible+0.012
+0.011
 are+0.011
 +0.011
m+0.010
Neg logit contributions
rily-0.027
 tunes-0.027
 films-0.025
 tune-0.025
ening-0.025
Token.
Token ID13
Feature activation+1.87e-01

Pos logit contributions
 tunes+0.082
rily+0.079
ening+0.079
en+0.077
 films+0.077
Neg logit contributions
 are-0.035
 -0.035
-0.033
ossible-0.032
'm-0.031
Token
Token ID198
Feature activation+4.80e-01

Pos logit contributions
 films+0.011
 tunes+0.011
rily+0.011
phony+0.010
 tune+0.010
Neg logit contributions
-0.004
m-0.004
 are-0.004
,-0.004
 -0.003
Token
Token ID198
Feature activation+7.10e-01

Pos logit contributions
 tunes+0.031
 films+0.031
phony+0.029
rily+0.029
 tune+0.028
Neg logit contributions
-0.007
,-0.007
m-0.007
 are-0.006
 -0.006
TokenHer
Token ID9360
Feature activation-7.89e-01

Pos logit contributions
 tunes+0.033
 films+0.031
 tune+0.030
phony+0.029
 chords+0.029
Neg logit contributions
 -0.025
,-0.025
m-0.025
 are-0.024
my-0.022
Token mom
Token ID1995
Feature activation-1.45e+00

Pos logit contributions
+0.019
 +0.019
,+0.016
 are+0.016
m+0.016
Neg logit contributions
 tunes-0.049
 films-0.049
 melodies-0.045
lights-0.045
 tune-0.045
Token and
Token ID290
Feature activation+1.89e-01

Pos logit contributions
 +0.042
 are+0.041
ossible+0.041
+0.040
 am+0.039
Neg logit contributions
 tunes-0.076
en-0.071
 tune-0.070
 films-0.069
phony-0.069
Token dad
Token ID9955
Feature activation+8.90e-01

Pos logit contributions
 tunes+0.011
 films+0.010
 tune+0.010
 melodies+0.010
phony+0.010
Neg logit contributions
-0.006
 -0.006
,-0.006
m-0.005
 are-0.005
Token came
Token ID1625
Feature activation+3.41e-01

Pos logit contributions
 tunes+0.047
en+0.044
 films+0.043
 tune+0.043
phony+0.043
Neg logit contributions
-0.025
 -0.025
ossible-0.023
 she-0.023
 am-0.022
Token and
Token ID290
Feature activation-1.04e+00

Pos logit contributions
 tunes+0.019
 films+0.019
phony+0.018
rily+0.018
 tune+0.018
Neg logit contributions
 -0.010
-0.010
,-0.009
 she-0.009
 are-0.009
Token
Token ID198
Feature activation+4.33e-01

Pos logit contributions
 tunes+0.013
 films+0.012
phony+0.012
rated+0.012
 melodies+0.012
Neg logit contributions
-0.002
,-0.002
 -0.002
 she-0.001
 are-0.001
Token
Token ID198
Feature activation+7.41e-01

Pos logit contributions
 tunes+0.032
 films+0.031
phony+0.030
rated+0.030
 melodies+0.030
Neg logit contributions
-0.004
,-0.004
 -0.004
 she-0.003
 are-0.003
TokenMom
Token ID29252
Feature activation-2.36e+00

Pos logit contributions
 tunes+0.039
 films+0.037
phony+0.036
 tune+0.035
 melodies+0.035
Neg logit contributions
 -0.023
,-0.022
-0.021
 are-0.021
 she-0.021
Token hears
Token ID23180
Feature activation+5.54e-02

Pos logit contributions
 +0.098
+0.098
,+0.095
 she+0.092
 are+0.087
Neg logit contributions
 tunes-0.101
 films-0.098
ening-0.091
phony-0.090
rated-0.089
Token the
Token ID262
Feature activation-9.59e-01

Pos logit contributions
 tunes+0.003
 films+0.003
phony+0.003
rated+0.003
 melodies+0.003
Neg logit contributions
,-0.002
 -0.002
-0.002
 she-0.001
 are-0.001
Token noise
Token ID7838
Feature activation+5.67e-01

Pos logit contributions
+0.055
 +0.053
,+0.053
 she+0.050
ossible+0.049
Neg logit contributions
 tunes-0.029
 films-0.028
 tune-0.024
 melodies-0.024
rated-0.023
Token and
Token ID290
Feature activation-2.94e-01

Pos logit contributions
 tunes+0.035
 films+0.034
ening+0.032
 tune+0.032
 melodies+0.032
Neg logit contributions
 -0.014
-0.013
,-0.012
ossible-0.012
 she-0.011
Token comes
Token ID2058
Feature activation+2.24e-01

Pos logit contributions
+0.011
 +0.011
,+0.011
 she+0.010
m+0.010
Neg logit contributions
 tunes-0.014
 films-0.014
phony-0.013
rated-0.013
ening-0.013
Token to
Token ID284
Feature activation-1.12e+00

Pos logit contributions
 tunes+0.015
 films+0.014
phony+0.014
rated+0.014
 melodies+0.014
Neg logit contributions
 -0.005
-0.005
,-0.005
 she-0.004
 are-0.004
Token the
Token ID262
Feature activation-1.14e+00

Pos logit contributions
,+0.024
 +0.023
+0.022
 she+0.020
 are+0.019
Neg logit contributions
 tunes-0.076
 films-0.072
rated-0.071
rily-0.071
phony-0.070
Token room
Token ID2119
Feature activation-1.01e+00

Pos logit contributions
+0.036
 +0.036
,+0.035
 she+0.032
 are+0.031
Neg logit contributions
 tunes-0.063
 films-0.061
phony-0.057
 tune-0.057
 melodies-0.056
Token went
Token ID1816
Feature activation+5.69e-01

Pos logit contributions
 tunes+0.085
 films+0.082
rated+0.080
phony+0.079
 melodies+0.078
Neg logit contributions
 -0.030
-0.029
,-0.028
 she-0.026
m-0.024
Token to
Token ID284
Feature activation+1.93e-01

Pos logit contributions
 films+0.033
 tunes+0.032
phony+0.030
 melodies+0.029
 tune+0.029
Neg logit contributions
-0.016
 -0.015
,-0.014
 are-0.013
m-0.013
Token the
Token ID262
Feature activation-3.82e-01

Pos logit contributions
 tunes+0.013
rated+0.013
 films+0.013
rily+0.013
ening+0.012
Neg logit contributions
 -0.004
,-0.004
-0.004
 she-0.003
 are-0.003
Token park
Token ID3952
Feature activation-1.22e+00

Pos logit contributions
,+0.010
 +0.010
+0.010
 she+0.009
 are+0.008
Neg logit contributions
 tunes-0.023
 films-0.022
rated-0.022
phony-0.022
rily-0.021
Token to
Token ID284
Feature activation+5.79e-01

Pos logit contributions
+0.037
 +0.032
 are+0.030
m+0.028
,+0.027
Neg logit contributions
 tunes-0.067
 films-0.063
phony-0.062
 tune-0.062
rated-0.059
Token play
Token ID711
Feature activation-1.98e+00

Pos logit contributions
 tunes+0.037
 films+0.036
phony+0.035
 melodies+0.034
rated+0.034
Neg logit contributions
-0.012
,-0.012
 -0.012
 she-0.011
m-0.010
Token.
Token ID13
Feature activation-4.83e-01

Pos logit contributions
+0.039
 +0.037
,+0.035
 she+0.031
 are+0.030
Neg logit contributions
 tunes-0.133
 films-0.128
phony-0.123
 tune-0.122
 melodies-0.122
Token
Token ID220
Feature activation+1.59e-02

Pos logit contributions
 +0.012
,+0.011
+0.011
 she+0.010
 are+0.009
Neg logit contributions
 tunes-0.032
 films-0.030
rated-0.029
 melodies-0.029
ening-0.029
Token
Token ID198
Feature activation-1.69e-01

Pos logit contributions
 tunes+0.001
 films+0.001
rily+0.001
phony+0.001
rated+0.001
Neg logit contributions
-0.000
,-0.000
 -0.000
 she-0.000
 are-0.000
Token
Token ID198
Feature activation+1.52e-01

Pos logit contributions
 +0.001
,+0.001
+0.001
 she+0.001
 are+0.000
Neg logit contributions
 tunes-0.014
 films-0.013
rated-0.013
phony-0.013
 melodies-0.013
TokenWhile
Token ID3633
Feature activation+8.96e-01

Pos logit contributions
 tunes+0.008
 films+0.007
rily+0.007
rated+0.007
phony+0.007
Neg logit contributions
-0.006
 -0.006
,-0.006
 she-0.005
 are-0.005
Token asked
Token ID1965
Feature activation+5.69e-01

Pos logit contributions
,+0.022
 +0.021
+0.021
 she+0.019
 are+0.017
Neg logit contributions
 tunes-0.047
 films-0.046
rated-0.044
phony-0.044
 melodies-0.043
Token.
Token ID13
Feature activation-7.99e-01

Pos logit contributions
 tunes+0.032
rily+0.031
en+0.030
 tune+0.030
 films+0.029
Neg logit contributions
ossible-0.014
 are-0.014
-0.013
m-0.013
ll-0.013
Token
Token ID220
Feature activation-3.36e-01

Pos logit contributions
 are+0.013
+0.012
m+0.011
,+0.010
 forgive+0.009
Neg logit contributions
 tunes-0.050
rily-0.049
 films-0.048
phony-0.047
 tune-0.047
Token
Token ID198
Feature activation-5.58e-01

Pos logit contributions
 are+0.006
m+0.005
,+0.005
ossible+0.005
+0.005
Neg logit contributions
rily-0.020
 films-0.019
 tunes-0.019
lights-0.019
rated-0.018
Token
Token ID198
Feature activation-3.26e-01

Pos logit contributions
 are+0.012
,+0.010
 she+0.010
m+0.009
+0.009
Neg logit contributions
 tunes-0.033
 films-0.031
 tune-0.031
phony-0.031
rily-0.030
TokenMom
Token ID29252
Feature activation-2.44e+00

Pos logit contributions
 are+0.011
,+0.011
 she+0.010
m+0.010
 +0.010
Neg logit contributions
 tunes-0.016
 tune-0.015
phony-0.014
 films-0.014
 chords-0.014
Token smiled
Token ID13541
Feature activation-1.04e+00

Pos logit contributions
 am+0.080
 are+0.078
ossible+0.073
'm+0.073
 she+0.070
Neg logit contributions
en-0.116
 tunes-0.111
ening-0.106
ples-0.102
le-0.102
Token.
Token ID13
Feature activation-2.88e-01

Pos logit contributions
 are+0.026
 +0.025
ossible+0.024
+0.024
 am+0.022
Neg logit contributions
rily-0.061
 tunes-0.061
ening-0.058
en-0.057
 tune-0.057
Token "
Token ID366
Feature activation+1.91e-01

Pos logit contributions
 are+0.010
ossible+0.009
+0.009
 forgive+0.009
m+0.009
Neg logit contributions
 tunes-0.013
rily-0.012
 films-0.012
phony-0.012
ksh-0.012
TokenNo
Token ID2949
Feature activation-6.34e-02

Pos logit contributions
 tunes+0.010
 films+0.009
 melodies+0.009
 tune+0.009
phony+0.009
Neg logit contributions
,-0.007
 -0.006
-0.006
 are-0.006
 she-0.006
Token,
Token ID11
Feature activation-1.18e-01

Pos logit contributions
 +0.001
+0.001
ossible+0.001
m+0.001
 she+0.001
Neg logit contributions
 tunes-0.005
 films-0.004
ening-0.004
 melodies-0.004
en-0.004
Token Every
Token ID3887
Feature activation-1.23e+00

Pos logit contributions
,+0.014
+0.014
m+0.014
 are+0.013
 forgive+0.012
Neg logit contributions
 tunes-0.026
 films-0.025
phony-0.025
vet-0.024
rily-0.024
Token day
Token ID1110
Feature activation+8.82e-02

Pos logit contributions
 +0.044
,+0.044
+0.044
 she+0.040
 are+0.039
Neg logit contributions
 tunes-0.063
 films-0.060
phony-0.057
rated-0.057
 melodies-0.056
Token Mia
Token ID32189
Feature activation-4.97e-01

Pos logit contributions
 tunes+0.006
 films+0.006
en+0.006
phony+0.006
 melodies+0.006
Neg logit contributions
ossible-0.001
 -0.001
-0.001
 forgive-0.001
m-0.001
Token would
Token ID561
Feature activation+7.44e-01

Pos logit contributions
 +0.015
,+0.014
+0.014
 she+0.013
m+0.013
Neg logit contributions
 tunes-0.028
 films-0.027
 melodies-0.025
rated-0.025
en-0.025
Token blink
Token ID21019
Feature activation+6.26e-01

Pos logit contributions
 tunes+0.039
 tune+0.036
en+0.035
 melodies+0.035
 films+0.034
Neg logit contributions
m-0.024
 -0.024
-0.022
ossible-0.022
,-0.021
Token at
Token ID379
Feature activation-2.32e+00

Pos logit contributions
 tunes+0.040
 films+0.038
rily+0.037
ening+0.037
 melodies+0.037
Neg logit contributions
-0.014
 -0.014
,-0.012
m-0.012
ossible-0.012
Token the
Token ID262
Feature activation-2.84e-01

Pos logit contributions
,+0.047
 +0.043
 she+0.039
+0.037
 are+0.034
Neg logit contributions
 tunes-0.159
phony-0.158
rated-0.157
 films-0.153
rily-0.151
Token radio
Token ID5243
Feature activation-5.45e-01

Pos logit contributions
,+0.014
+0.014
 +0.014
 are+0.013
 she+0.013
Neg logit contributions
rily-0.010
 tunes-0.010
phony-0.010
vet-0.010
rated-0.010
Token and
Token ID290
Feature activation+2.29e-01

Pos logit contributions
 +0.011
+0.011
ossible+0.010
m+0.010
 are+0.009
Neg logit contributions
 tunes-0.036
 films-0.034
 tune-0.033
 melodies-0.033
ening-0.032
Token smile
Token ID8212
Feature activation+1.20e+00

Pos logit contributions
 tunes+0.013
 films+0.012
 melodies+0.012
 tune+0.011
ening+0.011
Neg logit contributions
 -0.007
-0.007
m-0.007
,-0.007
ossible-0.007
Token.
Token ID13
Feature activation-6.16e-01

Pos logit contributions
 tunes+0.078
 tune+0.072
rily+0.072
 melodies+0.071
 films+0.071
Neg logit contributions
 -0.025
-0.025
ossible-0.023
 are-0.021
m-0.020
Token wants
Token ID3382
Feature activation-3.62e-01

Pos logit contributions
 films+0.013
 tunes+0.013
ening+0.012
en+0.012
rated+0.012
Neg logit contributions
 -0.008
-0.007
 she-0.007
 forgive-0.007
m-0.007
Token to
Token ID284
Feature activation-3.34e-01

Pos logit contributions
+0.007
 +0.007
,+0.007
 she+0.006
 are+0.006
Neg logit contributions
 tunes-0.024
 films-0.023
phony-0.022
 melodies-0.022
rated-0.022
Token build
Token ID1382
Feature activation+7.19e-02

Pos logit contributions
+0.010
 +0.010
,+0.010
 she+0.009
m+0.009
Neg logit contributions
 tunes-0.019
 films-0.018
phony-0.017
 melodies-0.017
 tune-0.017
Token a
Token ID257
Feature activation-8.58e-01

Pos logit contributions
 tunes+0.005
 films+0.004
rated+0.004
phony+0.004
rily+0.004
Neg logit contributions
,-0.002
 -0.002
-0.002
 she-0.001
 are-0.001
Token big
Token ID1263
Feature activation-8.45e-01

Pos logit contributions
+0.028
 +0.027
,+0.026
 she+0.024
m+0.024
Neg logit contributions
 tunes-0.046
 films-0.045
phony-0.043
 tune-0.042
rily-0.042
Token castle
Token ID16669
Feature activation-2.09e+00

Pos logit contributions
 +0.027
+0.027
,+0.027
 she+0.025
 are+0.024
Neg logit contributions
 tunes-0.046
 films-0.044
phony-0.042
rated-0.041
 melodies-0.041
Token and
Token ID290
Feature activation+6.74e-01

Pos logit contributions
 +0.047
+0.045
,+0.041
 she+0.038
m+0.038
Neg logit contributions
 tunes-0.134
 films-0.129
phony-0.123
rated-0.122
rily-0.122
Token Mia
Token ID32189
Feature activation-2.71e-01

Pos logit contributions
 tunes+0.040
 films+0.039
rated+0.037
phony+0.037
 melodies+0.036
Neg logit contributions
m-0.016
-0.016
,-0.016
 -0.016
ossible-0.014
Token wants
Token ID3382
Feature activation+3.02e-01

Pos logit contributions
 +0.008
 forgive+0.008
m+0.007
ossible+0.007
 she+0.007
Neg logit contributions
 tunes-0.013
rated-0.012
ening-0.012
en-0.012
 films-0.012
Token to
Token ID284
Feature activation-2.19e-02

Pos logit contributions
 tunes+0.019
 films+0.018
 melodies+0.017
 tune+0.017
phony+0.017
Neg logit contributions
-0.007
 -0.007
,-0.007
m-0.006
 are-0.006
Token build
Token ID1382
Feature activation+3.45e-01

Pos logit contributions
m+0.001
+0.001
 +0.001
,+0.001
 she+0.001
Neg logit contributions
 tunes-0.001
 films-0.001
phony-0.001
en-0.001
 melodies-0.001
Token
Token ID198
Feature activation-2.95e-01

Pos logit contributions
+0.006
m+0.005
 are+0.005
,+0.005
 pr+0.004
Neg logit contributions
 tunes-0.022
 films-0.022
phony-0.020
rily-0.020
 tune-0.020
Token
Token ID198
Feature activation+9.18e-02

Pos logit contributions
+0.004
,+0.003
 +0.003
 are+0.003
m+0.003
Neg logit contributions
 tunes-0.021
 films-0.020
phony-0.020
rily-0.019
 melodies-0.019
TokenMom
Token ID29252
Feature activation-3.07e+00

Pos logit contributions
 tunes+0.005
 films+0.005
phony+0.004
 tune+0.004
 melodies+0.004
Neg logit contributions
 -0.003
,-0.003
 are-0.003
m-0.003
-0.003
Tokenmy
Token ID1820
Feature activation-1.03e-01

Pos logit contributions
+0.061
 +0.054
 she+0.054
,+0.050
ossible+0.048
Neg logit contributions
 tunes-0.185
en-0.181
rated-0.178
ening-0.177
lights-0.174
Token was
Token ID373
Feature activation-5.88e-01

Pos logit contributions
+0.003
 +0.003
ossible+0.003
,+0.003
 she+0.003
Neg logit contributions
 tunes-0.005
 films-0.005
ening-0.005
 tune-0.005
en-0.005
Token angry
Token ID7954
Feature activation-2.85e+00

Pos logit contributions
+0.014
 +0.014
,+0.014
 are+0.012
 she+0.012
Neg logit contributions
 tunes-0.036
 films-0.035
phony-0.033
rily-0.033
 tune-0.033
Token.
Token ID13
Feature activation+2.03e-01

Pos logit contributions
 +0.061
+0.059
ossible+0.053
 are+0.048
,+0.044
Neg logit contributions
 tunes-0.182
 films-0.174
ening-0.170
en-0.166
 tune-0.165
Token She
Token ID1375
Feature activation-5.20e-01

Pos logit contributions
 tunes+0.015
 films+0.014
phony+0.014
rily+0.014
rated+0.014
Neg logit contributions
-0.003
,-0.002
 -0.002
 are-0.002
m-0.002
Token grabbed
Token ID13176
Feature activation+5.31e-02

Pos logit contributions
 +0.012
,+0.012
+0.012
 she+0.010
 are+0.010
Neg logit contributions
 tunes-0.033
 films-0.032
phony-0.031
rated-0.030
 melodies-0.030
Token Lily
Token ID20037
Feature activation-1.02e+00

Pos logit contributions
 tunes+0.003
rated+0.003
phony+0.003
 films+0.003
 melodies+0.003
Neg logit contributions
,-0.002
 -0.002
-0.001
 she-0.001
 are-0.001
Token and
Token ID290
Feature activation+3.51e-01

Pos logit contributions
 +0.030
'm+0.030
 am+0.027
 forgive+0.027
ossible+0.027
Neg logit contributions
 tunes-0.050
 films-0.049
en-0.048
ening-0.048
 tune-0.047