TOP ACTIVATIONS
MAX = 3.704

will take the rh in oc eros back to the zoo
him about the frog 's habitat , diet and how it
It is the rh in oc eros !" The
Sam hugged the rh in oc eros goodbye , then ran
had discovered the rh in oc eros , but they had
me see the rh in oc eros . It is still
go home , Spot said goodbye to his new friend .
, making his rh in oc eros charge at Lily 's
goodbye to the rh in oc eros . They say they
paid him for his av oc ados . Tom
will visit the rh in oc eros at the zoo .
They put the rh in oc eros in the truck .
bag , kissed his mom goodbye and set off on his
mom my . He said goodbye and went back to j
go home . Max said goodbye and thanked her for playing
bird welcomed the rh in oc eros to her home ,
learn about the rh in oc eros . They say goodbye
go home , he said goodbye to Kitty . He promised
It was the rh in oc eros . It lifted its
back to the rh in oc eros and showed it what

MIN ACTIVATIONS
MIN = -4.114

ow ! It hurts !" L ila says , holding her
box and saw a j igsaw puzzle inside . He paused
a bright pink cover and L izzy was very excited .
to the beach ?" asked L izzy .
It was a big j igsaw puzzle and she wanted to
jumped up and l icked Jas mine 's face .
's a stream !" said Jas mine .
I have some ice ?" L ila asked .
Look , a bird !" L ila says . " I
would take out a j igsaw puzzle and try to solve
. It 's not a good place to go ."
bee is angry and strikes L ila on the hand .
Stay away , dog !" L ila says . She uses
very long and flexible !" L ila said , bending the
look at that balloon !" L ila says . " It
to Mia 's house ?" L ila asked .
trick - or - t reating anymore . She sadly put
She is so pretty !" L ola says .
the salt , Mom ?" L ila asks .
butterfly turned and looked at L ola with its big eyes

INTERVAL 1.750 - 3.704
CONTAINS 0.287%

. The end . <|endoftext|> Harry and his dad were walking
R uffy w agged his tail . Tim picked
was so excited and waved goodbye to the kind old man
other dog with his big ears and his fierce bark .
friend ." R oc co nodded and said ,

INTERVAL -0.205 - 1.750
CONTAINS 65.036%

, marvel ing at her beauty . The bear
we have some water , please ?" Lia asked . She
to playing in the park . They are not scared of
! Thank you , Sara !" They are happy and go
believed in magic . <|endoftext|> Anna and Ben are friends .

INTERVAL -2.160 - -0.205
CONTAINS 34.602%

did . In fact , this small creature did not like
his eyes and saw that they had arrived at their floor
careful with food , if not it could get stuck in
S ara and Lily felt ashamed . They knew
Everything looked so beautiful . His mom had even packed nap

INTERVAL -4.114 - -2.160
CONTAINS 0.075%

. Sarah was a really good singer and dancer too .
the fridge not to get cookies , but to talk to
was a little girl called L ola . She loved to
me . You are a good brother ." Mia says .
" Help ! Help !" L ila and Tom shout .
Token will
Token ID481
Feature activation+1.73e+00

Pos logit contributions
cut+0.013
ho+0.013
ir+0.012
itt+0.011
bed+0.010
Neg logit contributions
 Alice-0.017
 Samantha-0.017
 Debbie-0.016
 Lib-0.016
 Kara-0.016
Token take
Token ID1011
Feature activation+5.75e-01

Pos logit contributions
itt+0.047
ho+0.047
oken+0.046
cut+0.045
ir+0.045
Neg logit contributions
 Samantha-0.045
 Alice-0.045
 avail-0.043
 Debbie-0.043
 Lib-0.043
Token the
Token ID262
Feature activation+7.91e-01

Pos logit contributions
 can+0.012
ho+0.012
ir+0.012
itt+0.012
oken+0.012
Neg logit contributions
oph-0.018
ibi-0.018
 Debbie-0.018
 avail-0.017
 Samantha-0.017
Token rh
Token ID9529
Feature activation+1.78e+00

Pos logit contributions
ir+0.021
ho+0.021
oken+0.020
cut+0.020
itt+0.020
Neg logit contributions
 Samantha-0.020
 Debbie-0.020
 Alice-0.020
 avail-0.020
 Lib-0.020
Tokenin
Token ID259
Feature activation+6.15e-01

Pos logit contributions
oken+0.043
ho+0.039
 sincere+0.037
 can+0.037
ir+0.037
Neg logit contributions
 avail-0.048
 Debbie-0.048
 Lib-0.048
oph-0.047
 Alice-0.047
Tokenoc
Token ID420
Feature activation+3.70e+00

Pos logit contributions
oken+0.003
ho+0.003
ir+0.003
 sincere+0.002
cut+0.002
Neg logit contributions
 Samantha-0.029
 Debbie-0.029
 Alice-0.029
 Lib-0.028
 avail-0.028
Tokeneros
Token ID27498
Feature activation+9.48e-01

Pos logit contributions
oken+0.103
 can+0.098
Did+0.095
 cannot+0.092
 affect+0.092
Neg logit contributions
 avail-0.092
 Lib-0.091
 Samantha-0.088
 Debbie-0.085
 Alice-0.085
Token back
Token ID736
Feature activation+5.13e-02

Pos logit contributions
ho+0.023
ir+0.022
oken+0.022
cut+0.022
itt+0.021
Neg logit contributions
 Samantha-0.027
 Alice-0.027
 Debbie-0.027
 Lib-0.026
 avail-0.026
Token to
Token ID284
Feature activation+1.90e-01

Pos logit contributions
ho+0.001
cut+0.001
ir+0.001
oken+0.001
itt+0.001
Neg logit contributions
 Samantha-0.002
 Debbie-0.002
 Alice-0.002
 avail-0.002
oph-0.002
Token the
Token ID262
Feature activation+5.40e-01

Pos logit contributions
ho+0.005
cut+0.005
ir+0.005
oken+0.005
itt+0.005
Neg logit contributions
 Samantha-0.005
 Debbie-0.005
 Alice-0.005
 avail-0.005
 Lib-0.005
Token zoo
Token ID26626
Feature activation+5.24e-01

Pos logit contributions
ir+0.014
ho+0.014
oken+0.013
cut+0.013
itt+0.013
Neg logit contributions
 Samantha-0.014
 Alice-0.014
 Debbie-0.014
 avail-0.014
 Lib-0.014
Token him
Token ID683
Feature activation+1.02e-01

Pos logit contributions
 avail+0.032
oph+0.031
 Debbie+0.031
ibi+0.030
 Samantha+0.030
Neg logit contributions
ir-0.011
ho-0.011
oken-0.010
 can-0.010
cut-0.010
Token about
Token ID546
Feature activation-2.45e-01

Pos logit contributions
ho+0.003
cut+0.003
ir+0.003
 can+0.003
oken+0.003
Neg logit contributions
 Samantha-0.003
 Debbie-0.003
 avail-0.003
oph-0.002
 Alice-0.002
Token the
Token ID262
Feature activation+6.15e-01

Pos logit contributions
 Samantha+0.006
 Alice+0.006
 Debbie+0.005
 avail+0.005
 Lib+0.005
Neg logit contributions
ho-0.007
oken-0.007
ir-0.007
itt-0.007
cut-0.007
Token frog
Token ID21264
Feature activation-3.03e-01

Pos logit contributions
ir+0.017
itt+0.017
ho+0.017
ended+0.016
oken+0.016
Neg logit contributions
 Samantha-0.014
 avail-0.014
 Alice-0.014
 Lib-0.014
 Debbie-0.014
Token's
Token ID338
Feature activation+1.26e+00

Pos logit contributions
 Samantha+0.009
 Alice+0.009
 Debbie+0.009
 Lib+0.009
oph+0.009
Neg logit contributions
ho-0.006
ir-0.006
oken-0.006
cut-0.006
itt-0.006
Token habitat
Token ID20018
Feature activation+3.69e+00

Pos logit contributions
Did+0.037
ir+0.037
itt+0.037
can+0.036
ho+0.035
Neg logit contributions
 Samantha-0.027
 Alice-0.026
 Debbie-0.025
 avail-0.025
 Mary-0.025
Token,
Token ID11
Feature activation+7.47e-01

Pos logit contributions
cut+0.107
ho+0.105
itt+0.101
cient+0.099
 sincere+0.098
Neg logit contributions
 Samantha-0.083
oph-0.083
 Alice-0.082
 Clara-0.081
 Sophia-0.080
Token diet
Token ID5496
Feature activation+1.75e+00

Pos logit contributions
ir+0.019
oken+0.019
ho+0.019
itt+0.019
ighters+0.017
Neg logit contributions
 Samantha-0.021
 Alice-0.021
 Mary-0.020
 Debbie-0.020
 Lib-0.019
Token and
Token ID290
Feature activation-5.89e-01

Pos logit contributions
 sincere+0.037
oken+0.037
ir+0.037
ho+0.036
cut+0.035
Neg logit contributions
 Samantha-0.052
 Alice-0.051
 Debbie-0.051
oph-0.050
 Lib-0.050
Token how
Token ID703
Feature activation+1.88e-01

Pos logit contributions
 Samantha+0.016
 Alice+0.015
 Debbie+0.015
 Lib+0.015
 avail+0.015
Neg logit contributions
ir-0.016
ho-0.015
oken-0.015
itt-0.015
cut-0.014
Token it
Token ID340
Feature activation-6.14e-02

Pos logit contributions
ho+0.005
ir+0.005
cut+0.005
oken+0.005
itt+0.005
Neg logit contributions
 Samantha-0.005
 Debbie-0.005
 Alice-0.005
 avail-0.005
 Lib-0.005
TokenIt
Token ID1026
Feature activation-1.68e-01

Pos logit contributions
oken+0.037
ir+0.035
ho+0.035
cut+0.034
 sincere+0.033
Neg logit contributions
 Debbie-0.030
 Samantha-0.030
 Alice-0.029
 Lib-0.029
 Carly-0.028
Token is
Token ID318
Feature activation+1.40e-01

Pos logit contributions
 Alice+0.005
 Debbie+0.005
 Samantha+0.005
 Lib+0.005
 avail+0.005
Neg logit contributions
ir-0.003
ho-0.003
oken-0.003
itt-0.003
cut-0.003
Token the
Token ID262
Feature activation+8.76e-01

Pos logit contributions
 can+0.003
 or+0.003
cut+0.003
 sincere+0.003
ho+0.003
Neg logit contributions
oph-0.004
 Debbie-0.004
 avail-0.004
 Samantha-0.004
ibi-0.004
Token rh
Token ID9529
Feature activation+2.19e+00

Pos logit contributions
ir+0.023
ho+0.023
itt+0.022
oken+0.022
cut+0.021
Neg logit contributions
 Debbie-0.023
 avail-0.023
 Samantha-0.023
 Lib-0.022
 Alice-0.022
Tokenin
Token ID259
Feature activation+6.91e-01

Pos logit contributions
oken+0.065
ho+0.061
 Acc+0.060
 Did+0.059
 sincere+0.056
Neg logit contributions
 avail-0.043
 Debbie-0.043
 Lib-0.042
 Robbie-0.041
 possible-0.041
Tokenoc
Token ID420
Feature activation+3.68e+00

Pos logit contributions
oken+0.012
ir+0.009
 sincere+0.009
ho+0.008
ighters+0.008
Neg logit contributions
 Lib-0.026
 Debbie-0.025
 Alice-0.025
 Samantha-0.025
 Robbie-0.025
Tokeneros
Token ID27498
Feature activation+1.04e+00

Pos logit contributions
Did+0.100
oken+0.100
 can+0.098
 cannot+0.096
 forgive+0.094
Neg logit contributions
 Lib-0.090
 avail-0.087
 Samantha-0.084
 flavours-0.084
 Debbie-0.082
Token!"
Token ID2474
Feature activation+4.32e-01

Pos logit contributions
oken+0.027
ho+0.027
ir+0.027
cut+0.026
itt+0.026
Neg logit contributions
 Alice-0.027
 Debbie-0.027
 Samantha-0.027
 Lib-0.026
 avail-0.026
Token
Token ID198
Feature activation+2.02e-01

Pos logit contributions
ho+0.008
 can+0.008
ir+0.008
itt+0.008
cut+0.008
Neg logit contributions
 avail-0.014
 Samantha-0.014
oph-0.014
 Debbie-0.014
 Lib-0.014
Token
Token ID198
Feature activation+4.85e-02

Pos logit contributions
ho+0.004
ir+0.004
cut+0.004
 can+0.004
itt+0.004
Neg logit contributions
 avail-0.006
 Samantha-0.006
oph-0.006
 Debbie-0.006
 Alice-0.006
TokenThe
Token ID464
Feature activation+3.63e-01

Pos logit contributions
ho+0.001
cut+0.001
ir+0.001
oken+0.001
itt+0.001
Neg logit contributions
 Samantha-0.001
 Debbie-0.001
 avail-0.001
 Alice-0.001
oph-0.001
Token Sam
Token ID3409
Feature activation+1.02e-01

Pos logit contributions
 Debbie+0.003
oph+0.003
 Lib+0.003
 Samantha+0.003
 avail+0.003
Neg logit contributions
ho-0.003
ir-0.003
cut-0.003
 can-0.003
 sincere-0.003
Token hugged
Token ID41130
Feature activation+3.90e-01

Pos logit contributions
oken+0.002
itt+0.002
cut+0.002
ho+0.002
ir+0.002
Neg logit contributions
 Samantha-0.003
 Alice-0.003
 Debbie-0.003
oph-0.003
 Lib-0.003
Token the
Token ID262
Feature activation+9.74e-01

Pos logit contributions
ho+0.010
oken+0.010
cut+0.010
itt+0.010
ir+0.010
Neg logit contributions
 Samantha-0.011
 Alice-0.011
 Debbie-0.010
 Carly-0.010
 avail-0.010
Token rh
Token ID9529
Feature activation+1.68e+00

Pos logit contributions
oken+0.027
ir+0.027
ho+0.027
itt+0.026
ighters+0.025
Neg logit contributions
 Samantha-0.024
 Alice-0.023
 Lib-0.023
 Debbie-0.023
 avail-0.022
Tokenin
Token ID259
Feature activation+3.93e-01

Pos logit contributions
oken+0.050
 Did+0.042
ho+0.042
 Acc+0.041
 can+0.041
Neg logit contributions
 Lib-0.037
 Debbie-0.036
isi-0.036
oph-0.035
 Alice-0.035
Tokenoc
Token ID420
Feature activation+3.63e+00

Pos logit contributions
oken+0.004
ir+0.003
 sincere+0.003
ho+0.003
 affect+0.002
Neg logit contributions
 Alice-0.017
 Lib-0.017
 Samantha-0.017
 Debbie-0.017
ibi-0.016
Tokeneros
Token ID27498
Feature activation+7.58e-01

Pos logit contributions
oken+0.101
 can+0.097
Did+0.093
 cannot+0.093
 affect+0.091
Neg logit contributions
 Lib-0.093
 Samantha-0.090
 avail-0.087
 Debbie-0.086
oph-0.085
Token goodbye
Token ID24829
Feature activation+3.24e+00

Pos logit contributions
ho+0.012
cut+0.011
ir+0.011
 can+0.010
itt+0.010
Neg logit contributions
 Samantha-0.029
 avail-0.029
 Debbie-0.028
oph-0.028
 Alice-0.028
Token,
Token ID11
Feature activation+6.60e-01

Pos logit contributions
oken+0.067
ir+0.063
cut+0.062
itt+0.062
ho+0.060
Neg logit contributions
 Samantha-0.106
 Alice-0.106
 Lib-0.104
 Debbie-0.103
oph-0.100
Token then
Token ID788
Feature activation+4.76e-01

Pos logit contributions
oken+0.016
ir+0.016
itt+0.016
ho+0.015
cut+0.015
Neg logit contributions
 Samantha-0.020
 Alice-0.020
 Debbie-0.019
 Lib-0.019
 Carly-0.018
Token ran
Token ID4966
Feature activation-1.91e-01

Pos logit contributions
ho+0.012
ir+0.012
oken+0.012
cut+0.012
itt+0.011
Neg logit contributions
 Samantha-0.013
 Debbie-0.013
 Alice-0.013
 Lib-0.013
 avail-0.013
Token had
Token ID550
Feature activation+1.01e+00

Pos logit contributions
 Samantha+0.005
oph+0.005
 Alice+0.005
 Lib+0.005
 Debbie+0.005
Neg logit contributions
ir-0.003
oken-0.003
itt-0.003
cut-0.003
ho-0.003
Token discovered
Token ID5071
Feature activation-7.83e-01

Pos logit contributions
ho+0.020
cut+0.019
ir+0.019
oken+0.019
 can+0.019
Neg logit contributions
 Debbie-0.033
 Samantha-0.033
 avail-0.033
oph-0.032
 Alice-0.032
Token the
Token ID262
Feature activation+1.76e-01

Pos logit contributions
 Samantha+0.020
 Debbie+0.020
 Alice+0.020
 Lib+0.019
 avail+0.019
Neg logit contributions
ho-0.021
ir-0.021
cut-0.020
oken-0.020
itt-0.020
Token rh
Token ID9529
Feature activation+2.05e+00

Pos logit contributions
ir+0.005
itt+0.005
ho+0.005
oken+0.005
bed+0.005
Neg logit contributions
 Lib-0.004
 Samantha-0.004
 Alice-0.004
 Debbie-0.004
 avail-0.004
Tokenin
Token ID259
Feature activation+6.85e-01

Pos logit contributions
oken+0.060
 Acc+0.056
ho+0.055
 Did+0.054
 sincere+0.052
Neg logit contributions
isi-0.040
 Lib-0.040
 Debbie-0.040
 Alice-0.039
 prizes-0.039
Tokenoc
Token ID420
Feature activation+3.59e+00

Pos logit contributions
oken+0.014
ir+0.012
 sincere+0.011
onel+0.010
ighters+0.010
Neg logit contributions
 Lib-0.023
 flavors-0.022
 Alice-0.022
 flavours-0.022
 Mary-0.022
Tokeneros
Token ID27498
Feature activation+1.14e+00

Pos logit contributions
oken+0.109
 can+0.108
Did+0.107
 cannot+0.105
 forgive+0.100
Neg logit contributions
 Lib-0.078
 flavours-0.078
 flavors-0.073
 Samantha-0.072
 avail-0.069
Token,
Token ID11
Feature activation-4.85e-01

Pos logit contributions
oken+0.024
ir+0.024
itt+0.023
ho+0.023
cut+0.021
Neg logit contributions
 Alice-0.036
 Samantha-0.034
 Lib-0.034
 Debbie-0.033
 Mary-0.033
Token but
Token ID475
Feature activation+6.04e-01

Pos logit contributions
 Alice+0.015
 Samantha+0.015
 Mary+0.015
 Debbie+0.014
 Lib+0.014
Neg logit contributions
oken-0.012
ir-0.012
itt-0.011
ho-0.011
cut-0.010
Token they
Token ID484
Feature activation-2.87e-01

Pos logit contributions
ir+0.019
oken+0.019
ho+0.018
cut+0.018
itt+0.018
Neg logit contributions
 Samantha-0.014
 Alice-0.014
 Debbie-0.014
 Lib-0.013
 Carly-0.013
Token had
Token ID550
Feature activation+8.87e-01

Pos logit contributions
 Samantha+0.008
 Alice+0.008
 Lib+0.008
 Debbie+0.007
 Megan+0.007
Neg logit contributions
oken-0.007
ir-0.007
itt-0.007
 sy-0.006
ho-0.006
Token me
Token ID502
Feature activation+1.13e+00

Pos logit contributions
ho+0.014
ir+0.014
 can+0.013
cut+0.013
'+0.013
Neg logit contributions
 Debbie-0.025
 Samantha-0.025
 avail-0.025
 Lib-0.024
oph-0.024
Token see
Token ID766
Feature activation+4.08e-01

Pos logit contributions
ho+0.029
cut+0.028
ir+0.025
 sincere+0.025
Did+0.025
Neg logit contributions
 Samantha-0.032
 Alice-0.031
 Debbie-0.031
 avail-0.030
 Lib-0.030
Token the
Token ID262
Feature activation+1.11e+00

Pos logit contributions
ho+0.012
oken+0.012
itt+0.011
cut+0.011
ir+0.011
Neg logit contributions
 Samantha-0.010
 Alice-0.010
 Debbie-0.010
 Lib-0.010
 avail-0.010
Token rh
Token ID9529
Feature activation+1.85e+00

Pos logit contributions
ir+0.030
oken+0.030
ho+0.030
itt+0.029
 sincere+0.028
Neg logit contributions
 Samantha-0.027
 Lib-0.027
 Alice-0.026
 Debbie-0.026
 avail-0.026
Tokenin
Token ID259
Feature activation+6.26e-01

Pos logit contributions
oken+0.050
ho+0.044
 Acc+0.043
 sincere+0.042
 Did+0.042
Neg logit contributions
 Lib-0.044
 Debbie-0.044
 avail-0.044
oph-0.044
isi-0.043
Tokenoc
Token ID420
Feature activation+3.59e+00

Pos logit contributions
oken+0.008
 sincere+0.006
ir+0.006
ho+0.005
itt+0.004
Neg logit contributions
 Lib-0.026
 Alice-0.026
 Debbie-0.026
 Samantha-0.026
 Mary-0.025
Tokeneros
Token ID27498
Feature activation+9.25e-01

Pos logit contributions
oken+0.102
 can+0.098
Did+0.094
 cannot+0.093
 affect+0.091
Neg logit contributions
 Lib-0.089
 avail-0.083
 Samantha-0.082
oph-0.080
 flavours-0.080
Token.
Token ID13
Feature activation+7.04e-01

Pos logit contributions
oken+0.027
ho+0.027
itt+0.026
ir+0.026
 sincere+0.025
Neg logit contributions
 Alice-0.021
 Debbie-0.020
 Samantha-0.020
 Lib-0.020
 Clara-0.019
Token It
Token ID632
Feature activation+1.39e-01

Pos logit contributions
ho+0.016
ir+0.015
oken+0.015
cut+0.015
itt+0.014
Neg logit contributions
 Samantha-0.022
 Alice-0.022
 Debbie-0.022
 Lib-0.021
 avail-0.021
Token is
Token ID318
Feature activation-4.62e-01

Pos logit contributions
ir+0.003
oken+0.003
ho+0.003
itt+0.002
cut+0.002
Neg logit contributions
 Alice-0.005
 Debbie-0.005
 Samantha-0.004
 Lib-0.004
 Kara-0.004
Token still
Token ID991
Feature activation+2.78e-01

Pos logit contributions
oph+0.016
 avail+0.015
 Debbie+0.015
ibi+0.015
 Samantha+0.014
Neg logit contributions
 can-0.010
cut-0.009
 or-0.009
 sincere-0.009
 cannot-0.008
Token go
Token ID467
Feature activation-1.47e-01

Pos logit contributions
 Debbie+0.012
 Alice+0.011
 Lib+0.011
 Samantha+0.011
oph+0.011
Neg logit contributions
ir-0.007
 can-0.006
ho-0.006
oken-0.006
 forgive-0.006
Token home
Token ID1363
Feature activation-8.21e-03

Pos logit contributions
 Samantha+0.004
 Debbie+0.004
 avail+0.004
 Alice+0.004
oph+0.004
Neg logit contributions
ho-0.003
ir-0.003
cut-0.003
oken-0.003
 can-0.003
Token,
Token ID11
Feature activation-2.67e-01

Pos logit contributions
 Samantha+0.000
 Debbie+0.000
 Alice+0.000
 avail+0.000
 Lib+0.000
Neg logit contributions
ho-0.000
cut-0.000
ir-0.000
oken-0.000
itt-0.000
Token Spot
Token ID15899
Feature activation+3.63e-01

Pos logit contributions
oph+0.007
 avail+0.007
ibi+0.007
 Debbie+0.007
isi+0.006
Neg logit contributions
ho-0.007
 can-0.007
 or-0.007
 cannot-0.007
 sincere-0.007
Token said
Token ID531
Feature activation+5.31e-01

Pos logit contributions
ho+0.006
ir+0.006
cut+0.006
oken+0.006
itt+0.005
Neg logit contributions
 Samantha-0.013
 Debbie-0.013
 Alice-0.013
 avail-0.013
oph-0.013
Token goodbye
Token ID24829
Feature activation+3.57e+00

Pos logit contributions
ir+0.013
oken+0.013
cut+0.013
ho+0.013
itt+0.012
Neg logit contributions
 Alice-0.015
 Samantha-0.015
 Debbie-0.015
 Lib-0.015
 avail-0.014
Token to
Token ID284
Feature activation-5.22e-01

Pos logit contributions
ir+0.077
oken+0.077
cut+0.075
itt+0.074
ho+0.074
Neg logit contributions
 Samantha-0.111
 Alice-0.110
 Debbie-0.110
 Lib-0.109
oph-0.107
Token his
Token ID465
Feature activation+4.88e-01

Pos logit contributions
oph+0.017
 Debbie+0.016
 Samantha+0.016
ibi+0.016
 Lib+0.016
Neg logit contributions
 can-0.010
 sincere-0.010
 forgive-0.010
ir-0.010
cut-0.010
Token new
Token ID649
Feature activation+6.65e-01

Pos logit contributions
ho+0.016
cut+0.016
itt+0.015
ir+0.015
oken+0.015
Neg logit contributions
 avail-0.010
 Samantha-0.010
 Alice-0.010
oph-0.009
ibi-0.009
Token friend
Token ID1545
Feature activation-4.60e-01

Pos logit contributions
ho+0.020
cut+0.020
oken+0.019
ir+0.019
itt+0.019
Neg logit contributions
 Samantha-0.015
 avail-0.015
 Debbie-0.015
 Alice-0.015
oph-0.014
Token.
Token ID13
Feature activation+4.44e-01

Pos logit contributions
 Samantha+0.017
 Debbie+0.017
 avail+0.017
oph+0.016
 Alice+0.016
Neg logit contributions
ho-0.007
cut-0.007
ir-0.007
 can-0.007
oken-0.007
Token,
Token ID11
Feature activation-3.49e-01

Pos logit contributions
ho+0.001
cut+0.001
ir+0.001
 can+0.001
 sincere+0.001
Neg logit contributions
oph-0.003
 avail-0.003
 Debbie-0.003
 Samantha-0.003
 Lib-0.003
Token making
Token ID1642
Feature activation-2.36e-01

Pos logit contributions
 Alice+0.010
 Samantha+0.010
 Debbie+0.010
 Lib+0.009
 avail+0.009
Neg logit contributions
ho-0.008
oken-0.008
ir-0.008
itt-0.008
 can-0.008
Token his
Token ID465
Feature activation-1.83e-01

Pos logit contributions
oph+0.008
 Lib+0.008
 Samantha+0.008
bi+0.007
 avail+0.007
Neg logit contributions
ir-0.005
oken-0.004
 can-0.004
e-0.004
 or-0.004
Token rh
Token ID9529
Feature activation+2.02e+00

Pos logit contributions
 Samantha+0.006
 Debbie+0.006
 avail+0.005
 Alice+0.005
oph+0.005
Neg logit contributions
ho-0.004
cut-0.004
ir-0.004
oken-0.004
itt-0.004
Tokenin
Token ID259
Feature activation+5.36e-01

Pos logit contributions
oken+0.048
ho+0.048
 Did+0.046
 can+0.045
 Acc+0.043
Neg logit contributions
 avail-0.053
isi-0.051
 Debbie-0.051
oph-0.050
 Alice-0.050
Tokenoc
Token ID420
Feature activation+3.56e+00

Pos logit contributions
ho+0.003
oc+0.003
cut+0.003
ir+0.003
 can+0.002
Neg logit contributions
 Samantha-0.026
 Debbie-0.025
 avail-0.025
oph-0.025
 Alice-0.025
Tokeneros
Token ID27498
Feature activation+1.17e+00

Pos logit contributions
 can+0.094
oken+0.091
Did+0.090
'+0.086
 cannot+0.085
Neg logit contributions
 avail-0.092
 Lib-0.089
 Samantha-0.088
 Debbie-0.087
oph-0.086
Token charge
Token ID3877
Feature activation+4.00e-01

Pos logit contributions
ho+0.023
ir+0.022
cut+0.022
 can+0.021
oc+0.021
Neg logit contributions
 Samantha-0.040
 Debbie-0.040
oph-0.039
 avail-0.039
 Lib-0.038
Token at
Token ID379
Feature activation+9.94e-02

Pos logit contributions
ho+0.009
cut+0.009
ir+0.008
 sincere+0.008
oc+0.008
Neg logit contributions
 Alice-0.012
 Samantha-0.012
 Lib-0.012
 Debbie-0.012
 Carly-0.011
Token Lily
Token ID20037
Feature activation-1.93e+00

Pos logit contributions
ho+0.003
ir+0.002
cut+0.002
oken+0.002
 can+0.002
Neg logit contributions
 Samantha-0.003
 Debbie-0.003
oph-0.003
 avail-0.003
 Alice-0.003
Token's
Token ID338
Feature activation+5.27e-01

Pos logit contributions
 Alice+0.055
 Samantha+0.054
 Debbie+0.054
 Lib+0.053
 avail+0.053
Neg logit contributions
ho-0.045
oken-0.044
ir-0.044
cut-0.043
itt-0.043
Token goodbye
Token ID24829
Feature activation+3.18e+00

Pos logit contributions
oken+0.019
ir+0.019
ho+0.019
oc+0.018
cut+0.018
Neg logit contributions
 Alice-0.016
 Samantha-0.015
 Lib-0.015
 Debbie-0.015
 Carly-0.015
Token to
Token ID284
Feature activation-5.54e-01

Pos logit contributions
oken+0.084
ir+0.082
cut+0.079
 sincere+0.079
itt+0.077
Neg logit contributions
 Alice-0.086
 Samantha-0.084
 Debbie-0.083
 Lib-0.083
 Mary-0.080
Token the
Token ID262
Feature activation+1.95e-01

Pos logit contributions
 Samantha+0.014
 Alice+0.014
 Debbie+0.014
 avail+0.013
 Lib+0.013
Neg logit contributions
ho-0.016
cut-0.015
oken-0.015
ir-0.015
itt-0.015
Token rh
Token ID9529
Feature activation+1.60e+00

Pos logit contributions
ir+0.006
oken+0.006
itt+0.005
ho+0.005
ighters+0.005
Neg logit contributions
 Alice-0.004
 Lib-0.004
 Samantha-0.004
 Debbie-0.004
 avail-0.004
Tokenin
Token ID259
Feature activation+1.04e-01

Pos logit contributions
oken+0.043
ho+0.037
 can+0.037
 Did+0.036
 affect+0.036
Neg logit contributions
 Debbie-0.040
 Lib-0.039
oph-0.039
 avail-0.039
 Alice-0.038
Tokenoc
Token ID420
Feature activation+3.56e+00

Pos logit contributions
oken+0.002
 sincere+0.001
ir+0.001
onel+0.001
 affect+0.001
Neg logit contributions
 Lib-0.004
 Alice-0.004
 Debbie-0.004
 Samantha-0.004
 Mary-0.004
Tokeneros
Token ID27498
Feature activation+4.53e-01

Pos logit contributions
oken+0.103
 can+0.097
 affect+0.092
Did+0.091
 cannot+0.091
Neg logit contributions
 Lib-0.089
 Samantha-0.085
 avail-0.084
 Debbie-0.083
 Alice-0.083
Token.
Token ID13
Feature activation+4.10e-01

Pos logit contributions
oken+0.013
ir+0.012
ho+0.012
itt+0.012
 sincere+0.012
Neg logit contributions
 Alice-0.012
 Lib-0.011
 Debbie-0.011
 Samantha-0.011
 Mary-0.011
Token They
Token ID1119
Feature activation-4.62e-01

Pos logit contributions
ho+0.010
ir+0.009
 can+0.009
cut+0.009
oken+0.009
Neg logit contributions
 Debbie-0.012
 avail-0.012
 Samantha-0.012
oph-0.012
 Alice-0.011
Token say
Token ID910
Feature activation+7.64e-01

Pos logit contributions
 Samantha+0.014
 Debbie+0.014
 Alice+0.014
 avail+0.014
oph+0.014
Neg logit contributions
ho-0.010
ir-0.010
cut-0.010
oken-0.010
itt-0.010
Token they
Token ID484
Feature activation+3.45e-01

Pos logit contributions
oken+0.022
ir+0.021
ho+0.021
cut+0.021
oc+0.021
Neg logit contributions
 Alice-0.020
 Samantha-0.019
 Lib-0.019
 Debbie-0.019
 Carly-0.018
Token paid
Token ID3432
Feature activation-7.80e-01

Pos logit contributions
 Samantha+0.014
 Alice+0.014
 Lib+0.013
 Debbie+0.013
 Kara+0.013
Neg logit contributions
itt-0.014
oken-0.014
cut-0.014
ir-0.013
 sincere-0.013
Token him
Token ID683
Feature activation-2.96e-02

Pos logit contributions
oph+0.022
 Samantha+0.022
 Debbie+0.022
 avail+0.022
 Lib+0.021
Neg logit contributions
ho-0.019
cut-0.018
ir-0.018
 can-0.018
oken-0.018
Token for
Token ID329
Feature activation+1.21e+00

Pos logit contributions
 Samantha+0.001
 Debbie+0.001
 Alice+0.001
 avail+0.001
 Lib+0.001
Neg logit contributions
ho-0.001
ir-0.001
cut-0.001
oken-0.001
itt-0.001
Token his
Token ID465
Feature activation+9.47e-01

Pos logit contributions
ir+0.028
ho+0.028
oken+0.028
 sincere+0.027
 can+0.027
Neg logit contributions
oph-0.035
 Debbie-0.034
ibi-0.033
 Samantha-0.033
 Lib-0.032
Token av
Token ID1196
Feature activation+7.92e-01

Pos logit contributions
ir+0.027
ho+0.027
cut+0.026
oken+0.026
itt+0.025
Neg logit contributions
 Samantha-0.022
 avail-0.022
 Alice-0.022
 Debbie-0.022
 Lib-0.021
Tokenoc
Token ID420
Feature activation+3.54e+00

Pos logit contributions
ho+0.002
cut+0.001
ir+0.001
oken+0.000
itt+0.000
Neg logit contributions
 Samantha-0.040
 Debbie-0.040
 Alice-0.040
 avail-0.039
 Lib-0.039
Tokenados
Token ID22484
Feature activation-9.16e-01

Pos logit contributions
 can+0.114
oken+0.112
ir+0.111
Did+0.111
ho+0.109
Neg logit contributions
 avail-0.069
 Samantha-0.068
 Lib-0.066
oph-0.066
 Alice-0.063
Token.
Token ID13
Feature activation+5.99e-01

Pos logit contributions
 Samantha+0.026
 Alice+0.026
 Debbie+0.026
 avail+0.025
 Lib+0.025
Neg logit contributions
ho-0.022
ir-0.021
oken-0.021
cut-0.021
itt-0.020
Token
Token ID198
Feature activation+5.85e-01

Pos logit contributions
 can+0.011
ho+0.011
itt+0.011
cut+0.011
 sincere+0.011
Neg logit contributions
 avail-0.020
oph-0.020
 Debbie-0.019
 Samantha-0.019
 Lib-0.018
Token
Token ID198
Feature activation+9.54e-02

Pos logit contributions
ho+0.013
ir+0.012
cut+0.012
itt+0.012
oken+0.012
Neg logit contributions
 avail-0.018
 Samantha-0.018
 Debbie-0.018
oph-0.018
 Alice-0.018
TokenTom
Token ID13787
Feature activation+6.17e-01

Pos logit contributions
ho+0.002
cut+0.002
ir+0.002
oken+0.002
itt+0.002
Neg logit contributions
 Samantha-0.003
 Debbie-0.003
 avail-0.003
 Alice-0.003
oph-0.003
Token will
Token ID481
Feature activation+1.57e+00

Pos logit contributions
ho+0.008
ir+0.007
cut+0.007
itt+0.007
bed+0.007
Neg logit contributions
 Alice-0.011
 Samantha-0.010
 Debbie-0.010
 Carly-0.010
 Lib-0.010
Token visit
Token ID3187
Feature activation+1.36e+00

Pos logit contributions
itt+0.044
ho+0.043
oken+0.042
cut+0.042
ir+0.040
Neg logit contributions
 Samantha-0.040
 Alice-0.040
 Debbie-0.040
 avail-0.039
 Lib-0.039
Token the
Token ID262
Feature activation+3.69e-01

Pos logit contributions
oken+0.040
cut+0.040
itt+0.040
ho+0.039
ir+0.038
Neg logit contributions
 Alice-0.034
 Samantha-0.033
 Lib-0.032
 avail-0.032
 Debbie-0.032
Token rh
Token ID9529
Feature activation+1.66e+00

Pos logit contributions
ir+0.011
oken+0.010
itt+0.010
ho+0.010
cut+0.009
Neg logit contributions
 Alice-0.009
 Lib-0.009
 avail-0.009
 Samantha-0.009
 Debbie-0.009
Tokenin
Token ID259
Feature activation+2.14e-01

Pos logit contributions
oken+0.043
ho+0.039
 can+0.037
 Did+0.037
 sincere+0.037
Neg logit contributions
 avail-0.042
 Debbie-0.042
 Lib-0.041
oph-0.041
 Alice-0.040
Tokenoc
Token ID420
Feature activation+3.53e+00

Pos logit contributions
oken+0.003
 sincere+0.003
ir+0.002
itt+0.002
ho+0.002
Neg logit contributions
 Lib-0.008
 Alice-0.008
 Debbie-0.008
 Kara-0.008
 Samantha-0.008
Tokeneros
Token ID27498
Feature activation+5.52e-01

Pos logit contributions
oken+0.101
 can+0.097
 cannot+0.091
Did+0.091
 affect+0.091
Neg logit contributions
 Lib-0.087
 avail-0.084
 Samantha-0.082
 Debbie-0.080
 Alice-0.080
Token at
Token ID379
Feature activation+6.22e-01

Pos logit contributions
oken+0.016
ir+0.015
itt+0.015
ho+0.015
 sincere+0.015
Neg logit contributions
 Alice-0.013
 Lib-0.013
 Debbie-0.012
 Samantha-0.012
 Mary-0.012
Token the
Token ID262
Feature activation-1.35e-01

Pos logit contributions
ho+0.016
ir+0.015
cut+0.015
 can+0.015
oken+0.015
Neg logit contributions
 Samantha-0.017
 Debbie-0.017
oph-0.016
 avail-0.016
 Alice-0.016
Token zoo
Token ID26626
Feature activation+1.61e-01

Pos logit contributions
 Debbie+0.004
 Alice+0.004
 Samantha+0.004
 avail+0.004
 Lib+0.004
Neg logit contributions
ir-0.003
ho-0.003
oken-0.003
cut-0.003
itt-0.003
Token.
Token ID13
Feature activation+4.33e-01

Pos logit contributions
oken+0.005
ir+0.004
ho+0.004
itt+0.004
cut+0.004
Neg logit contributions
 Alice-0.004
 Lib-0.004
 Debbie-0.004
 Samantha-0.004
 Mary-0.004
Token They
Token ID1119
Feature activation-2.23e-01

Pos logit contributions
ho+0.007
cut+0.007
ir+0.007
oken+0.007
itt+0.007
Neg logit contributions
 Samantha-0.009
 Debbie-0.009
 Alice-0.009
 avail-0.009
oph-0.008
Token put
Token ID1234
Feature activation+1.22e-01

Pos logit contributions
 Debbie+0.006
 Alice+0.006
 Samantha+0.006
 Lib+0.006
 Carly+0.006
Neg logit contributions
oken-0.005
ho-0.005
ir-0.005
itt-0.005
cut-0.005
Token the
Token ID262
Feature activation+7.22e-01

Pos logit contributions
ho+0.003
cut+0.003
 can+0.003
ir+0.003
itt+0.003
Neg logit contributions
oph-0.004
 Debbie-0.003
 Samantha-0.003
 avail-0.003
 Alice-0.003
Token rh
Token ID9529
Feature activation+1.75e+00

Pos logit contributions
oken+0.020
ir+0.020
ho+0.019
itt+0.019
 sincere+0.018
Neg logit contributions
 Samantha-0.017
 Debbie-0.017
 avail-0.017
 Alice-0.017
 Lib-0.017
Tokenin
Token ID259
Feature activation+5.02e-01

Pos logit contributions
oken+0.043
ho+0.040
 sincere+0.038
ir+0.037
 can+0.037
Neg logit contributions
 Debbie-0.046
 avail-0.046
 Lib-0.045
oph-0.045
 Alice-0.045
Tokenoc
Token ID420
Feature activation+3.52e+00

Pos logit contributions
oken+0.008
 sincere+0.008
ir+0.007
ho+0.007
 affect+0.006
Neg logit contributions
 Alice-0.018
 Debbie-0.018
 Lib-0.018
 Samantha-0.018
 avail-0.017
Tokeneros
Token ID27498
Feature activation+7.23e-01

Pos logit contributions
oken+0.102
 can+0.096
Did+0.095
 affect+0.093
 cannot+0.091
Neg logit contributions
 Lib-0.082
 avail-0.082
 Samantha-0.079
 flavours-0.077
oph-0.077
Token in
Token ID287
Feature activation+4.94e-01

Pos logit contributions
ho+0.013
ir+0.013
oken+0.013
cut+0.012
 sincere+0.012
Neg logit contributions
 Samantha-0.024
 Alice-0.024
 Debbie-0.024
 Lib-0.024
 avail-0.023
Token the
Token ID262
Feature activation+5.01e-01

Pos logit contributions
ho+0.014
cut+0.013
ir+0.013
oken+0.013
itt+0.013
Neg logit contributions
 Samantha-0.012
 Debbie-0.012
 Alice-0.012
 avail-0.012
 Lib-0.012
Token truck
Token ID7779
Feature activation-3.89e-01

Pos logit contributions
ho+0.013
ir+0.012
cut+0.012
oken+0.012
itt+0.012
Neg logit contributions
 Samantha-0.014
 Debbie-0.013
 Alice-0.013
 avail-0.013
 Lib-0.013
Token.
Token ID13
Feature activation+4.03e-01

Pos logit contributions
 Alice+0.009
 Lib+0.009
 Debbie+0.009
 Samantha+0.009
 Mary+0.009
Neg logit contributions
oken-0.012
ir-0.011
 sincere-0.011
ho-0.010
 affect-0.010
Token bag
Token ID6131
Feature activation+6.38e-01

Pos logit contributions
 Samantha+0.005
 Debbie+0.005
 Alice+0.005
 avail+0.005
 Lib+0.005
Neg logit contributions
ho-0.007
ir-0.007
cut-0.007
itt-0.007
oken-0.007
Token,
Token ID11
Feature activation-1.05e-01

Pos logit contributions
ir+0.014
oken+0.014
 sincere+0.014
 affect+0.014
ho+0.013
Neg logit contributions
 Lib-0.020
 Alice-0.020
 Samantha-0.019
ibi-0.019
 Debbie-0.019
Token kissed
Token ID27946
Feature activation+6.82e-01

Pos logit contributions
 Samantha+0.003
 Debbie+0.003
 Alice+0.003
 Lib+0.003
 Carly+0.003
Neg logit contributions
ir-0.003
ho-0.003
cut-0.003
oken-0.003
itt-0.003
Token his
Token ID465
Feature activation+2.02e-02

Pos logit contributions
oken+0.024
 affect+0.024
ir+0.023
 sincere+0.023
itt+0.023
Neg logit contributions
 Samantha-0.014
 Alice-0.013
 Debbie-0.013
 Robbie-0.012
 Carly-0.012
Token mom
Token ID1995
Feature activation-1.04e+00

Pos logit contributions
ho+0.001
ir+0.001
cut+0.001
oken+0.001
itt+0.001
Neg logit contributions
 Samantha-0.001
 Debbie-0.001
 Alice-0.001
 avail-0.001
 Lib-0.000
Token goodbye
Token ID24829
Feature activation+3.51e+00

Pos logit contributions
 Samantha+0.038
 Alice+0.038
 Debbie+0.037
 Lib+0.037
 Carly+0.036
Neg logit contributions
oken-0.017
ir-0.017
ho-0.017
 sincere-0.017
cut-0.016
Token and
Token ID290
Feature activation+7.10e-01

Pos logit contributions
oken+0.065
ir+0.064
itt+0.062
cut+0.060
 sincere+0.060
Neg logit contributions
 Alice-0.122
 Samantha-0.122
 Debbie-0.121
 Lib-0.120
oph-0.117
Token set
Token ID900
Feature activation+2.23e-01

Pos logit contributions
ho+0.016
cut+0.016
ir+0.015
 can+0.015
oken+0.015
Neg logit contributions
 Debbie-0.021
 Samantha-0.021
 avail-0.021
oph-0.020
 Alice-0.020
Token off
Token ID572
Feature activation-1.59e-01

Pos logit contributions
ir+0.006
'+0.005
cut+0.005
 fix+0.005
 can+0.005
Neg logit contributions
oph-0.006
 Samantha-0.006
 Debbie-0.006
isi-0.006
 Lib-0.006
Token on
Token ID319
Feature activation-1.08e+00

Pos logit contributions
oph+0.005
 Samantha+0.005
 Debbie+0.005
 avail+0.005
 Alice+0.005
Neg logit contributions
ho-0.003
ir-0.003
cut-0.003
 can-0.003
oken-0.003
Token his
Token ID465
Feature activation+3.21e-01

Pos logit contributions
oph+0.032
 Samantha+0.031
 Debbie+0.031
ibi+0.030
 Lib+0.029
Neg logit contributions
ir-0.024
 can-0.024
oken-0.024
ho-0.023
 sincere-0.023
Token mom
Token ID1995
Feature activation-9.60e-01

Pos logit contributions
 Alice+0.007
 Samantha+0.007
 Debbie+0.007
 avail+0.007
 Lib+0.007
Neg logit contributions
itt-0.005
ho-0.005
oken-0.005
ir-0.005
cut-0.005
Tokenmy
Token ID1820
Feature activation-2.61e-01

Pos logit contributions
 Alice+0.031
 Samantha+0.031
 Debbie+0.030
 Mary+0.030
 Lib+0.030
Neg logit contributions
oken-0.019
ho-0.019
 sincere-0.018
itt-0.018
ir-0.018
Token.
Token ID13
Feature activation+6.89e-01

Pos logit contributions
 Alice+0.009
 Samantha+0.009
 Debbie+0.009
 Mary+0.008
 Lib+0.008
Neg logit contributions
oken-0.005
ho-0.005
 sincere-0.005
itt-0.005
ir-0.005
Token He
Token ID679
Feature activation+6.88e-02

Pos logit contributions
ho+0.012
cut+0.011
itt+0.011
ir+0.011
 can+0.011
Neg logit contributions
 avail-0.025
oph-0.023
 Debbie-0.023
 Samantha-0.023
 Alice-0.023
Token said
Token ID531
Feature activation+8.25e-01

Pos logit contributions
oken+0.002
ir+0.002
ho+0.002
itt+0.002
cut+0.002
Neg logit contributions
oph-0.002
 Debbie-0.002
 Alice-0.002
 Samantha-0.002
 Carly-0.002
Token goodbye
Token ID24829
Feature activation+3.50e+00

Pos logit contributions
oken+0.021
ir+0.020
 affect+0.019
itt+0.019
oc+0.018
Neg logit contributions
 Alice-0.024
 Lib-0.024
 Debbie-0.023
 Samantha-0.023
 Clara-0.023
Token and
Token ID290
Feature activation+5.31e-01

Pos logit contributions
oken+0.093
ir+0.090
 sincere+0.086
itt+0.086
cut+0.085
Neg logit contributions
 Alice-0.099
 Samantha-0.098
 Debbie-0.096
 Lib-0.096
 Mary-0.093
Token went
Token ID1816
Feature activation-3.39e-01

Pos logit contributions
ho+0.012
cut+0.011
 can+0.010
ir+0.010
oc+0.010
Neg logit contributions
 avail-0.017
 Debbie-0.017
 Samantha-0.016
oph-0.016
 Alice-0.016
Token back
Token ID736
Feature activation-1.55e-01

Pos logit contributions
oph+0.010
 avail+0.010
 Samantha+0.010
 Debbie+0.010
 Alice+0.010
Neg logit contributions
ho-0.008
ir-0.008
'-0.007
 can-0.007
itt-0.007
Token to
Token ID284
Feature activation-6.01e-01

Pos logit contributions
oph+0.005
 avail+0.005
 Debbie+0.005
 Samantha+0.005
 Alice+0.005
Neg logit contributions
ho-0.003
 can-0.003
ir-0.003
cut-0.003
itt-0.003
Token j
Token ID474
Feature activation-7.26e-01

Pos logit contributions
 Samantha+0.016
 Debbie+0.016
 avail+0.016
 Alice+0.016
oph+0.015
Neg logit contributions
ho-0.016
cut-0.015
ir-0.015
oken-0.015
 can-0.015
Token go
Token ID467
Feature activation-1.04e-01

Pos logit contributions
ir+0.003
ho+0.003
oken+0.003
cut+0.003
 can+0.003
Neg logit contributions
 Debbie-0.004
 Samantha-0.004
 Lib-0.004
 Alice-0.004
oph-0.004
Token home
Token ID1363
Feature activation+2.87e-01

Pos logit contributions
 Debbie+0.003
 Samantha+0.003
 avail+0.003
oph+0.003
 Alice+0.003
Neg logit contributions
ho-0.003
ir-0.003
 can-0.003
cut-0.003
oken-0.003
Token.
Token ID13
Feature activation-1.62e-02

Pos logit contributions
ho+0.006
ir+0.006
cut+0.006
itt+0.006
oken+0.006
Neg logit contributions
 Alice-0.009
 Samantha-0.009
 Debbie-0.009
 Lib-0.009
 Clara-0.009
Token Max
Token ID5436
Feature activation+1.03e+00

Pos logit contributions
 avail+0.000
 Debbie+0.000
 Samantha+0.000
 Lib+0.000
oph+0.000
Neg logit contributions
 can-0.000
 Did-0.000
 cannot-0.000
ho-0.000
 sincere-0.000
Token said
Token ID531
Feature activation+8.37e-01

Pos logit contributions
ho+0.020
cut+0.019
ir+0.019
 can+0.018
oken+0.018
Neg logit contributions
 Samantha-0.034
 Debbie-0.034
 avail-0.034
 Alice-0.033
oph-0.033
Token goodbye
Token ID24829
Feature activation+3.48e+00

Pos logit contributions
ir+0.018
oken+0.017
cut+0.017
itt+0.016
ho+0.016
Neg logit contributions
 Alice-0.028
 Samantha-0.027
 Lib-0.027
 Debbie-0.027
 Megan-0.026
Token and
Token ID290
Feature activation+7.22e-01

Pos logit contributions
ho+0.062
ir+0.062
oken+0.061
cut+0.061
itt+0.058
Neg logit contributions
 Samantha-0.121
 Alice-0.120
 Debbie-0.120
 Lib-0.118
oph-0.117
Token thanked
Token ID26280
Feature activation+1.06e+00

Pos logit contributions
 can+0.015
ho+0.014
 forgive+0.014
cut+0.014
 could+0.013
Neg logit contributions
 Debbie-0.023
 avail-0.023
 Samantha-0.022
 Lib-0.022
oph-0.022
Token her
Token ID607
Feature activation-4.39e-01

Pos logit contributions
ir+0.030
ho+0.030
 can+0.030
cut+0.029
 sincere+0.028
Neg logit contributions
 Debbie-0.025
 Samantha-0.025
oph-0.025
 Lib-0.024
 avail-0.024
Token for
Token ID329
Feature activation+8.80e-01

Pos logit contributions
 Samantha+0.015
 Debbie+0.015
 avail+0.015
oph+0.015
 Alice+0.015
Neg logit contributions
ho-0.008
cut-0.008
 can-0.008
ir-0.008
oken-0.007
Token playing
Token ID2712
Feature activation-1.15e-01

Pos logit contributions
ho+0.021
ir+0.020
cut+0.020
oken+0.020
itt+0.019
Neg logit contributions
 Samantha-0.026
 Alice-0.026
 Debbie-0.025
 avail-0.025
 Lib-0.025
Token bird
Token ID6512
Feature activation+8.78e-01

Pos logit contributions
ir+0.014
oken+0.014
ho+0.014
itt+0.013
ighters+0.013
Neg logit contributions
 Samantha-0.015
 Lib-0.014
 Alice-0.014
 Debbie-0.014
 avail-0.014
Token welcomed
Token ID15846
Feature activation-5.10e-02

Pos logit contributions
oken+0.021
itt+0.020
ir+0.019
ho+0.019
ighters+0.018
Neg logit contributions
 Alice-0.026
oph-0.026
 Debbie-0.025
 Samantha-0.025
 Lib-0.025
Token the
Token ID262
Feature activation+4.60e-01

Pos logit contributions
 Samantha+0.001
 Debbie+0.001
 Alice+0.001
 avail+0.001
 Lib+0.001
Neg logit contributions
ho-0.001
cut-0.001
oken-0.001
ir-0.001
itt-0.001
Token rh
Token ID9529
Feature activation+1.76e+00

Pos logit contributions
ir+0.012
oken+0.012
itt+0.012
ho+0.012
ighters+0.012
Neg logit contributions
 Samantha-0.011
 Lib-0.011
 Alice-0.011
 Debbie-0.011
 avail-0.010
Tokenin
Token ID259
Feature activation+6.50e-01

Pos logit contributions
oken+0.053
 Acc+0.046
 Did+0.045
ho+0.045
 affect+0.044
Neg logit contributions
 Lib-0.036
oph-0.036
isi-0.036
 Debbie-0.036
bi-0.035
Tokenoc
Token ID420
Feature activation+3.48e+00

Pos logit contributions
oken+0.009
 sincere+0.008
ir+0.007
 affect+0.007
ighters+0.006
Neg logit contributions
 Lib-0.026
 Alice-0.026
 Samantha-0.026
 Debbie-0.025
bi-0.025
Tokeneros
Token ID27498
Feature activation+7.97e-01

Pos logit contributions
oken+0.101
 can+0.097
 affect+0.092
Did+0.091
 cannot+0.091
Neg logit contributions
 Lib-0.088
 Samantha-0.082
 avail-0.079
oph-0.078
 Debbie-0.078
Token to
Token ID284
Feature activation-5.68e-01

Pos logit contributions
ho+0.016
ir+0.015
oken+0.015
cut+0.015
itt+0.015
Neg logit contributions
 Alice-0.026
 Samantha-0.026
 Debbie-0.026
 Lib-0.025
oph-0.025
Token her
Token ID607
Feature activation-6.39e-01

Pos logit contributions
 Samantha+0.015
 avail+0.015
 Alice+0.015
 Debbie+0.015
 Lib+0.015
Neg logit contributions
ho-0.015
itt-0.015
cut-0.015
oken-0.015
ir-0.014
Token home
Token ID1363
Feature activation+4.74e-01

Pos logit contributions
 Alice+0.016
 Lib+0.016
 Samantha+0.016
 avail+0.016
 Debbie+0.015
Neg logit contributions
itt-0.018
ir-0.018
ho-0.017
oken-0.017
cut-0.016
Token,
Token ID11
Feature activation+3.15e-01

Pos logit contributions
oken+0.014
ir+0.014
itt+0.014
ho+0.013
cut+0.013
Neg logit contributions
 Alice-0.012
 Lib-0.011
 Mary-0.011
 Samantha-0.011
 Clara-0.011
Token learn
Token ID2193
Feature activation-1.14e+00

Pos logit contributions
 Alice+0.014
 Debbie+0.014
 Samantha+0.013
 Lib+0.013
 Carly+0.013
Neg logit contributions
ir-0.014
ho-0.014
itt-0.014
oken-0.013
bed-0.013
Token about
Token ID546
Feature activation+5.35e-02

Pos logit contributions
 Samantha+0.030
 Debbie+0.030
 Alice+0.030
 avail+0.030
oph+0.029
Neg logit contributions
ho-0.030
cut-0.029
ir-0.029
oken-0.028
itt-0.028
Token the
Token ID262
Feature activation-8.39e-02

Pos logit contributions
ho+0.001
cut+0.001
ir+0.001
 can+0.001
oken+0.001
Neg logit contributions
 Samantha-0.001
 Debbie-0.001
oph-0.001
 avail-0.001
 Alice-0.001
Token rh
Token ID9529
Feature activation+1.75e+00

Pos logit contributions
 Alice+0.002
 Lib+0.002
 avail+0.002
 Samantha+0.002
 Debbie+0.002
Neg logit contributions
ir-0.002
oken-0.002
itt-0.002
ho-0.002
ended-0.002
Tokenin
Token ID259
Feature activation+2.82e-01

Pos logit contributions
oken+0.048
 Acc+0.042
ho+0.042
 Did+0.041
 can+0.041
Neg logit contributions
 avail-0.041
 Debbie-0.041
 Lib-0.041
oph-0.039
isi-0.039
Tokenoc
Token ID420
Feature activation+3.48e+00

Pos logit contributions
oken+0.005
 sincere+0.004
ir+0.004
onel+0.004
 Acc+0.003
Neg logit contributions
 Lib-0.010
 Alice-0.010
 Debbie-0.010
 Samantha-0.010
 Mary-0.010
Tokeneros
Token ID27498
Feature activation+8.90e-01

Pos logit contributions
oken+0.101
 can+0.098
Did+0.093
 cannot+0.093
ighters+0.091
Neg logit contributions
 Lib-0.083
 avail-0.081
 Samantha-0.078
 flavours-0.077
 Alice-0.076
Token.
Token ID13
Feature activation+2.45e-01

Pos logit contributions
oken+0.024
ir+0.023
ho+0.023
itt+0.022
 sincere+0.021
Neg logit contributions
 Alice-0.023
 Lib-0.022
 Samantha-0.022
 Debbie-0.021
 Mary-0.021
Token They
Token ID1119
Feature activation-2.72e-01

Pos logit contributions
ho+0.007
cut+0.007
ir+0.007
oken+0.007
itt+0.007
Neg logit contributions
 Samantha-0.006
 Alice-0.006
 Debbie-0.006
 Lib-0.006
 Carly-0.006
Token say
Token ID910
Feature activation+6.42e-01

Pos logit contributions
 Alice+0.007
 Debbie+0.007
 Samantha+0.007
 Lib+0.007
 Carly+0.007
Neg logit contributions
ho-0.007
itt-0.007
ir-0.007
oken-0.007
can-0.006
Token goodbye
Token ID24829
Feature activation+3.18e+00

Pos logit contributions
oken+0.019
ir+0.019
ho+0.019
oc+0.018
cut+0.018
Neg logit contributions
 Alice-0.016
 Samantha-0.015
 Lib-0.015
 Debbie-0.015
 Carly-0.015
Token go
Token ID467
Feature activation-1.78e-01

Pos logit contributions
ho+0.001
ir+0.001
 can+0.001
oken+0.001
cut+0.001
Neg logit contributions
 Debbie-0.002
 Samantha-0.002
oph-0.002
 avail-0.002
 Lib-0.002
Token home
Token ID1363
Feature activation-1.15e-01

Pos logit contributions
 Debbie+0.005
 Samantha+0.005
 avail+0.005
oph+0.005
 Alice+0.005
Neg logit contributions
ho-0.004
ir-0.004
cut-0.004
oken-0.004
 can-0.004
Token,
Token ID11
Feature activation-2.04e-01

Pos logit contributions
 Samantha+0.004
 Debbie+0.004
 Alice+0.004
 avail+0.004
 Lib+0.004
Neg logit contributions
ho-0.002
cut-0.002
ir-0.002
oken-0.002
itt-0.001
Token he
Token ID339
Feature activation+8.93e-01

Pos logit contributions
oph+0.007
 avail+0.007
die+0.006
isi+0.006
igsaw+0.006
Neg logit contributions
 or-0.004
 can-0.004
ho-0.004
cut-0.004
 and-0.004
Token said
Token ID531
Feature activation+5.40e-01

Pos logit contributions
ho+0.022
ir+0.022
cut+0.022
oken+0.022
itt+0.021
Neg logit contributions
 Samantha-0.025
 Debbie-0.025
 Alice-0.024
oph-0.024
 Lib-0.024
Token goodbye
Token ID24829
Feature activation+3.48e+00

Pos logit contributions
ir+0.011
oken+0.011
cut+0.011
ho+0.011
itt+0.010
Neg logit contributions
 Alice-0.018
 Samantha-0.018
 Debbie-0.018
 Lib-0.017
 avail-0.017
Token to
Token ID284
Feature activation-7.20e-01

Pos logit contributions
ho+0.068
ir+0.068
cut+0.067
oken+0.067
itt+0.064
Neg logit contributions
 Samantha-0.114
 Debbie-0.113
 Alice-0.113
 Lib-0.111
oph-0.110
Token Kitty
Token ID21168
Feature activation-1.32e+00

Pos logit contributions
oph+0.022
 avail+0.019
mber+0.019
 Debbie+0.019
ibi+0.019
Neg logit contributions
 can-0.017
cut-0.017
ho-0.017
ir-0.016
 or-0.016
Token.
Token ID13
Feature activation+2.24e-01

Pos logit contributions
 Samantha+0.049
 Alice+0.049
 Debbie+0.049
 Lib+0.048
 avail+0.048
Neg logit contributions
ho-0.021
ir-0.019
cut-0.019
oken-0.019
itt-0.018
Token He
Token ID679
Feature activation+3.00e-02

Pos logit contributions
 Did+0.004
ho+0.004
 can+0.003
oc+0.003
 Don+0.003
Neg logit contributions
 avail-0.009
 possible-0.008
oph-0.008
die-0.007
 Debbie-0.007
Token promised
Token ID8072
Feature activation+1.22e+00

Pos logit contributions
ho+0.001
cut+0.001
 can+0.001
ir+0.001
oken+0.001
Neg logit contributions
 avail-0.001
 Samantha-0.001
 Debbie-0.001
 Alice-0.001
 Lib-0.001
Token It
Token ID632
Feature activation+1.42e-01

Pos logit contributions
 Alice+0.009
 Samantha+0.009
 Debbie+0.009
 Mary+0.009
 Lib+0.009
Neg logit contributions
cut-0.010
ho-0.010
ir-0.009
 affect-0.009
oken-0.009
Token was
Token ID373
Feature activation-4.10e-01

Pos logit contributions
ir+0.003
oken+0.003
itt+0.003
ho+0.003
cut+0.003
Neg logit contributions
 Alice-0.004
 Debbie-0.004
 Samantha-0.004
 Megan-0.004
 Lib-0.004
Token the
Token ID262
Feature activation+3.62e-01

Pos logit contributions
 Samantha+0.012
 Debbie+0.012
 avail+0.011
 Alice+0.011
oph+0.011
Neg logit contributions
ho-0.010
cut-0.010
ir-0.010
oken-0.009
itt-0.009
Token rh
Token ID9529
Feature activation+2.07e+00

Pos logit contributions
ir+0.010
ho+0.010
itt+0.009
oc+0.009
oken+0.009
Neg logit contributions
 avail-0.009
 Debbie-0.009
 Samantha-0.009
 Alice-0.009
 Lib-0.009
Tokenin
Token ID259
Feature activation+7.30e-01

Pos logit contributions
oken+0.059
ho+0.056
 Acc+0.054
 Did+0.054
 Don+0.051
Neg logit contributions
 Debbie-0.044
 avail-0.044
oph-0.042
 Lib-0.041
 Megan-0.041
Tokenoc
Token ID420
Feature activation+3.47e+00

Pos logit contributions
oken+0.019
ir+0.019
ho+0.017
 sincere+0.017
ighters+0.017
Neg logit contributions
 Debbie-0.018
 Lib-0.018
 Alice-0.018
 Samantha-0.018
 avail-0.017
Tokeneros
Token ID27498
Feature activation+1.14e+00

Pos logit contributions
Did+0.095
oken+0.093
 can+0.093
 cannot+0.091
 affect+0.090
Neg logit contributions
 Lib-0.085
 avail-0.081
 flavours-0.081
 Samantha-0.079
 Debbie-0.077
Token.
Token ID13
Feature activation-2.68e-01

Pos logit contributions
ir+0.027
ho+0.027
oken+0.027
cut+0.026
itt+0.026
Neg logit contributions
 Alice-0.032
 Samantha-0.032
 Debbie-0.032
 Lib-0.031
 avail-0.031
Token It
Token ID632
Feature activation+4.00e-01

Pos logit contributions
 Samantha+0.008
 Debbie+0.008
 Alice+0.008
 Lib+0.007
 Carly+0.007
Neg logit contributions
ho-0.007
cut-0.007
ir-0.007
oken-0.006
 affect-0.006
Token lifted
Token ID13663
Feature activation+3.42e-02

Pos logit contributions
oken+0.010
ir+0.009
itt+0.009
 affect+0.009
cut+0.009
Neg logit contributions
 Debbie-0.011
 Alice-0.011
 Samantha-0.011
 Megan-0.011
 Lib-0.011
Token its
Token ID663
Feature activation+3.54e-01

Pos logit contributions
ho+0.001
cut+0.001
ir+0.001
oken+0.001
itt+0.001
Neg logit contributions
 Samantha-0.001
 Debbie-0.001
 Alice-0.001
 avail-0.001
oph-0.001
Token back
Token ID736
Feature activation-2.30e-01

Pos logit contributions
ho+0.001
cut+0.001
ir+0.001
 can+0.001
'+0.001
Neg logit contributions
 avail-0.001
 Samantha-0.001
 Debbie-0.001
oph-0.001
 Alice-0.001
Token to
Token ID284
Feature activation-7.75e-01

Pos logit contributions
 avail+0.008
oph+0.008
 Samantha+0.008
 Debbie+0.008
 Alice+0.008
Neg logit contributions
ho-0.004
 can-0.004
ir-0.004
cut-0.004
itt-0.004
Token the
Token ID262
Feature activation+7.10e-01

Pos logit contributions
oph+0.021
 Debbie+0.021
 Samantha+0.021
 avail+0.021
 Alice+0.021
Neg logit contributions
ho-0.019
ir-0.019
cut-0.019
 can-0.018
oken-0.018
Token rh
Token ID9529
Feature activation+1.72e+00

Pos logit contributions
ho+0.018
ir+0.017
cut+0.017
oken+0.017
itt+0.016
Neg logit contributions
 Samantha-0.020
 Debbie-0.020
 Alice-0.019
 avail-0.019
oph-0.019
Tokenin
Token ID259
Feature activation+7.39e-01

Pos logit contributions
oken+0.036
ho+0.036
 can+0.034
ir+0.033
 sincere+0.033
Neg logit contributions
oph-0.052
 Debbie-0.052
 avail-0.050
 Alice-0.050
 Lib-0.050
Tokenoc
Token ID420
Feature activation+3.46e+00

Pos logit contributions
ho+0.003
oken+0.003
ir+0.003
cut+0.002
 sincere+0.002
Neg logit contributions
 Samantha-0.035
 Debbie-0.035
 Alice-0.035
oph-0.035
 Lib-0.035
Tokeneros
Token ID27498
Feature activation+7.31e-01

Pos logit contributions
oken+0.081
 can+0.080
ir+0.075
 affect+0.075
 sincere+0.074
Neg logit contributions
 Lib-0.100
 Samantha-0.099
 avail-0.097
 Debbie-0.097
oph-0.096
Token and
Token ID290
Feature activation+4.00e-01

Pos logit contributions
ho+0.014
oken+0.014
ir+0.014
cut+0.014
itt+0.013
Neg logit contributions
 Alice-0.024
 Samantha-0.024
 Debbie-0.023
 Lib-0.023
oph-0.023
Token showed
Token ID3751
Feature activation-8.67e-02

Pos logit contributions
ho+0.008
cut+0.007
 can+0.007
oc+0.007
itt+0.006
Neg logit contributions
 avail-0.014
 Debbie-0.013
oph-0.013
 Samantha-0.013
 Alice-0.013
Token it
Token ID340
Feature activation+1.36e-01

Pos logit contributions
 Debbie+0.003
 avail+0.003
oph+0.003
ibi+0.003
 Samantha+0.003
Neg logit contributions
cut-0.002
ho-0.002
ir-0.002
 can-0.002
oc-0.002
Token what
Token ID644
Feature activation-3.43e-01

Pos logit contributions
ho+0.002
 can+0.002
cut+0.002
ir+0.002
 forgive+0.002
Neg logit contributions
 avail-0.005
oph-0.005
 Debbie-0.005
 Samantha-0.005
 Alice-0.005
Token ow
Token ID12334
Feature activation+1.34e+00

Pos logit contributions
 Samantha+0.002
 Debbie+0.002
 Alice+0.002
 Kara+0.002
 Lib+0.002
Neg logit contributions
cut-0.002
oken-0.002
ho-0.002
ir-0.002
itt-0.002
Token!
Token ID0
Feature activation+1.67e-01

Pos logit contributions
 can+0.032
ho+0.032
ir+0.031
oc+0.029
itt+0.029
Neg logit contributions
 Debbie-0.043
 Samantha-0.042
 Alice-0.042
 Lib-0.042
oph-0.041
Token It
Token ID632
Feature activation+1.21e-01

Pos logit contributions
cut+0.005
oken+0.004
ho+0.004
ir+0.004
 affect+0.004
Neg logit contributions
 Samantha-0.005
 Alice-0.005
 Debbie-0.004
 Kara-0.004
 Mary-0.004
Token hurts
Token ID20406
Feature activation+1.33e+00

Pos logit contributions
oken+0.003
cut+0.003
ir+0.003
itt+0.003
 sincere+0.003
Neg logit contributions
 Alice-0.003
 Debbie-0.003
 Samantha-0.003
 Kara-0.003
 Megan-0.003
Token!"
Token ID2474
Feature activation-4.02e-01

Pos logit contributions
ho+0.038
ir+0.037
cut+0.037
 can+0.036
itt+0.035
Neg logit contributions
 Debbie-0.032
 avail-0.032
 Samantha-0.032
oph-0.031
 Lib-0.031
Token L
Token ID406
Feature activation-3.34e+00

Pos logit contributions
 avail+0.009
oph+0.009
 Debbie+0.009
 Samantha+0.009
 Lib+0.009
Neg logit contributions
ho-0.012
ir-0.011
 can-0.011
oken-0.011
itt-0.011
Tokenila
Token ID10102
Feature activation+8.46e-02

Pos logit contributions
 avail+0.129
 Alice+0.123
 Samantha+0.123
 Debbie+0.121
die+0.121
Neg logit contributions
ir-0.054
ho-0.050
itt-0.047
oken-0.046
 can-0.045
Token says
Token ID1139
Feature activation-7.00e-01

Pos logit contributions
ho+0.002
cut+0.002
ir+0.002
oken+0.002
itt+0.002
Neg logit contributions
 Alice-0.002
 Samantha-0.002
 Debbie-0.002
 Lib-0.002
 Kara-0.002
Token,
Token ID11
Feature activation-3.21e-01

Pos logit contributions
 Alice+0.022
 Samantha+0.022
 Debbie+0.021
 Lib+0.021
 Mary+0.021
Neg logit contributions
cut-0.015
oken-0.014
ir-0.014
ho-0.014
oc-0.014
Token holding
Token ID4769
Feature activation-2.85e-01

Pos logit contributions
 Samantha+0.010
 Alice+0.010
 Debbie+0.010
 avail+0.009
 Lib+0.009
Neg logit contributions
ho-0.007
ir-0.007
oken-0.007
itt-0.007
 can-0.007
Token her
Token ID607
Feature activation-3.95e-01

Pos logit contributions
oph+0.008
 Debbie+0.008
 avail+0.008
 Lib+0.008
 Samantha+0.008
Neg logit contributions
ir-0.007
 can-0.007
oken-0.007
ho-0.007
 sincere-0.006
Token box
Token ID3091
Feature activation-6.16e-01

Pos logit contributions
ho+0.004
cut+0.004
ir+0.004
oken+0.004
itt+0.004
Neg logit contributions
 Samantha-0.007
 Debbie-0.007
 Alice-0.007
 avail-0.007
oph-0.007
Token and
Token ID290
Feature activation-3.69e-01

Pos logit contributions
 Alice+0.022
 Lib+0.022
 Samantha+0.022
oph+0.022
 Debbie+0.022
Neg logit contributions
oken-0.011
 affect-0.010
cut-0.010
ho-0.010
ir-0.010
Token saw
Token ID2497
Feature activation-9.87e-03

Pos logit contributions
 Samantha+0.012
 Debbie+0.012
 avail+0.012
 Alice+0.012
oph+0.012
Neg logit contributions
ho-0.007
cut-0.007
ir-0.007
oken-0.007
 can-0.007
Token a
Token ID257
Feature activation-5.93e-01

Pos logit contributions
 Samantha+0.000
 Debbie+0.000
 avail+0.000
oph+0.000
 Alice+0.000
Neg logit contributions
ho-0.000
cut-0.000
ir-0.000
 can-0.000
oken-0.000
Token j
Token ID474
Feature activation-1.17e+00

Pos logit contributions
 Samantha+0.017
 Debbie+0.017
 avail+0.016
 Alice+0.016
oph+0.016
Neg logit contributions
ho-0.014
cut-0.014
ir-0.014
oken-0.013
 can-0.013
Tokenigsaw
Token ID45636
Feature activation-3.32e+00

Pos logit contributions
 Debbie+0.035
 Samantha+0.035
 Alice+0.034
 avail+0.034
 Lib+0.033
Neg logit contributions
ir-0.028
ho-0.028
itt-0.027
oken-0.026
cut-0.026
Token puzzle
Token ID15027
Feature activation-1.03e+00

Pos logit contributions
 Samantha+0.119
 Debbie+0.116
 Kara+0.113
ibi+0.111
 Carly+0.111
Neg logit contributions
cut-0.066
 can-0.065
'-0.062
ho-0.061
oc-0.061
Token inside
Token ID2641
Feature activation-7.36e-01

Pos logit contributions
 Samantha+0.038
 Debbie+0.037
 avail+0.036
oph+0.036
 Kara+0.036
Neg logit contributions
ho-0.018
cut-0.018
 can-0.018
'-0.017
ir-0.016
Token.
Token ID13
Feature activation+1.03e-01

Pos logit contributions
 Samantha+0.022
 Debbie+0.021
 avail+0.021
 Alice+0.021
oph+0.021
Neg logit contributions
ho-0.017
cut-0.017
ir-0.016
oken-0.016
 can-0.016
Token He
Token ID679
Feature activation-3.88e-02

Pos logit contributions
 can+0.002
itt+0.002
ho+0.002
cut+0.002
 Did+0.002
Neg logit contributions
 avail-0.003
oph-0.003
 Debbie-0.003
 Samantha-0.003
ibi-0.003
Token paused
Token ID24487
Feature activation+1.09e-01

Pos logit contributions
 Samantha+0.001
 Debbie+0.001
 avail+0.001
 Alice+0.001
oph+0.001
Neg logit contributions
ho-0.001
cut-0.001
ir-0.001
oken-0.001
itt-0.001
Token a
Token ID257
Feature activation-5.51e-01

Pos logit contributions
 avail+0.006
 Samantha+0.006
ibi+0.006
 Debbie+0.006
oph+0.006
Neg logit contributions
ho-0.005
 can-0.005
ir-0.005
cut-0.005
itt-0.005
Token bright
Token ID6016
Feature activation-2.25e-01

Pos logit contributions
 Samantha+0.015
 Debbie+0.015
 avail+0.015
 Alice+0.015
oph+0.014
Neg logit contributions
ho-0.014
cut-0.014
ir-0.013
oken-0.013
 can-0.013
Token pink
Token ID11398
Feature activation-8.95e-01

Pos logit contributions
 Samantha+0.008
 Debbie+0.008
 avail+0.008
oph+0.007
 Kara+0.007
Neg logit contributions
 can-0.004
ho-0.004
cut-0.004
ir-0.004
oc-0.004
Token cover
Token ID3002
Feature activation-1.28e-01

Pos logit contributions
 Samantha+0.026
 Debbie+0.025
 Kara+0.025
 avail+0.025
oph+0.025
Neg logit contributions
ho-0.021
 can-0.021
cut-0.020
oc-0.020
ir-0.019
Token and
Token ID290
Feature activation-3.75e-01

Pos logit contributions
 Samantha+0.004
 Debbie+0.004
 Alice+0.004
 avail+0.004
oph+0.004
Neg logit contributions
ho-0.003
cut-0.002
ir-0.002
oken-0.002
itt-0.002
Token L
Token ID406
Feature activation-3.28e+00

Pos logit contributions
oph+0.011
 Samantha+0.011
 avail+0.011
 Debbie+0.011
ibi+0.011
Neg logit contributions
 can-0.008
ho-0.008
cut-0.008
oc-0.007
 sincere-0.007
Tokenizzy
Token ID40593
Feature activation-8.33e-01

Pos logit contributions
 Samantha+0.112
 avail+0.111
 Debbie+0.110
 Alice+0.110
 Lib+0.110
Neg logit contributions
ir-0.062
ho-0.061
oken-0.060
 can-0.058
itt-0.057
Token was
Token ID373
Feature activation-1.14e+00

Pos logit contributions
 Samantha+0.027
 Debbie+0.027
 avail+0.026
 Alice+0.026
oph+0.026
Neg logit contributions
ho-0.017
cut-0.016
ir-0.016
oken-0.016
 can-0.016
Token very
Token ID845
Feature activation-3.26e-01

Pos logit contributions
 avail+0.034
 Samantha+0.034
oph+0.034
 Debbie+0.033
 Alice+0.033
Neg logit contributions
ho-0.025
cut-0.025
 can-0.024
ir-0.024
oken-0.024
Token excited
Token ID6568
Feature activation+2.76e-01

Pos logit contributions
 Samantha+0.010
 Debbie+0.010
 Alice+0.010
 avail+0.010
oph+0.010
Neg logit contributions
ho-0.007
cut-0.007
ir-0.007
oken-0.007
itt-0.007
Token.
Token ID13
Feature activation-1.92e-01

Pos logit contributions
 or+0.003
 and+0.003
 can+0.003
ho+0.002
,+0.002
Neg logit contributions
 Debbie-0.012
Whe-0.012
 avail-0.012
 Kara-0.012
 Samantha-0.012
Token to
Token ID284
Feature activation+6.02e-02

Pos logit contributions
ho+0.013
cut+0.013
ir+0.013
oken+0.012
itt+0.012
Neg logit contributions
 Samantha-0.015
 Lib-0.015
 Alice-0.015
 Debbie-0.015
 avail-0.015
Token the
Token ID262
Feature activation+3.52e-01

Pos logit contributions
 can+0.001
ir+0.001
 or+0.001
oken+0.001
'+0.001
Neg logit contributions
oph-0.002
ibi-0.002
 Debbie-0.002
 Samantha-0.002
 Alice-0.002
Token beach
Token ID10481
Feature activation-4.22e-02

Pos logit contributions
ho+0.011
cut+0.010
ir+0.010
oken+0.010
 can+0.010
Neg logit contributions
 Samantha-0.008
 Debbie-0.008
 Alice-0.008
 avail-0.008
oph-0.008
Token?"
Token ID1701
Feature activation-4.35e-01

Pos logit contributions
 Alice+0.001
 Samantha+0.001
 Debbie+0.001
 Lib+0.001
oph+0.001
Neg logit contributions
oken-0.001
ho-0.001
itt-0.001
ir-0.001
cut-0.001
Token asked
Token ID1965
Feature activation-1.25e+00

Pos logit contributions
 Debbie+0.012
oph+0.012
 avail+0.012
 Samantha+0.012
 Alice+0.011
Neg logit contributions
ho-0.011
 can-0.011
ir-0.011
cut-0.010
oken-0.010
Token L
Token ID406
Feature activation-3.28e+00

Pos logit contributions
oph+0.041
 Debbie+0.040
 avail+0.040
 Samantha+0.039
 Lib+0.038
Neg logit contributions
 can-0.027
ir-0.025
ho-0.025
 or-0.023
 forgive-0.023
Tokenizzy
Token ID40593
Feature activation-5.69e-01

Pos logit contributions
 Samantha+0.101
 Debbie+0.101
 Alice+0.099
 avail+0.099
oph+0.098
Neg logit contributions
ho-0.072
cut-0.069
ir-0.068
oken-0.066
itt-0.065
Token.
Token ID13
Feature activation-6.29e-01

Pos logit contributions
 Samantha+0.017
 Debbie+0.017
 avail+0.017
 Alice+0.017
oph+0.017
Neg logit contributions
ho-0.013
cut-0.012
ir-0.012
oken-0.012
 can-0.011
Token
Token ID220
Feature activation-9.73e-01

Pos logit contributions
 Samantha+0.018
 Debbie+0.018
 Alice+0.018
 Lib+0.018
 avail+0.017
Neg logit contributions
ho-0.015
cut-0.015
ir-0.015
oken-0.015
itt-0.014
Token
Token ID198
Feature activation-1.64e-01

Pos logit contributions
 avail+0.035
oph+0.034
 Debbie+0.034
 Samantha+0.033
 Lib+0.033
Neg logit contributions
 can-0.016
ir-0.016
ho-0.016
itt-0.015
'-0.015
Token
Token ID198
Feature activation-3.90e-01

Pos logit contributions
 avail+0.005
 Samantha+0.005
 Debbie+0.005
oph+0.005
 Alice+0.005
Neg logit contributions
ho-0.003
ir-0.003
 can-0.003
cut-0.003
'-0.003
Token It
Token ID632
Feature activation-2.03e-01

Pos logit contributions
 avail+0.009
 Samantha+0.009
 Debbie+0.009
ibi+0.009
 Lib+0.009
Neg logit contributions
 can-0.007
ir-0.006
ho-0.006
'-0.006
oken-0.006
Token was
Token ID373
Feature activation-6.57e-02

Pos logit contributions
 Samantha+0.007
 Alice+0.007
 Debbie+0.007
oph+0.007
 Lib+0.007
Neg logit contributions
ho-0.004
ir-0.004
cut-0.004
oken-0.004
itt-0.003
Token a
Token ID257
Feature activation-1.14e-01

Pos logit contributions
 Samantha+0.002
 Debbie+0.002
 avail+0.002
oph+0.002
 Alice+0.002
Neg logit contributions
ho-0.002
cut-0.002
ir-0.002
oken-0.002
 can-0.002
Token big
Token ID1263
Feature activation-3.50e-01

Pos logit contributions
 Samantha+0.003
 Debbie+0.003
 Alice+0.003
 avail+0.003
oph+0.003
Neg logit contributions
ho-0.003
ir-0.003
cut-0.003
oken-0.003
itt-0.003
Token j
Token ID474
Feature activation-5.24e-01

Pos logit contributions
 Samantha+0.011
 Alice+0.011
 avail+0.011
 Debbie+0.011
oph+0.010
Neg logit contributions
ho-0.007
ir-0.007
oken-0.007
cut-0.007
itt-0.007
Tokenigsaw
Token ID45636
Feature activation-3.24e+00

Pos logit contributions
 Debbie+0.015
 Samantha+0.015
 avail+0.015
 Alice+0.015
 Lib+0.014
Neg logit contributions
ir-0.013
ho-0.013
itt-0.012
oken-0.012
cut-0.012
Token puzzle
Token ID15027
Feature activation-5.92e-01

Pos logit contributions
 Samantha+0.109
 Debbie+0.106
 Kara+0.103
 Carly+0.102
 avail+0.101
Neg logit contributions
cut-0.071
 can-0.070
'-0.069
ho-0.068
oc-0.066
Token and
Token ID290
Feature activation-1.29e+00

Pos logit contributions
 Samantha+0.021
 Debbie+0.021
 avail+0.021
oph+0.021
 Alice+0.020
Neg logit contributions
ho-0.010
cut-0.010
 can-0.009
ir-0.009
itt-0.009
Token she
Token ID673
Feature activation-4.06e-01

Pos logit contributions
 avail+0.034
 Samantha+0.034
 Debbie+0.034
oph+0.033
ibi+0.033
Neg logit contributions
 can-0.032
ho-0.032
cut-0.030
 sincere-0.030
ir-0.030
Token wanted
Token ID2227
Feature activation+4.78e-02

Pos logit contributions
 Alice+0.013
 Debbie+0.013
 Samantha+0.013
oph+0.013
 Lib+0.013
Neg logit contributions
ir-0.008
oken-0.008
ho-0.008
cut-0.007
itt-0.007
Token to
Token ID284
Feature activation+4.72e-02

Pos logit contributions
ho+0.001
 can+0.001
ir+0.001
cut+0.001
oken+0.001
Neg logit contributions
 Samantha-0.002
 Debbie-0.002
 avail-0.002
oph-0.002
 Alice-0.002
Token jumped
Token ID11687
Feature activation-1.23e+00

Pos logit contributions
 avail+0.003
oph+0.003
isi+0.003
 possible+0.003
 Debbie+0.003
Neg logit contributions
ho-0.002
 can-0.002
 sw-0.002
bed-0.001
 realized-0.001
Token up
Token ID510
Feature activation+1.72e-01

Pos logit contributions
 Samantha+0.035
 avail+0.035
 Debbie+0.035
oph+0.034
 Alice+0.034
Neg logit contributions
ho-0.031
ir-0.029
cut-0.029
 can-0.028
oken-0.028
Token and
Token ID290
Feature activation-3.82e-01

Pos logit contributions
ho+0.003
ir+0.003
cut+0.002
 can+0.002
oken+0.002
Neg logit contributions
 avail-0.006
 Debbie-0.006
 Samantha-0.006
oph-0.006
 Alice-0.006
Token l
Token ID300
Feature activation-1.44e+00

Pos logit contributions
 avail+0.012
oph+0.012
 Samantha+0.011
 Debbie+0.011
isi+0.011
Neg logit contributions
ho-0.009
 can-0.009
cut-0.008
 sincere-0.007
'-0.007
Tokenicked
Token ID9484
Feature activation-9.63e-01

Pos logit contributions
 Samantha+0.048
 Debbie+0.048
 Alice+0.047
 Lib+0.047
oph+0.047
Neg logit contributions
ho-0.027
cut-0.027
ir-0.025
 can-0.025
oken-0.025
Token Jas
Token ID21961
Feature activation-3.21e+00

Pos logit contributions
 Samantha+0.022
 Debbie+0.021
 Alice+0.021
 avail+0.021
oph+0.021
Neg logit contributions
ho-0.029
cut-0.028
ir-0.028
oken-0.028
itt-0.027
Tokenmine
Token ID3810
Feature activation-1.58e-01

Pos logit contributions
 Samantha+0.105
 Debbie+0.105
 Alice+0.104
oph+0.102
 avail+0.102
Neg logit contributions
ho-0.065
cut-0.062
oken-0.060
ir-0.060
itt-0.059
Token's
Token ID338
Feature activation+6.70e-01

Pos logit contributions
 avail+0.007
 Samantha+0.007
 Debbie+0.007
oph+0.007
 Lib+0.007
Neg logit contributions
ir-0.002
'-0.002
ho-0.002
oc-0.002
e-0.002
Token face
Token ID1986
Feature activation+1.71e-01

Pos logit contributions
ho+0.014
cut+0.014
ir+0.014
oken+0.013
itt+0.013
Neg logit contributions
 Samantha-0.021
 Debbie-0.021
 Alice-0.021
 avail-0.020
oph-0.020
Token.
Token ID13
Feature activation+2.50e-01

Pos logit contributions
ho+0.004
cut+0.004
ir+0.004
oken+0.004
itt+0.004
Neg logit contributions
 Samantha-0.005
 Debbie-0.005
 Alice-0.005
 avail-0.005
oph-0.005
Token
Token ID198
Feature activation+2.91e-01

Pos logit contributions
ho+0.006
ir+0.006
 can+0.005
itt+0.005
cut+0.005
Neg logit contributions
 avail-0.007
 Debbie-0.007
 Samantha-0.007
 Alice-0.007
oph-0.007
Token's
Token ID338
Feature activation-1.34e-01

Pos logit contributions
 avail+0.013
ibi+0.013
oph+0.013
 Debbie+0.013
 Samantha+0.012
Neg logit contributions
 can-0.004
ho-0.004
'-0.003
 or-0.003
cut-0.003
Token a
Token ID257
Feature activation+7.58e-02

Pos logit contributions
 Debbie+0.004
oph+0.004
 Samantha+0.004
 avail+0.003
 Alice+0.003
Neg logit contributions
ho-0.003
 can-0.003
oken-0.003
ir-0.003
cut-0.003
Token stream
Token ID4269
Feature activation+4.43e-01

Pos logit contributions
ho+0.002
cut+0.002
 can+0.002
ir+0.002
oken+0.002
Neg logit contributions
 Samantha-0.002
 Debbie-0.002
 Alice-0.002
oph-0.002
 Lib-0.002
Token!"
Token ID2474
Feature activation-7.75e-01

Pos logit contributions
ho+0.012
oken+0.011
ir+0.011
 sincere+0.011
cut+0.011
Neg logit contributions
 Samantha-0.011
 Alice-0.011
 Debbie-0.011
 Lib-0.011
 avail-0.011
Token said
Token ID531
Feature activation-1.91e+00

Pos logit contributions
 avail+0.026
die+0.026
oph+0.025
isi+0.025
 Debbie+0.025
Neg logit contributions
 can-0.014
ir-0.013
'-0.013
 could-0.013
 cannot-0.013
Token Jas
Token ID21961
Feature activation-3.19e+00

Pos logit contributions
die+0.054
isi+0.048
 possible+0.047
 avail+0.046
oph+0.046
Neg logit contributions
ir-0.047
ho-0.046
bed-0.044
 named-0.044
'-0.044
Tokenmine
Token ID3810
Feature activation+5.54e-01

Pos logit contributions
 Samantha+0.101
 Debbie+0.099
 Alice+0.098
oph+0.098
 Lib+0.097
Neg logit contributions
ho-0.071
cut-0.068
itt-0.066
oken-0.066
oc-0.065
Token.
Token ID13
Feature activation-7.89e-01

Pos logit contributions
ho+0.011
ir+0.011
cut+0.010
oken+0.010
 can+0.010
Neg logit contributions
 Samantha-0.018
 Debbie-0.018
 Alice-0.018
 avail-0.018
oph-0.018
Token
Token ID220
Feature activation-7.73e-01

Pos logit contributions
 avail+0.026
oph+0.026
 Debbie+0.026
 Samantha+0.025
 Alice+0.025
Neg logit contributions
ho-0.015
 can-0.015
cut-0.014
itt-0.014
ir-0.014
Token
Token ID198
Feature activation-4.11e-01

Pos logit contributions
 avail+0.027
oph+0.027
 Debbie+0.026
 Samantha+0.026
 Lib+0.026
Neg logit contributions
ir-0.013
ho-0.013
 can-0.013
cut-0.013
itt-0.013
Token
Token ID198
Feature activation-5.89e-01

Pos logit contributions
 avail+0.013
oph+0.013
 Samantha+0.013
 Debbie+0.013
 Lib+0.013
Neg logit contributions
ho-0.008
ir-0.008
 can-0.008
cut-0.008
'-0.007
Token I
Token ID314
Feature activation-1.57e-02

Pos logit contributions
ho+0.006
oken+0.006
cut+0.006
ir+0.006
oc+0.006
Neg logit contributions
 Alice-0.006
 Samantha-0.006
 Debbie-0.006
 avail-0.006
 Kara-0.006
Token have
Token ID423
Feature activation+3.13e-01

Pos logit contributions
 Alice+0.000
 Samantha+0.000
 Kara+0.000
ibi+0.000
 Debbie+0.000
Neg logit contributions
ho-0.000
oken-0.000
ir-0.000
itt-0.000
cut-0.000
Token some
Token ID617
Feature activation+3.82e-01

Pos logit contributions
ho+0.008
 can+0.007
ir+0.007
cut+0.007
oken+0.007
Neg logit contributions
 Debbie-0.009
 Samantha-0.008
oph-0.008
 avail-0.008
 Alice-0.008
Token ice
Token ID4771
Feature activation-1.36e-01

Pos logit contributions
ho+0.014
ir+0.014
cut+0.013
oken+0.013
itt+0.013
Neg logit contributions
 Samantha-0.006
 avail-0.006
 Alice-0.006
 Debbie-0.006
 Lib-0.006
Token?"
Token ID1701
Feature activation-2.37e-01

Pos logit contributions
 avail+0.002
 Samantha+0.002
 Debbie+0.002
 Carly+0.002
 Alice+0.002
Neg logit contributions
ho-0.005
 can-0.005
cut-0.005
itt-0.005
ir-0.005
Token L
Token ID406
Feature activation-3.18e+00

Pos logit contributions
oph+0.007
 avail+0.007
isi+0.007
 Debbie+0.006
 Lib+0.006
Neg logit contributions
 can-0.005
ho-0.005
ir-0.005
 sincere-0.005
 cannot-0.005
Tokenila
Token ID10102
Feature activation-8.42e-02

Pos logit contributions
 avail+0.115
 Alice+0.112
 Samantha+0.112
 Debbie+0.109
 Lib+0.108
Neg logit contributions
ir-0.058
ho-0.054
oken-0.050
itt-0.050
 can-0.049
Token asked
Token ID1965
Feature activation-2.91e-01

Pos logit contributions
 Samantha+0.002
 Debbie+0.002
 Alice+0.002
 avail+0.002
oph+0.002
Neg logit contributions
ho-0.002
cut-0.002
ir-0.002
oken-0.002
itt-0.002
Token.
Token ID13
Feature activation-3.81e-01

Pos logit contributions
 Samantha+0.009
 Debbie+0.009
 avail+0.009
 Alice+0.009
oph+0.009
Neg logit contributions
ho-0.006
ir-0.006
cut-0.006
oken-0.006
 can-0.006
Token
Token ID198
Feature activation+1.63e-01

Pos logit contributions
 Samantha+0.010
 Alice+0.010
 Debbie+0.010
 Lib+0.010
oph+0.010
Neg logit contributions
cut-0.010
ho-0.010
ir-0.010
oken-0.010
oc-0.009
Token
Token ID198
Feature activation-1.36e-01

Pos logit contributions
ho+0.004
cut+0.004
ir+0.004
oken+0.004
itt+0.004
Neg logit contributions
 Samantha-0.005
 Debbie-0.005
 Alice-0.005
oph-0.004
 avail-0.004
TokenLook
Token ID8567
Feature activation-6.26e-01

Pos logit contributions
 avail+0.017
 Debbie+0.017
 Samantha+0.017
 Alice+0.016
oph+0.016
Neg logit contributions
ho-0.012
ir-0.012
cut-0.012
 can-0.011
Did-0.011
Token,
Token ID11
Feature activation+1.22e-01

Pos logit contributions
 avail+0.030
 Debbie+0.029
ibi+0.029
 Samantha+0.028
oph+0.028
Neg logit contributions
 can-0.004
ho-0.004
'-0.003
 or-0.003
ir-0.003
Token a
Token ID257
Feature activation-1.32e-01

Pos logit contributions
 can+0.003
ho+0.003
 or+0.003
 cannot+0.003
'+0.003
Neg logit contributions
 avail-0.003
oph-0.003
 Debbie-0.003
ibi-0.003
 Samantha-0.003
Token bird
Token ID6512
Feature activation+4.81e-01

Pos logit contributions
 Samantha+0.004
 Debbie+0.004
 Alice+0.004
 avail+0.004
 Lib+0.004
Neg logit contributions
oken-0.003
ho-0.003
ir-0.003
cut-0.003
itt-0.003
Token!"
Token ID2474
Feature activation-5.07e-01

Pos logit contributions
oken+0.016
ho+0.015
itt+0.015
ir+0.015
cut+0.015
Neg logit contributions
 Alice-0.009
 Samantha-0.009
 Debbie-0.009
 Lib-0.009
 avail-0.009
Token L
Token ID406
Feature activation-3.17e+00

Pos logit contributions
die+0.015
bi+0.015
 avail+0.014
oph+0.014
 possible+0.014
Neg logit contributions
 can-0.012
 or-0.012
 cannot-0.011
ir-0.011
ho-0.011
Tokenila
Token ID10102
Feature activation-6.61e-02

Pos logit contributions
 avail+0.120
 Samantha+0.114
 Alice+0.114
 Debbie+0.113
 Lib+0.110
Neg logit contributions
ir-0.055
ho-0.050
itt-0.046
oken-0.046
 can-0.045
Token says
Token ID1139
Feature activation-8.41e-01

Pos logit contributions
 avail+0.002
 Samantha+0.002
 Debbie+0.002
oph+0.002
 Lib+0.002
Neg logit contributions
ho-0.001
 can-0.001
'-0.001
ir-0.001
cut-0.001
Token.
Token ID13
Feature activation-5.92e-01

Pos logit contributions
 Samantha+0.027
 Debbie+0.027
 avail+0.027
oph+0.027
 Alice+0.026
Neg logit contributions
ho-0.017
ir-0.016
cut-0.016
oken-0.016
 can-0.016
Token "
Token ID366
Feature activation+9.30e-01

Pos logit contributions
 avail+0.021
oph+0.021
 Debbie+0.020
 Samantha+0.019
ibi+0.019
Neg logit contributions
ho-0.011
 can-0.010
itt-0.010
'-0.010
 cannot-0.009
TokenI
Token ID40
Feature activation-2.17e-01

Pos logit contributions
oken+0.026
cut+0.025
ho+0.025
ir+0.024
 sincere+0.024
Neg logit contributions
 Debbie-0.023
 Samantha-0.023
 Alice-0.023
 Lib-0.023
oph-0.022
Token would
Token ID561
Feature activation+9.75e-01

Pos logit contributions
ir+0.022
oken+0.020
ho+0.019
 sincere+0.019
 affect+0.018
Neg logit contributions
 Alice-0.020
 Lib-0.019
oph-0.019
 Debbie-0.019
 Samantha-0.019
Token take
Token ID1011
Feature activation+5.21e-01

Pos logit contributions
ir+0.026
ho+0.026
oken+0.026
itt+0.026
cut+0.024
Neg logit contributions
 Alice-0.025
 avail-0.024
 Debbie-0.024
 Lib-0.023
 Samantha-0.023
Token out
Token ID503
Feature activation-4.99e-01

Pos logit contributions
 can+0.012
oken+0.012
ir+0.012
itt+0.012
cut+0.012
Neg logit contributions
 Samantha-0.015
 Debbie-0.015
oph-0.015
ibi-0.015
 avail-0.014
Token a
Token ID257
Feature activation-5.28e-01

Pos logit contributions
 Alice+0.010
 Samantha+0.010
 Debbie+0.010
 Lib+0.009
 avail+0.009
Neg logit contributions
ho-0.017
cut-0.016
 affect-0.015
ir-0.015
oken-0.015
Token j
Token ID474
Feature activation-6.02e-01

Pos logit contributions
 Samantha+0.016
 Debbie+0.016
oph+0.016
 avail+0.015
 Lib+0.015
Neg logit contributions
 can-0.012
ho-0.012
cut-0.011
ir-0.011
'-0.011
Tokenigsaw
Token ID45636
Feature activation-3.16e+00

Pos logit contributions
 Samantha+0.015
 Debbie+0.015
 Alice+0.015
 avail+0.015
oph+0.015
Neg logit contributions
ho-0.016
ir-0.016
cut-0.016
oken-0.016
itt-0.015
Token puzzle
Token ID15027
Feature activation-7.20e-01

Pos logit contributions
 Samantha+0.102
 Debbie+0.098
 Kara+0.095
 Carly+0.094
 avail+0.093
Neg logit contributions
cut-0.073
 can-0.072
ho-0.072
itt-0.070
'-0.070
Token and
Token ID290
Feature activation-1.52e-02

Pos logit contributions
 Samantha+0.025
 Debbie+0.025
 Alice+0.025
 avail+0.025
oph+0.024
Neg logit contributions
ho-0.013
ir-0.012
oken-0.012
cut-0.012
 sincere-0.011
Token try
Token ID1949
Feature activation+2.88e-01

Pos logit contributions
 avail+0.000
 Alice+0.000
 Samantha+0.000
 Debbie+0.000
 Lib+0.000
Neg logit contributions
ho-0.000
ir-0.000
oken-0.000
cut-0.000
itt-0.000
Token to
Token ID284
Feature activation+1.09e-01

Pos logit contributions
ho+0.006
ir+0.006
cut+0.006
 can+0.006
oken+0.006
Neg logit contributions
 Samantha-0.009
 Debbie-0.009
 avail-0.009
oph-0.009
 Alice-0.009
Token solve
Token ID8494
Feature activation-1.24e+00

Pos logit contributions
ho+0.002
ir+0.002
oken+0.002
cut+0.002
itt+0.002
Neg logit contributions
 avail-0.004
 Alice-0.004
 Samantha-0.004
 Debbie-0.004
oph-0.004
Token.
Token ID13
Feature activation+5.57e-01

Pos logit contributions
 Debbie+0.022
 avail+0.022
 Samantha+0.022
oph+0.022
 Lib+0.022
Neg logit contributions
ho-0.015
cut-0.015
 can-0.014
ir-0.014
oc-0.014
Token It
Token ID632
Feature activation-2.82e-01

Pos logit contributions
ir+0.016
oken+0.015
ho+0.015
cut+0.015
 affect+0.015
Neg logit contributions
 Alice-0.015
 Samantha-0.015
 Mary-0.014
 Debbie-0.014
 Kara-0.014
Token's
Token ID338
Feature activation-5.41e-01

Pos logit contributions
 Alice+0.008
 Mary+0.007
 Debbie+0.007
 Kara+0.007
 Samantha+0.007
Neg logit contributions
ir-0.007
ho-0.007
oken-0.007
itt-0.007
 sincere-0.007
Token not
Token ID407
Feature activation-4.84e-01

Pos logit contributions
 Alice+0.015
 Samantha+0.015
 Debbie+0.015
 Lib+0.014
 Kara+0.014
Neg logit contributions
ho-0.014
ir-0.014
itt-0.014
cut-0.013
oken-0.013
Token a
Token ID257
Feature activation-3.32e-01

Pos logit contributions
oph+0.014
 Debbie+0.014
 avail+0.014
 Samantha+0.014
 Lib+0.013
Neg logit contributions
 can-0.011
ho-0.011
cut-0.011
ir-0.011
oken-0.011
Token good
Token ID922
Feature activation-3.12e+00

Pos logit contributions
 Alice+0.008
 Samantha+0.007
 Debbie+0.007
 Lib+0.007
 avail+0.007
Neg logit contributions
ir-0.010
oken-0.010
itt-0.009
cut-0.009
ended-0.009
Token place
Token ID1295
Feature activation+2.22e-01

Pos logit contributions
 Samantha+0.067
 Debbie+0.066
 Alice+0.066
 avail+0.065
 Lib+0.064
Neg logit contributions
ho-0.097
ir-0.095
cut-0.094
oken-0.092
itt-0.092
Token to
Token ID284
Feature activation-3.07e-01

Pos logit contributions
ir+0.006
 sincere+0.006
cut+0.006
ho+0.006
oken+0.006
Neg logit contributions
 Alice-0.006
 Samantha-0.006
 Lib-0.005
 Mary-0.005
 Debbie-0.005
Token go
Token ID467
Feature activation-4.87e-01

Pos logit contributions
 Alice+0.008
 avail+0.008
 Samantha+0.007
 Mary+0.007
 Sue+0.007
Neg logit contributions
itt-0.009
ho-0.009
cut-0.008
oken-0.008
 sincere-0.008
Token."
Token ID526
Feature activation-1.26e-01

Pos logit contributions
 Alice+0.013
 Samantha+0.012
 Lib+0.012
 avail+0.012
 Kara+0.011
Neg logit contributions
itt-0.014
ho-0.014
cut-0.013
ir-0.013
oken-0.013
Token
Token ID198
Feature activation-1.18e-01

Pos logit contributions
 Samantha+0.004
 Debbie+0.004
 avail+0.004
 Alice+0.004
oph+0.004
Neg logit contributions
ho-0.003
ir-0.003
cut-0.003
 can-0.003
oken-0.003
Token bee
Token ID20697
Feature activation+1.45e+00

Pos logit contributions
ho+0.004
ir+0.004
cut+0.004
oken+0.004
oc+0.004
Neg logit contributions
 Debbie-0.005
 Samantha-0.005
 avail-0.005
 Alice-0.005
oph-0.005
Token is
Token ID318
Feature activation-2.60e-01

Pos logit contributions
oken+0.040
cut+0.038
ended+0.038
itt+0.036
 affect+0.036
Neg logit contributions
 Alice-0.034
 Megan-0.033
 Debbie-0.033
 Mary-0.033
 Rosa-0.032
Token angry
Token ID7954
Feature activation+3.84e-01

Pos logit contributions
 Samantha+0.008
 Debbie+0.008
 Alice+0.008
 avail+0.008
 Lib+0.008
Neg logit contributions
ho-0.006
cut-0.005
ir-0.005
oken-0.005
itt-0.005
Token and
Token ID290
Feature activation-3.42e-01

Pos logit contributions
ho+0.006
 can+0.006
ir+0.005
cut+0.005
oken+0.005
Neg logit contributions
 Samantha-0.015
 Debbie-0.015
 avail-0.014
oph-0.014
 Lib-0.014
Token strikes
Token ID8956
Feature activation-5.02e-01

Pos logit contributions
 Samantha+0.010
 Debbie+0.010
 Alice+0.010
 Lib+0.010
 avail+0.010
Neg logit contributions
oken-0.008
ir-0.008
ho-0.008
cut-0.008
itt-0.007
Token L
Token ID406
Feature activation-3.11e+00

Pos logit contributions
 Samantha+0.012
 Debbie+0.011
 Alice+0.011
 Lib+0.011
 avail+0.011
Neg logit contributions
cut-0.015
ho-0.015
oken-0.014
ir-0.014
itt-0.014
Tokenila
Token ID10102
Feature activation-6.72e-01

Pos logit contributions
 avail+0.116
 Alice+0.113
 Samantha+0.113
 Debbie+0.111
 Lib+0.110
Neg logit contributions
ir-0.054
ho-0.052
 can-0.048
itt-0.048
oken-0.047
Token on
Token ID319
Feature activation-7.58e-01

Pos logit contributions
 Alice+0.018
 Debbie+0.017
 Megan+0.017
 Samantha+0.017
 Kara+0.017
Neg logit contributions
oken-0.017
 sincere-0.016
cut-0.015
itt-0.015
ho-0.015
Token the
Token ID262
Feature activation-2.82e-01

Pos logit contributions
 avail+0.021
oph+0.021
 Debbie+0.021
 Samantha+0.021
 Lib+0.020
Neg logit contributions
ho-0.019
ir-0.018
 can-0.018
cut-0.017
oken-0.017
Token hand
Token ID1021
Feature activation-2.56e-01

Pos logit contributions
 Debbie+0.006
 Alice+0.006
 Samantha+0.006
 avail+0.006
 Lib+0.006
Neg logit contributions
oc-0.008
ir-0.008
oken-0.008
itt-0.008
ho-0.008
Token.
Token ID13
Feature activation-5.08e-01

Pos logit contributions
 Alice+0.006
 Samantha+0.006
 Debbie+0.006
 Mary+0.006
 Lib+0.006
Neg logit contributions
oken-0.008
 sincere-0.007
cut-0.007
ir-0.007
itt-0.007
TokenStay
Token ID25681
Feature activation+4.78e-01

Pos logit contributions
 Samantha+0.016
 Debbie+0.016
 Alice+0.016
 avail+0.016
 Lib+0.015
Neg logit contributions
ho-0.013
ir-0.012
cut-0.012
oken-0.012
itt-0.012
Token away
Token ID1497
Feature activation+7.05e-01

Pos logit contributions
ho+0.011
ir+0.011
itt+0.011
oken+0.011
cut+0.011
Neg logit contributions
 Samantha-0.014
 Debbie-0.014
 Alice-0.014
 avail-0.014
 Lib-0.014
Token,
Token ID11
Feature activation-4.82e-01

Pos logit contributions
ho+0.010
cut+0.009
 can+0.009
ir+0.009
 or+0.009
Neg logit contributions
 avail-0.027
 Debbie-0.027
oph-0.027
 Samantha-0.027
 Lib-0.026
Token dog
Token ID3290
Feature activation+6.67e-01

Pos logit contributions
 Samantha+0.009
 Alice+0.009
 Debbie+0.009
 Kara+0.008
 Carly+0.008
Neg logit contributions
cut-0.017
ho-0.017
oken-0.017
itt-0.016
ir-0.016
Token!"
Token ID2474
Feature activation-1.12e-01

Pos logit contributions
ho+0.016
 can+0.016
cut+0.016
oc+0.016
 or+0.015
Neg logit contributions
 avail-0.021
 Debbie-0.020
oph-0.020
 Samantha-0.020
ibi-0.019
Token L
Token ID406
Feature activation-3.11e+00

Pos logit contributions
die+0.003
oph+0.003
 avail+0.003
bi+0.003
isi+0.003
Neg logit contributions
 can-0.003
 or-0.003
ho-0.003
 cannot-0.003
ir-0.003
Tokenila
Token ID10102
Feature activation+4.01e-02

Pos logit contributions
 avail+0.123
 Alice+0.116
 Samantha+0.115
die+0.114
 Debbie+0.114
Neg logit contributions
ir-0.050
ho-0.046
itt-0.043
 can-0.042
'-0.041
Token says
Token ID1139
Feature activation-7.81e-01

Pos logit contributions
ho+0.001
cut+0.001
ir+0.001
oken+0.001
itt+0.001
Neg logit contributions
 Samantha-0.001
 Debbie-0.001
 Alice-0.001
 avail-0.001
 Lib-0.001
Token.
Token ID13
Feature activation-3.65e-01

Pos logit contributions
 Alice+0.021
 Samantha+0.020
 Debbie+0.020
 Lib+0.020
 avail+0.019
Neg logit contributions
cut-0.021
ho-0.020
ir-0.020
oken-0.020
oc-0.020
Token She
Token ID1375
Feature activation+5.62e-02

Pos logit contributions
 Samantha+0.011
 Debbie+0.011
 avail+0.011
 Alice+0.011
oph+0.011
Neg logit contributions
ho-0.008
cut-0.008
ir-0.008
oken-0.008
 can-0.007
Token uses
Token ID3544
Feature activation+4.82e-01

Pos logit contributions
ho+0.001
cut+0.001
ir+0.001
itt+0.001
oken+0.001
Neg logit contributions
 Debbie-0.002
 Alice-0.002
 Samantha-0.002
 Carly-0.001
 Kara-0.001
Token very
Token ID845
Feature activation+3.90e-01

Pos logit contributions
 Debbie+0.008
 avail+0.008
 Carly+0.007
oph+0.007
ibi+0.007
Neg logit contributions
 can-0.005
ho-0.005
 or-0.005
ir-0.005
 sincere-0.005
Token long
Token ID890
Feature activation+9.36e-01

Pos logit contributions
ho+0.010
ir+0.010
cut+0.010
 can+0.010
oken+0.009
Neg logit contributions
 Debbie-0.011
 Samantha-0.011
 Alice-0.010
 avail-0.010
oph-0.010
Token and
Token ID290
Feature activation-8.51e-01

Pos logit contributions
ho+0.022
cut+0.022
ir+0.022
 can+0.021
oken+0.021
Neg logit contributions
 Samantha-0.027
 Debbie-0.027
 avail-0.026
oph-0.026
 Alice-0.026
Token flexible
Token ID12846
Feature activation-9.49e-02

Pos logit contributions
 Samantha+0.024
 Debbie+0.024
 avail+0.024
 Alice+0.024
oph+0.024
Neg logit contributions
ho-0.021
cut-0.020
ir-0.020
 can-0.020
oken-0.019
Token!"
Token ID2474
Feature activation-4.17e-01

Pos logit contributions
 Debbie+0.003
 Samantha+0.003
oph+0.003
 avail+0.003
 Carly+0.003
Neg logit contributions
 can-0.002
ho-0.002
cut-0.002
ir-0.002
 or-0.002
Token L
Token ID406
Feature activation-3.10e+00

Pos logit contributions
die+0.014
 avail+0.013
bi+0.013
 possible+0.012
oph+0.012
Neg logit contributions
 can-0.009
 or-0.009
 cannot-0.008
ir-0.008
'-0.008
Tokenila
Token ID10102
Feature activation+5.94e-02

Pos logit contributions
 avail+0.128
die+0.123
 Alice+0.122
 Samantha+0.119
 Debbie+0.118
Neg logit contributions
ir-0.046
ho-0.036
'-0.036
oken-0.035
itt-0.034
Token said
Token ID531
Feature activation-8.45e-01

Pos logit contributions
ho+0.001
cut+0.001
ir+0.001
'+0.001
 can+0.001
Neg logit contributions
 avail-0.002
 Debbie-0.002
 Samantha-0.002
 Carly-0.002
 Alice-0.002
Token,
Token ID11
Feature activation-5.91e-01

Pos logit contributions
oph+0.032
 Debbie+0.031
 avail+0.031
 Samantha+0.031
 Carly+0.030
Neg logit contributions
ho-0.013
'-0.013
ir-0.012
 can-0.011
itt-0.011
Token bending
Token ID30808
Feature activation-6.15e-01

Pos logit contributions
 Samantha+0.017
 Debbie+0.017
 Alice+0.017
 avail+0.016
 Lib+0.016
Neg logit contributions
ho-0.014
ir-0.014
cut-0.014
oken-0.014
 can-0.013
Token the
Token ID262
Feature activation+1.30e-01

Pos logit contributions
 Debbie+0.016
 Samantha+0.016
 avail+0.016
oph+0.016
 Alice+0.016
Neg logit contributions
ho-0.016
ir-0.016
cut-0.015
 can-0.015
oken-0.015
Token look
Token ID804
Feature activation-5.14e-01

Pos logit contributions
 avail+0.009
oph+0.009
 Samantha+0.009
 Debbie+0.009
ibi+0.009
Neg logit contributions
 can-0.007
ho-0.007
ir-0.006
 cannot-0.006
itt-0.006
Token at
Token ID379
Feature activation-2.50e-01

Pos logit contributions
 Samantha+0.023
 avail+0.023
 Debbie+0.023
oph+0.022
 Alice+0.022
Neg logit contributions
ho-0.005
 can-0.005
ir-0.004
oken-0.004
cut-0.004
Token that
Token ID326
Feature activation+4.09e-01

Pos logit contributions
 avail+0.008
 Debbie+0.008
 Samantha+0.008
oph+0.008
ibi+0.008
Neg logit contributions
ho-0.005
ir-0.005
 can-0.005
cut-0.004
oken-0.004
Token balloon
Token ID21190
Feature activation-3.99e-01

Pos logit contributions
'+0.010
 can+0.010
ho+0.010
 or+0.009
cut+0.009
Neg logit contributions
ibi-0.012
 Samantha-0.012
oph-0.011
 avail-0.011
 Kara-0.011
Token!"
Token ID2474
Feature activation-4.13e-01

Pos logit contributions
 Samantha+0.012
 avail+0.011
 Debbie+0.011
oph+0.011
ibi+0.011
Neg logit contributions
 can-0.011
ho-0.010
cut-0.010
'-0.009
ir-0.009
Token L
Token ID406
Feature activation-3.09e+00

Pos logit contributions
 avail+0.012
oph+0.012
die+0.012
isi+0.012
bi+0.012
Neg logit contributions
 can-0.009
ir-0.009
 or-0.008
ho-0.008
 cannot-0.008
Tokenila
Token ID10102
Feature activation+2.49e-01

Pos logit contributions
 avail+0.122
 Alice+0.118
 Samantha+0.115
 Debbie+0.114
die+0.113
Neg logit contributions
ir-0.052
ho-0.045
itt-0.042
'-0.041
oken-0.040
Token says
Token ID1139
Feature activation-6.64e-01

Pos logit contributions
 can+0.005
'+0.004
ho+0.004
ir+0.004
 or+0.004
Neg logit contributions
 avail-0.009
 Samantha-0.008
 Debbie-0.008
oph-0.008
 Lib-0.008
Token.
Token ID13
Feature activation-6.19e-01

Pos logit contributions
 Alice+0.018
 Samantha+0.018
 Debbie+0.018
 avail+0.018
 Lib+0.018
Neg logit contributions
ho-0.017
cut-0.016
ir-0.016
oken-0.016
itt-0.015
Token "
Token ID366
Feature activation+1.12e+00

Pos logit contributions
 avail+0.021
oph+0.021
 Samantha+0.020
 Lib+0.020
 Debbie+0.020
Neg logit contributions
ho-0.012
 can-0.012
itt-0.011
'-0.011
ir-0.011
TokenIt
Token ID1026
Feature activation-3.91e-01

Pos logit contributions
oken+0.032
ir+0.031
cut+0.031
ho+0.030
 sincere+0.030
Neg logit contributions
 Debbie-0.027
 Samantha-0.027
 Alice-0.027
 Lib-0.027
oph-0.026
Token to
Token ID284
Feature activation-1.92e-01

Pos logit contributions
ho+0.013
cut+0.012
itt+0.012
ir+0.012
oken+0.012
Neg logit contributions
 Alice-0.014
 Samantha-0.013
 Debbie-0.013
 Lib-0.013
 Kara-0.013
Token Mia
Token ID32189
Feature activation-8.56e-01

Pos logit contributions
oph+0.006
 Debbie+0.006
 Samantha+0.006
 avail+0.006
 Lib+0.005
Neg logit contributions
ir-0.004
 can-0.004
ho-0.004
oken-0.004
cut-0.004
Token's
Token ID338
Feature activation+3.84e-01

Pos logit contributions
 Alice+0.025
 Debbie+0.025
 Samantha+0.025
oph+0.024
 avail+0.024
Neg logit contributions
ho-0.021
itt-0.020
cut-0.020
oken-0.020
ir-0.019
Token house
Token ID2156
Feature activation+3.14e-01

Pos logit contributions
ho+0.006
 can+0.006
oken+0.005
ir+0.005
 sincere+0.005
Neg logit contributions
 Samantha-0.014
 Debbie-0.014
oph-0.014
 avail-0.014
 Lib-0.014
Token?"
Token ID1701
Feature activation-3.05e-01

Pos logit contributions
ho+0.008
itt+0.007
cut+0.007
ir+0.007
oken+0.007
Neg logit contributions
 Alice-0.009
 Samantha-0.009
 Debbie-0.009
oph-0.009
 Lib-0.009
Token L
Token ID406
Feature activation-3.09e+00

Pos logit contributions
oph+0.009
 avail+0.009
 Debbie+0.008
isi+0.008
 Lib+0.008
Neg logit contributions
 can-0.007
ho-0.007
ir-0.007
 sincere-0.006
 or-0.006
Tokenila
Token ID10102
Feature activation-5.36e-02

Pos logit contributions
 avail+0.113
 Alice+0.110
 Samantha+0.109
 Debbie+0.108
 Lib+0.107
Neg logit contributions
ir-0.057
ho-0.052
itt-0.048
oken-0.048
 can-0.047
Token asked
Token ID1965
Feature activation-3.52e-01

Pos logit contributions
 Samantha+0.001
 Debbie+0.001
 Alice+0.001
 avail+0.001
oph+0.001
Neg logit contributions
ho-0.001
cut-0.001
ir-0.001
oken-0.001
itt-0.001
Token.
Token ID13
Feature activation-5.81e-01

Pos logit contributions
 Debbie+0.012
 Samantha+0.011
oph+0.011
 avail+0.011
 Lib+0.011
Neg logit contributions
ho-0.007
ir-0.007
 can-0.006
oken-0.006
cut-0.006
Token
Token ID198
Feature activation+1.55e-02

Pos logit contributions
 Samantha+0.013
 Alice+0.013
 Debbie+0.013
 Lib+0.012
 avail+0.012
Neg logit contributions
cut-0.018
ho-0.017
ir-0.017
oken-0.017
oc-0.016
Token
Token ID198
Feature activation-2.72e-01

Pos logit contributions
cut+0.000
ho+0.000
oken+0.000
ir+0.000
 affect+0.000
Neg logit contributions
 Samantha-0.000
 Alice-0.000
 Debbie-0.000
oph-0.000
 Carly-0.000
Token trick
Token ID6908
Feature activation-1.17e-01

Pos logit contributions
 Samantha+0.006
 Debbie+0.006
 Alice+0.006
 avail+0.006
oph+0.006
Neg logit contributions
ho-0.005
cut-0.005
ir-0.005
oken-0.004
itt-0.004
Token-
Token ID12
Feature activation-2.77e-02

Pos logit contributions
 Samantha+0.003
 Debbie+0.003
 avail+0.003
 Alice+0.003
oph+0.003
Neg logit contributions
ho-0.003
cut-0.003
ir-0.003
oken-0.003
itt-0.003
Tokenor
Token ID273
Feature activation-5.97e-01

Pos logit contributions
 Samantha+0.001
 Debbie+0.001
 Alice+0.001
 avail+0.001
oph+0.001
Neg logit contributions
ho-0.001
ir-0.001
cut-0.001
oken-0.001
 can-0.001
Token-
Token ID12
Feature activation+1.90e-02

Pos logit contributions
 Samantha+0.016
 Debbie+0.016
 avail+0.016
oph+0.016
 Alice+0.016
Neg logit contributions
ho-0.015
cut-0.015
ir-0.015
oken-0.014
itt-0.014
Tokent
Token ID83
Feature activation-2.60e-01

Pos logit contributions
ho+0.001
ir+0.001
cut+0.001
oken+0.001
 can+0.001
Neg logit contributions
 Samantha-0.000
 Debbie-0.000
 Alice-0.000
 avail-0.000
 Lib-0.000
Tokenreating
Token ID34567
Feature activation-3.08e+00

Pos logit contributions
oph+0.007
 Debbie+0.007
 avail+0.007
 Samantha+0.006
 Lib+0.006
Neg logit contributions
cut-0.007
 can-0.007
ir-0.007
ho-0.007
 or-0.007
Token anymore
Token ID7471
Feature activation-1.10e+00

Pos logit contributions
 Samantha+0.100
 Debbie+0.100
 Alice+0.099
 avail+0.098
 Lib+0.097
Neg logit contributions
ho-0.062
ir-0.059
cut-0.059
oken-0.058
itt-0.056
Token.
Token ID13
Feature activation+2.32e-01

Pos logit contributions
 Samantha+0.033
 Debbie+0.033
 avail+0.033
 Alice+0.033
oph+0.032
Neg logit contributions
ho-0.025
cut-0.024
ir-0.024
oken-0.023
 can-0.023
Token She
Token ID1375
Feature activation-2.64e-01

Pos logit contributions
ho+0.004
 can+0.004
itt+0.004
 Did+0.004
 sincere+0.004
Neg logit contributions
 avail-0.008
 Debbie-0.007
oph-0.007
 Samantha-0.007
ibi-0.007
Token sadly
Token ID21098
Feature activation+1.12e+00

Pos logit contributions
 Samantha+0.008
 Alice+0.008
 Debbie+0.008
oph+0.008
 Kara+0.008
Neg logit contributions
ir-0.006
ho-0.006
cut-0.005
itt-0.005
oken-0.005
Token put
Token ID1234
Feature activation+5.50e-02

Pos logit contributions
ho+0.026
ir+0.025
cut+0.025
oken+0.024
itt+0.024
Neg logit contributions
 Samantha-0.033
 Debbie-0.033
 Alice-0.033
 avail-0.032
oph-0.032
Token She
Token ID1375
Feature activation-2.25e-01

Pos logit contributions
ho+0.004
ir+0.004
oken+0.004
cut+0.004
itt+0.003
Neg logit contributions
 Alice-0.004
 Samantha-0.004
 Debbie-0.004
 Lib-0.004
 Kara-0.004
Token is
Token ID318
Feature activation-7.06e-01

Pos logit contributions
 Alice+0.007
 Debbie+0.007
 Samantha+0.007
 Lib+0.007
 Clara+0.007
Neg logit contributions
ho-0.005
ir-0.005
cut-0.005
oken-0.005
 affect-0.004
Token so
Token ID523
Feature activation+3.96e-01

Pos logit contributions
 avail+0.021
oph+0.021
 Samantha+0.021
 Debbie+0.021
 Alice+0.020
Neg logit contributions
ho-0.015
 can-0.015
 sincere-0.015
cut-0.015
ir-0.015
Token pretty
Token ID2495
Feature activation-4.48e-01

Pos logit contributions
ho+0.011
cut+0.011
ir+0.011
oken+0.010
itt+0.010
Neg logit contributions
 Samantha-0.010
 Debbie-0.010
 Alice-0.010
 avail-0.009
 Lib-0.009
Token!"
Token ID2474
Feature activation-9.08e-02

Pos logit contributions
 avail+0.018
oph+0.017
 Samantha+0.017
ibi+0.017
 Debbie+0.017
Neg logit contributions
 or-0.006
 can-0.006
cut-0.006
ho-0.006
'-0.005
Token L
Token ID406
Feature activation-3.08e+00

Pos logit contributions
 avail+0.002
oph+0.002
isi+0.002
 Debbie+0.002
 Samantha+0.002
Neg logit contributions
 can-0.002
ho-0.002
ir-0.002
'-0.002
 sincere-0.002
Tokenola
Token ID5708
Feature activation-7.43e-01

Pos logit contributions
 avail+0.112
 Samantha+0.108
 Alice+0.106
 Lib+0.102
 Heidi+0.102
Neg logit contributions
ir-0.060
oken-0.055
'-0.053
 can-0.053
ho-0.051
Token says
Token ID1139
Feature activation-6.55e-01

Pos logit contributions
 avail+0.021
 Samantha+0.021
 Debbie+0.020
oph+0.020
 Lib+0.019
Neg logit contributions
 can-0.018
ho-0.018
cut-0.017
'-0.017
ir-0.017
Token.
Token ID13
Feature activation-4.64e-01

Pos logit contributions
 Alice+0.018
 Samantha+0.018
 Debbie+0.018
 Lib+0.018
 avail+0.017
Neg logit contributions
cut-0.016
ho-0.016
ir-0.016
oken-0.016
oc-0.015
Token
Token ID198
Feature activation+3.85e-02

Pos logit contributions
 avail+0.014
 Samantha+0.014
 Debbie+0.014
oph+0.014
 Alice+0.014
Neg logit contributions
ho-0.010
 can-0.010
ir-0.010
itt-0.009
cut-0.009
Token
Token ID198
Feature activation-2.70e-01

Pos logit contributions
ho+0.001
cut+0.001
ir+0.001
oken+0.001
itt+0.001
Neg logit contributions
 Samantha-0.001
 Debbie-0.001
 Alice-0.001
 avail-0.001
oph-0.001
Token the
Token ID262
Feature activation+1.07e+00

Pos logit contributions
 Samantha+0.009
 Debbie+0.008
 Alice+0.008
 avail+0.008
 Lib+0.008
Neg logit contributions
ho-0.010
cut-0.010
oken-0.010
ir-0.010
 sincere-0.009
Token salt
Token ID8268
Feature activation-1.79e-01

Pos logit contributions
ho+0.035
ir+0.035
 sincere+0.034
oken+0.033
cut+0.033
Neg logit contributions
 Debbie-0.021
 Samantha-0.021
 avail-0.020
oph-0.020
 Alice-0.019
Token,
Token ID11
Feature activation-6.50e-01

Pos logit contributions
 Samantha+0.005
 Debbie+0.005
 Alice+0.005
 Lib+0.005
 Clara+0.005
Neg logit contributions
oken-0.004
ir-0.004
ho-0.004
 sincere-0.004
oc-0.004
Token Mom
Token ID11254
Feature activation-7.68e-01

Pos logit contributions
 Samantha+0.013
 Debbie+0.013
 Alice+0.012
oph+0.012
 avail+0.012
Neg logit contributions
ho-0.022
cut-0.022
ir-0.021
oken-0.021
itt-0.021
Token?"
Token ID1701
Feature activation-3.96e-01

Pos logit contributions
 Samantha+0.027
 Debbie+0.027
 avail+0.027
oph+0.027
 Carly+0.027
Neg logit contributions
 can-0.013
cut-0.012
ho-0.012
'-0.012
 or-0.012
Token L
Token ID406
Feature activation-3.08e+00

Pos logit contributions
oph+0.011
 avail+0.010
 Debbie+0.010
isi+0.010
 Alice+0.010
Neg logit contributions
 can-0.010
ho-0.009
ir-0.009
 or-0.009
 sincere-0.009
Tokenila
Token ID10102
Feature activation+2.76e-02

Pos logit contributions
 avail+0.113
 Alice+0.111
 Samantha+0.110
 Debbie+0.109
 Lib+0.108
Neg logit contributions
ir-0.055
ho-0.051
itt-0.047
oken-0.046
 can-0.046
Token asks
Token ID7893
Feature activation-1.51e-01

Pos logit contributions
 can+0.000
'+0.000
cut+0.000
ir+0.000
ho+0.000
Neg logit contributions
 Samantha-0.001
 avail-0.001
 Debbie-0.001
oph-0.001
 Lib-0.001
Token.
Token ID13
Feature activation-5.49e-01

Pos logit contributions
 Samantha+0.005
 Debbie+0.005
 avail+0.005
oph+0.004
 Alice+0.004
Neg logit contributions
ho-0.003
ir-0.003
cut-0.003
oken-0.003
 can-0.003
Token
Token ID198
Feature activation+8.73e-02

Pos logit contributions
 Samantha+0.015
 Debbie+0.015
 Alice+0.015
 avail+0.015
oph+0.015
Neg logit contributions
ho-0.014
cut-0.013
ir-0.013
oken-0.013
itt-0.012
Token
Token ID198
Feature activation-2.40e-01

Pos logit contributions
ho+0.002
cut+0.002
ir+0.002
oken+0.002
itt+0.002
Neg logit contributions
 Samantha-0.002
 Debbie-0.002
 Alice-0.002
 avail-0.002
oph-0.002
Token butterfly
Token ID35113
Feature activation+1.03e+00

Pos logit contributions
ho+0.011
cut+0.011
ir+0.011
oken+0.010
itt+0.010
Neg logit contributions
 Samantha-0.011
 Debbie-0.011
 Alice-0.011
 avail-0.011
oph-0.011
Token turned
Token ID2900
Feature activation-1.27e-01

Pos logit contributions
oken+0.033
itt+0.031
ir+0.029
ighters+0.029
 affect+0.029
Neg logit contributions
 Alice-0.023
oph-0.022
 Samantha-0.022
 Debbie-0.021
 Lib-0.021
Token and
Token ID290
Feature activation-8.77e-01

Pos logit contributions
 Samantha+0.003
 Alice+0.003
 Lib+0.003
 Debbie+0.003
 Carly+0.003
Neg logit contributions
itt-0.004
oken-0.004
cut-0.004
oc-0.003
 affect-0.003
Token looked
Token ID3114
Feature activation-6.84e-01

Pos logit contributions
 Samantha+0.026
 Debbie+0.026
 avail+0.025
 Alice+0.025
oph+0.025
Neg logit contributions
ho-0.021
ir-0.020
cut-0.019
 can-0.019
oken-0.019
Token at
Token ID379
Feature activation-8.88e-01

Pos logit contributions
 Samantha+0.021
 Alice+0.021
 Lib+0.020
 Debbie+0.020
oph+0.020
Neg logit contributions
itt-0.016
ho-0.015
cut-0.015
 affect-0.014
oken-0.014
Token L
Token ID406
Feature activation-3.06e+00

Pos logit contributions
 Samantha+0.023
 Debbie+0.022
 avail+0.022
 Alice+0.022
oph+0.022
Neg logit contributions
ho-0.024
ir-0.024
 can-0.023
oken-0.023
cut-0.022
Tokenola
Token ID5708
Feature activation-1.35e+00

Pos logit contributions
 Samantha+0.095
 avail+0.095
 Alice+0.094
 Debbie+0.091
 Lib+0.091
Neg logit contributions
ir-0.069
ho-0.067
oken-0.067
 can-0.064
itt-0.063
Token with
Token ID351
Feature activation-2.20e-01

Pos logit contributions
 Debbie+0.044
 Samantha+0.043
 Alice+0.043
 Lib+0.042
 avail+0.042
Neg logit contributions
ho-0.027
cut-0.026
oken-0.026
ir-0.026
itt-0.025
Token its
Token ID663
Feature activation+8.83e-01

Pos logit contributions
 Samantha+0.006
 Alice+0.006
 Debbie+0.006
 avail+0.006
 Lib+0.006
Neg logit contributions
ho-0.006
cut-0.006
itt-0.006
oken-0.005
ir-0.005
Token big
Token ID1263
Feature activation-2.23e-01

Pos logit contributions
cut+0.025
itt+0.024
oc+0.024
Did+0.023
ho+0.023
Neg logit contributions
 avail-0.022
 Samantha-0.022
ibi-0.021
 Alice-0.021
 Debbie-0.020
Token eyes
Token ID2951
Feature activation+1.36e+00

Pos logit contributions
 avail+0.006
 Samantha+0.006
 Debbie+0.006
 Alice+0.006
oph+0.006
Neg logit contributions
cut-0.006
itt-0.006
ho-0.006
Did-0.006
ighters-0.006
Token.
Token ID13
Feature activation+8.48e-02

Pos logit contributions
 Samantha+0.012
 Debbie+0.012
 Alice+0.012
 avail+0.012
oph+0.011
Neg logit contributions
ho-0.013
cut-0.013
ir-0.013
oken-0.012
itt-0.012
Token The
Token ID383
Feature activation+3.63e-01

Pos logit contributions
itt+0.002
 can+0.002
ho+0.002
 Did+0.002
oken+0.001
Neg logit contributions
 avail-0.003
 Debbie-0.003
oph-0.003
isi-0.003
 Samantha-0.003
Token end
Token ID886
Feature activation-4.74e-01

Pos logit contributions
ho+0.011
ir+0.011
oc+0.010
cut+0.010
itt+0.010
Neg logit contributions
 Samantha-0.009
 Alice-0.008
 avail-0.008
 Debbie-0.008
oph-0.008
Token.
Token ID13
Feature activation+2.34e-01

Pos logit contributions
 avail+0.018
 Debbie+0.018
oph+0.018
 Samantha+0.018
 Lib+0.018
Neg logit contributions
 can-0.008
 forgive-0.007
'-0.007
ho-0.007
oc-0.007
Token<|endoftext|>
Token ID50256
Feature activation-1.34e+00

Pos logit contributions
itt+0.003
 can+0.003
 forgive+0.003
 Did+0.003
oc+0.003
Neg logit contributions
 avail-0.010
 flavours-0.009
 Debbie-0.009
 flavors-0.009
oph-0.008
TokenHarry
Token ID18308
Feature activation+2.05e+00

Pos logit contributions
 avail+0.046
oph+0.044
 Samantha+0.044
 Debbie+0.043
ibi+0.042
Neg logit contributions
 can-0.027
'-0.025
itt-0.025
ho-0.025
ir-0.024
Token and
Token ID290
Feature activation+1.13e+00

Pos logit contributions
ho+0.041
ir+0.040
oken+0.040
cut+0.039
itt+0.039
Neg logit contributions
 Samantha-0.068
 Debbie-0.067
 Alice-0.067
oph-0.066
 Carly-0.064
Token his
Token ID465
Feature activation+2.87e-01

Pos logit contributions
ho+0.029
 can+0.028
cut+0.027
 forgive+0.026
'+0.026
Neg logit contributions
 avail-0.030
oph-0.030
 Debbie-0.029
ibi-0.028
 Kara-0.028
Token dad
Token ID9955
Feature activation+1.07e+00

Pos logit contributions
ho+0.007
 can+0.007
itt+0.007
ir+0.006
oken+0.006
Neg logit contributions
 avail-0.008
 Debbie-0.008
 Samantha-0.008
 Lib-0.008
oph-0.008
Token were
Token ID547
Feature activation-3.23e-01

Pos logit contributions
cut+0.027
ir+0.027
ho+0.027
oken+0.026
itt+0.026
Neg logit contributions
 Samantha-0.031
oph-0.030
 Alice-0.029
 Debbie-0.029
 Lib-0.029
Token walking
Token ID6155
Feature activation+2.26e-01

Pos logit contributions
 Samantha+0.011
 Debbie+0.011
 Alice+0.010
 Lib+0.010
oph+0.010
Neg logit contributions
ho-0.007
cut-0.006
ir-0.006
oken-0.006
 can-0.006
Token R
Token ID371
Feature activation-3.15e-01

Pos logit contributions
ho+0.001
oc+0.001
 can+0.001
cut+0.001
itt+0.001
Neg logit contributions
 avail-0.001
oph-0.001
 Samantha-0.001
 possible-0.001
 flavours-0.001
Tokenuffy
Token ID15352
Feature activation+6.67e-01

Pos logit contributions
 Debbie+0.008
 Samantha+0.008
 Alice+0.008
oph+0.008
 avail+0.008
Neg logit contributions
ho-0.009
cut-0.008
oken-0.008
ir-0.008
 can-0.008
Token w
Token ID266
Feature activation-4.92e-01

Pos logit contributions
ho+0.016
cut+0.016
ir+0.016
oken+0.015
itt+0.015
Neg logit contributions
 Samantha-0.019
 Debbie-0.019
 avail-0.018
 Alice-0.018
oph-0.018
Tokenagged
Token ID14655
Feature activation-1.02e+00

Pos logit contributions
 Samantha+0.014
 Debbie+0.014
oph+0.014
 Alice+0.014
ibi+0.014
Neg logit contributions
ho-0.012
cut-0.011
 can-0.011
oc-0.010
 sincere-0.010
Token his
Token ID465
Feature activation+1.26e+00

Pos logit contributions
oph+0.043
 Debbie+0.040
 avail+0.038
 Samantha+0.038
 Lib+0.038
Neg logit contributions
 and-0.013
cut-0.012
ho-0.012
 sincere-0.012
bed-0.012
Token tail
Token ID7894
Feature activation+1.99e+00

Pos logit contributions
ho+0.027
ir+0.027
cut+0.026
oken+0.026
 sincere+0.026
Neg logit contributions
 Debbie-0.039
 Samantha-0.039
oph-0.039
 avail-0.039
 Lib-0.039
Token.
Token ID13
Feature activation-9.93e-03

Pos logit contributions
ho+0.040
oken+0.039
ir+0.038
cut+0.038
itt+0.037
Neg logit contributions
 Alice-0.065
 Samantha-0.063
 Lib-0.063
 Debbie-0.062
oph-0.062
Token
Token ID198
Feature activation+1.52e-01

Pos logit contributions
 avail+0.000
oph+0.000
 possible+0.000
 Samantha+0.000
isi+0.000
Neg logit contributions
ho-0.000
 can-0.000
 Did-0.000
oc-0.000
 Don-0.000
Token
Token ID198
Feature activation-1.09e-01

Pos logit contributions
ho+0.003
ir+0.003
cut+0.003
 can+0.003
itt+0.003
Neg logit contributions
 avail-0.005
 Samantha-0.005
 Debbie-0.005
oph-0.005
 Alice-0.005
TokenTim
Token ID14967
Feature activation+8.68e-02

Pos logit contributions
 Samantha+0.003
 Debbie+0.003
 Alice+0.003
 avail+0.003
oph+0.003
Neg logit contributions
ho-0.003
cut-0.003
ir-0.003
oken-0.003
itt-0.003
Token picked
Token ID6497
Feature activation-1.58e+00

Pos logit contributions
oken+0.002
ir+0.002
ho+0.002
cut+0.001
itt+0.001
Neg logit contributions
oph-0.003
 Alice-0.003
 Samantha-0.003
 Debbie-0.003
 Lib-0.003
Token was
Token ID373
Feature activation-6.03e-01

Pos logit contributions
oken+0.008
ir+0.008
ho+0.007
 affect+0.007
cut+0.007
Neg logit contributions
 Alice-0.011
 Samantha-0.011
 Debbie-0.011
oph-0.011
 Lib-0.011
Token so
Token ID523
Feature activation-3.58e-01

Pos logit contributions
 avail+0.019
oph+0.018
 Debbie+0.018
 Samantha+0.018
ibi+0.017
Neg logit contributions
ho-0.013
cut-0.012
 sincere-0.012
ir-0.012
oken-0.012
Token excited
Token ID6568
Feature activation+1.14e-01

Pos logit contributions
 Samantha+0.011
 Debbie+0.011
 Alice+0.010
 Lib+0.010
oph+0.010
Neg logit contributions
ho-0.008
cut-0.008
ir-0.008
oken-0.008
itt-0.008
Token and
Token ID290
Feature activation-6.09e-01

Pos logit contributions
ho+0.001
 can+0.001
 or+0.001
'+0.001
cut+0.001
Neg logit contributions
 Debbie-0.004
 avail-0.004
 Samantha-0.004
 Kara-0.004
ibi-0.004
Token waved
Token ID26834
Feature activation+8.59e-01

Pos logit contributions
 Samantha+0.018
 Debbie+0.017
 avail+0.017
 Alice+0.017
oph+0.017
Neg logit contributions
ho-0.015
cut-0.014
ir-0.014
oken-0.014
itt-0.013
Token goodbye
Token ID24829
Feature activation+2.56e+00

Pos logit contributions
ho+0.021
cut+0.021
ir+0.021
oken+0.021
itt+0.020
Neg logit contributions
 Samantha-0.024
 Debbie-0.024
 Alice-0.024
 avail-0.023
 Lib-0.023
Token to
Token ID284
Feature activation-1.20e+00

Pos logit contributions
oken+0.056
ir+0.056
ho+0.056
cut+0.055
itt+0.054
Neg logit contributions
 Samantha-0.078
 Debbie-0.077
 Alice-0.077
 Lib-0.076
 avail-0.075
Token the
Token ID262
Feature activation+6.58e-01

Pos logit contributions
 Samantha+0.032
 Debbie+0.032
 Alice+0.031
 avail+0.031
oph+0.031
Neg logit contributions
ho-0.031
ir-0.030
cut-0.030
oken-0.029
 can-0.029
Token kind
Token ID1611
Feature activation-4.91e-01

Pos logit contributions
ho+0.018
ir+0.018
itt+0.018
oken+0.017
cut+0.017
Neg logit contributions
 Samantha-0.016
 Debbie-0.016
 Lib-0.016
 Alice-0.016
 Carly-0.016
Token old
Token ID1468
Feature activation-6.07e-01

Pos logit contributions
 Samantha+0.013
 Debbie+0.013
 Alice+0.013
oph+0.012
 avail+0.012
Neg logit contributions
ho-0.013
ir-0.013
cut-0.013
oken-0.013
itt-0.012
Token man
Token ID582
Feature activation-2.82e-01

Pos logit contributions
 Samantha+0.016
 Debbie+0.016
 Alice+0.016
 avail+0.015
 Lib+0.015
Neg logit contributions
ho-0.016
ir-0.016
cut-0.016
oken-0.015
itt-0.015
Token other
Token ID584
Feature activation-1.23e-01

Pos logit contributions
ir+0.010
ho+0.010
oken+0.010
cut+0.010
itt+0.010
Neg logit contributions
 Samantha-0.011
 Debbie-0.011
 Alice-0.011
 avail-0.011
 Lib-0.011
Token dog
Token ID3290
Feature activation+1.59e-01

Pos logit contributions
 Samantha+0.003
 Debbie+0.003
 Alice+0.003
 avail+0.003
oph+0.003
Neg logit contributions
ho-0.003
cut-0.003
ir-0.003
oken-0.003
itt-0.003
Token with
Token ID351
Feature activation+5.35e-01

Pos logit contributions
oken+0.004
ir+0.004
ho+0.003
cut+0.003
itt+0.003
Neg logit contributions
 Alice-0.005
 Samantha-0.005
 Debbie-0.005
 Lib-0.005
 Carly-0.004
Token his
Token ID465
Feature activation+5.45e-01

Pos logit contributions
ho+0.016
cut+0.015
ir+0.015
oken+0.015
itt+0.015
Neg logit contributions
 Samantha-0.013
 Debbie-0.013
 Alice-0.013
 avail-0.012
 Lib-0.012
Token big
Token ID1263
Feature activation+2.81e-01

Pos logit contributions
ho+0.014
ir+0.014
cut+0.014
oken+0.013
itt+0.013
Neg logit contributions
 Samantha-0.015
 Alice-0.015
 Debbie-0.014
 avail-0.014
ibi-0.014
Token ears
Token ID11368
Feature activation+1.84e+00

Pos logit contributions
cut+0.007
ho+0.007
itt+0.007
oken+0.007
ighters+0.007
Neg logit contributions
 Samantha-0.008
 Alice-0.008
 avail-0.007
 Debbie-0.007
 Carly-0.007
Token and
Token ID290
Feature activation-9.59e-02

Pos logit contributions
oken+0.046
ho+0.045
cut+0.044
ir+0.044
itt+0.043
Neg logit contributions
 Samantha-0.050
 Alice-0.050
 Debbie-0.049
 Lib-0.048
 Mary-0.046
Token his
Token ID465
Feature activation-6.94e-02

Pos logit contributions
oph+0.003
 Samantha+0.003
 Debbie+0.003
 avail+0.003
 Alice+0.003
Neg logit contributions
ho-0.002
cut-0.002
 can-0.002
ir-0.002
 sincere-0.002
Token fierce
Token ID14800
Feature activation+6.21e-01

Pos logit contributions
 Alice+0.002
 Samantha+0.002
 Debbie+0.002
ibi+0.002
 Carly+0.002
Neg logit contributions
ir-0.002
oken-0.002
cut-0.002
ho-0.002
itt-0.002
Token bark
Token ID21405
Feature activation+2.34e-01

Pos logit contributions
cut+0.015
itt+0.014
ho+0.014
ir+0.014
oken+0.014
Neg logit contributions
 Samantha-0.018
 Alice-0.017
 Debbie-0.017
 avail-0.017
ibi-0.017
Token.
Token ID13
Feature activation-1.59e-01

Pos logit contributions
oken+0.006
cut+0.006
ho+0.005
itt+0.005
 sincere+0.005
Neg logit contributions
 Alice-0.006
 Samantha-0.006
 Debbie-0.006
 Lib-0.006
oph-0.006
Token friend
Token ID1545
Feature activation+1.82e-01

Pos logit contributions
ho+0.029
cut+0.028
 can+0.028
ir+0.027
oken+0.027
Neg logit contributions
 Samantha-0.031
 avail-0.031
 Debbie-0.031
oph-0.030
 Alice-0.030
Token."
Token ID526
Feature activation+6.00e-01

Pos logit contributions
ho+0.005
ir+0.004
oken+0.004
cut+0.004
itt+0.004
Neg logit contributions
 Alice-0.005
 Samantha-0.005
 Debbie-0.005
 Lib-0.005
 avail-0.005
Token
Token ID198
Feature activation+7.50e-01

Pos logit contributions
ho+0.012
 can+0.011
cut+0.011
itt+0.011
ir+0.011
Neg logit contributions
 avail-0.021
oph-0.020
 Debbie-0.019
 Samantha-0.019
 Alice-0.019
Token
Token ID198
Feature activation+5.47e-01

Pos logit contributions
ho+0.017
cut+0.016
ir+0.016
oken+0.016
itt+0.016
Neg logit contributions
 Samantha-0.023
 Debbie-0.022
 avail-0.022
 Alice-0.022
oph-0.022
TokenR
Token ID49
Feature activation-1.68e-01

Pos logit contributions
oken+0.015
ho+0.015
ir+0.015
 can+0.014
cut+0.014
Neg logit contributions
 Debbie-0.014
 Samantha-0.014
 Alice-0.013
 Kara-0.013
 Carly-0.013
Tokenoc
Token ID420
Feature activation+2.71e+00

Pos logit contributions
 Samantha+0.004
oph+0.004
 Debbie+0.004
ibi+0.004
 Alice+0.004
Neg logit contributions
ho-0.005
cut-0.005
 can-0.004
oken-0.004
ir-0.004
Tokenco
Token ID1073
Feature activation-5.18e-01

Pos logit contributions
oken+0.068
 affect+0.064
 can+0.063
ir+0.061
Did+0.060
Neg logit contributions
 Lib-0.075
 Debbie-0.075
 Samantha-0.074
 Alice-0.073
 Kara-0.072
Token nodded
Token ID14464
Feature activation-4.65e-01

Pos logit contributions
 Samantha+0.016
 Alice+0.016
 Lib+0.016
 Debbie+0.016
oph+0.015
Neg logit contributions
ho-0.011
oken-0.011
 affect-0.011
 can-0.011
 sincere-0.011
Token and
Token ID290
Feature activation+5.13e-01

Pos logit contributions
 Alice+0.015
 Samantha+0.014
 Debbie+0.014
 Mary+0.014
 Clara+0.014
Neg logit contributions
itt-0.011
oken-0.011
ho-0.010
cut-0.010
 affect-0.010
Token said
Token ID531
Feature activation+7.80e-01

Pos logit contributions
ho+0.008
 forgive+0.008
 can+0.008
cut+0.007
 sw+0.007
Neg logit contributions
 avail-0.020
oph-0.018
ibi-0.018
 Debbie-0.018
 possible-0.018
Token,
Token ID11
Feature activation+5.17e-01

Pos logit contributions
oken+0.014
ir+0.013
cut+0.013
ho+0.013
 affect+0.013
Neg logit contributions
 Alice-0.028
 Samantha-0.027
 Debbie-0.027
 Lib-0.027
 Megan-0.026
Token,
Token ID11
Feature activation-2.59e-01

Pos logit contributions
 Samantha+0.002
 Lib+0.002
 Alice+0.002
 Debbie+0.002
 Mary+0.002
Neg logit contributions
oken-0.002
itt-0.002
oc-0.002
ir-0.002
cut-0.002
Token marvel
Token ID22672
Feature activation-6.53e-01

Pos logit contributions
 Alice+0.007
 Debbie+0.007
 Samantha+0.007
 Mary+0.007
 Lib+0.006
Neg logit contributions
ir-0.007
oken-0.007
itt-0.007
oc-0.007
ubby-0.007
Tokening
Token ID278
Feature activation+2.74e-01

Pos logit contributions
 Debbie+0.020
 Samantha+0.020
 Alice+0.020
oph+0.020
 Lib+0.019
Neg logit contributions
cut-0.014
ho-0.014
oken-0.013
ir-0.013
itt-0.013
Token at
Token ID379
Feature activation-4.70e-01

Pos logit contributions
itt+0.005
oc+0.005
oken+0.005
 affect+0.005
cut+0.005
Neg logit contributions
 Samantha-0.009
 Alice-0.009
 Lib-0.009
 Debbie-0.009
 Carly-0.008
Token her
Token ID607
Feature activation-9.15e-01

Pos logit contributions
 Samantha+0.012
 Alice+0.011
 Debbie+0.011
 avail+0.011
 Lib+0.011
Neg logit contributions
ho-0.013
cut-0.013
oken-0.013
ir-0.013
itt-0.013
Token beauty
Token ID8737
Feature activation+7.79e-01

Pos logit contributions
 Samantha+0.026
 Debbie+0.026
 avail+0.025
 Alice+0.025
oph+0.025
Neg logit contributions
ho-0.023
ir-0.022
cut-0.022
oken-0.021
 can-0.021
Token.
Token ID13
Feature activation-5.55e-02

Pos logit contributions
ho+0.015
cut+0.014
ir+0.014
 can+0.014
oken+0.013
Neg logit contributions
 Samantha-0.026
 Debbie-0.026
 avail-0.026
oph-0.025
 Alice-0.025
Token
Token ID198
Feature activation-6.43e-02

Pos logit contributions
 avail+0.002
oph+0.002
 Debbie+0.002
 Samantha+0.002
ibi+0.002
Neg logit contributions
itt-0.001
 can-0.001
 Did-0.001
ho-0.001
ir-0.001
Token
Token ID198
Feature activation-3.80e-01

Pos logit contributions
 avail+0.002
 Samantha+0.002
oph+0.002
 Debbie+0.002
 Alice+0.002
Neg logit contributions
ho-0.001
ir-0.001
cut-0.001
itt-0.001
 can-0.001
TokenThe
Token ID464
Feature activation+2.56e-01

Pos logit contributions
 Samantha+0.012
 Debbie+0.012
 Alice+0.011
oph+0.011
 avail+0.011
Neg logit contributions
ho-0.008
ir-0.008
cut-0.008
oken-0.008
 can-0.008
Token bear
Token ID6842
Feature activation+1.32e+00

Pos logit contributions
ho+0.007
ir+0.007
itt+0.007
ighters+0.007
oken+0.007
Neg logit contributions
 Samantha-0.006
 Alice-0.006
 avail-0.006
 Kara-0.006
 Lib-0.006
Token we
Token ID356
Feature activation-2.63e-01

Pos logit contributions
oken+0.034
ir+0.032
onel+0.030
cut+0.030
oc+0.030
Neg logit contributions
 Alice-0.022
 Samantha-0.020
 Lib-0.020
 Debbie-0.019
 Stella-0.019
Token have
Token ID423
Feature activation+3.43e-01

Pos logit contributions
 Samantha+0.006
 Alice+0.006
 Kara+0.006
 Stella+0.006
 Clara+0.006
Neg logit contributions
cut-0.008
oken-0.008
itt-0.008
Did-0.008
oc-0.007
Token some
Token ID617
Feature activation+3.42e-01

Pos logit contributions
 can+0.008
ho+0.008
ir+0.007
oken+0.007
cut+0.007
Neg logit contributions
 Debbie-0.010
oph-0.010
 avail-0.010
 Samantha-0.010
 Carly-0.010
Token water
Token ID1660
Feature activation+4.43e-02

Pos logit contributions
ho+0.012
cut+0.011
ir+0.011
itt+0.011
oc+0.011
Neg logit contributions
 Samantha-0.006
 Alice-0.006
 avail-0.006
 Debbie-0.006
 Kara-0.006
Token,
Token ID11
Feature activation-5.33e-01

Pos logit contributions
oken+0.001
itt+0.001
ir+0.001
oc+0.001
cut+0.001
Neg logit contributions
 Samantha-0.001
 Alice-0.001
oph-0.001
 Lib-0.001
 Sophie-0.001
Token please
Token ID3387
Feature activation+6.92e-01

Pos logit contributions
 Alice+0.016
 Samantha+0.015
 Mary+0.015
 Debbie+0.014
 Sophia+0.014
Neg logit contributions
itt-0.014
ho-0.014
oken-0.014
ir-0.014
cut-0.013
Token?"
Token ID1701
Feature activation+1.96e-01

Pos logit contributions
ho+0.023
itt+0.023
oken+0.022
ir+0.022
onel+0.022
Neg logit contributions
 Samantha-0.015
 Alice-0.015
 avail-0.014
 Mary-0.014
 Debbie-0.014
Token Lia
Token ID49520
Feature activation-1.19e+00

Pos logit contributions
ho+0.006
ir+0.006
oken+0.006
cut+0.006
itt+0.006
Neg logit contributions
 Samantha-0.004
 Alice-0.004
 Debbie-0.004
 Lib-0.004
 avail-0.004
Token asked
Token ID1965
Feature activation-8.86e-02

Pos logit contributions
 Debbie+0.030
 Alice+0.030
 Samantha+0.030
oph+0.029
 Lib+0.029
Neg logit contributions
ho-0.033
oken-0.032
itt-0.032
cut-0.031
ir-0.031
Token.
Token ID13
Feature activation-3.43e-01

Pos logit contributions
 Alice+0.002
 Samantha+0.002
 avail+0.002
 Debbie+0.002
 Lib+0.002
Neg logit contributions
cut-0.002
ho-0.002
oken-0.002
oc-0.002
ir-0.002
Token She
Token ID1375
Feature activation-2.14e-01

Pos logit contributions
 Alice+0.008
 Samantha+0.008
 Debbie+0.008
 Lib+0.008
 Carly+0.008
Neg logit contributions
oken-0.010
cut-0.010
ir-0.010
ho-0.010
oc-0.009
Token to
Token ID284
Feature activation-7.01e-01

Pos logit contributions
 avail+0.006
 Samantha+0.006
 Debbie+0.006
oph+0.006
 Alice+0.006
Neg logit contributions
ho-0.003
ir-0.003
cut-0.003
 can-0.003
itt-0.003
Token playing
Token ID2712
Feature activation-2.96e-02

Pos logit contributions
 Samantha+0.019
 Debbie+0.019
oph+0.019
 avail+0.019
 Alice+0.018
Neg logit contributions
ho-0.018
ir-0.017
cut-0.017
oken-0.017
 can-0.017
Token in
Token ID287
Feature activation+2.46e-01

Pos logit contributions
 Alice+0.001
 Samantha+0.001
 Lib+0.001
 Sophia+0.001
 Clara+0.001
Neg logit contributions
ho-0.001
ir-0.001
oken-0.001
itt-0.001
cut-0.001
Token the
Token ID262
Feature activation+5.72e-01

Pos logit contributions
ho+0.005
ir+0.005
 can+0.005
cut+0.005
oken+0.005
Neg logit contributions
 Debbie-0.007
oph-0.007
 Samantha-0.007
 avail-0.007
 Lib-0.007
Token park
Token ID3952
Feature activation-7.82e-01

Pos logit contributions
 can+0.014
ho+0.014
cut+0.013
oken+0.013
'+0.013
Neg logit contributions
 Debbie-0.016
 Samantha-0.016
 avail-0.016
oph-0.016
 Lib-0.016
Token.
Token ID13
Feature activation+2.27e-01

Pos logit contributions
 Alice+0.019
 Lib+0.018
 Sophia+0.018
 Mary+0.017
 Samantha+0.017
Neg logit contributions
oken-0.024
itt-0.022
ir-0.022
cut-0.022
ho-0.021
Token They
Token ID1119
Feature activation-2.15e-01

Pos logit contributions
 can+0.004
 cannot+0.003
 Did+0.003
 or+0.003
ho+0.003
Neg logit contributions
 avail-0.009
oph-0.009
isi-0.008
 Samantha-0.008
 Lib-0.008
Token are
Token ID389
Feature activation-5.93e-01

Pos logit contributions
 Alice+0.006
 Debbie+0.006
 Samantha+0.006
 Lib+0.006
 Carly+0.006
Neg logit contributions
itt-0.005
ir-0.005
cut-0.005
oken-0.005
ho-0.005
Token not
Token ID407
Feature activation-4.50e-01

Pos logit contributions
oph+0.023
 avail+0.022
 Debbie+0.021
 Samantha+0.021
 Lib+0.021
Neg logit contributions
 sincere-0.010
cut-0.009
 can-0.009
ho-0.009
oken-0.008
Token scared
Token ID12008
Feature activation+1.03e+00

Pos logit contributions
oph+0.021
 avail+0.020
 Debbie+0.020
 Carly+0.019
 Lib+0.019
Neg logit contributions
 or-0.005
 sincere-0.004
 can-0.004
 sad-0.004
 mean-0.004
Token of
Token ID286
Feature activation-3.15e-01

Pos logit contributions
ho+0.023
cut+0.023
itt+0.022
ir+0.022
oken+0.021
Neg logit contributions
 Alice-0.031
 Samantha-0.031
 Debbie-0.030
 avail-0.030
 Lib-0.030
Token!
Token ID0
Feature activation+1.04e+00

Pos logit contributions
oph+0.017
 Samantha+0.017
 avail+0.017
 Debbie+0.017
 Lib+0.016
Neg logit contributions
 can-0.011
 or-0.010
ho-0.010
'-0.010
ir-0.009
Token Thank
Token ID6952
Feature activation+9.45e-01

Pos logit contributions
ho+0.019
 can+0.018
cut+0.017
ir+0.017
itt+0.017
Neg logit contributions
 Samantha-0.036
 Debbie-0.036
 avail-0.036
oph-0.036
 Lib-0.035
Token you
Token ID345
Feature activation+8.80e-01

Pos logit contributions
ho+0.013
ir+0.012
itt+0.012
'+0.012
 can+0.012
Neg logit contributions
 Samantha-0.037
 Debbie-0.037
oph-0.037
 avail-0.036
 Alice-0.036
Token,
Token ID11
Feature activation+5.90e-01

Pos logit contributions
ho+0.014
 can+0.013
ir+0.013
'+0.012
cut+0.012
Neg logit contributions
 avail-0.033
 Samantha-0.033
 Debbie-0.032
 Lib-0.032
oph-0.032
Token Sara
Token ID24799
Feature activation+4.56e-01

Pos logit contributions
 can+0.018
ho+0.018
ir+0.018
 forgive+0.017
cut+0.017
Neg logit contributions
 avail-0.013
 Samantha-0.012
 Debbie-0.012
oph-0.012
ibi-0.012
Token!"
Token ID2474
Feature activation+6.71e-01

Pos logit contributions
 can+0.009
oc+0.009
'+0.008
cut+0.008
 or+0.008
Neg logit contributions
 avail-0.017
 Samantha-0.016
oph-0.016
 Debbie-0.015
 Lib-0.015
Token They
Token ID1119
Feature activation-1.75e-01

Pos logit contributions
ho+0.012
itt+0.011
 can+0.011
 or+0.010
ir+0.010
Neg logit contributions
 avail-0.027
oph-0.023
 Samantha-0.023
 Debbie-0.023
 Alice-0.022
Token are
Token ID389
Feature activation-4.36e-01

Pos logit contributions
 avail+0.006
 Samantha+0.006
 Debbie+0.006
 Alice+0.006
 Lib+0.006
Neg logit contributions
ho-0.004
 can-0.004
ir-0.003
cut-0.003
 forgive-0.003
Token happy
Token ID3772
Feature activation+3.84e-02

Pos logit contributions
oph+0.020
 avail+0.020
igsaw+0.018
 flavours+0.018
 Samantha+0.018
Neg logit contributions
 still-0.005
 sincere-0.005
 sorry-0.005
 no-0.005
 can-0.004
Token and
Token ID290
Feature activation-2.43e-01

Pos logit contributions
ho+0.001
ir+0.001
cut+0.001
 can+0.001
oken+0.001
Neg logit contributions
 Samantha-0.001
 Debbie-0.001
 avail-0.001
 Alice-0.001
oph-0.001
Token go
Token ID467
Feature activation+6.96e-02

Pos logit contributions
 avail+0.009
oph+0.009
 Samantha+0.008
 Debbie+0.008
 Alice+0.008
Neg logit contributions
ho-0.005
 can-0.005
 cannot-0.004
 forgive-0.004
 sincere-0.004
Token believed
Token ID4762
Feature activation-5.96e-01

Pos logit contributions
 Samantha+0.008
oph+0.008
 Debbie+0.008
 Alice+0.008
 Lib+0.008
Neg logit contributions
oken-0.007
itt-0.007
cut-0.007
ir-0.007
ho-0.007
Token in
Token ID287
Feature activation+1.38e-01

Pos logit contributions
 Alice+0.014
 Samantha+0.014
 Debbie+0.013
 Carly+0.013
 Mary+0.013
Neg logit contributions
cut-0.018
oc-0.018
ho-0.018
oken-0.017
 affect-0.017
Token magic
Token ID5536
Feature activation+6.69e-01

Pos logit contributions
ho+0.004
cut+0.004
ir+0.004
itt+0.004
oken+0.004
Neg logit contributions
 Samantha-0.003
 Alice-0.003
 Debbie-0.003
 avail-0.003
 Carly-0.003
Token.
Token ID13
Feature activation+1.20e-01

Pos logit contributions
ho+0.019
itt+0.019
ir+0.019
Did+0.018
 sincere+0.018
Neg logit contributions
 avail-0.015
 treats-0.014
 Alice-0.014
 flavours-0.014
 Mary-0.014
Token<|endoftext|>
Token ID50256
Feature activation-1.40e+00

Pos logit contributions
 can+0.002
ho+0.002
itt+0.002
ir+0.002
oken+0.002
Neg logit contributions
 avail-0.004
oph-0.004
 Debbie-0.004
 Samantha-0.004
 Lib-0.004
TokenAnna
Token ID31160
Feature activation+1.35e+00

Pos logit contributions
 avail+0.042
 Samantha+0.042
oph+0.041
 Debbie+0.041
ibi+0.040
Neg logit contributions
 can-0.033
ir-0.031
itt-0.031
ho-0.031
'-0.031
Token and
Token ID290
Feature activation+1.11e+00

Pos logit contributions
ir+0.027
oken+0.027
cut+0.027
ho+0.026
itt+0.025
Neg logit contributions
 Alice-0.045
 Samantha-0.045
 Debbie-0.045
oph-0.043
 Mary-0.043
Token Ben
Token ID3932
Feature activation+1.77e+00

Pos logit contributions
ho+0.029
 can+0.028
ir+0.027
'+0.027
 sincere+0.027
Neg logit contributions
 avail-0.030
oph-0.029
 Debbie-0.028
ibi-0.028
 Lib-0.027
Token are
Token ID389
Feature activation+2.12e-01

Pos logit contributions
cut+0.033
ho+0.032
ir+0.032
itt+0.031
oken+0.031
Neg logit contributions
 Samantha-0.061
 Debbie-0.060
 Alice-0.060
 Lib-0.059
oph-0.059
Token friends
Token ID2460
Feature activation-3.14e-01

Pos logit contributions
cut+0.006
ho+0.006
 affect+0.006
oc+0.006
ir+0.005
Neg logit contributions
 Samantha-0.006
 Alice-0.006
 Debbie-0.006
 Lib-0.005
 Carly-0.005
Token.
Token ID13
Feature activation-1.72e-01

Pos logit contributions
 Samantha+0.010
 Debbie+0.010
 Alice+0.009
 avail+0.009
oph+0.009
Neg logit contributions
ho-0.007
cut-0.007
ir-0.007
oken-0.006
itt-0.006
Token did
Token ID750
Feature activation-7.61e-01

Pos logit contributions
ho+0.006
ir+0.006
oken+0.006
cut+0.005
itt+0.005
Neg logit contributions
 Alice-0.007
 Samantha-0.007
 Debbie-0.007
 Lib-0.007
 avail-0.007
Token.
Token ID13
Feature activation-4.83e-02

Pos logit contributions
 Alice+0.020
 Samantha+0.019
 avail+0.019
 Debbie+0.019
 Lib+0.019
Neg logit contributions
ho-0.021
itt-0.020
ir-0.020
cut-0.019
oken-0.019
Token In
Token ID554
Feature activation-6.31e-01

Pos logit contributions
 Samantha+0.002
 Debbie+0.002
 avail+0.002
oph+0.001
 Alice+0.001
Neg logit contributions
ho-0.001
ir-0.001
cut-0.001
 can-0.001
itt-0.001
Token fact
Token ID1109
Feature activation-6.30e-02

Pos logit contributions
 Alice+0.015
 Samantha+0.015
 Debbie+0.014
 avail+0.014
 Mary+0.014
Neg logit contributions
ho-0.019
oc-0.018
ighters-0.018
oken-0.018
cut-0.018
Token,
Token ID11
Feature activation-5.46e-01

Pos logit contributions
 Alice+0.002
 Samantha+0.002
 Mary+0.002
 Debbie+0.002
 Lib+0.002
Neg logit contributions
ho-0.001
cut-0.001
Did-0.001
oken-0.001
 affect-0.001
Token this
Token ID428
Feature activation-4.04e-01

Pos logit contributions
 Alice+0.011
 Samantha+0.010
 Mary+0.010
 avail+0.010
 Carly+0.009
Neg logit contributions
oken-0.019
ho-0.019
ir-0.019
itt-0.018
cut-0.018
Token small
Token ID1402
Feature activation-1.87e-01

Pos logit contributions
 Alice+0.011
 Debbie+0.011
 Samantha+0.011
oph+0.011
 avail+0.011
Neg logit contributions
ho-0.010
ir-0.010
cut-0.010
itt-0.009
oken-0.009
Token creature
Token ID7185
Feature activation+1.59e+00

Pos logit contributions
 avail+0.004
 Alice+0.004
 Mary+0.004
 Debbie+0.004
 Samantha+0.004
Neg logit contributions
itt-0.005
oken-0.005
ighters-0.005
Did-0.005
ir-0.005
Token did
Token ID750
Feature activation-8.64e-01

Pos logit contributions
oken+0.043
ir+0.042
itt+0.041
ho+0.039
ighters+0.038
Neg logit contributions
 Alice-0.042
 Samantha-0.040
 Mary-0.040
 Debbie-0.039
oph-0.039
Token not
Token ID407
Feature activation-5.49e-01

Pos logit contributions
 Alice+0.024
 Samantha+0.024
 avail+0.024
 Debbie+0.024
 Lib+0.023
Neg logit contributions
ho-0.022
itt-0.021
ir-0.021
cut-0.021
oken-0.020
Token like
Token ID588
Feature activation-8.52e-01

Pos logit contributions
 Samantha+0.017
 Alice+0.016
 avail+0.016
 Debbie+0.016
 Lib+0.016
Neg logit contributions
ho-0.012
itt-0.012
cut-0.012
ir-0.012
oken-0.012
Token his
Token ID465
Feature activation-5.37e-01

Pos logit contributions
oph+0.040
 Debbie+0.039
 Samantha+0.039
 avail+0.038
ibi+0.038
Neg logit contributions
ho-0.032
cut-0.032
ir-0.031
 can-0.031
 sincere-0.029
Token eyes
Token ID2951
Feature activation+1.39e+00

Pos logit contributions
 Debbie+0.013
 Alice+0.013
 Samantha+0.013
 Carly+0.013
 avail+0.013
Neg logit contributions
ho-0.015
cut-0.015
ir-0.015
Did-0.014
itt-0.014
Token and
Token ID290
Feature activation-4.44e-01

Pos logit contributions
oken+0.034
 affect+0.033
itt+0.032
cut+0.032
ighters+0.032
Neg logit contributions
 Alice-0.042
 Lib-0.041
 Samantha-0.040
 Mary-0.040
oph-0.040
Token saw
Token ID2497
Feature activation-2.97e-01

Pos logit contributions
 Samantha+0.014
 avail+0.014
 Debbie+0.014
oph+0.013
 Alice+0.013
Neg logit contributions
ho-0.010
cut-0.009
ir-0.009
 can-0.009
itt-0.009
Token that
Token ID326
Feature activation+2.71e-02

Pos logit contributions
 Samantha+0.007
 Alice+0.007
 Debbie+0.007
 Lib+0.007
 avail+0.006
Neg logit contributions
ho-0.009
ir-0.009
oken-0.008
 affect-0.008
cut-0.008
Token they
Token ID484
Feature activation-5.83e-01

Pos logit contributions
ho+0.001
ir+0.001
oken+0.001
cut+0.001
itt+0.001
Neg logit contributions
 Samantha-0.001
 Debbie-0.001
 Alice-0.001
 avail-0.001
 Lib-0.001
Token had
Token ID550
Feature activation+4.87e-01

Pos logit contributions
 Alice+0.017
 Samantha+0.017
 Debbie+0.016
 Lib+0.016
 Kara+0.016
Neg logit contributions
itt-0.013
oken-0.013
ho-0.013
cut-0.013
ir-0.012
Token arrived
Token ID5284
Feature activation-6.40e-01

Pos logit contributions
ho+0.012
cut+0.011
ir+0.011
oken+0.011
itt+0.011
Neg logit contributions
 Samantha-0.014
 Debbie-0.014
 Alice-0.014
 avail-0.014
oph-0.013
Token at
Token ID379
Feature activation+1.16e-02

Pos logit contributions
 Samantha+0.019
 Debbie+0.019
 Alice+0.019
 Lib+0.019
 avail+0.019
Neg logit contributions
ho-0.015
ir-0.014
cut-0.014
oken-0.014
itt-0.014
Token their
Token ID511
Feature activation-5.64e-01

Pos logit contributions
ho+0.000
cut+0.000
ir+0.000
oken+0.000
itt+0.000
Neg logit contributions
 Samantha-0.000
 Debbie-0.000
 Alice-0.000
 avail-0.000
 Lib-0.000
Token floor
Token ID4314
Feature activation+4.97e-01

Pos logit contributions
 Samantha+0.014
 Alice+0.014
 Debbie+0.014
 avail+0.013
 Lib+0.013
Neg logit contributions
ho-0.016
ir-0.016
cut-0.016
itt-0.015
oken-0.015
Token careful
Token ID8161
Feature activation+5.54e-01

Pos logit contributions
ho+0.010
ir+0.010
cut+0.010
oken+0.009
itt+0.009
Neg logit contributions
 Samantha-0.015
 Debbie-0.015
 Alice-0.015
 avail-0.015
 Lib-0.015
Token with
Token ID351
Feature activation+1.46e-01

Pos logit contributions
ho+0.010
ir+0.010
oken+0.009
itt+0.009
 sincere+0.009
Neg logit contributions
 Alice-0.019
 Samantha-0.019
 Debbie-0.019
 Lib-0.018
 avail-0.018
Token food
Token ID2057
Feature activation-3.97e-01

Pos logit contributions
ho+0.005
ir+0.004
cut+0.004
oken+0.004
itt+0.004
Neg logit contributions
 Samantha-0.003
 Alice-0.003
 Debbie-0.003
 avail-0.003
 Lib-0.003
Token,
Token ID11
Feature activation+4.15e-02

Pos logit contributions
 Alice+0.011
 Samantha+0.011
 Debbie+0.011
 Lib+0.011
 Clara+0.010
Neg logit contributions
ir-0.010
oken-0.010
ho-0.010
 sincere-0.009
itt-0.009
Token if
Token ID611
Feature activation+2.89e-01

Pos logit contributions
ir+0.001
ho+0.001
itt+0.001
cut+0.001
oken+0.001
Neg logit contributions
 Samantha-0.001
 Alice-0.001
 Mary-0.001
 Debbie-0.001
 Clara-0.001
Token not
Token ID407
Feature activation-1.50e+00

Pos logit contributions
ir+0.007
ho+0.007
cut+0.007
oken+0.007
oc+0.006
Neg logit contributions
 Samantha-0.008
 Debbie-0.008
 Alice-0.008
 avail-0.008
 Lib-0.008
Token it
Token ID340
Feature activation-3.19e-01

Pos logit contributions
oph+0.050
 Debbie+0.049
 Samantha+0.049
 avail+0.049
 Alice+0.048
Neg logit contributions
ho-0.030
 can-0.028
cut-0.028
ir-0.027
 sincere-0.026
Token could
Token ID714
Feature activation+6.25e-01

Pos logit contributions
 Alice+0.011
 Samantha+0.010
 Debbie+0.010
 Lib+0.010
 Kara+0.010
Neg logit contributions
ir-0.006
oken-0.006
ho-0.006
cut-0.006
itt-0.006
Token get
Token ID651
Feature activation+1.25e-01

Pos logit contributions
itt+0.017
oken+0.016
ir+0.015
ho+0.015
 Did+0.015
Neg logit contributions
 Samantha-0.017
 Alice-0.016
 avail-0.016
 Lib-0.015
 Debbie-0.015
Token stuck
Token ID7819
Feature activation+3.61e-01

Pos logit contributions
ho+0.003
ir+0.003
cut+0.003
itt+0.003
oc+0.003
Neg logit contributions
 Samantha-0.003
 Alice-0.003
 Debbie-0.003
 avail-0.003
 Lib-0.003
Token in
Token ID287
Feature activation+3.97e-01

Pos logit contributions
 sincere+0.008
ho+0.008
itt+0.008
oken+0.008
oc+0.008
Neg logit contributions
 Alice-0.010
 Samantha-0.010
 Lib-0.010
 avail-0.010
 Clara-0.010
Token
Token ID198
Feature activation+3.73e-01

Pos logit contributions
 avail+0.011
oph+0.011
 Debbie+0.010
 Samantha+0.010
 Alice+0.010
Neg logit contributions
ho-0.007
itt-0.006
 can-0.006
ir-0.006
'-0.006
Token
Token ID198
Feature activation-2.58e-02

Pos logit contributions
ho+0.007
ir+0.007
cut+0.007
 can+0.007
itt+0.007
Neg logit contributions
 avail-0.013
 Samantha-0.012
 Debbie-0.012
oph-0.012
 Alice-0.012
TokenS
Token ID50
Feature activation-1.68e+00

Pos logit contributions
 avail+0.001
 Debbie+0.001
 Samantha+0.001
 Alice+0.001
oph+0.001
Neg logit contributions
ho-0.001
ir-0.001
cut-0.001
'-0.001
itt-0.001
Tokenara
Token ID3301
Feature activation+6.23e-01

Pos logit contributions
 Samantha+0.034
 Debbie+0.034
 Alice+0.033
 avail+0.033
 Lib+0.032
Neg logit contributions
ho-0.056
ir-0.054
cut-0.053
oken-0.053
 can-0.053
Token and
Token ID290
Feature activation+3.08e-01

Pos logit contributions
ir+0.016
oken+0.016
ho+0.015
cut+0.015
 affect+0.015
Neg logit contributions
oph-0.018
 Alice-0.017
 Debbie-0.017
 Samantha-0.017
 Clara-0.016
Token Lily
Token ID20037
Feature activation-2.89e-01

Pos logit contributions
ho+0.007
 can+0.007
 or+0.006
ir+0.006
 cannot+0.006
Neg logit contributions
 avail-0.010
oph-0.009
 flavours-0.009
 Lib-0.009
 Debbie-0.009
Token felt
Token ID2936
Feature activation+1.77e-01

Pos logit contributions
oph+0.008
 Samantha+0.008
 Alice+0.008
 Debbie+0.008
 Lib+0.008
Neg logit contributions
ir-0.007
cut-0.007
ho-0.007
oken-0.007
itt-0.007
Token ashamed
Token ID22461
Feature activation-3.65e-02

Pos logit contributions
ho+0.003
ir+0.002
cut+0.002
itt+0.002
oken+0.002
Neg logit contributions
 Samantha-0.007
 Alice-0.007
 Debbie-0.007
 avail-0.007
 Lib-0.007
Token.
Token ID13
Feature activation-4.11e-01

Pos logit contributions
 Alice+0.001
 Samantha+0.001
 Sophie+0.001
oph+0.001
 Rebecca+0.001
Neg logit contributions
cut-0.001
ho-0.001
itt-0.001
cient-0.001
ir-0.001
Token They
Token ID1119
Feature activation-6.32e-01

Pos logit contributions
 avail+0.016
isi+0.014
oph+0.014
 possible+0.014
 Debbie+0.014
Neg logit contributions
ho-0.007
 can-0.007
 or-0.007
itt-0.006
ir-0.006
Token knew
Token ID2993
Feature activation-8.93e-01

Pos logit contributions
 Samantha+0.019
 Alice+0.019
oph+0.019
 Debbie+0.019
 Lib+0.018
Neg logit contributions
ho-0.014
ir-0.014
cut-0.014
itt-0.014
oken-0.014
Token Everything
Token ID11391
Feature activation-5.94e-01

Pos logit contributions
 avail+0.022
 Debbie+0.021
 Samantha+0.021
oph+0.021
 Alice+0.020
Neg logit contributions
ho-0.015
ir-0.014
cut-0.014
 can-0.014
itt-0.014
Token looked
Token ID3114
Feature activation-2.79e-01

Pos logit contributions
 Alice+0.018
 Samantha+0.018
 Debbie+0.018
 Lib+0.018
oph+0.017
Neg logit contributions
ho-0.013
ir-0.013
itt-0.013
oken-0.012
cut-0.012
Token so
Token ID523
Feature activation+1.32e-01

Pos logit contributions
 Samantha+0.006
 Alice+0.006
 avail+0.006
 Lib+0.006
oph+0.006
Neg logit contributions
itt-0.009
ho-0.009
cut-0.009
ir-0.008
oc-0.008
Token beautiful
Token ID4950
Feature activation-9.33e-02

Pos logit contributions
ho+0.005
cut+0.005
ir+0.004
itt+0.004
oc+0.004
Neg logit contributions
 Samantha-0.002
 Alice-0.002
 Debbie-0.002
 Lib-0.002
 avail-0.002
Token.
Token ID13
Feature activation-6.24e-01

Pos logit contributions
 Samantha+0.003
 Debbie+0.003
 Alice+0.003
 avail+0.003
oph+0.003
Neg logit contributions
ho-0.002
ir-0.002
cut-0.002
oken-0.002
itt-0.002
Token His
Token ID2399
Feature activation-3.67e-01

Pos logit contributions
 avail+0.019
 Debbie+0.018
 Samantha+0.018
oph+0.018
 Lib+0.017
Neg logit contributions
ho-0.014
 can-0.014
ir-0.013
itt-0.013
cut-0.013
Token mom
Token ID1995
Feature activation+7.31e-02

Pos logit contributions
 Samantha+0.009
 Debbie+0.009
 Alice+0.009
 avail+0.009
 Carly+0.008
Neg logit contributions
ir-0.010
cut-0.010
ho-0.010
oken-0.010
itt-0.010
Token had
Token ID550
Feature activation+2.66e-01

Pos logit contributions
oken+0.002
ho+0.002
ir+0.002
itt+0.002
cut+0.002
Neg logit contributions
 Samantha-0.002
 Debbie-0.002
 Alice-0.002
oph-0.002
 Lib-0.002
Token even
Token ID772
Feature activation-2.18e-01

Pos logit contributions
ho+0.007
cut+0.006
ir+0.006
itt+0.006
oken+0.006
Neg logit contributions
 Samantha-0.008
 Debbie-0.008
 Alice-0.008
oph-0.007
 Lib-0.007
Token packed
Token ID11856
Feature activation-4.80e-01

Pos logit contributions
 Samantha+0.006
 Debbie+0.006
 Alice+0.005
 avail+0.005
oph+0.005
Neg logit contributions
ho-0.006
ir-0.006
cut-0.006
itt-0.006
oken-0.006
Token nap
Token ID25422
Feature activation-2.18e-01

Pos logit contributions
 Debbie+0.013
 Samantha+0.013
 avail+0.013
oph+0.013
ibi+0.013
Neg logit contributions
cut-0.011
ho-0.011
 can-0.011
ir-0.011
oken-0.011
Token.
Token ID13
Feature activation-4.39e-01

Pos logit contributions
 Alice+0.005
 Debbie+0.004
 Samantha+0.004
 Lib+0.004
 Carly+0.004
Neg logit contributions
cut-0.004
itt-0.003
oken-0.003
ir-0.003
ho-0.003
Token Sarah
Token ID10490
Feature activation+1.66e-01

Pos logit contributions
 Debbie+0.010
 Samantha+0.010
 avail+0.010
 Alice+0.010
oph+0.010
Neg logit contributions
ho-0.013
ir-0.012
cut-0.012
oken-0.012
 can-0.012
Token was
Token ID373
Feature activation-3.52e-01

Pos logit contributions
ir+0.004
oken+0.004
itt+0.004
cut+0.004
ho+0.004
Neg logit contributions
 Alice-0.005
 Samantha-0.005
 Debbie-0.005
oph-0.005
 Kara-0.005
Token a
Token ID257
Feature activation-3.89e-01

Pos logit contributions
 avail+0.012
oph+0.012
 Samantha+0.012
 Debbie+0.011
 Lib+0.011
Neg logit contributions
oken-0.007
 sincere-0.007
ho-0.007
ir-0.006
cut-0.006
Token really
Token ID1107
Feature activation-6.24e-02

Pos logit contributions
 Alice+0.011
 Samantha+0.011
 Debbie+0.011
oph+0.011
 avail+0.011
Neg logit contributions
ir-0.009
ho-0.009
cut-0.009
oken-0.008
itt-0.008
Token good
Token ID922
Feature activation-2.25e+00

Pos logit contributions
 Samantha+0.002
 Debbie+0.002
 avail+0.002
oph+0.002
 Lib+0.002
Neg logit contributions
ho-0.001
 can-0.001
ir-0.001
oken-0.001
 sincere-0.001
Token singer
Token ID14015
Feature activation-4.87e-01

Pos logit contributions
 Samantha+0.069
 Debbie+0.067
 Lib+0.066
oph+0.064
 Alice+0.064
Neg logit contributions
 can-0.051
ho-0.049
 or-0.048
'-0.047
oken-0.047
Token and
Token ID290
Feature activation-6.34e-01

Pos logit contributions
 Alice+0.016
 Samantha+0.015
 Debbie+0.015
 Lib+0.015
oph+0.015
Neg logit contributions
oken-0.010
ho-0.010
ir-0.010
cut-0.010
itt-0.010
Token dancer
Token ID38619
Feature activation-6.13e-01

Pos logit contributions
 Samantha+0.014
 Alice+0.014
 Debbie+0.013
 avail+0.013
oph+0.013
Neg logit contributions
ho-0.020
ir-0.019
cut-0.019
oken-0.019
itt-0.019
Token too
Token ID1165
Feature activation-1.98e-01

Pos logit contributions
 Alice+0.020
 Samantha+0.019
 Debbie+0.019
 Carly+0.019
 Lib+0.019
Neg logit contributions
oken-0.013
ho-0.013
ir-0.013
cut-0.012
itt-0.012
Token.
Token ID13
Feature activation+3.73e-02

Pos logit contributions
 Samantha+0.006
 Debbie+0.006
 Alice+0.006
 avail+0.006
oph+0.006
Neg logit contributions
ho-0.004
cut-0.004
ir-0.004
oken-0.004
itt-0.004
Token the
Token ID262
Feature activation+9.51e-01

Pos logit contributions
 Alice+0.006
 Samantha+0.006
 avail+0.006
 Debbie+0.006
 Lib+0.006
Neg logit contributions
ho-0.010
cut-0.009
ir-0.009
itt-0.009
oken-0.009
Token fridge
Token ID25772
Feature activation-4.73e-01

Pos logit contributions
ir+0.031
 affect+0.030
ug+0.029
ho+0.028
ighters+0.028
Neg logit contributions
 Samantha-0.021
oph-0.020
 Alice-0.019
 Lib-0.019
 Mary-0.019
Token not
Token ID407
Feature activation-6.35e-01

Pos logit contributions
oph+0.014
 Alice+0.014
 Lib+0.014
 Mary+0.014
 Samantha+0.014
Neg logit contributions
oken-0.011
ir-0.010
 sincere-0.010
ighters-0.009
 grant-0.009
Token to
Token ID284
Feature activation+2.46e-01

Pos logit contributions
 Samantha+0.019
 Debbie+0.019
 Alice+0.019
 avail+0.019
 Lib+0.018
Neg logit contributions
ho-0.014
ir-0.014
cut-0.014
itt-0.014
oken-0.013
Token get
Token ID651
Feature activation-2.56e-02

Pos logit contributions
itt+0.007
ho+0.007
cut+0.007
oken+0.006
ighters+0.006
Neg logit contributions
 avail-0.006
 Samantha-0.006
 Alice-0.006
 Lib-0.006
 Debbie-0.006
Token cookies
Token ID14746
Feature activation-2.21e+00

Pos logit contributions
 Samantha+0.001
 Debbie+0.001
 Alice+0.001
 avail+0.001
 Lib+0.001
Neg logit contributions
ho-0.001
cut-0.001
ir-0.001
oken-0.001
itt-0.001
Token,
Token ID11
Feature activation+3.68e-01

Pos logit contributions
 Samantha+0.070
 Alice+0.070
 Lib+0.069
 Debbie+0.069
oph+0.068
Neg logit contributions
ho-0.045
ir-0.045
oken-0.045
cut-0.043
itt-0.043
Token but
Token ID475
Feature activation+4.61e-01

Pos logit contributions
ir+0.010
ho+0.010
itt+0.010
cut+0.010
oken+0.010
Neg logit contributions
 Alice-0.010
 Samantha-0.010
 Mary-0.010
 Lib-0.010
 Sophie-0.009
Token to
Token ID284
Feature activation-1.77e-01

Pos logit contributions
ir+0.014
oken+0.014
itt+0.013
ho+0.013
cut+0.013
Neg logit contributions
 Samantha-0.011
 Alice-0.011
 avail-0.011
 Lib-0.011
 Debbie-0.011
Token talk
Token ID1561
Feature activation+7.01e-02

Pos logit contributions
 Samantha+0.006
 avail+0.006
 Debbie+0.005
 Alice+0.005
 Lib+0.005
Neg logit contributions
ho-0.004
itt-0.004
ir-0.004
oken-0.004
cut-0.004
Token to
Token ID284
Feature activation-1.45e+00

Pos logit contributions
ir+0.002
itt+0.002
ho+0.002
oken+0.002
cut+0.002
Neg logit contributions
 Alice-0.002
 Samantha-0.002
 Debbie-0.002
 Lib-0.002
 Mary-0.002
Token was
Token ID373
Feature activation+3.27e-01

Pos logit contributions
 Samantha+0.016
 Debbie+0.016
 Alice+0.016
 avail+0.016
 Lib+0.016
Neg logit contributions
ho-0.008
cut-0.008
ir-0.008
oken-0.008
itt-0.007
Token a
Token ID257
Feature activation-6.53e-01

Pos logit contributions
ir+0.006
itt+0.006
oken+0.006
e+0.006
 can+0.006
Neg logit contributions
 avail-0.011
 Debbie-0.011
oph-0.011
 Carly-0.010
 Samantha-0.010
Token little
Token ID1310
Feature activation-3.37e-02

Pos logit contributions
 Samantha+0.020
 Debbie+0.020
 avail+0.020
 Alice+0.020
oph+0.019
Neg logit contributions
ho-0.014
cut-0.014
ir-0.014
oken-0.013
 can-0.013
Token girl
Token ID2576
Feature activation+5.16e-01

Pos logit contributions
 Debbie+0.001
 avail+0.001
 Samantha+0.001
oph+0.001
 Lib+0.001
Neg logit contributions
ho-0.001
 can-0.001
cut-0.001
ir-0.001
itt-0.001
Token called
Token ID1444
Feature activation+1.53e-03

Pos logit contributions
oken+0.009
 affect+0.009
cut+0.008
ir+0.008
ho+0.008
Neg logit contributions
 Samantha-0.018
 Alice-0.018
 Mary-0.018
 Lib-0.017
 Debbie-0.017
Token L
Token ID406
Feature activation-2.34e+00

Pos logit contributions
ho+0.000
ir+0.000
itt+0.000
 can+0.000
 or+0.000
Neg logit contributions
 avail-0.000
ibi-0.000
oph-0.000
 flavours-0.000
 Debbie-0.000
Tokenola
Token ID5708
Feature activation-4.98e-01

Pos logit contributions
 Samantha+0.068
 Debbie+0.068
 avail+0.068
 Alice+0.067
 Lib+0.066
Neg logit contributions
ho-0.055
ir-0.053
cut-0.052
oken-0.052
itt-0.051
Token.
Token ID13
Feature activation-6.13e-01

Pos logit contributions
 Samantha+0.014
 Debbie+0.014
 Alice+0.014
 Lib+0.014
oph+0.014
Neg logit contributions
ho-0.012
cut-0.011
ir-0.011
oken-0.011
itt-0.011
Token She
Token ID1375
Feature activation-3.53e-01

Pos logit contributions
 Debbie+0.018
 Samantha+0.018
 avail+0.018
 Alice+0.017
oph+0.017
Neg logit contributions
ho-0.014
 can-0.014
ir-0.014
oken-0.014
cut-0.013
Token loved
Token ID6151
Feature activation+7.89e-01

Pos logit contributions
 Alice+0.012
 Debbie+0.012
 Samantha+0.012
 Lib+0.012
oph+0.012
Neg logit contributions
ir-0.006
ho-0.006
cut-0.006
oken-0.006
itt-0.006
Token to
Token ID284
Feature activation+7.28e-02

Pos logit contributions
ho+0.015
cut+0.013
ir+0.013
 can+0.013
itt+0.013
Neg logit contributions
 avail-0.027
 Debbie-0.027
 Samantha-0.027
oph-0.027
 Alice-0.026
Token me
Token ID502
Feature activation+1.35e+00

Pos logit contributions
 can+0.016
'+0.015
 or+0.015
e+0.013
ir+0.013
Neg logit contributions
 Lib-0.059
 Debbie-0.057
ibi-0.056
 Samantha-0.056
 Kara-0.055
Token.
Token ID13
Feature activation+1.15e+00

Pos logit contributions
ir+0.033
ho+0.033
cut+0.032
 can+0.032
'+0.031
Neg logit contributions
 Debbie-0.038
oph-0.038
 Samantha-0.038
 Alice-0.037
 avail-0.037
Token You
Token ID921
Feature activation+7.44e-01

Pos logit contributions
ho+0.027
ir+0.026
oken+0.026
cut+0.026
 affect+0.024
Neg logit contributions
 Alice-0.034
 Samantha-0.034
 Debbie-0.033
 Lib-0.033
 Carly-0.033
Token are
Token ID389
Feature activation-5.10e-01

Pos logit contributions
ho+0.013
ir+0.012
cut+0.012
itt+0.011
oken+0.011
Neg logit contributions
 Alice-0.026
 Samantha-0.026
oph-0.026
 Lib-0.025
 Debbie-0.025
Token a
Token ID257
Feature activation+4.65e-02

Pos logit contributions
oph+0.022
ibi+0.021
 avail+0.020
 Samantha+0.020
 Debbie+0.019
Neg logit contributions
 can-0.008
e-0.008
 sincere-0.008
 sorry-0.007
 cannot-0.007
Token good
Token ID922
Feature activation-2.21e+00

Pos logit contributions
ho+0.001
 can+0.001
cut+0.001
 sincere+0.001
oc+0.001
Neg logit contributions
 Samantha-0.001
 Debbie-0.001
 avail-0.001
oph-0.001
 Lib-0.001
Token brother
Token ID3956
Feature activation+3.92e-01

Pos logit contributions
 Debbie+0.065
 Samantha+0.065
 avail+0.064
oph+0.064
 Lib+0.064
Neg logit contributions
ho-0.052
 can-0.050
ir-0.050
cut-0.049
itt-0.048
Token."
Token ID526
Feature activation+8.91e-01

Pos logit contributions
 can+0.008
cut+0.008
 or+0.008
'+0.007
oc+0.007
Neg logit contributions
 avail-0.013
 Samantha-0.013
 Debbie-0.013
oph-0.013
 Carly-0.013
Token Mia
Token ID32189
Feature activation-5.21e-01

Pos logit contributions
 can+0.026
 cannot+0.024
ho+0.023
itt+0.023
ir+0.023
Neg logit contributions
 avail-0.024
oph-0.023
 Samantha-0.023
ibi-0.022
 Debbie-0.022
Token says
Token ID1139
Feature activation-4.00e-01

Pos logit contributions
 avail+0.018
 Samantha+0.017
 Debbie+0.016
oph+0.016
 Lib+0.015
Neg logit contributions
 can-0.012
'-0.011
 or-0.011
ho-0.010
ir-0.010
Token.
Token ID13
Feature activation-3.70e-01

Pos logit contributions
 Samantha+0.011
 Debbie+0.011
 Alice+0.011
 avail+0.011
 Lib+0.011
Neg logit contributions
ho-0.010
cut-0.009
ir-0.009
oken-0.009
itt-0.009
Token"
Token ID1
Feature activation-7.18e-01

Pos logit contributions
 Samantha+0.005
 avail+0.005
 Debbie+0.005
 Alice+0.005
oph+0.005
Neg logit contributions
ho-0.003
ir-0.003
cut-0.003
itt-0.003
 can-0.003
TokenHelp
Token ID22087
Feature activation+7.72e-01

Pos logit contributions
 Samantha+0.020
 Debbie+0.020
 Alice+0.020
 avail+0.020
oph+0.020
Neg logit contributions
ho-0.018
ir-0.017
cut-0.017
oken-0.017
itt-0.017
Token!
Token ID0
Feature activation+1.79e-01

Pos logit contributions
ho+0.016
ir+0.016
 can+0.015
cut+0.015
oken+0.015
Neg logit contributions
 Debbie-0.024
 Samantha-0.024
oph-0.024
 Lib-0.024
 avail-0.023
Token Help
Token ID10478
Feature activation+1.08e+00

Pos logit contributions
cut+0.003
ho+0.003
ir+0.003
oken+0.003
 affect+0.003
Neg logit contributions
 Samantha-0.006
 Alice-0.006
 Debbie-0.006
 Kara-0.006
 Carly-0.006
Token!"
Token ID2474
Feature activation+8.20e-02

Pos logit contributions
 can+0.028
ho+0.027
'+0.026
ir+0.025
 or+0.025
Neg logit contributions
 avail-0.030
 Lib-0.029
 Debbie-0.029
oph-0.029
 Samantha-0.029
Token L
Token ID406
Feature activation-2.57e+00

Pos logit contributions
ho+0.003
cut+0.003
ir+0.003
oken+0.003
itt+0.002
Neg logit contributions
 Samantha-0.002
 Alice-0.002
 Debbie-0.002
 Lib-0.002
 avail-0.002
Tokenila
Token ID10102
Feature activation+3.22e-01

Pos logit contributions
 Samantha+0.079
 Debbie+0.079
 Alice+0.077
 avail+0.077
oph+0.077
Neg logit contributions
ho-0.057
cut-0.055
ir-0.054
oken-0.053
 can-0.052
Token and
Token ID290
Feature activation+3.96e-01

Pos logit contributions
ho+0.008
ir+0.007
oken+0.007
cut+0.007
 sincere+0.007
Neg logit contributions
 Alice-0.010
 Debbie-0.009
 Samantha-0.009
 Carly-0.009
 Kara-0.009
Token Tom
Token ID4186
Feature activation-3.23e-01

Pos logit contributions
ho+0.010
ir+0.009
 can+0.009
 sincere+0.009
oken+0.009
Neg logit contributions
oph-0.012
 avail-0.011
 Lib-0.011
 Samantha-0.011
 Debbie-0.011
Token shout
Token ID9191
Feature activation+2.34e-01

Pos logit contributions
 Samantha+0.008
 Alice+0.008
 Debbie+0.008
 Kara+0.008
 Carly+0.008
Neg logit contributions
ho-0.009
cut-0.009
itt-0.008
oken-0.008
ir-0.008
Token.
Token ID13
Feature activation+1.25e-01

Pos logit contributions
ho+0.007
cut+0.007
itt+0.006
ir+0.006
oken+0.006
Neg logit contributions
 Alice-0.006
 Samantha-0.005
 Debbie-0.005
 Lib-0.005
 Kara-0.005