TOP ACTIVATIONS
MAX = 5.619

. She wrote about how grand baby is always smiling ,
herself regularly to see how much she was growing . <|endoftext|>
help Daisy always remember how much she was loved .
above and thought about how much better its garden was .
was amazed to see how wide it was .
wanted to always remember how much it had meant to her
look back and see how far she had come . Eventually
stopped and spoke about how much fun they had playing hockey
It helps me measure how much food we have . â
can help us see how much we have to share with
They want to see what dad bought them . Dad opens
extreme kindness and just how much he cared for the family
was surprised to see how bright they looked ! He decided
forest didn 't like how boss y Billy was . They
't wait to see what Santa had brought them . They
Everyone was amazed at how wide the hedge was !
they liked and what topp ings would go best .
the lake and test how far it could go . But
table is , and how wide the door is . He
you will think of how much I love being your friend

MIN ACTIVATIONS
MIN = -5.681

, you â ll always have your memories of
it was Tim my 's fault but Tim my felt guilty
told her it was her fault because she dropped her toy
and we â ll complete this together . â
awn â s fault . The
and said it was their fault . Mom said they were
ol anda and her sun fl owers started to dance together
, we â ll mention it to the grown
very happy with his sun fl owers . He visited them
and look at their snow fl akes . S
sw am away from O ll ie . O ll ie
en vious of their sun fl owers . He does not
and I â ll ride it ! â
are happy with their snow fl akes . They glue them
He said it was her fault for not moving .
wealthy prince and his sun fl owers lived happily ever after
Tom , it was my fault ," Mia said , as
No , it 's your fault , you were too close
to wear his flip - fl ops so he wouldn 't
garden and admire his sun fl owers . They also see

INTERVAL 2.794 - 5.619
CONTAINS 0.042%

But then he saw something - a box ! Curious ,
explain to the cow how much it loved milk . The
as she saw how much fun it was .
night , Post remembered how nice it felt to take a
its own success . Every time it did something , it

INTERVAL -0.031 - 2.794
CONTAINS 51.894%

learned from his wise old grandfather . He hopped over to
bounced so high that she ended up in the air !
to reach out and grab a big branch from the tree
L ily and Ben thanked the fairy and hugged her
and didn 't understand why his friends were being mean .

INTERVAL -2.856 - -0.031
CONTAINS 48.000%

Help , help !" Lily cries . " I fell in
the paintings . He told them stories about his life and
good friends . The bird lived in the tree
J enny went to the dance rehears als ,
! You 're the best sister ever !" he said ,

INTERVAL -5.681 - -2.856
CONTAINS 0.064%

You â ve really improved . â
ily went to the books he lf and found an ancient
It is a stupid robot anyway . Watch this !" He
got their own z ig zag . It looked like they
to go play in the nearest tree . He thought he
Token.
Token ID13
Feature activation+7.06e-03

Pos logit contributions
 Sorry+0.010
chard+0.010
ually+0.009
apped+0.009
ap+0.008
Neg logit contributions
 adult-0.010
<-0.009
 ambulance-0.009
 stret-0.009
coon-0.009
Token She
Token ID1375
Feature activation-5.15e-02

Pos logit contributions
 Sorry+0.000
chard+0.000
ually+0.000
apped+0.000
ap+0.000
Neg logit contributions
 adult-0.000
 ambulance-0.000
co-0.000
<-0.000
 stret-0.000
Token wrote
Token ID2630
Feature activation+6.71e-01

Pos logit contributions
 adult+0.002
 ambulance+0.002
co+0.002
<+0.002
 sheep+0.002
Neg logit contributions
 Sorry-0.002
chard-0.002
ually-0.002
ap-0.002
apped-0.002
Token about
Token ID546
Feature activation+4.41e-01

Pos logit contributions
 Sorry+0.027
chard+0.027
ually+0.025
apped+0.024
rac+0.023
Neg logit contributions
 adult-0.026
 ambulance-0.023
<-0.022
co-0.022
 stret-0.022
Token how
Token ID703
Feature activation-1.01e+00

Pos logit contributions
 Sorry+0.014
chard+0.014
ually+0.012
apped+0.012
ap+0.012
Neg logit contributions
 adult-0.021
 ambulance-0.019
co-0.018
 stret-0.018
<-0.018
Token grand
Token ID4490
Feature activation+4.69e+00

Pos logit contributions
 adult+0.037
 ambulance+0.031
<+0.030
Sweet+0.030
 sheep+0.030
Neg logit contributions
 Sorry-0.043
chard-0.042
ually-0.041
ishment-0.040
apped-0.039
Tokenbaby
Token ID40252
Feature activation+2.21e+00

Pos logit contributions
 Sorry+0.145
chard+0.128
apped+0.128
ually+0.127
ap+0.124
Neg logit contributions
 adult-0.212
 stret-0.201
<-0.194
 ambulance-0.189
coon-0.187
Token is
Token ID318
Feature activation+4.20e-01

Pos logit contributions
 Sorry+0.097
chard+0.095
ually+0.087
apped+0.085
ap+0.084
Neg logit contributions
 adult-0.081
 ambulance-0.070
co-0.070
 wolf-0.067
coon-0.066
Token always
Token ID1464
Feature activation+1.31e+00

Pos logit contributions
 Sorry+0.021
chard+0.019
ually+0.018
apped+0.018
rac+0.017
Neg logit contributions
 adult-0.014
 ambulance-0.012
 warning-0.011
 living-0.011
 sheep-0.011
Token smiling
Token ID16755
Feature activation-3.34e-01

Pos logit contributions
 Sorry+0.066
icker+0.062
ually+0.061
apped+0.060
chard+0.059
Neg logit contributions
 adult-0.035
 living-0.034
 warning-0.033
 ambulance-0.030
 children-0.030
Token,
Token ID11
Feature activation-4.38e-01

Pos logit contributions
 adult+0.010
 ambulance+0.009
 wolf+0.009
 sheep+0.008
co+0.008
Neg logit contributions
 Sorry-0.016
chard-0.016
ually-0.015
arium-0.014
rac-0.014
Token herself
Token ID5223
Feature activation-1.75e+00

Pos logit contributions
 Sorry+0.038
chard+0.037
ually+0.034
apped+0.033
arium+0.032
Neg logit contributions
 adult-0.035
 ambulance-0.030
co-0.029
 stret-0.028
coon-0.028
Token regularly
Token ID7987
Feature activation+3.01e-01

Pos logit contributions
 adult+0.068
 ambulance+0.060
co+0.058
<+0.057
 stret+0.057
Neg logit contributions
 Sorry-0.071
chard-0.070
ually-0.064
apped-0.063
ap-0.061
Token to
Token ID284
Feature activation+4.51e-01

Pos logit contributions
 Sorry+0.013
chard+0.013
apped+0.012
ually+0.012
rac+0.011
Neg logit contributions
 adult-0.011
 ambulance-0.009
co-0.009
 stret-0.009
<-0.009
Token see
Token ID766
Feature activation+1.63e-01

Pos logit contributions
 Sorry+0.018
chard+0.018
apped+0.017
ually+0.016
arium+0.016
Neg logit contributions
 adult-0.018
 ambulance-0.016
 stret-0.015
co-0.015
<-0.015
Token how
Token ID703
Feature activation-5.64e-01

Pos logit contributions
chard+0.006
 Sorry+0.006
ually+0.005
nam+0.005
ap+0.005
Neg logit contributions
 adult-0.007
<-0.006
 ambulance-0.006
co-0.006
coon-0.006
Token much
Token ID881
Feature activation+4.61e+00

Pos logit contributions
 adult+0.023
 ambulance+0.021
<+0.020
co+0.020
 stret+0.019
Neg logit contributions
 Sorry-0.022
chard-0.021
ually-0.020
apped-0.019
ap-0.019
Token she
Token ID673
Feature activation+1.69e+00

Pos logit contributions
 Sorry+0.179
chard+0.177
ually+0.162
apped+0.159
ap+0.152
Neg logit contributions
 adult-0.190
 ambulance-0.166
co-0.161
<-0.160
 stret-0.159
Token was
Token ID373
Feature activation+6.89e-01

Pos logit contributions
 Sorry+0.074
chard+0.072
apped+0.067
ually+0.066
ap+0.064
Neg logit contributions
 adult-0.062
 ambulance-0.053
co-0.052
<-0.051
coon-0.050
Token growing
Token ID3957
Feature activation+5.84e-02

Pos logit contributions
 Sorry+0.030
chard+0.029
apped+0.027
ually+0.027
rac+0.026
Neg logit contributions
 adult-0.026
 ambulance-0.022
co-0.021
<-0.021
 stret-0.021
Token.
Token ID13
Feature activation+3.19e-01

Pos logit contributions
 Sorry+0.003
chard+0.002
ually+0.002
apped+0.002
rac+0.002
Neg logit contributions
 adult-0.002
 ambulance-0.002
co-0.002
 wolf-0.002
<-0.002
Token<|endoftext|>
Token ID50256
Feature activation+1.03e+00

Pos logit contributions
 Sorry+0.013
chard+0.012
apped+0.011
ually+0.011
ap+0.011
Neg logit contributions
 adult-0.013
 ambulance-0.012
co-0.011
 stret-0.011
coon-0.011
Token help
Token ID1037
Feature activation-6.72e-01

Pos logit contributions
 adult+0.017
 ambulance+0.015
co+0.014
<+0.014
 stret+0.014
Neg logit contributions
 Sorry-0.019
chard-0.019
ually-0.017
apped-0.017
ap-0.016
Token Daisy
Token ID40355
Feature activation-3.00e-01

Pos logit contributions
 adult+0.026
 ambulance+0.023
<+0.023
co+0.022
 stret+0.022
Neg logit contributions
 Sorry-0.027
chard-0.027
ually-0.024
ap-0.023
apped-0.023
Token always
Token ID1464
Feature activation+1.28e+00

Pos logit contributions
 adult+0.012
 ambulance+0.010
<+0.010
 stret+0.010
co+0.010
Neg logit contributions
 Sorry-0.013
chard-0.012
ually-0.011
apped-0.011
rac-0.011
Token remember
Token ID3505
Feature activation-4.51e-01

Pos logit contributions
 Sorry+0.058
chard+0.057
apped+0.056
ually+0.053
arium+0.052
Neg logit contributions
 adult-0.043
 warning-0.038
 ambulance-0.037
 wolf-0.035
 stret-0.035
Token how
Token ID703
Feature activation-7.76e-01

Pos logit contributions
 adult+0.018
 ambulance+0.016
co+0.016
<+0.016
 stret+0.016
Neg logit contributions
 Sorry-0.018
chard-0.017
ually-0.016
apped-0.016
ap-0.015
Token much
Token ID881
Feature activation+4.57e+00

Pos logit contributions
 adult+0.029
 ambulance+0.025
co+0.024
<+0.024
 stret+0.024
Neg logit contributions
 Sorry-0.033
chard-0.033
ually-0.030
apped-0.030
ap-0.029
Token she
Token ID673
Feature activation+1.80e+00

Pos logit contributions
 Sorry+0.182
chard+0.181
ually+0.169
apped+0.167
rac+0.158
Neg logit contributions
 adult-0.182
 ambulance-0.157
 wolf-0.151
<-0.151
co-0.150
Token was
Token ID373
Feature activation+8.29e-01

Pos logit contributions
 Sorry+0.076
chard+0.072
apped+0.067
ually+0.066
ap+0.065
Neg logit contributions
 adult-0.069
 ambulance-0.059
co-0.059
<-0.058
 wolf-0.057
Token loved
Token ID6151
Feature activation+5.07e-01

Pos logit contributions
 Sorry+0.037
chard+0.035
ually+0.033
apped+0.033
rac+0.032
Neg logit contributions
 adult-0.030
 ambulance-0.026
<-0.025
co-0.024
 stret-0.024
Token.
Token ID13
Feature activation+2.29e-01

Pos logit contributions
chard+0.020
 Sorry+0.020
ually+0.018
apped+0.018
ap+0.017
Neg logit contributions
 adult-0.020
 ambulance-0.018
<-0.017
 stret-0.017
co-0.017
Token
Token ID198
Feature activation-4.77e-02

Pos logit contributions
 Sorry+0.009
chard+0.008
apped+0.008
ually+0.008
arium+0.008
Neg logit contributions
 adult-0.010
 ambulance-0.009
 stret-0.009
co-0.009
coon-0.008
Token above
Token ID2029
Feature activation+7.14e-02

Pos logit contributions
 Sorry+0.042
chard+0.041
ually+0.038
apped+0.037
rac+0.036
Neg logit contributions
 adult-0.034
 ambulance-0.030
co-0.029
 wolf-0.028
 sheep-0.028
Token and
Token ID290
Feature activation+6.63e-01

Pos logit contributions
 Sorry+0.003
chard+0.003
ually+0.003
nam+0.003
apped+0.003
Neg logit contributions
 adult-0.003
 wolf-0.002
 ambulance-0.002
 sheep-0.002
 wool-0.002
Token thought
Token ID1807
Feature activation-1.19e+00

Pos logit contributions
 Sorry+0.027
ap+0.024
chard+0.024
ually+0.024
rac+0.024
Neg logit contributions
 adult-0.025
 sheep-0.023
 wolf-0.023
 warning-0.022
 ambulance-0.021
Token about
Token ID546
Feature activation+1.45e+00

Pos logit contributions
 adult+0.056
 ambulance+0.050
co+0.049
 stret+0.049
<+0.048
Neg logit contributions
 Sorry-0.039
chard-0.039
ually-0.035
apped-0.034
ap-0.032
Token how
Token ID703
Feature activation-7.10e-02

Pos logit contributions
 Sorry+0.051
chard+0.051
ually+0.046
apped+0.044
ap+0.043
Neg logit contributions
 adult-0.064
 ambulance-0.058
co-0.057
 stret-0.057
<-0.056
Token much
Token ID881
Feature activation+4.53e+00

Pos logit contributions
 adult+0.003
<+0.002
 ambulance+0.002
 wolf+0.002
 sheep+0.002
Neg logit contributions
 Sorry-0.003
chard-0.003
ually-0.003
apped-0.003
ishment-0.003
Token better
Token ID1365
Feature activation+2.90e+00

Pos logit contributions
chard+0.176
 Sorry+0.175
ually+0.170
apped+0.164
rac+0.155
Neg logit contributions
 adult-0.184
 wolf-0.157
 ambulance-0.156
 sheep-0.154
 truth-0.150
Token its
Token ID663
Feature activation+5.52e-01

Pos logit contributions
 Sorry+0.123
chard+0.121
apped+0.118
ually+0.116
icker+0.109
Neg logit contributions
 adult-0.104
 sheep-0.091
 wolf-0.091
 ambulance-0.086
 wool-0.086
Token garden
Token ID11376
Feature activation+2.12e-01

Pos logit contributions
chard+0.021
 Sorry+0.019
ually+0.017
 collection+0.017
nam+0.017
Neg logit contributions
co-0.022
coon-0.021
 stret-0.020
 adult-0.020
Sweet-0.019
Token was
Token ID373
Feature activation+1.34e-01

Pos logit contributions
 Sorry+0.010
chard+0.010
ually+0.009
apped+0.009
rac+0.009
Neg logit contributions
 adult-0.007
 ambulance-0.006
co-0.005
 wolf-0.005
 sheep-0.005
Token.
Token ID13
Feature activation+6.88e-01

Pos logit contributions
 Sorry+0.006
chard+0.006
ually+0.005
apped+0.005
rac+0.005
Neg logit contributions
 adult-0.005
 ambulance-0.004
<-0.004
co-0.004
 stret-0.004
Token was
Token ID373
Feature activation+1.55e-01

Pos logit contributions
 Sorry+0.026
chard+0.026
ually+0.024
apped+0.023
ap+0.022
Neg logit contributions
 adult-0.024
 ambulance-0.021
co-0.020
<-0.020
 stret-0.020
Token amazed
Token ID24562
Feature activation+7.66e-01

Pos logit contributions
 Sorry+0.006
chard+0.006
ually+0.006
apped+0.005
ap+0.005
Neg logit contributions
 adult-0.006
 ambulance-0.006
<-0.005
co-0.005
 stret-0.005
Token to
Token ID284
Feature activation+1.04e+00

Pos logit contributions
 Sorry+0.038
chard+0.037
ually+0.035
apped+0.033
rac+0.032
Neg logit contributions
 adult-0.024
 ambulance-0.020
 wolf-0.019
co-0.019
 sheep-0.019
Token see
Token ID766
Feature activation+3.47e-01

Pos logit contributions
 Sorry+0.039
chard+0.038
ually+0.034
apped+0.034
ap+0.032
Neg logit contributions
 adult-0.045
 ambulance-0.040
<-0.038
co-0.038
 stret-0.038
Token how
Token ID703
Feature activation-6.85e-01

Pos logit contributions
chard+0.012
 Sorry+0.012
ually+0.011
apped+0.011
ap+0.010
Neg logit contributions
 adult-0.015
 ambulance-0.014
co-0.013
<-0.013
 stret-0.013
Token wide
Token ID3094
Feature activation+4.53e+00

Pos logit contributions
 adult+0.025
 ambulance+0.021
<+0.021
 wolf+0.020
 sheep+0.020
Neg logit contributions
 Sorry-0.030
chard-0.029
ually-0.028
apped-0.026
ap-0.026
Token it
Token ID340
Feature activation+1.88e+00

Pos logit contributions
 Sorry+0.175
chard+0.170
ually+0.159
apped+0.156
ap+0.150
Neg logit contributions
 adult-0.189
 ambulance-0.166
 wolf-0.162
 sheep-0.159
co-0.158
Token was
Token ID373
Feature activation+7.25e-01

Pos logit contributions
 Sorry+0.090
chard+0.089
ually+0.082
apped+0.081
ap+0.079
Neg logit contributions
 adult-0.061
 ambulance-0.051
co-0.050
<-0.049
coon-0.048
Token.
Token ID13
Feature activation+4.40e-01

Pos logit contributions
 Sorry+0.029
chard+0.029
ually+0.027
apped+0.026
ap+0.025
Neg logit contributions
 adult-0.029
 ambulance-0.025
co-0.024
<-0.024
 stret-0.024
Token
Token ID198
Feature activation+4.48e-02

Pos logit contributions
 Sorry+0.018
chard+0.017
ually+0.016
apped+0.016
ap+0.015
Neg logit contributions
 adult-0.018
 ambulance-0.015
<-0.015
co-0.015
 stret-0.014
Token
Token ID198
Feature activation+1.32e-01

Pos logit contributions
chard+0.002
 Sorry+0.001
apped+0.001
ually+0.001
ap+0.001
Neg logit contributions
 adult-0.002
 ambulance-0.002
 stret-0.002
 wolf-0.002
 ponies-0.002
Token wanted
Token ID2227
Feature activation-1.14e+00

Pos logit contributions
 adult+0.010
<+0.008
 ambulance+0.008
 sheep+0.008
 wolf+0.008
Neg logit contributions
chard-0.013
 Sorry-0.013
ishment-0.011
apped-0.011
ually-0.011
Token to
Token ID284
Feature activation+3.34e-01

Pos logit contributions
 adult+0.046
 ambulance+0.041
 stret+0.041
co+0.040
coon+0.040
Neg logit contributions
chard-0.044
 Sorry-0.043
ually-0.039
apped-0.038
ap-0.037
Token always
Token ID1464
Feature activation+1.75e+00

Pos logit contributions
 Sorry+0.014
chard+0.014
ually+0.012
apped+0.012
rac+0.012
Neg logit contributions
 adult-0.013
 ambulance-0.011
<-0.011
 stret-0.011
co-0.011
Token remember
Token ID3505
Feature activation+7.96e-02

Pos logit contributions
chard+0.077
 Sorry+0.076
apped+0.072
ually+0.069
rac+0.066
Neg logit contributions
 adult-0.063
 ambulance-0.055
 warning-0.053
 stret-0.053
<-0.052
Token how
Token ID703
Feature activation-7.34e-01

Pos logit contributions
 Sorry+0.003
chard+0.003
ually+0.003
apped+0.002
ap+0.002
Neg logit contributions
 adult-0.004
 ambulance-0.003
co-0.003
<-0.003
 stret-0.003
Token much
Token ID881
Feature activation+4.52e+00

Pos logit contributions
 adult+0.028
 ambulance+0.024
<+0.023
 wolf+0.023
 sheep+0.022
Neg logit contributions
 Sorry-0.031
chard-0.031
ually-0.028
apped-0.028
ap-0.027
Token it
Token ID340
Feature activation+1.97e+00

Pos logit contributions
 Sorry+0.182
chard+0.180
ually+0.168
apped+0.166
ap+0.157
Neg logit contributions
 adult-0.180
 ambulance-0.156
 wolf-0.149
<-0.149
co-0.149
Token had
Token ID550
Feature activation+1.80e+00

Pos logit contributions
 Sorry+0.089
chard+0.088
apped+0.082
ually+0.080
ap+0.078
Neg logit contributions
 adult-0.068
 ambulance-0.058
co-0.056
<-0.056
 wolf-0.056
Token meant
Token ID4001
Feature activation+7.93e-01

Pos logit contributions
 Sorry+0.082
chard+0.079
ually+0.074
apped+0.073
ap+0.070
Neg logit contributions
 adult-0.064
 ambulance-0.054
<-0.051
co-0.051
 wolf-0.051
Token to
Token ID284
Feature activation+4.03e-01

Pos logit contributions
 Sorry+0.031
chard+0.031
ually+0.028
apped+0.028
ap+0.027
Neg logit contributions
 adult-0.032
 ambulance-0.028
co-0.027
<-0.027
 stret-0.027
Token her
Token ID607
Feature activation+4.38e-01

Pos logit contributions
 Sorry+0.018
chard+0.018
apped+0.017
ually+0.016
arium+0.016
Neg logit contributions
 adult-0.015
 ambulance-0.012
<-0.012
co-0.012
coon-0.011
Token look
Token ID804
Feature activation+5.66e-01

Pos logit contributions
 Sorry+0.027
chard+0.027
apped+0.024
ually+0.024
arium+0.023
Neg logit contributions
 adult-0.030
 ambulance-0.026
co-0.026
<-0.026
 stret-0.025
Token back
Token ID736
Feature activation+8.01e-01

Pos logit contributions
 Sorry+0.022
chard+0.022
ually+0.020
apped+0.020
ap+0.019
Neg logit contributions
 adult-0.023
 ambulance-0.020
co-0.019
<-0.019
 stret-0.019
Token and
Token ID290
Feature activation+3.42e-01

Pos logit contributions
chard+0.031
 Sorry+0.031
ually+0.028
apped+0.028
ap+0.027
Neg logit contributions
 adult-0.033
 ambulance-0.029
 stret-0.028
co-0.028
<-0.028
Token see
Token ID766
Feature activation-5.73e-02

Pos logit contributions
 Sorry+0.011
chard+0.011
apped+0.010
ually+0.010
kin+0.009
Neg logit contributions
 adult-0.016
 ambulance-0.014
co-0.013
 wolf-0.013
<-0.013
Token how
Token ID703
Feature activation-5.06e-01

Pos logit contributions
 adult+0.002
<+0.002
 ambulance+0.002
 stret+0.002
co+0.002
Neg logit contributions
chard-0.002
 Sorry-0.002
ually-0.002
apped-0.002
ap-0.002
Token far
Token ID1290
Feature activation+4.48e+00

Pos logit contributions
 adult+0.020
 ambulance+0.017
<+0.017
co+0.017
 wolf+0.016
Neg logit contributions
 Sorry-0.020
chard-0.019
ually-0.018
apped-0.018
ap-0.017
Token she
Token ID673
Feature activation+1.35e+00

Pos logit contributions
 Sorry+0.171
chard+0.166
ually+0.155
apped+0.155
ap+0.148
Neg logit contributions
 adult-0.187
 ambulance-0.168
co-0.162
 wolf-0.161
 stret-0.159
Token had
Token ID550
Feature activation+1.38e+00

Pos logit contributions
 Sorry+0.060
chard+0.058
ually+0.054
apped+0.053
ap+0.052
Neg logit contributions
 adult-0.049
 ambulance-0.042
co-0.041
<-0.040
coon-0.040
Token come
Token ID1282
Feature activation+2.07e-01

Pos logit contributions
 Sorry+0.059
chard+0.057
ually+0.053
apped+0.052
ap+0.050
Neg logit contributions
 adult-0.053
 ambulance-0.045
co-0.045
<-0.044
coon-0.043
Token.
Token ID13
Feature activation+3.48e-01

Pos logit contributions
 Sorry+0.008
chard+0.008
ually+0.007
apped+0.007
ap+0.007
Neg logit contributions
 adult-0.008
 ambulance-0.007
<-0.007
co-0.007
 stret-0.007
Token Eventually
Token ID16178
Feature activation-8.00e-01

Pos logit contributions
 Sorry+0.012
chard+0.012
apped+0.011
ually+0.011
ap+0.010
Neg logit contributions
 adult-0.016
 ambulance-0.014
 stret-0.014
coon-0.014
co-0.014
Token stopped
Token ID5025
Feature activation+1.56e-01

Pos logit contributions
 adult+0.031
 ambulance+0.027
 sheep+0.027
co+0.026
 wolf+0.025
Neg logit contributions
 Sorry-0.029
chard-0.028
ually-0.026
rac-0.025
kins-0.024
Token and
Token ID290
Feature activation+1.86e-01

Pos logit contributions
 Sorry+0.006
chard+0.006
ap+0.006
apped+0.006
ually+0.006
Neg logit contributions
 adult-0.006
 ambulance-0.005
co-0.005
coon-0.005
<-0.005
Token spoke
Token ID5158
Feature activation+5.95e-01

Pos logit contributions
 Sorry+0.007
kins+0.007
chard+0.007
kin+0.007
ually+0.007
Neg logit contributions
 adult-0.007
 ambulance-0.006
 sheep-0.006
 stret-0.006
 wolf-0.006
Token about
Token ID546
Feature activation-1.48e-01

Pos logit contributions
 Sorry+0.026
chard+0.026
ually+0.024
apped+0.023
rac+0.022
Neg logit contributions
 adult-0.021
 ambulance-0.019
co-0.018
<-0.017
 stret-0.017
Token how
Token ID703
Feature activation-6.26e-01

Pos logit contributions
 adult+0.007
co+0.006
 ambulance+0.006
coon+0.006
<+0.006
Neg logit contributions
 Sorry-0.005
chard-0.005
ually-0.004
apped-0.004
ap-0.004
Token much
Token ID881
Feature activation+4.48e+00

Pos logit contributions
 adult+0.025
 ambulance+0.022
<+0.021
co+0.021
 sheep+0.021
Neg logit contributions
 Sorry-0.025
chard-0.025
ually-0.023
apped-0.022
ishment-0.021
Token fun
Token ID1257
Feature activation+3.39e+00

Pos logit contributions
 Sorry+0.156
chard+0.155
ually+0.143
apped+0.138
rac+0.131
Neg logit contributions
 adult-0.204
 ambulance-0.180
co-0.174
<-0.172
 sheep-0.172
Token they
Token ID484
Feature activation+9.92e-01

Pos logit contributions
 Sorry+0.094
chard+0.093
ually+0.081
apped+0.079
ap+0.074
Neg logit contributions
 adult-0.177
 ambulance-0.161
co-0.157
<-0.156
 stret-0.156
Token had
Token ID550
Feature activation+1.19e+00

Pos logit contributions
 Sorry+0.042
chard+0.040
ually+0.038
apped+0.037
ap+0.037
Neg logit contributions
 adult-0.038
co-0.032
 ambulance-0.032
<-0.032
coon-0.032
Token playing
Token ID2712
Feature activation-9.52e-01

Pos logit contributions
 Sorry+0.055
chard+0.053
ually+0.048
rac+0.046
apped+0.046
Neg logit contributions
 adult-0.042
co-0.036
 ambulance-0.035
<-0.035
 warning-0.034
Token hockey
Token ID12217
Feature activation+3.65e-01

Pos logit contributions
 adult+0.039
 ambulance+0.035
co+0.035
<+0.034
 stret+0.034
Neg logit contributions
 Sorry-0.037
chard-0.036
ually-0.033
apped-0.032
rac-0.031
TokenIt
Token ID1026
Feature activation-1.33e-01

Pos logit contributions
 adult+0.017
 ambulance+0.015
 stret+0.014
coon+0.014
 wolf+0.014
Neg logit contributions
 Sorry-0.021
chard-0.020
ually-0.019
apped-0.019
ap-0.018
Token helps
Token ID5419
Feature activation+9.81e-02

Pos logit contributions
 adult+0.005
 ambulance+0.004
co+0.004
<+0.004
 stret+0.004
Neg logit contributions
 Sorry-0.006
chard-0.006
ually-0.005
apped-0.005
ap-0.005
Token me
Token ID502
Feature activation+3.93e-01

Pos logit contributions
 Sorry+0.004
chard+0.004
ually+0.004
apped+0.004
ap+0.004
Neg logit contributions
 adult-0.004
 ambulance-0.003
co-0.003
<-0.003
 stret-0.003
Token measure
Token ID3953
Feature activation+7.30e-01

Pos logit contributions
 Sorry+0.016
chard+0.016
apped+0.015
ually+0.014
arium+0.014
Neg logit contributions
 adult-0.016
 ambulance-0.013
<-0.013
co-0.013
 stret-0.013
Token how
Token ID703
Feature activation-7.56e-01

Pos logit contributions
 Sorry+0.030
chard+0.030
apped+0.027
ually+0.027
arium+0.026
Neg logit contributions
 adult-0.029
 ambulance-0.025
co-0.024
 sheep-0.023
<-0.023
Token much
Token ID881
Feature activation+4.47e+00

Pos logit contributions
 adult+0.028
 ambulance+0.023
<+0.022
 wolf+0.022
 sheep+0.022
Neg logit contributions
 Sorry-0.032
chard-0.032
apped-0.031
ually-0.030
ishment-0.029
Token food
Token ID2057
Feature activation+1.31e+00

Pos logit contributions
chard+0.172
 Sorry+0.170
ually+0.158
apped+0.156
ap+0.145
Neg logit contributions
 adult-0.187
 ambulance-0.162
<-0.155
co-0.155
 sheep-0.153
Token we
Token ID356
Feature activation+1.05e+00

Pos logit contributions
chard+0.058
 Sorry+0.058
apped+0.054
ually+0.054
arium+0.050
Neg logit contributions
 adult-0.046
 ambulance-0.039
co-0.038
<-0.037
 stret-0.036
Token have
Token ID423
Feature activation-9.89e-02

Pos logit contributions
 Sorry+0.044
chard+0.043
apped+0.040
ually+0.039
arium+0.038
Neg logit contributions
 adult-0.039
 ambulance-0.033
<-0.032
co-0.032
coon-0.032
Token.
Token ID13
Feature activation+5.14e-01

Pos logit contributions
 adult+0.004
 ambulance+0.003
co+0.003
<+0.003
 stret+0.003
Neg logit contributions
 Sorry-0.004
chard-0.004
apped-0.004
ually-0.004
rac-0.004
Tokenâ
Token ID22940
Feature activation+1.29e+00

Pos logit contributions
 Sorry+0.019
chard+0.018
apped+0.017
ually+0.017
ap+0.016
Neg logit contributions
 adult-0.021
coon-0.019
 ambulance-0.019
co-0.019
 stret-0.019
Token can
Token ID460
Feature activation+2.19e-02

Pos logit contributions
chard+0.015
 Sorry+0.015
ually+0.013
apped+0.013
rac+0.013
Neg logit contributions
 adult-0.012
 ambulance-0.010
co-0.010
 sheep-0.010
coon-0.009
Token help
Token ID1037
Feature activation+1.96e-02

Pos logit contributions
 Sorry+0.001
chard+0.001
ually+0.001
ap+0.001
apped+0.001
Neg logit contributions
 adult-0.001
 ambulance-0.001
 stret-0.001
<-0.001
co-0.001
Token us
Token ID514
Feature activation+7.71e-01

Pos logit contributions
 Sorry+0.001
chard+0.001
ually+0.001
apped+0.001
ap+0.001
Neg logit contributions
 adult-0.001
 ambulance-0.001
<-0.001
co-0.001
 stret-0.001
Token see
Token ID766
Feature activation+1.13e-01

Pos logit contributions
 Sorry+0.031
chard+0.031
apped+0.028
ually+0.028
rac+0.026
Neg logit contributions
 adult-0.031
 ambulance-0.027
<-0.025
 stret-0.025
co-0.025
Token how
Token ID703
Feature activation-6.41e-01

Pos logit contributions
 Sorry+0.005
chard+0.005
ually+0.004
apped+0.004
ap+0.004
Neg logit contributions
 adult-0.004
 ambulance-0.004
co-0.004
 stret-0.004
<-0.004
Token much
Token ID881
Feature activation+4.47e+00

Pos logit contributions
 adult+0.025
 ambulance+0.021
<+0.020
 sheep+0.020
 wolf+0.020
Neg logit contributions
 Sorry-0.026
chard-0.026
ually-0.025
apped-0.024
rac-0.023
Token we
Token ID356
Feature activation+1.75e+00

Pos logit contributions
chard+0.164
 Sorry+0.162
ually+0.152
apped+0.150
rac+0.142
Neg logit contributions
 adult-0.193
 ambulance-0.170
 wolf-0.162
 sheep-0.162
co-0.159
Token have
Token ID423
Feature activation+6.23e-01

Pos logit contributions
 Sorry+0.076
chard+0.074
apped+0.071
ually+0.068
rac+0.066
Neg logit contributions
 adult-0.062
 ambulance-0.052
<-0.052
co-0.050
coon-0.050
Token to
Token ID284
Feature activation+3.84e-01

Pos logit contributions
 Sorry+0.027
chard+0.027
ually+0.025
apped+0.025
rac+0.024
Neg logit contributions
 adult-0.023
 ambulance-0.019
<-0.019
co-0.018
 warning-0.018
Token share
Token ID2648
Feature activation+6.53e-01

Pos logit contributions
 Sorry+0.016
chard+0.016
apped+0.016
ually+0.015
arium+0.014
Neg logit contributions
 adult-0.014
 ambulance-0.012
<-0.012
 stret-0.012
co-0.011
Token with
Token ID351
Feature activation+9.66e-01

Pos logit contributions
 Sorry+0.028
chard+0.028
ually+0.026
apped+0.026
rac+0.024
Neg logit contributions
 adult-0.024
 ambulance-0.021
co-0.020
 sheep-0.020
 stret-0.020
Token They
Token ID1119
Feature activation+4.24e-01

Pos logit contributions
 adult+0.001
 ambulance+0.001
<+0.001
co+0.001
 stret+0.001
Neg logit contributions
chard-0.001
 Sorry-0.001
ually-0.001
apped-0.001
ap-0.001
Token want
Token ID765
Feature activation-1.10e+00

Pos logit contributions
 Sorry+0.018
chard+0.018
ually+0.017
apped+0.016
rac+0.016
Neg logit contributions
 adult-0.016
 ambulance-0.014
co-0.013
 stret-0.013
<-0.013
Token to
Token ID284
Feature activation+1.71e-01

Pos logit contributions
 adult+0.044
 ambulance+0.039
coon+0.039
<+0.038
 stret+0.038
Neg logit contributions
chard-0.043
 Sorry-0.043
ually-0.037
apped-0.037
ap-0.036
Token see
Token ID766
Feature activation-3.73e-02

Pos logit contributions
chard+0.007
 Sorry+0.007
apped+0.007
ually+0.007
arium+0.006
Neg logit contributions
 adult-0.007
 ambulance-0.006
 stret-0.006
co-0.006
<-0.005
Token what
Token ID644
Feature activation-6.27e-01

Pos logit contributions
 adult+0.002
<+0.002
coon+0.002
co+0.002
 ambulance+0.002
Neg logit contributions
chard-0.001
 Sorry-0.001
ually-0.001
apped-0.001
nam-0.001
Token dad
Token ID9955
Feature activation+4.46e+00

Pos logit contributions
 adult+0.024
 ambulance+0.022
co+0.020
 sheep+0.020
 wolf+0.020
Neg logit contributions
chard-0.026
 Sorry-0.025
ually-0.024
apped-0.023
ishment-0.022
Token bought
Token ID5839
Feature activation+5.76e-01

Pos logit contributions
 Sorry+0.204
chard+0.203
ually+0.187
apped+0.186
ap+0.177
Neg logit contributions
 adult-0.152
 ambulance-0.131
co-0.126
coon-0.121
<-0.121
Token them
Token ID606
Feature activation+2.41e-01

Pos logit contributions
 Sorry+0.026
chard+0.025
ually+0.023
apped+0.023
rac+0.022
Neg logit contributions
 adult-0.021
 ambulance-0.018
co-0.018
 stret-0.017
<-0.017
Token.
Token ID13
Feature activation+1.20e-01

Pos logit contributions
 Sorry+0.011
chard+0.011
ually+0.010
apped+0.010
rac+0.010
Neg logit contributions
 adult-0.008
 ambulance-0.007
co-0.007
<-0.007
 stret-0.007
Token Dad
Token ID17415
Feature activation+2.72e+00

Pos logit contributions
 Sorry+0.005
chard+0.005
ually+0.004
apped+0.004
ap+0.004
Neg logit contributions
 adult-0.005
 ambulance-0.004
co-0.004
 stret-0.004
<-0.004
Token opens
Token ID9808
Feature activation-8.17e-01

Pos logit contributions
 Sorry+0.099
chard+0.096
ually+0.091
apped+0.090
arium+0.089
Neg logit contributions
 adult-0.113
 stret-0.099
 ambulance-0.098
<-0.098
coon-0.097
Token extreme
Token ID3257
Feature activation-9.67e-01

Pos logit contributions
 adult+0.017
co+0.016
coon+0.016
Sweet+0.015
 stret+0.015
Neg logit contributions
 Sorry-0.018
chard-0.018
ap-0.017
ually-0.016
apped-0.016
Token kindness
Token ID23887
Feature activation-5.75e-01

Pos logit contributions
 adult+0.035
co+0.031
<+0.030
 ambulance+0.030
coon+0.030
Neg logit contributions
 Sorry-0.040
chard-0.039
ually-0.037
ap-0.036
apped-0.036
Token and
Token ID290
Feature activation+5.18e-01

Pos logit contributions
 adult+0.020
 ambulance+0.017
co+0.016
<+0.016
 stret+0.016
Neg logit contributions
 Sorry-0.026
chard-0.026
ually-0.024
apped-0.023
ap-0.023
Token just
Token ID655
Feature activation-1.44e-01

Pos logit contributions
chard+0.020
 Sorry+0.020
apped+0.019
ually+0.018
rac+0.018
Neg logit contributions
 adult-0.021
 ambulance-0.018
 wolf-0.018
 sheep-0.017
 warning-0.017
Token how
Token ID703
Feature activation-8.16e-01

Pos logit contributions
 adult+0.006
 ambulance+0.005
<+0.005
 wolf+0.005
co+0.005
Neg logit contributions
 Sorry-0.006
chard-0.006
apped-0.005
ually-0.005
rac-0.005
Token much
Token ID881
Feature activation+4.45e+00

Pos logit contributions
 adult+0.032
 ambulance+0.028
co+0.027
<+0.027
 stret+0.027
Neg logit contributions
 Sorry-0.033
chard-0.033
ually-0.030
apped-0.029
ap-0.028
Token he
Token ID339
Feature activation+1.86e+00

Pos logit contributions
 Sorry+0.165
chard+0.163
ually+0.149
apped+0.147
ap+0.140
Neg logit contributions
 adult-0.191
 ambulance-0.168
co-0.162
<-0.162
 stret-0.160
Token cared
Token ID19951
Feature activation+1.32e+00

Pos logit contributions
 Sorry+0.076
chard+0.075
apped+0.068
ually+0.068
ap+0.065
Neg logit contributions
 adult-0.073
 ambulance-0.063
co-0.062
<-0.061
coon-0.060
Token for
Token ID329
Feature activation+3.40e-01

Pos logit contributions
chard+0.053
 Sorry+0.053
ually+0.049
apped+0.048
ap+0.048
Neg logit contributions
 adult-0.049
co-0.043
 ambulance-0.043
<-0.042
 stret-0.042
Token the
Token ID262
Feature activation+4.40e-01

Pos logit contributions
chard+0.014
 Sorry+0.014
ually+0.012
apped+0.012
ap+0.012
Neg logit contributions
 adult-0.013
 ambulance-0.012
<-0.012
co-0.011
 stret-0.011
Token family
Token ID1641
Feature activation+1.22e+00

Pos logit contributions
chard+0.019
 Sorry+0.019
ually+0.016
nam+0.016
boat+0.016
Neg logit contributions
 adult-0.014
co-0.013
<-0.013
 stret-0.013
coon-0.013
Token was
Token ID373
Feature activation+2.99e-01

Pos logit contributions
chard+0.036
 Sorry+0.036
ually+0.033
apped+0.033
arium+0.032
Neg logit contributions
 adult-0.029
 ambulance-0.026
<-0.025
 sheep-0.025
 stret-0.025
Token surprised
Token ID6655
Feature activation+1.18e+00

Pos logit contributions
 Sorry+0.013
chard+0.013
ually+0.012
apped+0.011
rac+0.011
Neg logit contributions
 adult-0.011
 ambulance-0.010
<-0.009
co-0.009
 stret-0.009
Token to
Token ID284
Feature activation+5.77e-01

Pos logit contributions
 Sorry+0.043
chard+0.041
apped+0.038
ually+0.037
ap+0.037
Neg logit contributions
 adult-0.051
 stret-0.046
<-0.046
 ambulance-0.045
co-0.044
Token see
Token ID766
Feature activation+1.58e-01

Pos logit contributions
 Sorry+0.021
chard+0.021
ually+0.019
apped+0.018
ap+0.017
Neg logit contributions
 adult-0.026
 ambulance-0.023
co-0.022
<-0.022
 stret-0.022
Token how
Token ID703
Feature activation-4.62e-01

Pos logit contributions
 Sorry+0.005
chard+0.005
ually+0.005
apped+0.005
ap+0.004
Neg logit contributions
 adult-0.007
<-0.006
co-0.006
 ambulance-0.006
 stret-0.006
Token bright
Token ID6016
Feature activation+4.45e+00

Pos logit contributions
 adult+0.017
<+0.015
 ambulance+0.015
 sheep+0.014
 wolf+0.014
Neg logit contributions
chard-0.020
 Sorry-0.020
ually-0.018
apped-0.018
arium-0.017
Token they
Token ID484
Feature activation+1.86e+00

Pos logit contributions
 Sorry+0.180
chard+0.180
ually+0.166
apped+0.164
ap+0.156
Neg logit contributions
 adult-0.174
 ambulance-0.155
 wolf-0.149
co-0.149
 sheep-0.149
Token looked
Token ID3114
Feature activation+3.86e-01

Pos logit contributions
 Sorry+0.084
chard+0.079
apped+0.078
ap+0.076
ually+0.075
Neg logit contributions
 adult-0.060
co-0.054
coon-0.052
 sheep-0.051
<-0.051
Token!
Token ID0
Feature activation+8.22e-01

Pos logit contributions
 Sorry+0.018
chard+0.017
ually+0.016
apped+0.016
rac+0.015
Neg logit contributions
 adult-0.013
 ambulance-0.012
co-0.011
 stret-0.011
 sheep-0.011
Token He
Token ID679
Feature activation+9.88e-01

Pos logit contributions
 Sorry+0.033
chard+0.031
apped+0.029
ually+0.028
ap+0.027
Neg logit contributions
 adult-0.034
 ambulance-0.030
co-0.029
 stret-0.029
coon-0.029
Token decided
Token ID3066
Feature activation-3.95e-01

Pos logit contributions
 Sorry+0.037
chard+0.037
ually+0.033
apped+0.033
ap+0.031
Neg logit contributions
 adult-0.042
 ambulance-0.037
co-0.036
<-0.036
 stret-0.036
Token forest
Token ID8222
Feature activation-4.52e-01

Pos logit contributions
 adult+0.032
 ambulance+0.027
co+0.027
 stret+0.026
<+0.026
Neg logit contributions
 Sorry-0.046
chard-0.046
ually-0.042
apped-0.042
ap-0.041
Token didn
Token ID1422
Feature activation-9.49e-01

Pos logit contributions
 adult+0.016
 ambulance+0.014
co+0.013
 warning+0.013
 sheep+0.013
Neg logit contributions
 Sorry-0.021
chard-0.020
rac-0.018
nam-0.017
ishment-0.017
Token't
Token ID470
Feature activation-9.46e-01

Pos logit contributions
 adult+0.041
 sheep+0.038
 stret+0.038
co+0.037
ousy+0.037
Neg logit contributions
 Sorry-0.031
ap-0.028
 surely-0.027
 neither-0.026
chard-0.026
Token like
Token ID588
Feature activation-1.27e-01

Pos logit contributions
 adult+0.043
ousy+0.039
 ambulance+0.039
<+0.039
co+0.038
Neg logit contributions
chard-0.031
 Sorry-0.030
ually-0.026
ap-0.026
boat-0.024
Token how
Token ID703
Feature activation-9.41e-01

Pos logit contributions
 adult+0.006
<+0.005
co+0.005
 ambulance+0.005
 stret+0.005
Neg logit contributions
chard-0.004
 Sorry-0.004
ually-0.004
arium-0.003
ishment-0.003
Token boss
Token ID6478
Feature activation+4.44e+00

Pos logit contributions
 adult+0.039
 ambulance+0.034
<+0.033
co+0.033
 wolf+0.032
Neg logit contributions
 Sorry-0.036
chard-0.036
ually-0.033
apped-0.032
ap-0.031
Tokeny
Token ID88
Feature activation+2.55e+00

Pos logit contributions
 Sorry+0.179
chard+0.179
ually+0.170
arium+0.164
ap+0.163
Neg logit contributions
 adult-0.164
 ambulance-0.148
coon-0.147
 wolf-0.146
co-0.142
Token Billy
Token ID15890
Feature activation+2.18e+00

Pos logit contributions
 Sorry+0.099
chard+0.098
ually+0.089
apped+0.088
rac+0.084
Neg logit contributions
 adult-0.106
 ambulance-0.094
 wolf-0.090
coon-0.089
co-0.089
Token was
Token ID373
Feature activation-1.67e-01

Pos logit contributions
chard+0.088
 Sorry+0.087
ually+0.080
apped+0.077
ap+0.075
Neg logit contributions
 adult-0.085
 ambulance-0.074
<-0.073
 stret-0.072
co-0.070
Token.
Token ID13
Feature activation-8.71e-02

Pos logit contributions
 adult+0.007
 ambulance+0.006
co+0.006
<+0.006
 stret+0.006
Neg logit contributions
 Sorry-0.007
chard-0.006
ually-0.006
apped-0.006
ap-0.006
Token They
Token ID1119
Feature activation+1.45e-01

Pos logit contributions
 adult+0.004
 ambulance+0.003
<+0.003
 stret+0.003
co+0.003
Neg logit contributions
chard-0.003
 Sorry-0.003
ually-0.003
apped-0.003
kins-0.003
Token't
Token ID470
Feature activation-4.99e-01

Pos logit contributions
 adult+0.041
 stret+0.035
 ambulance+0.034
coon+0.033
 sheep+0.033
Neg logit contributions
 Sorry-0.045
chard-0.040
ap-0.039
apped-0.039
ually-0.039
Token wait
Token ID4043
Feature activation-5.83e-02

Pos logit contributions
 adult+0.024
 ambulance+0.022
co+0.021
<+0.021
 stret+0.021
Neg logit contributions
 Sorry-0.016
chard-0.015
apped-0.014
ually-0.014
rac-0.013
Token to
Token ID284
Feature activation-7.02e-01

Pos logit contributions
 adult+0.002
 ambulance+0.002
co+0.002
<+0.002
 stret+0.002
Neg logit contributions
 Sorry-0.003
chard-0.003
ually-0.002
apped-0.002
ap-0.002
Token see
Token ID766
Feature activation-1.18e-01

Pos logit contributions
 adult+0.029
 ambulance+0.025
co+0.024
<+0.024
 stret+0.024
Neg logit contributions
 Sorry-0.028
chard-0.027
ually-0.025
apped-0.024
ap-0.023
Token what
Token ID644
Feature activation-6.51e-01

Pos logit contributions
 adult+0.005
 ambulance+0.004
 stret+0.004
co+0.004
<+0.004
Neg logit contributions
 Sorry-0.005
chard-0.005
ually-0.004
apped-0.004
ap-0.004
Token Santa
Token ID8909
Feature activation+4.43e+00

Pos logit contributions
 adult+0.026
 ambulance+0.023
 wolf+0.023
 sheep+0.023
<+0.022
Neg logit contributions
 Sorry-0.025
chard-0.025
ually-0.024
apped-0.023
ap-0.022
Token had
Token ID550
Feature activation+1.23e+00

Pos logit contributions
 Sorry+0.202
chard+0.198
ually+0.182
apped+0.181
rac+0.178
Neg logit contributions
 adult-0.149
 ambulance-0.132
co-0.130
 wolf-0.127
 sheep-0.127
Token brought
Token ID3181
Feature activation+4.39e-01

Pos logit contributions
 Sorry+0.056
chard+0.053
ually+0.048
apped+0.046
rac+0.046
Neg logit contributions
 adult-0.044
co-0.039
 living-0.037
 ambulance-0.037
 warning-0.036
Token them
Token ID606
Feature activation-5.77e-01

Pos logit contributions
 Sorry+0.021
chard+0.021
ually+0.019
apped+0.019
rac+0.018
Neg logit contributions
 adult-0.014
 ambulance-0.012
co-0.012
 stret-0.011
 wolf-0.011
Token.
Token ID13
Feature activation+2.04e-01

Pos logit contributions
 adult+0.019
 ambulance+0.016
co+0.016
<+0.016
 stret+0.015
Neg logit contributions
 Sorry-0.028
chard-0.026
ually-0.025
apped-0.024
rac-0.024
Token They
Token ID1119
Feature activation+1.86e-01

Pos logit contributions
 Sorry+0.008
chard+0.008
ually+0.007
apped+0.007
ap+0.007
Neg logit contributions
 adult-0.008
 ambulance-0.007
<-0.007
co-0.007
 stret-0.007
Token Everyone
Token ID11075
Feature activation+5.38e-02

Pos logit contributions
 Sorry+0.013
chard+0.013
ually+0.012
apped+0.011
ap+0.011
Neg logit contributions
 adult-0.013
 ambulance-0.011
co-0.011
<-0.011
 stret-0.011
Token was
Token ID373
Feature activation-2.30e-01

Pos logit contributions
 Sorry+0.002
chard+0.002
ually+0.002
apped+0.002
ap+0.002
Neg logit contributions
 adult-0.002
 ambulance-0.002
co-0.002
<-0.002
coon-0.002
Token amazed
Token ID24562
Feature activation+6.46e-01

Pos logit contributions
 adult+0.009
 ambulance+0.008
co+0.008
 stret+0.008
<+0.008
Neg logit contributions
 Sorry-0.009
chard-0.009
ually-0.008
apped-0.008
arium-0.008
Token at
Token ID379
Feature activation+2.50e+00

Pos logit contributions
chard+0.031
 Sorry+0.031
ually+0.028
apped+0.027
rac+0.027
Neg logit contributions
 adult-0.021
 ambulance-0.018
co-0.017
 wolf-0.017
 sheep-0.017
Token how
Token ID703
Feature activation-9.21e-01

Pos logit contributions
 Sorry+0.089
chard+0.088
ually+0.079
apped+0.077
ap+0.074
Neg logit contributions
 adult-0.112
 ambulance-0.100
co-0.097
<-0.096
 stret-0.096
Token wide
Token ID3094
Feature activation+4.43e+00

Pos logit contributions
 adult+0.031
 ambulance+0.025
 wolf+0.024
<+0.024
 warning+0.024
Neg logit contributions
chard-0.042
 Sorry-0.041
ually-0.039
arium-0.038
apped-0.038
Token the
Token ID262
Feature activation+1.28e+00

Pos logit contributions
 Sorry+0.174
chard+0.172
ually+0.158
apped+0.157
ap+0.151
Neg logit contributions
 adult-0.180
 ambulance-0.160
 wolf-0.156
 sheep-0.154
co-0.151
Token hedge
Token ID19859
Feature activation+5.06e-01

Pos logit contributions
chard+0.054
 Sorry+0.053
ually+0.047
apped+0.046
ap+0.046
Neg logit contributions
 adult-0.045
 stret-0.041
co-0.041
 ambulance-0.040
<-0.039
Token was
Token ID373
Feature activation+4.88e-01

Pos logit contributions
 Sorry+0.024
chard+0.023
ually+0.021
apped+0.021
ap+0.020
Neg logit contributions
 adult-0.017
 ambulance-0.015
co-0.014
coon-0.014
<-0.014
Token!
Token ID0
Feature activation+5.97e-01

Pos logit contributions
 Sorry+0.021
chard+0.020
ually+0.019
apped+0.019
rac+0.018
Neg logit contributions
 adult-0.019
 ambulance-0.016
co-0.016
<-0.015
 stret-0.015
Token
Token ID198
Feature activation+7.15e-02

Pos logit contributions
 Sorry+0.024
chard+0.024
ually+0.022
apped+0.022
ap+0.021
Neg logit contributions
 adult-0.023
 ambulance-0.021
co-0.020
 stret-0.020
<-0.020
Token they
Token ID484
Feature activation-4.54e-01

Pos logit contributions
 Sorry+0.011
chard+0.011
ually+0.010
arium+0.010
apped+0.010
Neg logit contributions
 adult-0.007
co-0.006
 ambulance-0.006
<-0.006
 stret-0.006
Token liked
Token ID8288
Feature activation+5.10e-01

Pos logit contributions
 adult+0.018
 ambulance+0.016
co+0.016
<+0.016
 stret+0.015
Neg logit contributions
 Sorry-0.018
chard-0.017
ually-0.016
apped-0.016
ap-0.015
Token and
Token ID290
Feature activation+1.18e+00

Pos logit contributions
 Sorry+0.020
chard+0.020
ually+0.018
apped+0.018
ap+0.017
Neg logit contributions
 adult-0.021
 ambulance-0.018
co-0.017
<-0.017
 stret-0.017
Token what
Token ID644
Feature activation-3.40e-01

Pos logit contributions
 Sorry+0.045
chard+0.044
apped+0.041
ually+0.041
rac+0.039
Neg logit contributions
 adult-0.049
 ambulance-0.043
 stret-0.042
co-0.042
<-0.042
Token topp
Token ID23126
Feature activation+8.16e-02

Pos logit contributions
 adult+0.016
 ambulance+0.015
 wolf+0.014
 sheep+0.014
<+0.014
Neg logit contributions
 Sorry-0.011
chard-0.011
ually-0.010
apped-0.009
ap-0.009
Tokenings
Token ID654
Feature activation+4.41e+00

Pos logit contributions
chard+0.003
 Sorry+0.003
arium+0.002
ually+0.002
nam+0.002
Neg logit contributions
 adult-0.004
co-0.003
 ambulance-0.003
 stret-0.003
coon-0.003
Token would
Token ID561
Feature activation+1.60e+00

Pos logit contributions
 Sorry+0.163
chard+0.161
ually+0.149
apped+0.145
ap+0.139
Neg logit contributions
 adult-0.188
 ambulance-0.167
co-0.162
 stret-0.159
coon-0.158
Token go
Token ID467
Feature activation+1.56e+00

Pos logit contributions
 Sorry+0.064
chard+0.063
ually+0.058
apped+0.058
ap+0.055
Neg logit contributions
 adult-0.064
 ambulance-0.056
co-0.054
<-0.053
 stret-0.053
Token best
Token ID1266
Feature activation+6.00e-01

Pos logit contributions
 Sorry+0.071
chard+0.068
apped+0.065
ually+0.064
rac+0.062
Neg logit contributions
 adult-0.055
 ambulance-0.049
co-0.047
 stret-0.045
 wolf-0.042
Token.
Token ID13
Feature activation+3.51e-01

Pos logit contributions
 Sorry+0.025
chard+0.025
ually+0.023
apped+0.023
ap+0.022
Neg logit contributions
 adult-0.023
 ambulance-0.020
co-0.019
<-0.019
 stret-0.019
Token
Token ID220
Feature activation+5.13e-01

Pos logit contributions
 Sorry+0.014
chard+0.013
ually+0.012
apped+0.012
ap+0.012
Neg logit contributions
 adult-0.014
 ambulance-0.013
co-0.012
 stret-0.012
coon-0.012
Token the
Token ID262
Feature activation-4.13e-01

Pos logit contributions
 adult+0.010
 ambulance+0.009
 wolf+0.009
<+0.009
 sheep+0.009
Neg logit contributions
 Sorry-0.008
chard-0.008
ually-0.007
apped-0.007
ap-0.007
Token lake
Token ID13546
Feature activation-2.71e-01

Pos logit contributions
 adult+0.019
co+0.017
 stret+0.017
coon+0.017
Sweet+0.017
Neg logit contributions
chard-0.013
 Sorry-0.013
arium-0.012
-0.012
field-0.011
Token and
Token ID290
Feature activation+5.55e-01

Pos logit contributions
 adult+0.008
 ambulance+0.007
 wolf+0.006
 warning+0.006
co+0.006
Neg logit contributions
 Sorry-0.014
chard-0.013
rac-0.013
ually-0.013
apped-0.012
Token test
Token ID1332
Feature activation+1.20e+00

Pos logit contributions
 Sorry+0.022
chard+0.021
apped+0.021
rac+0.021
kins+0.020
Neg logit contributions
 adult-0.021
 warning-0.018
 ambulance-0.018
 wolf-0.018
 sheep-0.017
Token how
Token ID703
Feature activation-4.00e-01

Pos logit contributions
 Sorry+0.047
chard+0.046
ually+0.041
apped+0.040
rac+0.039
Neg logit contributions
 adult-0.050
 ambulance-0.044
 sheep-0.042
co-0.042
 stret-0.041
Token far
Token ID1290
Feature activation+4.41e+00

Pos logit contributions
 adult+0.014
<+0.013
 warning+0.012
 ambulance+0.012
 wolf+0.012
Neg logit contributions
 Sorry-0.017
chard-0.017
ishment-0.016
ually-0.015
apped-0.015
Token it
Token ID340
Feature activation+1.39e+00

Pos logit contributions
 Sorry+0.197
chard+0.197
apped+0.187
ually+0.185
ap+0.182
Neg logit contributions
 adult-0.144
 wolf-0.132
 ambulance-0.127
 sheep-0.125
coon-0.122
Token could
Token ID714
Feature activation+1.64e+00

Pos logit contributions
 Sorry+0.065
chard+0.063
ually+0.057
apped+0.057
ap+0.056
Neg logit contributions
 adult-0.047
co-0.041
 ambulance-0.040
<-0.039
coon-0.039
Token go
Token ID467
Feature activation+1.52e+00

Pos logit contributions
chard+0.057
 Sorry+0.057
ually+0.054
ap+0.052
apped+0.051
Neg logit contributions
 adult-0.070
 ambulance-0.062
 stret-0.061
co-0.061
Sweet-0.061
Token.
Token ID13
Feature activation+5.55e-01

Pos logit contributions
 Sorry+0.061
chard+0.060
ually+0.055
apped+0.054
ap+0.052
Neg logit contributions
 adult-0.061
 ambulance-0.053
co-0.051
<-0.051
 stret-0.051
Token But
Token ID887
Feature activation+5.42e-01

Pos logit contributions
chard+0.024
 Sorry+0.023
kins+0.022
ually+0.021
kin+0.021
Neg logit contributions
 adult-0.021
<-0.018
 ambulance-0.017
 wolf-0.016
co-0.016
Token table
Token ID3084
Feature activation+1.12e+00

Pos logit contributions
 adult+0.003
coon+0.003
 stret+0.003
<+0.003
 ambulance+0.003
Neg logit contributions
chard-0.004
 Sorry-0.004
nam-0.003
light-0.003
ually-0.003
Token is
Token ID318
Feature activation+3.30e-01

Pos logit contributions
chard+0.061
 Sorry+0.059
ually+0.055
apped+0.055
nam+0.054
Neg logit contributions
 adult-0.029
 ambulance-0.025
co-0.023
coon-0.021
 cord-0.020
Token,
Token ID11
Feature activation+3.32e-01

Pos logit contributions
 Sorry+0.015
chard+0.014
apped+0.014
ually+0.013
rac+0.013
Neg logit contributions
 adult-0.012
 ambulance-0.010
co-0.009
<-0.009
 wolf-0.009
Token and
Token ID290
Feature activation+5.93e-01

Pos logit contributions
 Sorry+0.015
chard+0.015
ually+0.014
apped+0.014
ap+0.013
Neg logit contributions
 adult-0.011
 ambulance-0.010
co-0.009
<-0.009
 stret-0.009
Token how
Token ID703
Feature activation-5.40e-01

Pos logit contributions
 Sorry+0.022
chard+0.021
ually+0.019
apped+0.019
ap+0.018
Neg logit contributions
 adult-0.026
 ambulance-0.023
co-0.022
 stret-0.022
<-0.022
Token wide
Token ID3094
Feature activation+4.40e+00

Pos logit contributions
 adult+0.020
 ambulance+0.018
<+0.017
 wolf+0.017
 sheep+0.017
Neg logit contributions
 Sorry-0.022
chard-0.022
apped-0.021
ually-0.020
ishment-0.020
Token the
Token ID262
Feature activation+7.58e-01

Pos logit contributions
chard+0.168
 Sorry+0.166
ually+0.156
apped+0.152
ap+0.142
Neg logit contributions
 adult-0.183
 ambulance-0.164
 sheep-0.158
 wolf-0.158
co-0.154
Token door
Token ID3420
Feature activation+1.58e+00

Pos logit contributions
chard+0.032
 Sorry+0.032
ually+0.029
nam+0.028
kins+0.028
Neg logit contributions
 adult-0.027
coon-0.023
co-0.023
 stret-0.023
<-0.022
Token is
Token ID318
Feature activation+7.31e-01

Pos logit contributions
chard+0.072
 Sorry+0.071
ually+0.065
apped+0.065
ap+0.062
Neg logit contributions
 adult-0.055
 ambulance-0.048
co-0.045
<-0.044
 stret-0.044
Token.
Token ID13
Feature activation+4.91e-01

Pos logit contributions
 Sorry+0.034
chard+0.032
apped+0.030
ually+0.030
ap+0.028
Neg logit contributions
 adult-0.026
 ambulance-0.022
<-0.021
co-0.021
 stret-0.021
Token He
Token ID679
Feature activation+8.99e-01

Pos logit contributions
 Sorry+0.018
chard+0.017
apped+0.017
ap+0.016
ually+0.015
Neg logit contributions
 adult-0.021
coon-0.019
co-0.019
 stret-0.019
 ambulance-0.019
Token you
Token ID345
Feature activation+5.55e-01

Pos logit contributions
chard+0.013
 Sorry+0.013
apped+0.013
ually+0.012
arium+0.012
Neg logit contributions
 adult-0.013
 ambulance-0.011
 sheep-0.011
 wolf-0.011
 warning-0.011
Token will
Token ID481
Feature activation-5.84e-01

Pos logit contributions
chard+0.027
 Sorry+0.026
apped+0.024
ually+0.024
ishment+0.023
Neg logit contributions
 adult-0.017
 ambulance-0.015
<-0.015
co-0.014
coon-0.014
Token think
Token ID892
Feature activation-1.55e+00

Pos logit contributions
 adult+0.022
 ambulance+0.019
<+0.019
co+0.019
 stret+0.019
Neg logit contributions
 Sorry-0.024
chard-0.024
ually-0.022
apped-0.022
rac-0.021
Token of
Token ID286
Feature activation+6.93e-01

Pos logit contributions
 adult+0.066
 ambulance+0.058
co+0.056
<+0.056
 stret+0.056
Neg logit contributions
 Sorry-0.058
chard-0.058
ually-0.052
apped-0.051
ap-0.049
Token how
Token ID703
Feature activation-6.20e-01

Pos logit contributions
 Sorry+0.026
chard+0.025
ually+0.023
apped+0.023
ap+0.021
Neg logit contributions
 adult-0.030
 ambulance-0.026
co-0.026
<-0.026
 stret-0.025
Token much
Token ID881
Feature activation+4.39e+00

Pos logit contributions
 adult+0.023
 ambulance+0.020
<+0.020
 wolf+0.019
 sheep+0.019
Neg logit contributions
chard-0.026
 Sorry-0.026
ually-0.024
apped-0.024
ishment-0.023
Token I
Token ID314
Feature activation+2.09e+00

Pos logit contributions
chard+0.174
 Sorry+0.171
ually+0.162
apped+0.160
rac+0.148
Neg logit contributions
 adult-0.178
 ambulance-0.153
<-0.147
 wolf-0.147
 sheep-0.146
Token love
Token ID1842
Feature activation+9.08e-01

Pos logit contributions
 Sorry+0.086
chard+0.085
ually+0.078
apped+0.077
ap+0.074
Neg logit contributions
 adult-0.081
 ambulance-0.071
co-0.068
<-0.068
 stret-0.067
Token being
Token ID852
Feature activation-6.18e-01

Pos logit contributions
 Sorry+0.032
chard+0.032
ually+0.029
apped+0.028
ap+0.027
Neg logit contributions
 adult-0.040
 ambulance-0.036
 stret-0.035
co-0.035
<-0.035
Token your
Token ID534
Feature activation-5.81e-01

Pos logit contributions
 adult+0.021
 ambulance+0.019
 sheep+0.019
 wolf+0.018
 warning+0.018
Neg logit contributions
 Sorry-0.028
ually-0.026
chard-0.026
apped-0.025
ishment-0.024
Token friend
Token ID1545
Feature activation-1.60e+00

Pos logit contributions
 adult+0.021
 ambulance+0.018
 stret+0.018
co+0.018
coon+0.018
Neg logit contributions
 Sorry-0.025
chard-0.025
ually-0.023
ap-0.022
apped-0.022
Token,
Token ID11
Feature activation-1.15e-01

Pos logit contributions
 adult+0.026
 ambulance+0.023
<+0.022
co+0.021
 stret+0.021
Neg logit contributions
 Sorry-0.030
chard-0.028
ually-0.027
apped-0.026
arium-0.025
Token you
Token ID345
Feature activation-1.43e-01

Pos logit contributions
 adult+0.005
 ambulance+0.004
co+0.004
<+0.004
coon+0.004
Neg logit contributions
 Sorry-0.005
chard-0.005
ually-0.004
ap-0.004
apped-0.004
Tokenâ
Token ID22940
Feature activation-9.19e-01

Pos logit contributions
 adult+0.005
 ambulance+0.004
<+0.004
co+0.004
coon+0.004
Neg logit contributions
chard-0.007
 Sorry-0.006
ually-0.006
apped-0.006
ishment-0.006
Token
Token ID26391
Feature activation-1.39e+00

Pos logit contributions
 adult+0.033
co+0.031
 ambulance+0.031
coon+0.029
 stret+0.027
Neg logit contributions
chard-0.041
 Sorry-0.040
nam-0.037
kins-0.036
apped-0.036
Token
Token ID8151
Feature activation-6.11e-01

Pos logit contributions
 adult+0.053
co+0.053
 stret+0.051
 ambulance+0.050
<+0.049
Neg logit contributions
chard-0.052
 Sorry-0.049
ishment-0.047
nam-0.046
apped-0.044
Tokenll
Token ID297
Feature activation-5.43e+00

Pos logit contributions
 adult+0.024
 ambulance+0.021
co+0.021
<+0.020
 stret+0.020
Neg logit contributions
 Sorry-0.024
chard-0.024
ually-0.022
apped-0.021
ishment-0.021
Token always
Token ID1464
Feature activation-3.09e-01

Pos logit contributions
 adult+0.221
 ambulance+0.194
<+0.189
co+0.188
 stret+0.188
Neg logit contributions
 Sorry-0.209
chard-0.205
ually-0.191
apped-0.184
ap-0.181
Token have
Token ID423
Feature activation-1.18e+00

Pos logit contributions
 adult+0.011
 ambulance+0.010
 warning+0.010
<+0.009
co+0.009
Neg logit contributions
chard-0.013
 Sorry-0.013
apped-0.012
ually-0.012
rac-0.011
Token your
Token ID534
Feature activation-6.47e-02

Pos logit contributions
 adult+0.042
 ambulance+0.036
<+0.035
co+0.034
 stret+0.034
Neg logit contributions
 Sorry-0.053
chard-0.052
ually-0.048
apped-0.047
ap-0.045
Token memories
Token ID9846
Feature activation-4.60e-01

Pos logit contributions
 adult+0.002
 ambulance+0.002
co+0.002
coon+0.002
<+0.002
Neg logit contributions
 Sorry-0.003
chard-0.003
ually-0.002
apped-0.002
ap-0.002
Token of
Token ID286
Feature activation-1.70e-01

Pos logit contributions
 adult+0.016
 ambulance+0.014
co+0.014
<+0.013
 stret+0.013
Neg logit contributions
 Sorry-0.021
chard-0.020
ually-0.019
apped-0.018
rac-0.018
Token it
Token ID340
Feature activation-3.24e-02

Pos logit contributions
 adult+0.016
 ambulance+0.014
co+0.014
<+0.014
 stret+0.014
Neg logit contributions
 Sorry-0.010
chard-0.010
ually-0.009
apped-0.009
ap-0.008
Token was
Token ID373
Feature activation-1.60e-01

Pos logit contributions
 adult+0.001
 stret+0.001
 reins+0.001
<+0.001
 ponies+0.001
Neg logit contributions
 Sorry-0.001
chard-0.001
ually-0.001
apped-0.001
 neither-0.001
Token Tim
Token ID5045
Feature activation-1.21e+00

Pos logit contributions
 adult+0.006
co+0.005
 ambulance+0.005
 stret+0.005
<+0.005
Neg logit contributions
 Sorry-0.007
chard-0.007
ually-0.006
apped-0.006
ap-0.006
Tokenmy
Token ID1820
Feature activation-1.15e+00

Pos logit contributions
 adult+0.045
<+0.043
 stret+0.042
Sweet+0.039
coon+0.038
Neg logit contributions
chard-0.045
ap-0.042
 Sorry-0.042
ually-0.042
apped-0.041
Token's
Token ID338
Feature activation-1.14e-01

Pos logit contributions
 adult+0.044
 ambulance+0.038
<+0.038
co+0.038
 stret+0.037
Neg logit contributions
 Sorry-0.047
chard-0.047
ually-0.043
apped-0.042
ap-0.041
Token fault
Token ID8046
Feature activation-5.35e+00

Pos logit contributions
coon+0.005
 stret+0.005
co+0.005
 adult+0.005
 ponies+0.005
Neg logit contributions
 Sorry-0.003
apped-0.003
ap-0.003
kin-0.003
icker-0.003
Token but
Token ID475
Feature activation-1.72e-01

Pos logit contributions
 adult+0.238
 stret+0.227
<+0.224
co+0.220
Sweet+0.218
Neg logit contributions
apped-0.168
 neither-0.159
ap-0.152
 Sorry-0.147
chard-0.146
Token Tim
Token ID5045
Feature activation-1.25e+00

Pos logit contributions
 adult+0.008
 ambulance+0.007
<+0.006
co+0.006
 stret+0.006
Neg logit contributions
 Sorry-0.006
chard-0.006
ually-0.006
apped-0.005
ap-0.005
Tokenmy
Token ID1820
Feature activation-8.94e-01

Pos logit contributions
 adult+0.039
 stret+0.033
<+0.033
 ambulance+0.032
coon+0.031
Neg logit contributions
 Sorry-0.058
chard-0.058
ually-0.055
apped-0.053
ap-0.053
Token felt
Token ID2936
Feature activation-8.42e-01

Pos logit contributions
 adult+0.033
 ambulance+0.029
co+0.028
<+0.028
coon+0.027
Neg logit contributions
 Sorry-0.038
chard-0.038
ually-0.035
apped-0.034
ap-0.033
Token guilty
Token ID6717
Feature activation-6.40e-01

Pos logit contributions
 adult+0.029
 ambulance+0.024
co+0.024
 stret+0.023
<+0.023
Neg logit contributions
 Sorry-0.039
chard-0.038
ually-0.035
apped-0.035
ap-0.034
Token told
Token ID1297
Feature activation-1.17e+00

Pos logit contributions
 adult+0.004
 ambulance+0.003
co+0.003
<+0.003
 stret+0.003
Neg logit contributions
 Sorry-0.003
chard-0.003
ually-0.003
apped-0.003
ap-0.003
Token her
Token ID607
Feature activation-4.53e-01

Pos logit contributions
 adult+0.053
co+0.050
 reins+0.048
 stret+0.047
<+0.047
Neg logit contributions
 Sorry-0.035
chard-0.033
 neither-0.031
ap-0.029
ishment-0.029
Token it
Token ID340
Feature activation-7.40e-01

Pos logit contributions
 adult+0.019
co+0.018
 stret+0.017
coon+0.017
 ambulance+0.017
Neg logit contributions
 Sorry-0.016
chard-0.015
apped-0.015
ually-0.014
ap-0.014
Token was
Token ID373
Feature activation-9.73e-01

Pos logit contributions
 adult+0.029
 stret+0.025
 ambulance+0.025
<+0.024
coon+0.024
Neg logit contributions
 Sorry-0.030
chard-0.029
ually-0.027
apped-0.027
ap-0.025
Token her
Token ID607
Feature activation-5.89e-01

Pos logit contributions
 adult+0.035
 ambulance+0.030
co+0.030
 stret+0.030
coon+0.029
Neg logit contributions
chard-0.042
 Sorry-0.042
ually-0.038
apped-0.037
ap-0.037
Token fault
Token ID8046
Feature activation-5.28e+00

Pos logit contributions
 adult+0.020
coon+0.018
co+0.018
ousy+0.018
<+0.018
Neg logit contributions
chard-0.025
 Sorry-0.025
ually-0.023
ap-0.023
apped-0.022
Token because
Token ID780
Feature activation-4.03e-01

Pos logit contributions
 adult+0.248
 stret+0.228
co+0.225
 ambulance+0.223
coon+0.218
Neg logit contributions
apped-0.162
chard-0.153
 neither-0.149
 Sorry-0.147
ap-0.144
Token she
Token ID673
Feature activation+4.68e-01

Pos logit contributions
 adult+0.020
 stret+0.019
co+0.018
 ambulance+0.018
<+0.018
Neg logit contributions
chard-0.012
 Sorry-0.012
apped-0.010
ually-0.010
kins-0.009
Token dropped
Token ID5710
Feature activation-2.39e-01

Pos logit contributions
 Sorry+0.015
chard+0.015
ually+0.014
apped+0.013
kins+0.012
Neg logit contributions
 adult-0.022
 stret-0.020
 ambulance-0.020
 reins-0.019
co-0.018
Token her
Token ID607
Feature activation+2.63e-01

Pos logit contributions
 adult+0.011
 ambulance+0.009
co+0.009
<+0.009
 stret+0.009
Neg logit contributions
 Sorry-0.008
chard-0.008
ually-0.007
apped-0.007
ap-0.007
Token toy
Token ID13373
Feature activation-1.35e+00

Pos logit contributions
chard+0.012
 Sorry+0.012
ually+0.011
apped+0.011
kins+0.011
Neg logit contributions
 adult-0.009
 ambulance-0.007
co-0.007
coon-0.007
<-0.007
Token and
Token ID290
Feature activation-1.09e-01

Pos logit contributions
chard+0.008
 Sorry+0.008
apped+0.007
ually+0.007
ishment+0.007
Neg logit contributions
 adult-0.007
 ambulance-0.006
<-0.006
 warning-0.006
 wolf-0.006
Token we
Token ID356
Feature activation+1.17e-01

Pos logit contributions
 adult+0.004
 ambulance+0.003
 wolf+0.003
 warning+0.003
 sheep+0.003
Neg logit contributions
chard-0.005
apped-0.005
 Sorry-0.005
ishment-0.004
arium-0.004
Tokenâ
Token ID22940
Feature activation-4.93e-01

Pos logit contributions
 Sorry+0.005
chard+0.005
ually+0.005
apped+0.005
ap+0.004
Neg logit contributions
 adult-0.004
 ambulance-0.004
co-0.004
 stret-0.004
<-0.004
Token
Token ID26391
Feature activation-1.08e+00

Pos logit contributions
 adult+0.019
co+0.018
 ambulance+0.018
coon+0.017
<+0.016
Neg logit contributions
chard-0.021
 Sorry-0.020
nam-0.019
kins-0.018
bel-0.018
Token
Token ID8151
Feature activation-1.17e-01

Pos logit contributions
co+0.040
 adult+0.038
 stret+0.037
<+0.037
 ambulance+0.037
Neg logit contributions
chard-0.044
 Sorry-0.039
ishment-0.038
nam-0.038
ateful-0.036
Tokenll
Token ID297
Feature activation-5.26e+00

Pos logit contributions
 adult+0.005
 ambulance+0.004
co+0.004
<+0.004
 stret+0.004
Neg logit contributions
 Sorry-0.004
chard-0.004
ually-0.004
ishment-0.004
apped-0.004
Token complete
Token ID1844
Feature activation+5.07e-01

Pos logit contributions
 adult+0.215
 ambulance+0.189
co+0.183
<+0.183
 stret+0.182
Neg logit contributions
 Sorry-0.203
chard-0.199
ually-0.186
apped-0.180
ap-0.176
Token this
Token ID428
Feature activation-8.28e-01

Pos logit contributions
chard+0.019
 Sorry+0.019
apped+0.017
ually+0.016
ishment+0.015
Neg logit contributions
 adult-0.022
 ambulance-0.019
<-0.019
co-0.019
 stret-0.019
Token together
Token ID1978
Feature activation-5.66e-01

Pos logit contributions
 adult+0.030
 ambulance+0.026
co+0.025
<+0.025
 stret+0.025
Neg logit contributions
 Sorry-0.036
chard-0.036
ually-0.033
apped-0.032
ap-0.031
Token.
Token ID13
Feature activation-9.01e-02

Pos logit contributions
 adult+0.020
 ambulance+0.017
co+0.017
<+0.017
 sheep+0.016
Neg logit contributions
chard-0.025
 Sorry-0.025
apped-0.022
ually-0.022
ishment-0.021
Tokenâ
Token ID22940
Feature activation+3.89e-01

Pos logit contributions
 adult+0.004
 ambulance+0.003
co+0.003
<+0.003
 stret+0.003
Neg logit contributions
 Sorry-0.004
chard-0.004
ually-0.003
apped-0.003
ap-0.003
Tokenawn
Token ID3832
Feature activation-1.67e+00

Pos logit contributions
 adult+0.042
 ambulance+0.038
 stret+0.038
 wolf+0.037
 sheep+0.036
Neg logit contributions
chard-0.036
 Sorry-0.035
ually-0.034
apped-0.033
ap-0.032
Tokenâ
Token ID22940
Feature activation-1.13e+00

Pos logit contributions
 adult+0.069
 ambulance+0.062
co+0.060
 stret+0.059
<+0.059
Neg logit contributions
 Sorry-0.064
chard-0.063
ually-0.057
apped-0.055
rac-0.054
Token
Token ID26391
Feature activation-1.76e+00

Pos logit contributions
 adult+0.044
 ambulance+0.039
co+0.039
<+0.037
coon+0.037
Neg logit contributions
chard-0.046
 Sorry-0.046
apped-0.040
ually-0.040
arium-0.039
Token
Token ID8151
Feature activation-8.78e-01

Pos logit contributions
 adult+0.059
co+0.057
<+0.054
 stret+0.053
 ambulance+0.052
Neg logit contributions
chard-0.077
 Sorry-0.074
ishment-0.067
ually-0.066
nam-0.065
Tokens
Token ID82
Feature activation-8.22e-01

Pos logit contributions
 adult+0.033
 ambulance+0.029
co+0.028
<+0.027
 stret+0.027
Neg logit contributions
 Sorry-0.037
chard-0.037
ually-0.034
apped-0.033
ap-0.032
Token fault
Token ID8046
Feature activation-5.22e+00

Pos logit contributions
 adult+0.033
 ambulance+0.029
co+0.028
 stret+0.028
<+0.028
Neg logit contributions
 Sorry-0.033
chard-0.032
ually-0.030
apped-0.029
ap-0.028
Token.
Token ID13
Feature activation-3.34e-01

Pos logit contributions
 adult+0.231
 ambulance+0.205
 stret+0.202
co+0.202
<+0.200
Neg logit contributions
 Sorry-0.180
chard-0.176
apped-0.170
ually-0.167
ap-0.161
Token
Token ID220
Feature activation-2.30e-01

Pos logit contributions
 adult+0.014
 ambulance+0.012
co+0.012
 stret+0.012
coon+0.012
Neg logit contributions
 Sorry-0.013
chard-0.012
ually-0.011
apped-0.011
ap-0.011
Token
Token ID198
Feature activation-4.98e-01

Pos logit contributions
 adult+0.009
 ambulance+0.008
 wolf+0.008
co+0.008
 stret+0.008
Neg logit contributions
 Sorry-0.009
chard-0.009
apped-0.009
ually-0.008
ap-0.008
Token
Token ID198
Feature activation-3.00e-01

Pos logit contributions
 adult+0.023
 ambulance+0.023
 wolf+0.021
 tele+0.021
 stret+0.020
Neg logit contributions
 Sorry-0.014
ually-0.014
chard-0.013
apped-0.013
-0.013
TokenThe
Token ID464
Feature activation+1.82e-01

Pos logit contributions
 adult+0.013
 ambulance+0.012
 wolf+0.012
 tele+0.012
 stret+0.011
Neg logit contributions
 Sorry-0.009
ually-0.009
chard-0.008
boat-0.008
ised-0.008
Token and
Token ID290
Feature activation-1.68e-01

Pos logit contributions
 adult+0.063
 ambulance+0.055
co+0.053
 stret+0.052
<+0.052
Neg logit contributions
 Sorry-0.074
chard-0.073
ually-0.067
apped-0.066
rac-0.063
Token said
Token ID531
Feature activation-7.84e-01

Pos logit contributions
 adult+0.005
 sheep+0.005
co+0.005
 sc+0.005
 children+0.005
Neg logit contributions
 Sorry-0.007
chard-0.007
ually-0.007
icker-0.007
arium-0.007
Token it
Token ID340
Feature activation-1.77e-01

Pos logit contributions
 adult+0.032
 ambulance+0.028
 warning+0.027
 sheep+0.027
 wolf+0.026
Neg logit contributions
chard-0.030
 Sorry-0.028
ually-0.028
ishment-0.027
arium-0.026
Token was
Token ID373
Feature activation-3.85e-01

Pos logit contributions
 adult+0.005
 ambulance+0.004
co+0.004
<+0.004
coon+0.004
Neg logit contributions
chard-0.009
 Sorry-0.009
ually-0.008
apped-0.008
rac-0.008
Token their
Token ID511
Feature activation-5.18e-01

Pos logit contributions
 adult+0.014
 ambulance+0.012
<+0.012
 wolf+0.011
 stret+0.011
Neg logit contributions
 Sorry-0.017
chard-0.016
ually-0.016
apped-0.015
rac-0.015
Token fault
Token ID8046
Feature activation-5.20e+00

Pos logit contributions
 adult+0.020
co+0.018
 ambulance+0.018
coon+0.018
 stret+0.018
Neg logit contributions
 Sorry-0.021
chard-0.021
ually-0.019
apped-0.018
ap-0.018
Token.
Token ID13
Feature activation+2.54e-01

Pos logit contributions
 adult+0.236
 stret+0.212
 ambulance+0.210
co+0.208
<+0.207
Neg logit contributions
 Sorry-0.172
chard-0.166
apped-0.165
ap-0.159
ually-0.157
Token Mom
Token ID11254
Feature activation+1.22e+00

Pos logit contributions
chard+0.010
 Sorry+0.010
ually+0.009
apped+0.009
kins+0.008
Neg logit contributions
 adult-0.010
 ambulance-0.009
<-0.009
co-0.009
 stret-0.009
Token said
Token ID531
Feature activation-5.49e-01

Pos logit contributions
 Sorry+0.036
chard+0.035
apped+0.034
ually+0.033
ap+0.031
Neg logit contributions
 adult-0.058
<-0.053
 stret-0.052
 ambulance-0.051
 wolf-0.050
Token they
Token ID484
Feature activation-1.22e-02

Pos logit contributions
 adult+0.028
 ambulance+0.025
<+0.025
co+0.025
 stret+0.025
Neg logit contributions
 Sorry-0.016
chard-0.016
ually-0.014
apped-0.013
ap-0.013
Token were
Token ID547
Feature activation-4.82e-01

Pos logit contributions
 adult+0.001
 ambulance+0.001
 stret+0.001
co+0.000
coon+0.000
Neg logit contributions
 Sorry-0.000
chard-0.000
ually-0.000
apped-0.000
ap-0.000
Tokenol
Token ID349
Feature activation-9.76e-01

Pos logit contributions
 adult+0.016
 stret+0.015
 wolf+0.015
 ambulance+0.015
coon+0.014
Neg logit contributions
 Sorry-0.012
apped-0.012
ap-0.011
ually-0.011
rac-0.010
Tokenanda
Token ID5282
Feature activation-1.31e+00

Pos logit contributions
 adult+0.040
 ambulance+0.036
<+0.035
 stret+0.035
coon+0.034
Neg logit contributions
 Sorry-0.037
chard-0.036
ually-0.034
apped-0.033
ap-0.032
Token and
Token ID290
Feature activation-5.38e-01

Pos logit contributions
 adult+0.054
 ambulance+0.047
co+0.046
 sheep+0.046
 wolf+0.045
Neg logit contributions
 Sorry-0.051
chard-0.051
ually-0.044
apped-0.044
nam-0.043
Token her
Token ID607
Feature activation-7.09e-01

Pos logit contributions
 adult+0.021
 sheep+0.018
 ambulance+0.017
 wolf+0.017
co+0.016
Neg logit contributions
chard-0.023
 Sorry-0.022
ually-0.020
apped-0.020
arium-0.019
Token sun
Token ID4252
Feature activation-1.15e+00

Pos logit contributions
 adult+0.025
coon+0.023
 stret+0.022
 ambulance+0.021
<+0.021
Neg logit contributions
 Sorry-0.030
chard-0.029
apped-0.027
ually-0.027
kins-0.027
Tokenfl
Token ID2704
Feature activation-5.19e+00

Pos logit contributions
 adult+0.037
 ambulance+0.031
<+0.030
 stret+0.030
co+0.029
Neg logit contributions
 Sorry-0.055
chard-0.054
ually-0.051
apped-0.050
ap-0.049
Tokenowers
Token ID3618
Feature activation-1.68e+00

Pos logit contributions
 adult+0.290
 ambulance+0.261
 stret+0.256
<+0.255
coon+0.253
Neg logit contributions
 Sorry-0.129
chard-0.120
apped-0.111
ually-0.109
ap-0.102
Token started
Token ID2067
Feature activation-5.28e-02

Pos logit contributions
 adult+0.065
 sheep+0.057
 ambulance+0.057
co+0.057
<+0.056
Neg logit contributions
 Sorry-0.068
chard-0.066
ually-0.058
ishment-0.057
apped-0.056
Token to
Token ID284
Feature activation-5.71e-01

Pos logit contributions
 adult+0.002
 ambulance+0.001
co+0.001
<+0.001
 sheep+0.001
Neg logit contributions
 Sorry-0.002
chard-0.002
ually-0.002
apped-0.002
rac-0.002
Token dance
Token ID9280
Feature activation+9.76e-01

Pos logit contributions
 adult+0.023
 ambulance+0.020
co+0.020
<+0.020
 sheep+0.019
Neg logit contributions
 Sorry-0.023
chard-0.022
apped-0.020
ually-0.020
rac-0.019
Token together
Token ID1978
Feature activation-5.36e-01

Pos logit contributions
 Sorry+0.043
chard+0.041
ually+0.038
apped+0.036
rac+0.036
Neg logit contributions
 adult-0.035
co-0.032
 ambulance-0.031
 stret-0.031
<-0.030
Token,
Token ID11
Feature activation-5.51e-01

Pos logit contributions
 adult+0.070
 ambulance+0.062
co+0.060
<+0.060
 stret+0.059
Neg logit contributions
 Sorry-0.056
chard-0.055
ually-0.050
apped-0.049
ap-0.046
Token we
Token ID356
Feature activation+3.14e-02

Pos logit contributions
 adult+0.025
co+0.022
 ambulance+0.022
<+0.022
coon+0.021
Neg logit contributions
 Sorry-0.019
chard-0.018
ually-0.016
apped-0.016
ap-0.016
Tokenâ
Token ID22940
Feature activation-6.37e-01

Pos logit contributions
 Sorry+0.001
chard+0.001
ually+0.001
apped+0.001
ap+0.001
Neg logit contributions
 adult-0.001
 ambulance-0.001
 stret-0.001
co-0.001
coon-0.001
Token
Token ID26391
Feature activation-1.18e+00

Pos logit contributions
 adult+0.021
co+0.020
 ambulance+0.019
coon+0.018
 stret+0.017
Neg logit contributions
chard-0.031
 Sorry-0.029
nam-0.028
apped-0.027
arium-0.027
Token
Token ID8151
Feature activation-2.30e-01

Pos logit contributions
co+0.045
 stret+0.042
 adult+0.042
<+0.041
 ambulance+0.040
Neg logit contributions
chard-0.048
 Sorry-0.042
ishment-0.041
nam-0.041
ateful-0.039
Tokenll
Token ID297
Feature activation-5.17e+00

Pos logit contributions
 adult+0.009
 ambulance+0.008
co+0.008
<+0.008
 stret+0.008
Neg logit contributions
 Sorry-0.009
chard-0.009
ually-0.008
apped-0.008
ap-0.007
Token mention
Token ID3068
Feature activation-3.47e-01

Pos logit contributions
 adult+0.212
 ambulance+0.186
<+0.182
co+0.181
 stret+0.180
Neg logit contributions
 Sorry-0.198
chard-0.192
ually-0.181
apped-0.174
ap-0.173
Token it
Token ID340
Feature activation-7.14e-02

Pos logit contributions
 adult+0.015
 ambulance+0.013
co+0.013
<+0.013
 stret+0.013
Neg logit contributions
 Sorry-0.013
chard-0.012
ually-0.011
apped-0.011
ap-0.011
Token to
Token ID284
Feature activation+1.76e-01

Pos logit contributions
 adult+0.003
 ambulance+0.002
co+0.002
<+0.002
 stret+0.002
Neg logit contributions
 Sorry-0.003
chard-0.003
ually-0.003
apped-0.003
ap-0.003
Token the
Token ID262
Feature activation+4.48e-01

Pos logit contributions
 Sorry+0.007
chard+0.007
ually+0.006
apped+0.006
ap+0.006
Neg logit contributions
 adult-0.007
 ambulance-0.006
co-0.006
<-0.006
 stret-0.006
Token grown
Token ID7334
Feature activation+9.87e-01

Pos logit contributions
chard+0.020
 Sorry+0.020
ap+0.017
nam+0.016
 league+0.016
Neg logit contributions
 adult-0.014
<-0.013
Sweet-0.013
ousy-0.013
co-0.012
Token very
Token ID845
Feature activation-6.31e-02

Pos logit contributions
 adult+0.004
 ambulance+0.004
<+0.004
 stret+0.004
 warning+0.004
Neg logit contributions
 Sorry-0.005
chard-0.005
ually-0.005
apped-0.005
rac-0.004
Token happy
Token ID3772
Feature activation-1.66e-01

Pos logit contributions
 adult+0.002
 warning+0.002
 ambulance+0.002
<+0.002
 sheep+0.001
Neg logit contributions
 Sorry-0.003
chard-0.003
ually-0.003
arium-0.003
ishment-0.003
Token with
Token ID351
Feature activation+7.79e-01

Pos logit contributions
 adult+0.006
 ambulance+0.005
co+0.005
<+0.005
 wolf+0.005
Neg logit contributions
 Sorry-0.007
chard-0.007
ually-0.007
apped-0.007
rac-0.007
Token his
Token ID465
Feature activation+2.70e-01

Pos logit contributions
 Sorry+0.028
chard+0.028
ually+0.025
apped+0.025
ap+0.024
Neg logit contributions
 adult-0.035
 ambulance-0.031
co-0.030
<-0.030
 stret-0.029
Token sun
Token ID4252
Feature activation-8.00e-01

Pos logit contributions
 Sorry+0.012
chard+0.012
ually+0.011
apped+0.011
ap+0.010
Neg logit contributions
 adult-0.010
 ambulance-0.008
co-0.008
 stret-0.008
<-0.008
Tokenfl
Token ID2704
Feature activation-5.12e+00

Pos logit contributions
 adult+0.034
 ambulance+0.030
co+0.029
<+0.029
 stret+0.029
Neg logit contributions
 Sorry-0.030
chard-0.030
ually-0.027
apped-0.027
ap-0.025
Tokenowers
Token ID3618
Feature activation-1.44e+00

Pos logit contributions
 adult+0.270
 ambulance+0.245
co+0.239
<+0.238
 stret+0.237
Neg logit contributions
 Sorry-0.139
chard-0.137
ually-0.120
apped-0.116
ap-0.109
Token.
Token ID13
Feature activation+2.22e-01

Pos logit contributions
 adult+0.048
 ambulance+0.041
 wolf+0.040
 sheep+0.040
co+0.039
Neg logit contributions
 Sorry-0.067
chard-0.065
ually-0.060
apped-0.059
rac-0.058
Token He
Token ID679
Feature activation+5.28e-01

Pos logit contributions
 Sorry+0.009
chard+0.009
ually+0.009
apped+0.008
ap+0.008
Neg logit contributions
 adult-0.008
 ambulance-0.007
<-0.007
co-0.007
 stret-0.007
Token visited
Token ID8672
Feature activation+1.17e+00

Pos logit contributions
 Sorry+0.021
chard+0.020
ually+0.018
apped+0.018
ap+0.017
Neg logit contributions
 adult-0.022
 ambulance-0.019
<-0.019
co-0.019
 stret-0.018
Token them
Token ID606
Feature activation-1.00e-01

Pos logit contributions
 Sorry+0.047
chard+0.045
ually+0.041
apped+0.041
ap+0.039
Neg logit contributions
 adult-0.049
 ambulance-0.042
co-0.041
 wolf-0.040
<-0.040
Token and
Token ID290
Feature activation-1.48e-01

Pos logit contributions
 adult+0.040
 ambulance+0.035
co+0.035
 stret+0.033
<+0.031
Neg logit contributions
 Sorry-0.075
chard-0.073
ually-0.067
apped-0.064
rac-0.063
Token look
Token ID804
Feature activation-1.34e-01

Pos logit contributions
 adult+0.005
 sheep+0.004
 ambulance+0.004
co+0.004
 stret+0.004
Neg logit contributions
chard-0.006
 Sorry-0.006
ually-0.006
apped-0.006
arium-0.006
Token at
Token ID379
Feature activation+1.80e+00

Pos logit contributions
 adult+0.004
 ambulance+0.004
 stret+0.004
co+0.004
 sheep+0.004
Neg logit contributions
 Sorry-0.006
chard-0.006
ually-0.006
apped-0.005
rac-0.005
Token their
Token ID511
Feature activation+8.31e-01

Pos logit contributions
 Sorry+0.069
chard+0.066
ually+0.061
apped+0.059
ap+0.056
Neg logit contributions
 adult-0.077
 ambulance-0.068
co-0.065
 sheep-0.064
<-0.064
Token snow
Token ID6729
Feature activation-1.48e+00

Pos logit contributions
 Sorry+0.034
chard+0.032
ually+0.031
ap+0.031
apped+0.030
Neg logit contributions
 adult-0.032
coon-0.028
 ambulance-0.028
co-0.027
 stret-0.027
Tokenfl
Token ID2704
Feature activation-5.09e+00

Pos logit contributions
 adult+0.051
 ambulance+0.045
co+0.045
 stret+0.043
coon+0.042
Neg logit contributions
chard-0.066
 Sorry-0.066
ually-0.060
apped-0.059
rac-0.057
Tokenakes
Token ID1124
Feature activation-1.63e+00

Pos logit contributions
 adult+0.245
 ambulance+0.228
co+0.226
<+0.220
 stret+0.219
Neg logit contributions
chard-0.161
 Sorry-0.153
ually-0.134
apped-0.125
arium-0.124
Token.
Token ID13
Feature activation-5.74e-02

Pos logit contributions
 adult+0.057
 ambulance+0.050
co+0.049
 stret+0.048
<+0.047
Neg logit contributions
 Sorry-0.073
chard-0.072
ually-0.066
apped-0.065
rac-0.063
Token
Token ID198
Feature activation-2.43e-01

Pos logit contributions
 adult+0.003
 ambulance+0.002
coon+0.002
co+0.002
 stret+0.002
Neg logit contributions
 Sorry-0.002
chard-0.002
apped-0.002
ually-0.002
ap-0.002
Token
Token ID198
Feature activation-2.70e-02

Pos logit contributions
 adult+0.012
 ambulance+0.011
 wolf+0.011
coon+0.010
 stret+0.010
Neg logit contributions
 Sorry-0.007
chard-0.007
ually-0.007
apped-0.006
ap-0.006
TokenS
Token ID50
Feature activation+3.89e-01

Pos logit contributions
 adult+0.001
 ambulance+0.001
 wolf+0.001
 stret+0.001
co+0.001
Neg logit contributions
 Sorry-0.001
chard-0.001
ually-0.001
apped-0.001
ap-0.001
Token sw
Token ID1509
Feature activation-2.18e+00

Pos logit contributions
 adult+0.078
 ambulance+0.069
 sheep+0.067
co+0.067
 wolf+0.066
Neg logit contributions
 Sorry-0.080
chard-0.078
ually-0.071
apped-0.069
rac-0.068
Tokenam
Token ID321
Feature activation+4.56e-02

Pos logit contributions
 adult+0.111
 ambulance+0.100
co+0.098
<+0.097
ousy+0.093
Neg logit contributions
chard-0.069
 Sorry-0.067
nam-0.058
ually-0.052
arium-0.051
Token away
Token ID1497
Feature activation-6.91e-01

Pos logit contributions
 Sorry+0.002
chard+0.002
ually+0.002
rac+0.002
apped+0.002
Neg logit contributions
 adult-0.002
 ambulance-0.002
co-0.001
<-0.001
 wolf-0.001
Token from
Token ID422
Feature activation-3.72e-01

Pos logit contributions
 adult+0.024
 warning+0.020
 ambulance+0.020
 wolf+0.019
co+0.019
Neg logit contributions
 Sorry-0.032
chard-0.031
rac-0.029
ually-0.028
apped-0.027
Token O
Token ID440
Feature activation-3.43e+00

Pos logit contributions
 adult+0.017
 ambulance+0.015
co+0.015
<+0.015
 stret+0.015
Neg logit contributions
 Sorry-0.013
chard-0.013
ually-0.012
apped-0.011
ap-0.011
Tokenll
Token ID297
Feature activation-5.04e+00

Pos logit contributions
 adult+0.096
 ambulance+0.079
co+0.076
<+0.075
 stret+0.073
Neg logit contributions
 Sorry-0.180
chard-0.178
ually-0.166
apped-0.164
ap-0.158
Tokenie
Token ID494
Feature activation-2.18e+00

Pos logit contributions
Sweet+0.244
 adult+0.240
co+0.240
 extermin+0.235
 stret+0.226
Neg logit contributions
boat-0.129
chard-0.128
ually-0.111
ams-0.110
ap-0.110
Token.
Token ID13
Feature activation-5.71e-01

Pos logit contributions
 adult+0.087
 ambulance+0.076
co+0.074
<+0.074
 stret+0.074
Neg logit contributions
 Sorry-0.086
chard-0.086
ually-0.078
apped-0.077
ap-0.074
Token O
Token ID440
Feature activation-2.26e+00

Pos logit contributions
 adult+0.025
 ambulance+0.023
 stret+0.023
co+0.022
coon+0.022
Neg logit contributions
 Sorry-0.021
chard-0.020
apped-0.019
ually-0.019
ap-0.017
Tokenll
Token ID297
Feature activation-3.70e+00

Pos logit contributions
 adult+0.074
 ambulance+0.064
 stret+0.062
 wolf+0.059
<+0.058
Neg logit contributions
 Sorry-0.105
chard-0.104
ually-0.099
apped-0.097
ap-0.096
Tokenie
Token ID494
Feature activation-9.50e-01

Pos logit contributions
 adult+0.173
 stret+0.165
 ambulance+0.161
co+0.159
Sweet+0.154
Neg logit contributions
chard-0.109
 Sorry-0.107
ually-0.104
apped-0.103
boat-0.101
Token en
Token ID551
Feature activation+1.48e+00

Pos logit contributions
 Sorry+0.001
chard+0.001
ually+0.001
apped+0.001
ap+0.001
Neg logit contributions
 adult-0.001
 ambulance-0.001
co-0.001
<-0.001
 stret-0.001
Tokenvious
Token ID1442
Feature activation-8.99e-01

Pos logit contributions
chard+0.073
 Sorry+0.073
ually+0.072
kins+0.065
field+0.063
Neg logit contributions
 adult-0.046
 ambulance-0.041
<-0.036
co-0.036
coon-0.035
Token of
Token ID286
Feature activation-1.41e-01

Pos logit contributions
 adult+0.028
 ambulance+0.023
 stret+0.023
<+0.023
co+0.023
Neg logit contributions
 Sorry-0.044
chard-0.043
apped-0.040
ually-0.040
ap-0.039
Token their
Token ID511
Feature activation+3.22e-01

Pos logit contributions
 adult+0.005
 ambulance+0.005
 stret+0.005
<+0.005
co+0.004
Neg logit contributions
chard-0.006
 Sorry-0.006
apped-0.005
ually-0.005
kins-0.005
Token sun
Token ID4252
Feature activation-1.03e+00

Pos logit contributions
 Sorry+0.012
chard+0.012
ually+0.011
apped+0.011
ap+0.010
Neg logit contributions
 adult-0.013
 ambulance-0.012
co-0.012
 stret-0.012
<-0.012
Tokenfl
Token ID2704
Feature activation-5.04e+00

Pos logit contributions
 adult+0.040
 ambulance+0.036
co+0.035
 sheep+0.035
 wolf+0.034
Neg logit contributions
 Sorry-0.042
chard-0.041
ually-0.037
apped-0.036
rac-0.035
Tokenowers
Token ID3618
Feature activation-1.41e+00

Pos logit contributions
 adult+0.222
 ambulance+0.207
co+0.207
ousy+0.198
<+0.197
Neg logit contributions
chard-0.176
 Sorry-0.170
ually-0.151
nam-0.145
arium-0.138
Token.
Token ID13
Feature activation-1.91e-01

Pos logit contributions
 adult+0.049
 ambulance+0.043
co+0.041
 wolf+0.041
 sheep+0.041
Neg logit contributions
 Sorry-0.064
chard-0.061
ually-0.058
apped-0.057
rac-0.055
Token He
Token ID679
Feature activation+6.67e-01

Pos logit contributions
 adult+0.007
 ambulance+0.006
co+0.006
<+0.006
 stret+0.006
Neg logit contributions
 Sorry-0.008
chard-0.008
ually-0.008
apped-0.007
ap-0.007
Token does
Token ID857
Feature activation-5.40e-01

Pos logit contributions
 Sorry+0.026
chard+0.025
ually+0.023
apped+0.023
ap+0.022
Neg logit contributions
 adult-0.027
 stret-0.024
 ambulance-0.023
co-0.023
coon-0.023
Token not
Token ID407
Feature activation-1.31e-01

Pos logit contributions
 adult+0.019
<+0.016
 warning+0.016
 ambulance+0.015
 stret+0.015
Neg logit contributions
 Sorry-0.025
chard-0.024
apped-0.023
ually-0.023
light-0.021
Token and
Token ID290
Feature activation+3.42e-01

Pos logit contributions
 adult+0.074
 ambulance+0.066
co+0.064
 stret+0.063
<+0.062
Neg logit contributions
 Sorry-0.077
chard-0.076
ually-0.069
apped-0.068
ap-0.066
Token I
Token ID314
Feature activation+6.16e-01

Pos logit contributions
chard+0.013
 Sorry+0.013
apped+0.012
ually+0.012
arium+0.012
Neg logit contributions
 adult-0.014
 ambulance-0.012
 stret-0.012
 sheep-0.012
 wolf-0.011
Tokenâ
Token ID22940
Feature activation-1.73e-01

Pos logit contributions
 Sorry+0.026
chard+0.026
ually+0.024
apped+0.023
ap+0.023
Neg logit contributions
 adult-0.023
 ambulance-0.020
co-0.019
<-0.019
 stret-0.019
Token
Token ID26391
Feature activation-9.32e-01

Pos logit contributions
 adult+0.007
 ambulance+0.006
co+0.006
coon+0.006
 stret+0.006
Neg logit contributions
chard-0.008
 Sorry-0.007
arium-0.007
apped-0.007
kins-0.007
Token
Token ID8151
Feature activation-1.02e-02

Pos logit contributions
 adult+0.033
co+0.033
 stret+0.032
<+0.031
 ambulance+0.031
Neg logit contributions
chard-0.038
 Sorry-0.036
ishment-0.033
ually-0.033
apped-0.032
Tokenll
Token ID297
Feature activation-5.02e+00

Pos logit contributions
 adult+0.000
 ambulance+0.000
co+0.000
<+0.000
 stret+0.000
Neg logit contributions
 Sorry-0.000
chard-0.000
ually-0.000
apped-0.000
ap-0.000
Token ride
Token ID6594
Feature activation+2.01e-01

Pos logit contributions
 adult+0.208
 ambulance+0.181
<+0.180
co+0.178
Sweet+0.177
Neg logit contributions
 Sorry-0.190
chard-0.182
ually-0.174
ap-0.165
apped-0.164
Token it
Token ID340
Feature activation-4.61e-01

Pos logit contributions
 Sorry+0.008
chard+0.007
ually+0.007
apped+0.007
arium+0.006
Neg logit contributions
 adult-0.008
 ambulance-0.008
co-0.007
<-0.007
 stret-0.007
Token!
Token ID0
Feature activation-3.21e-01

Pos logit contributions
 adult+0.018
 ambulance+0.016
co+0.016
<+0.015
 stret+0.015
Neg logit contributions
 Sorry-0.019
chard-0.019
ually-0.017
apped-0.017
ap-0.016
Tokenâ
Token ID22940
Feature activation+4.04e-01

Pos logit contributions
 adult+0.012
 ambulance+0.011
co+0.011
 stret+0.010
coon+0.010
Neg logit contributions
 Sorry-0.014
chard-0.013
ually-0.012
apped-0.012
ap-0.011
Token
Token ID26391
Feature activation-3.28e-01

Pos logit contributions
chard+0.019
 Sorry+0.018
apped+0.017
arium+0.016
nam+0.016
Neg logit contributions
 adult-0.014
co-0.013
 ambulance-0.012
coon-0.012
<-0.011
Token are
Token ID389
Feature activation-1.16e-01

Pos logit contributions
 Sorry+0.009
chard+0.009
ually+0.008
apped+0.008
rac+0.008
Neg logit contributions
 adult-0.008
 ambulance-0.007
<-0.007
co-0.007
 stret-0.007
Token happy
Token ID3772
Feature activation-5.50e-01

Pos logit contributions
 adult+0.004
 ambulance+0.003
<+0.003
 sheep+0.003
 warning+0.003
Neg logit contributions
 Sorry-0.005
chard-0.005
ually-0.005
arium-0.005
rac-0.005
Token with
Token ID351
Feature activation+5.44e-01

Pos logit contributions
 adult+0.022
 ambulance+0.019
co+0.019
<+0.018
 stret+0.018
Neg logit contributions
 Sorry-0.022
chard-0.022
ually-0.020
apped-0.020
ap-0.019
Token their
Token ID511
Feature activation+1.09e-02

Pos logit contributions
 Sorry+0.013
chard+0.013
ually+0.011
apped+0.011
ap+0.010
Neg logit contributions
 adult-0.030
 ambulance-0.027
co-0.027
<-0.027
coon-0.027
Token snow
Token ID6729
Feature activation-1.88e+00

Pos logit contributions
 Sorry+0.000
ap+0.000
kins+0.000
ually+0.000
light+0.000
Neg logit contributions
 adult-0.000
coon-0.000
co-0.000
iger-0.000
 stret-0.000
Tokenfl
Token ID2704
Feature activation-5.02e+00

Pos logit contributions
 adult+0.083
 ambulance+0.072
<+0.070
 stret+0.070
coon+0.068
Neg logit contributions
 Sorry-0.068
chard-0.066
ually-0.062
apped-0.062
ap-0.059
Tokenakes
Token ID1124
Feature activation-1.86e+00

Pos logit contributions
 adult+0.276
 ambulance+0.249
 stret+0.243
<+0.242
co+0.241
Neg logit contributions
 Sorry-0.129
chard-0.122
ually-0.109
apped-0.109
ap-0.102
Token.
Token ID13
Feature activation+6.75e-03

Pos logit contributions
 adult+0.071
 ambulance+0.062
co+0.059
<+0.059
 stret+0.059
Neg logit contributions
 Sorry-0.078
chard-0.077
ually-0.071
apped-0.070
ap-0.067
Token They
Token ID1119
Feature activation+1.17e-01

Pos logit contributions
chard+0.000
 Sorry+0.000
ually+0.000
kins+0.000
apped+0.000
Neg logit contributions
 adult-0.000
<-0.000
 ambulance-0.000
co-0.000
 stret-0.000
Token glue
Token ID22749
Feature activation+1.43e-01

Pos logit contributions
 Sorry+0.005
chard+0.005
ually+0.004
apped+0.004
ap+0.004
Neg logit contributions
 adult-0.005
 ambulance-0.004
co-0.004
<-0.004
 stret-0.004
Token them
Token ID606
Feature activation-1.06e+00

Pos logit contributions
chard+0.007
 Sorry+0.006
nam+0.006
ually+0.006
arium+0.006
Neg logit contributions
 adult-0.005
 ambulance-0.004
 sheep-0.004
 wool-0.004
co-0.004
Token He
Token ID679
Feature activation+7.45e-01

Pos logit contributions
 adult+0.018
 ambulance+0.016
 stret+0.016
co+0.016
coon+0.015
Neg logit contributions
 Sorry-0.020
chard-0.019
apped-0.018
ually-0.018
ap-0.017
Token said
Token ID531
Feature activation-6.54e-01

Pos logit contributions
 Sorry+0.023
chard+0.023
apped+0.020
ually+0.020
rac+0.019
Neg logit contributions
 adult-0.036
 ambulance-0.032
 stret-0.031
<-0.031
ousy-0.031
Token it
Token ID340
Feature activation-2.30e-01

Pos logit contributions
 adult+0.031
 ambulance+0.028
co+0.027
<+0.027
 stret+0.027
Neg logit contributions
 Sorry-0.021
chard-0.021
ually-0.019
apped-0.019
ap-0.018
Token was
Token ID373
Feature activation-3.17e-01

Pos logit contributions
 adult+0.010
 stret+0.010
 ambulance+0.009
 reins+0.009
<+0.009
Neg logit contributions
 Sorry-0.008
chard-0.007
apped-0.007
ually-0.007
 neither-0.007
Token her
Token ID607
Feature activation-6.47e-01

Pos logit contributions
 adult+0.013
 ambulance+0.012
co+0.011
<+0.011
 stret+0.011
Neg logit contributions
 Sorry-0.012
chard-0.012
ually-0.011
apped-0.011
ap-0.010
Token fault
Token ID8046
Feature activation-5.02e+00

Pos logit contributions
 adult+0.023
Sweet+0.023
co+0.022
<+0.022
ousy+0.022
Neg logit contributions
 Sorry-0.025
chard-0.024
ap-0.023
apped-0.023
ually-0.022
Token for
Token ID329
Feature activation-1.30e+00

Pos logit contributions
 adult+0.244
 stret+0.239
Sweet+0.229
co+0.228
<+0.223
Neg logit contributions
apped-0.144
ap-0.136
 neither-0.133
 together-0.119
 Sorry-0.116
Token not
Token ID407
Feature activation-6.70e-01

Pos logit contributions
 adult+0.049
 ambulance+0.042
co+0.041
<+0.040
 stret+0.040
Neg logit contributions
 Sorry-0.056
chard-0.055
ually-0.050
apped-0.050
ap-0.048
Token moving
Token ID3867
Feature activation-5.29e-01

Pos logit contributions
 adult+0.023
 ambulance+0.019
 warning+0.018
<+0.018
 stret+0.018
Neg logit contributions
 Sorry-0.032
chard-0.031
ually-0.029
apped-0.028
rac-0.028
Token.
Token ID13
Feature activation+1.19e-01

Pos logit contributions
 adult+0.020
 ambulance+0.018
co+0.017
 stret+0.017
<+0.016
Neg logit contributions
 Sorry-0.023
chard-0.022
ually-0.021
apped-0.020
rac-0.019
Token
Token ID198
Feature activation-4.49e-01

Pos logit contributions
 Sorry+0.005
chard+0.005
ually+0.005
apped+0.005
ap+0.005
Neg logit contributions
 adult-0.004
 ambulance-0.004
co-0.004
 stret-0.004
<-0.004
Token wealthy
Token ID11574
Feature activation-7.89e-01

Pos logit contributions
chard+0.007
 Sorry+0.007
ually+0.006
kins+0.006
apped+0.006
Neg logit contributions
 adult-0.005
 stret-0.004
co-0.004
 ambulance-0.004
coon-0.004
Token prince
Token ID19716
Feature activation+2.36e-01

Pos logit contributions
 adult+0.027
 ambulance+0.023
 wolf+0.022
co+0.021
<+0.021
Neg logit contributions
 Sorry-0.037
chard-0.036
ually-0.034
apped-0.033
ap-0.032
Token and
Token ID290
Feature activation-4.05e-01

Pos logit contributions
 Sorry+0.011
chard+0.011
ually+0.010
apped+0.010
nam+0.009
Neg logit contributions
 adult-0.008
 ambulance-0.007
co-0.007
 sheep-0.007
 wolf-0.007
Token his
Token ID465
Feature activation-2.60e-01

Pos logit contributions
 adult+0.017
 wolf+0.014
 sheep+0.014
 ambulance+0.013
 warning+0.013
Neg logit contributions
 Sorry-0.017
chard-0.016
ually-0.016
apped-0.015
ap-0.015
Token sun
Token ID4252
Feature activation-9.96e-01

Pos logit contributions
 adult+0.009
 ambulance+0.008
co+0.008
 stret+0.008
<+0.008
Neg logit contributions
chard-0.012
 Sorry-0.012
ually-0.011
apped-0.010
ap-0.010
Tokenfl
Token ID2704
Feature activation-5.01e+00

Pos logit contributions
 adult+0.038
 ambulance+0.034
co+0.033
<+0.032
 stret+0.032
Neg logit contributions
 Sorry-0.041
chard-0.041
ually-0.036
apped-0.035
rac-0.035
Tokenowers
Token ID3618
Feature activation-1.66e+00

Pos logit contributions
 adult+0.252
 ambulance+0.230
co+0.226
<+0.222
 stret+0.221
Neg logit contributions
chard-0.148
 Sorry-0.146
ually-0.126
apped-0.118
arium-0.114
Token lived
Token ID5615
Feature activation+9.21e-01

Pos logit contributions
 adult+0.064
 ambulance+0.056
 sheep+0.056
co+0.055
<+0.055
Neg logit contributions
 Sorry-0.069
chard-0.066
ually-0.057
ishment-0.057
apped-0.056
Token happily
Token ID18177
Feature activation-6.98e-01

Pos logit contributions
 Sorry+0.044
chard+0.037
ually+0.036
rac+0.036
ishment+0.035
Neg logit contributions
 adult-0.030
co-0.028
 living-0.027
 sheep-0.027
<-0.026
Token ever
Token ID1683
Feature activation-2.39e-01

Pos logit contributions
 adult+0.022
co+0.021
 sheep+0.020
 living+0.020
 ambulance+0.019
Neg logit contributions
 Sorry-0.032
chard-0.032
ishment-0.028
rac-0.027
ually-0.027
Token after
Token ID706
Feature activation-4.97e-01

Pos logit contributions
 adult+0.011
co+0.009
 sheep+0.009
 ambulance+0.009
 wolf+0.009
Neg logit contributions
 Sorry-0.009
chard-0.008
apped-0.008
ually-0.007
rac-0.007
Token Tom
Token ID4186
Feature activation+7.43e-01

Pos logit contributions
 adult+0.030
co+0.028
 stret+0.027
<+0.027
ousy+0.027
Neg logit contributions
 Sorry-0.020
chard-0.018
rac-0.015
apped-0.015
ap-0.015
Token,
Token ID11
Feature activation-1.15e+00

Pos logit contributions
 Sorry+0.027
chard+0.027
ually+0.024
apped+0.024
rac+0.023
Neg logit contributions
 adult-0.032
 ambulance-0.029
co-0.028
 stret-0.028
<-0.027
Token it
Token ID340
Feature activation-4.42e-01

Pos logit contributions
 adult+0.067
co+0.065
ousy+0.064
 stret+0.063
iger+0.062
Neg logit contributions
 Sorry-0.025
chard-0.022
ap-0.017
 neither-0.016
rac-0.016
Token was
Token ID373
Feature activation-5.84e-01

Pos logit contributions
 adult+0.017
 ambulance+0.015
 stret+0.014
co+0.014
<+0.014
Neg logit contributions
 Sorry-0.019
chard-0.018
ually-0.017
apped-0.016
ap-0.016
Token my
Token ID616
Feature activation-9.26e-01

Pos logit contributions
 adult+0.021
 ambulance+0.018
<+0.017
 stret+0.017
co+0.017
Neg logit contributions
 Sorry-0.026
chard-0.026
ually-0.024
apped-0.024
ap-0.023
Token fault
Token ID8046
Feature activation-4.99e+00

Pos logit contributions
 adult+0.032
co+0.029
coon+0.029
<+0.028
 stret+0.028
Neg logit contributions
 Sorry-0.041
chard-0.039
ap-0.037
apped-0.037
ually-0.036
Token,"
Token ID553
Feature activation-7.06e-01

Pos logit contributions
 adult+0.244
 stret+0.219
co+0.218
<+0.218
 ambulance+0.217
Neg logit contributions
 Sorry-0.149
chard-0.140
apped-0.136
ap-0.132
ually-0.130
Token Mia
Token ID32189
Feature activation-4.00e-01

Pos logit contributions
 adult+0.029
 ambulance+0.025
<+0.024
 sheep+0.024
co+0.024
Neg logit contributions
chard-0.028
 Sorry-0.027
ually-0.026
apped-0.024
ap-0.024
Token said
Token ID531
Feature activation-3.41e-01

Pos logit contributions
 adult+0.018
 ambulance+0.016
<+0.016
co+0.015
 stret+0.015
Neg logit contributions
 Sorry-0.014
chard-0.014
ually-0.012
apped-0.012
ap-0.012
Token,
Token ID11
Feature activation-3.00e-02

Pos logit contributions
 adult+0.014
 ambulance+0.012
 stret+0.012
co+0.012
 warning+0.012
Neg logit contributions
 Sorry-0.014
chard-0.013
ually-0.012
 league-0.012
arium-0.012
Token as
Token ID355
Feature activation+3.84e-01

Pos logit contributions
 adult+0.001
 ambulance+0.001
<+0.001
co+0.001
coon+0.001
Neg logit contributions
chard-0.001
 Sorry-0.001
ually-0.001
apped-0.001
ap-0.001
TokenNo
Token ID2949
Feature activation+7.02e-01

Pos logit contributions
 adult+0.009
 ambulance+0.008
 stret+0.008
<+0.008
ousy+0.007
Neg logit contributions
 Sorry-0.010
chard-0.010
ually-0.009
apped-0.009
rac-0.009
Token,
Token ID11
Feature activation-5.38e-01

Pos logit contributions
 Sorry+0.014
ap+0.010
chard+0.009
ually+0.009
+0.009
Neg logit contributions
 adult-0.040
<-0.039
co-0.039
 stret-0.038
Sweet-0.038
Token it
Token ID340
Feature activation-7.41e-02

Pos logit contributions
 adult+0.026
 ambulance+0.023
co+0.023
<+0.023
 stret+0.022
Neg logit contributions
 Sorry-0.017
chard-0.017
ually-0.015
apped-0.015
ap-0.014
Token's
Token ID338
Feature activation+7.60e-01

Pos logit contributions
 adult+0.003
 stret+0.002
<+0.002
 ambulance+0.002
co+0.002
Neg logit contributions
 Sorry-0.003
chard-0.003
ually-0.003
ap-0.003
apped-0.003
Token your
Token ID534
Feature activation-1.06e+00

Pos logit contributions
 Sorry+0.028
chard+0.027
ually+0.024
ap+0.024
rac+0.023
Neg logit contributions
 adult-0.033
 ambulance-0.029
co-0.029
coon-0.028
 stret-0.028
Token fault
Token ID8046
Feature activation-4.99e+00

Pos logit contributions
 adult+0.036
coon+0.034
co+0.033
Sweet+0.033
ousy+0.032
Neg logit contributions
 Sorry-0.045
ap-0.041
chard-0.039
apped-0.038
ually-0.037
Token,
Token ID11
Feature activation-3.78e-01

Pos logit contributions
 adult+0.243
<+0.215
 ambulance+0.215
 stret+0.213
co+0.213
Neg logit contributions
 Sorry-0.154
chard-0.143
apped-0.136
ually-0.135
ap-0.134
Token you
Token ID345
Feature activation-3.06e-01

Pos logit contributions
 adult+0.019
co+0.017
<+0.017
 stret+0.016
 ambulance+0.016
Neg logit contributions
 Sorry-0.012
chard-0.011
ap-0.009
rac-0.009
ually-0.009
Token were
Token ID547
Feature activation-8.77e-01

Pos logit contributions
 adult+0.012
 ambulance+0.011
<+0.010
co+0.010
 stret+0.010
Neg logit contributions
chard-0.012
 Sorry-0.012
ually-0.011
apped-0.011
ap-0.011
Token too
Token ID1165
Feature activation-1.37e+00

Pos logit contributions
 adult+0.034
 ambulance+0.029
<+0.029
co+0.029
 stret+0.029
Neg logit contributions
chard-0.036
 Sorry-0.035
ually-0.031
ap-0.031
apped-0.031
Token close
Token ID1969
Feature activation+6.23e-01

Pos logit contributions
 adult+0.048
 ambulance+0.041
 stret+0.039
co+0.039
 wolf+0.038
Neg logit contributions
chard-0.063
 Sorry-0.062
ually-0.058
apped-0.058
arium-0.056
Token to
Token ID284
Feature activation-4.43e-01

Pos logit contributions
 adult+0.014
 ambulance+0.012
<+0.012
co+0.011
 wolf+0.011
Neg logit contributions
 Sorry-0.012
chard-0.012
ually-0.010
apped-0.010
rac-0.010
Token wear
Token ID5806
Feature activation+1.08e+00

Pos logit contributions
 adult+0.016
 ambulance+0.014
 stret+0.013
co+0.013
<+0.013
Neg logit contributions
chard-0.020
 Sorry-0.019
apped-0.018
ually-0.018
arium-0.017
Token his
Token ID465
Feature activation+9.47e-01

Pos logit contributions
 Sorry+0.042
chard+0.041
ually+0.037
apped+0.037
ap+0.035
Neg logit contributions
 adult-0.046
 ambulance-0.041
 stret-0.040
<-0.040
co-0.040
Token flip
Token ID14283
Feature activation-1.29e-01

Pos logit contributions
 Sorry+0.041
chard+0.041
ually+0.038
apped+0.037
rac+0.036
Neg logit contributions
 adult-0.034
 ambulance-0.029
co-0.028
coon-0.028
<-0.028
Token-
Token ID12
Feature activation+1.15e+00

Pos logit contributions
 adult+0.005
 ambulance+0.004
co+0.004
 stret+0.004
<+0.004
Neg logit contributions
 Sorry-0.006
chard-0.006
ually-0.005
apped-0.005
arium-0.005
Tokenfl
Token ID2704
Feature activation-4.98e+00

Pos logit contributions
 Sorry+0.047
chard+0.045
ually+0.041
apped+0.041
ap+0.039
Neg logit contributions
 adult-0.046
co-0.041
 ambulance-0.041
<-0.039
 stret-0.039
Tokenops
Token ID2840
Feature activation-1.86e+00

Pos logit contributions
 adult+0.229
 ambulance+0.211
co+0.206
<+0.201
 stret+0.201
Neg logit contributions
chard-0.169
 Sorry-0.165
ually-0.149
apped-0.138
arium-0.135
Token so
Token ID523
Feature activation+7.37e-01

Pos logit contributions
 adult+0.065
 ambulance+0.056
co+0.054
 stret+0.053
<+0.053
Neg logit contributions
 Sorry-0.086
chard-0.082
ually-0.077
apped-0.075
rac-0.072
Token he
Token ID339
Feature activation+6.45e-01

Pos logit contributions
 Sorry+0.030
chard+0.030
arium+0.028
ishment+0.028
ually+0.028
Neg logit contributions
 adult-0.029
 ambulance-0.024
 wolf-0.024
 sheep-0.023
<-0.023
Token wouldn
Token ID3636
Feature activation-9.57e-01

Pos logit contributions
 Sorry+0.024
chard+0.024
ually+0.022
apped+0.021
ap+0.021
Neg logit contributions
 adult-0.027
 ambulance-0.024
co-0.023
<-0.023
 stret-0.023
Token't
Token ID470
Feature activation-1.44e+00

Pos logit contributions
 adult+0.036
 ambulance+0.031
co+0.031
 stret+0.031
<+0.030
Neg logit contributions
 Sorry-0.039
chard-0.036
ually-0.034
ap-0.034
apped-0.034
Token garden
Token ID11376
Feature activation-2.38e+00

Pos logit contributions
 adult+0.043
co+0.037
coon+0.037
Sweet+0.036
<+0.036
Neg logit contributions
 Sorry-0.047
chard-0.046
ap-0.042
apped-0.042
ually-0.041
Token and
Token ID290
Feature activation-4.24e-01

Pos logit contributions
 adult+0.083
 ambulance+0.074
co+0.070
 stret+0.067
 sheep+0.067
Neg logit contributions
 Sorry-0.107
chard-0.104
ually-0.096
apped-0.095
rac-0.094
Token admire
Token ID21099
Feature activation-6.09e-02

Pos logit contributions
 adult+0.017
 ambulance+0.016
 sheep+0.015
 stret+0.015
 wolf+0.015
Neg logit contributions
 Sorry-0.016
chard-0.015
ually-0.015
apped-0.014
rac-0.014
Token his
Token ID465
Feature activation+1.00e+00

Pos logit contributions
 adult+0.003
 ambulance+0.002
co+0.002
<+0.002
 stret+0.002
Neg logit contributions
 Sorry-0.002
chard-0.002
ually-0.002
apped-0.002
ap-0.002
Token sun
Token ID4252
Feature activation-5.48e-01

Pos logit contributions
 Sorry+0.041
chard+0.040
ap+0.036
kins+0.036
ually+0.036
Neg logit contributions
 adult-0.037
co-0.034
coon-0.034
 stret-0.033
 ambulance-0.032
Tokenfl
Token ID2704
Feature activation-4.95e+00

Pos logit contributions
 adult+0.019
 ambulance+0.017
co+0.017
 sheep+0.016
 wolf+0.016
Neg logit contributions
 Sorry-0.024
chard-0.023
ually-0.022
apped-0.022
arium-0.021
Tokenowers
Token ID3618
Feature activation-9.66e-01

Pos logit contributions
 adult+0.233
 ambulance+0.217
co+0.215
<+0.207
 stret+0.207
Neg logit contributions
chard-0.155
 Sorry-0.152
ually-0.133
apped-0.124
arium-0.123
Token.
Token ID13
Feature activation-1.07e-01

Pos logit contributions
 adult+0.032
 ambulance+0.028
 sheep+0.027
co+0.027
 wolf+0.027
Neg logit contributions
 Sorry-0.044
chard-0.042
apped-0.040
ually-0.040
rac-0.038
Token They
Token ID1119
Feature activation+1.12e-01

Pos logit contributions
 adult+0.005
 ambulance+0.004
co+0.004
coon+0.004
 stret+0.004
Neg logit contributions
 Sorry-0.004
chard-0.004
ually-0.004
apped-0.004
ap-0.003
Token also
Token ID635
Feature activation+2.00e-02

Pos logit contributions
 Sorry+0.005
chard+0.004
apped+0.004
ually+0.004
rac+0.004
Neg logit contributions
 adult-0.004
 ambulance-0.004
 sheep-0.004
<-0.004
co-0.004
Token see
Token ID766
Feature activation-2.44e-01

Pos logit contributions
 Sorry+0.001
chard+0.001
apped+0.001
ually+0.001
rac+0.001
Neg logit contributions
 adult-0.001
 sheep-0.001
 ambulance-0.001
 living-0.001
<-0.001
Token But
Token ID887
Feature activation+2.89e-01

Pos logit contributions
 adult+0.005
 ambulance+0.004
co+0.004
 stret+0.004
<+0.004
Neg logit contributions
 Sorry-0.005
chard-0.005
ually-0.004
apped-0.004
ap-0.004
Token then
Token ID788
Feature activation-3.45e-01

Pos logit contributions
 Sorry+0.012
chard+0.011
apped+0.010
ually+0.010
ap+0.010
Neg logit contributions
 adult-0.012
 wolf-0.010
 ambulance-0.010
 warning-0.010
<-0.010
Token he
Token ID339
Feature activation+5.63e-01

Pos logit contributions
 adult+0.012
 wolf+0.012
 warning+0.010
 sheep+0.010
 ambulance+0.010
Neg logit contributions
chard-0.015
 Sorry-0.015
apped-0.015
icker-0.015
ually-0.014
Token saw
Token ID2497
Feature activation+4.55e-01

Pos logit contributions
 Sorry+0.021
chard+0.020
ually+0.018
apped+0.018
ap+0.017
Neg logit contributions
 adult-0.025
 ambulance-0.022
co-0.021
<-0.021
 wolf-0.021
Token something
Token ID1223
Feature activation+1.93e-01

Pos logit contributions
 Sorry+0.016
chard+0.016
ually+0.014
apped+0.014
ap+0.014
Neg logit contributions
 adult-0.021
 ambulance-0.018
<-0.018
 wolf-0.018
coon-0.018
Token-
Token ID12
Feature activation+2.88e+00

Pos logit contributions
chard+0.006
 Sorry+0.005
 filled+0.005
kins+0.005
 sideways+0.005
Neg logit contributions
 adult-0.009
Sweet-0.008
iger-0.008
 reins-0.008
co-0.008
Token a
Token ID257
Feature activation+8.46e-01

Pos logit contributions
chard+0.117
 Sorry+0.109
ually+0.104
apped+0.099
rac+0.096
Neg logit contributions
 adult-0.114
 ambulance-0.101
 stret-0.101
<-0.099
co-0.095
Token box
Token ID3091
Feature activation+2.79e-01

Pos logit contributions
chard+0.036
 Sorry+0.035
nam+0.029
kin+0.028
arium+0.028
Neg logit contributions
 adult-0.030
co-0.028
<-0.027
Sweet-0.027
coon-0.026
Token!
Token ID0
Feature activation+3.24e-01

Pos logit contributions
chard+0.011
 Sorry+0.010
apped+0.010
ually+0.010
ap+0.009
Neg logit contributions
 adult-0.012
 ambulance-0.010
<-0.010
 stret-0.010
co-0.010
Token Curious
Token ID44269
Feature activation-8.29e-02

Pos logit contributions
 Sorry+0.014
chard+0.014
ually+0.013
apped+0.013
ap+0.013
Neg logit contributions
 adult-0.012
<-0.010
 ambulance-0.010
 wolf-0.009
coon-0.009
Token,
Token ID11
Feature activation+5.84e-01

Pos logit contributions
 adult+0.005
 stret+0.004
co+0.004
<+0.004
 ambulance+0.004
Neg logit contributions
 Sorry-0.002
chard-0.002
apped-0.002
ually-0.001
ap-0.001
Token explain
Token ID4727
Feature activation+1.61e-01

Pos logit contributions
chard+0.017
 Sorry+0.016
ually+0.015
apped+0.015
ishment+0.015
Neg logit contributions
 adult-0.014
 ambulance-0.012
<-0.012
 stret-0.011
 sheep-0.011
Token to
Token ID284
Feature activation+2.80e-01

Pos logit contributions
 Sorry+0.006
chard+0.006
ually+0.005
apped+0.005
ap+0.005
Neg logit contributions
 adult-0.007
 ambulance-0.006
co-0.006
<-0.006
 stret-0.006
Token the
Token ID262
Feature activation+5.84e-01

Pos logit contributions
 Sorry+0.010
chard+0.010
ually+0.009
apped+0.009
ap+0.008
Neg logit contributions
 adult-0.013
 ambulance-0.012
<-0.011
 wolf-0.011
 stret-0.011
Token cow
Token ID9875
Feature activation+5.18e-01

Pos logit contributions
 Sorry+0.023
chard+0.021
ap+0.019
 grew+0.019
bye+0.018
Neg logit contributions
 stret-0.019
mo-0.019
coon-0.019
ogo-0.018
co-0.018
Token how
Token ID703
Feature activation-1.05e+00

Pos logit contributions
 Sorry+0.020
chard+0.020
ually+0.018
apped+0.018
ap+0.017
Neg logit contributions
 adult-0.021
 ambulance-0.018
co-0.018
<-0.018
 stret-0.018
Token much
Token ID881
Feature activation+3.88e+00

Pos logit contributions
 adult+0.037
 ambulance+0.032
<+0.031
 sheep+0.031
 wolf+0.031
Neg logit contributions
 Sorry-0.046
chard-0.045
ually-0.044
apped-0.043
ishment-0.042
Token it
Token ID340
Feature activation+1.44e+00

Pos logit contributions
chard+0.169
 Sorry+0.167
ually+0.159
apped+0.157
ap+0.148
Neg logit contributions
 adult-0.143
 ambulance-0.123
 wolf-0.118
 sheep-0.116
<-0.116
Token loved
Token ID6151
Feature activation+7.96e-01

Pos logit contributions
 Sorry+0.060
chard+0.057
ually+0.053
apped+0.052
ap+0.051
Neg logit contributions
 adult-0.056
 ambulance-0.049
co-0.049
<-0.048
 wolf-0.047
Token milk
Token ID7545
Feature activation+1.12e+00

Pos logit contributions
 Sorry+0.031
chard+0.031
ually+0.028
apped+0.027
ap+0.026
Neg logit contributions
 adult-0.032
 ambulance-0.029
<-0.028
 stret-0.028
co-0.028
Token.
Token ID13
Feature activation+4.56e-01

Pos logit contributions
 Sorry+0.052
chard+0.050
ually+0.048
apped+0.047
rac+0.046
Neg logit contributions
 adult-0.038
 ambulance-0.033
co-0.032
 sheep-0.031
 wolf-0.030
Token The
Token ID383
Feature activation-7.20e-04

Pos logit contributions
chard+0.019
 Sorry+0.018
kins+0.018
kin+0.017
ually+0.017
Neg logit contributions
 adult-0.018
 ambulance-0.016
 sheep-0.014
<-0.014
 wolf-0.014
Token as
Token ID355
Feature activation+6.45e-01

Pos logit contributions
 Sorry+0.004
chard+0.004
ually+0.004
apped+0.004
rac+0.004
Neg logit contributions
 adult-0.002
 ambulance-0.002
co-0.002
 stret-0.002
 sheep-0.002
Token she
Token ID673
Feature activation-1.85e-01

Pos logit contributions
 Sorry+0.027
chard+0.026
ually+0.024
apped+0.024
rac+0.023
Neg logit contributions
 adult-0.025
 ambulance-0.021
 wolf-0.021
 sheep-0.021
<-0.020
Token saw
Token ID2497
Feature activation+1.79e-01

Pos logit contributions
 adult+0.007
co+0.006
<+0.006
 sheep+0.006
coon+0.006
Neg logit contributions
chard-0.008
 Sorry-0.008
nam-0.007
ually-0.006
ishment-0.006
Token how
Token ID703
Feature activation-7.67e-01

Pos logit contributions
 Sorry+0.006
chard+0.006
ually+0.005
apped+0.005
ap+0.005
Neg logit contributions
 adult-0.008
 ambulance-0.007
<-0.007
co-0.007
 stret-0.007
Token much
Token ID881
Feature activation+4.18e+00

Pos logit contributions
 adult+0.027
<+0.024
 ambulance+0.023
 sheep+0.023
Sweet+0.023
Neg logit contributions
chard-0.034
 Sorry-0.033
ually-0.030
apped-0.029
ishment-0.029
Token fun
Token ID1257
Feature activation+3.01e+00

Pos logit contributions
 Sorry+0.156
chard+0.154
ually+0.138
apped+0.136
ap+0.131
Neg logit contributions
 adult-0.177
 ambulance-0.157
co-0.153
 stret-0.153
<-0.152
Token it
Token ID340
Feature activation+8.91e-01

Pos logit contributions
 Sorry+0.096
chard+0.093
ually+0.084
apped+0.081
ap+0.078
Neg logit contributions
 adult-0.144
 ambulance-0.128
<-0.126
 stret-0.126
coon-0.124
Token was
Token ID373
Feature activation+1.57e-01

Pos logit contributions
 Sorry+0.041
chard+0.040
ually+0.037
apped+0.037
ap+0.036
Neg logit contributions
 adult-0.030
 ambulance-0.026
co-0.026
<-0.025
coon-0.025
Token.
Token ID13
Feature activation+3.45e-01

Pos logit contributions
 Sorry+0.007
chard+0.006
ually+0.006
apped+0.006
rac+0.006
Neg logit contributions
 adult-0.006
 ambulance-0.005
co-0.005
<-0.005
 stret-0.005
Token
Token ID220
Feature activation+2.30e-01

Pos logit contributions
 Sorry+0.014
chard+0.014
apped+0.013
ually+0.013
ap+0.012
Neg logit contributions
 adult-0.014
 ambulance-0.012
 stret-0.012
co-0.012
coon-0.012
Token
Token ID198
Feature activation-2.59e-01

Pos logit contributions
 Sorry+0.009
chard+0.009
apped+0.009
ually+0.008
ap+0.008
Neg logit contributions
 adult-0.009
 ambulance-0.008
 wolf-0.008
co-0.008
coon-0.008
Token night
Token ID1755
Feature activation+6.27e-01

Pos logit contributions
 Sorry+0.009
chard+0.009
ually+0.008
apped+0.008
ap+0.008
Neg logit contributions
 adult-0.010
 ambulance-0.008
co-0.008
<-0.008
 stret-0.008
Token,
Token ID11
Feature activation+1.99e-01

Pos logit contributions
chard+0.026
 Sorry+0.026
nam+0.024
ually+0.023
rac+0.023
Neg logit contributions
 adult-0.025
 wolf-0.022
 ambulance-0.022
 sheep-0.021
 stret-0.020
Token Post
Token ID2947
Feature activation+1.35e+00

Pos logit contributions
 Sorry+0.009
chard+0.009
apped+0.008
ually+0.008
ap+0.008
Neg logit contributions
 adult-0.007
 wolf-0.006
 sheep-0.006
 ambulance-0.006
 warning-0.006
Token remembered
Token ID12086
Feature activation-4.88e-01

Pos logit contributions
chard+0.060
 Sorry+0.059
nam+0.051
ually+0.051
apped+0.050
Neg logit contributions
 adult-0.051
 ambulance-0.045
co-0.044
coon-0.043
 wolf-0.042
Token how
Token ID703
Feature activation-1.41e+00

Pos logit contributions
 adult+0.019
 ambulance+0.016
 wolf+0.016
 warning+0.016
<+0.016
Neg logit contributions
 Sorry-0.021
chard-0.020
ually-0.019
apped-0.019
rac-0.018
Token nice
Token ID3621
Feature activation+2.85e+00

Pos logit contributions
 adult+0.058
<+0.050
 ambulance+0.049
 wolf+0.047
 warning+0.047
Neg logit contributions
 Sorry-0.056
chard-0.055
ually-0.050
apped-0.050
rac-0.048
Token it
Token ID340
Feature activation+9.90e-01

Pos logit contributions
 Sorry+0.110
chard+0.109
ually+0.101
apped+0.100
ap+0.094
Neg logit contributions
 adult-0.122
 ambulance-0.107
 wolf-0.104
 sheep-0.103
 stret-0.099
Token felt
Token ID2936
Feature activation-1.29e-01

Pos logit contributions
 Sorry+0.041
chard+0.040
ually+0.037
apped+0.036
ap+0.035
Neg logit contributions
 adult-0.039
 ambulance-0.034
co-0.032
<-0.032
 stret-0.032
Token to
Token ID284
Feature activation+2.36e-01

Pos logit contributions
 adult+0.005
 ambulance+0.005
<+0.004
co+0.004
 stret+0.004
Neg logit contributions
 Sorry-0.005
chard-0.005
ually-0.005
apped-0.005
rac-0.004
Token take
Token ID1011
Feature activation-1.64e-01

Pos logit contributions
 Sorry+0.010
chard+0.010
ually+0.009
apped+0.009
rac+0.008
Neg logit contributions
 adult-0.009
 ambulance-0.008
<-0.008
co-0.008
 stret-0.008
Token a
Token ID257
Feature activation+3.77e-01

Pos logit contributions
 adult+0.007
 ambulance+0.006
 stret+0.006
 wolf+0.006
<+0.006
Neg logit contributions
chard-0.007
 Sorry-0.007
ually-0.006
arium-0.006
apped-0.006
Token its
Token ID663
Feature activation+6.15e-01

Pos logit contributions
 Sorry+0.067
chard+0.066
ually+0.060
apped+0.059
ap+0.056
Neg logit contributions
 adult-0.077
 ambulance-0.067
co-0.065
<-0.064
coon-0.063
Token own
Token ID898
Feature activation-5.82e-01

Pos logit contributions
chard+0.025
 Sorry+0.025
ually+0.023
ap+0.023
apped+0.022
Neg logit contributions
 adult-0.022
coon-0.021
co-0.020
 stret-0.020
 ambulance-0.020
Token success
Token ID1943
Feature activation+3.66e-01

Pos logit contributions
 adult+0.021
 ambulance+0.018
<+0.018
co+0.018
 stret+0.018
Neg logit contributions
 Sorry-0.025
chard-0.024
ually-0.023
apped-0.022
ap-0.022
Token.
Token ID13
Feature activation+4.69e-01

Pos logit contributions
 Sorry+0.017
chard+0.016
apped+0.015
ually+0.015
rac+0.014
Neg logit contributions
 adult-0.013
co-0.011
 ambulance-0.011
 sheep-0.010
coon-0.010
Token Every
Token ID3887
Feature activation-3.55e-01

Pos logit contributions
 Sorry+0.018
chard+0.017
ually+0.016
apped+0.016
ap+0.015
Neg logit contributions
 adult-0.020
 ambulance-0.018
co-0.017
 stret-0.017
<-0.017
Token time
Token ID640
Feature activation+3.57e+00

Pos logit contributions
 adult+0.012
 ambulance+0.010
<+0.010
 wolf+0.009
co+0.009
Neg logit contributions
 Sorry-0.017
chard-0.017
ually-0.015
apped-0.015
nam-0.015
Token it
Token ID340
Feature activation+9.79e-01

Pos logit contributions
chard+0.139
 Sorry+0.133
ually+0.128
ishment+0.124
apped+0.119
Neg logit contributions
 adult-0.148
 ambulance-0.133
 sheep-0.130
 wolf-0.129
 ponies-0.123
Token did
Token ID750
Feature activation+4.99e-01

Pos logit contributions
 Sorry+0.040
chard+0.040
ually+0.035
apped+0.035
nam+0.034
Neg logit contributions
 adult-0.039
 ambulance-0.034
co-0.033
 wolf-0.032
<-0.032
Token something
Token ID1223
Feature activation+1.20e-01

Pos logit contributions
 Sorry+0.016
chard+0.016
ually+0.014
apped+0.014
ap+0.013
Neg logit contributions
 adult-0.024
 ambulance-0.022
co-0.021
<-0.021
 stret-0.021
Token,
Token ID11
Feature activation+8.08e-01

Pos logit contributions
 Sorry+0.005
chard+0.005
ually+0.005
apped+0.005
ap+0.005
Neg logit contributions
 adult-0.004
 ambulance-0.004
co-0.004
<-0.004
 stret-0.004
Token it
Token ID340
Feature activation+6.15e-01

Pos logit contributions
 Sorry+0.029
chard+0.029
ually+0.026
apped+0.026
rac+0.025
Neg logit contributions
 adult-0.036
 ambulance-0.031
 sheep-0.031
 wolf-0.030
 warning-0.029
Token learned
Token ID4499
Feature activation-1.62e-01

Pos logit contributions
chard+0.016
 Sorry+0.015
ually+0.015
apped+0.014
rac+0.014
Neg logit contributions
 adult-0.020
 ambulance-0.019
 stret-0.019
<-0.018
co-0.018
Token from
Token ID422
Feature activation+5.58e-01

Pos logit contributions
 adult+0.008
 ambulance+0.007
 stret+0.007
co+0.007
<+0.007
Neg logit contributions
chard-0.005
 Sorry-0.005
ually-0.004
apped-0.004
ap-0.004
Token his
Token ID465
Feature activation-5.27e-02

Pos logit contributions
chard+0.020
 Sorry+0.018
ually+0.017
ap+0.016
ishment+0.016
Neg logit contributions
 adult-0.024
 stret-0.022
co-0.022
 ambulance-0.022
<-0.022
Token wise
Token ID10787
Feature activation-8.49e-01

Pos logit contributions
co+0.002
 adult+0.002
coon+0.002
 stret+0.002
Sweet+0.002
Neg logit contributions
 Sorry-0.002
chard-0.002
ap-0.002
ually-0.002
kins-0.002
Token old
Token ID1468
Feature activation-1.05e+00

Pos logit contributions
Sweet+0.027
 stret+0.027
co+0.027
 adult+0.027
<+0.026
Neg logit contributions
chard-0.034
 Sorry-0.034
ap-0.032
ually-0.032
apped-0.031
Token grandfather
Token ID17695
Feature activation+4.25e-01

Pos logit contributions
 adult+0.032
Sweet+0.031
<+0.031
co+0.031
coon+0.030
Neg logit contributions
chard-0.046
 Sorry-0.045
ap-0.041
ually-0.041
arium-0.040
Token.
Token ID13
Feature activation-1.05e-01

Pos logit contributions
chard+0.017
 Sorry+0.016
ually+0.015
apped+0.014
ap+0.014
Neg logit contributions
 adult-0.017
 ambulance-0.015
<-0.015
 stret-0.015
co-0.015
Token He
Token ID679
Feature activation+5.32e-01

Pos logit contributions
 adult+0.004
 ambulance+0.003
<+0.003
co+0.003
 stret+0.003
Neg logit contributions
 Sorry-0.005
chard-0.005
ually-0.004
apped-0.004
ap-0.004
Token hopped
Token ID45230
Feature activation+4.69e-01

Pos logit contributions
 Sorry+0.019
chard+0.018
ually+0.017
apped+0.016
ap+0.016
Neg logit contributions
 adult-0.024
 ambulance-0.021
co-0.020
<-0.020
 stret-0.020
Token over
Token ID625
Feature activation-1.30e+00

Pos logit contributions
chard+0.016
 Sorry+0.015
apped+0.014
ually+0.014
ap+0.013
Neg logit contributions
 adult-0.021
 ambulance-0.019
 stret-0.019
ousy-0.019
co-0.018
Token to
Token ID284
Feature activation-3.78e-01

Pos logit contributions
 adult+0.055
 ambulance+0.050
 stret+0.050
<+0.049
ousy+0.049
Neg logit contributions
chard-0.046
 Sorry-0.044
apped-0.041
ually-0.040
arium-0.040
Token bounced
Token ID31318
Feature activation+7.60e-01

Pos logit contributions
 adult+0.017
 ambulance+0.015
co+0.015
<+0.014
 sheep+0.014
Neg logit contributions
 Sorry-0.020
chard-0.019
ually-0.017
apped-0.017
rac-0.016
Token so
Token ID523
Feature activation+7.60e-01

Pos logit contributions
chard+0.026
 Sorry+0.025
apped+0.023
ually+0.022
ap+0.022
Neg logit contributions
 adult-0.035
 ambulance-0.031
ousy-0.030
coon-0.030
 stret-0.029
Token high
Token ID1029
Feature activation+1.69e+00

Pos logit contributions
 Sorry+0.029
chard+0.029
ually+0.025
apped+0.025
arium+0.024
Neg logit contributions
 adult-0.032
<-0.029
co-0.028
 ambulance-0.027
 wolf-0.026
Token that
Token ID326
Feature activation+2.23e-01

Pos logit contributions
 Sorry+0.060
chard+0.059
ually+0.054
apped+0.052
ap+0.050
Neg logit contributions
 adult-0.075
 ambulance-0.066
<-0.064
co-0.064
 stret-0.064
Token she
Token ID673
Feature activation+2.39e-01

Pos logit contributions
 Sorry+0.010
chard+0.010
ually+0.009
apped+0.009
arium+0.008
Neg logit contributions
 adult-0.008
 ambulance-0.007
 wolf-0.007
 sheep-0.007
co-0.007
Token ended
Token ID4444
Feature activation+6.37e-01

Pos logit contributions
 Sorry+0.010
chard+0.009
ually+0.008
apped+0.008
ishment+0.008
Neg logit contributions
 adult-0.009
co-0.008
<-0.008
 ambulance-0.008
 sheep-0.008
Token up
Token ID510
Feature activation-2.05e+00

Pos logit contributions
 Sorry+0.027
chard+0.026
ually+0.024
apped+0.024
rac+0.023
Neg logit contributions
 adult-0.025
 ambulance-0.022
co-0.021
<-0.021
 stret-0.021
Token in
Token ID287
Feature activation-2.12e-01

Pos logit contributions
 adult+0.077
 ambulance+0.068
co+0.066
<+0.064
 stret+0.064
Neg logit contributions
 Sorry-0.088
chard-0.085
ually-0.079
apped-0.077
rac-0.076
Token the
Token ID262
Feature activation-5.72e-01

Pos logit contributions
 adult+0.010
 ambulance+0.009
co+0.008
<+0.008
coon+0.008
Neg logit contributions
 Sorry-0.007
chard-0.007
ually-0.006
apped-0.006
ap-0.006
Token air
Token ID1633
Feature activation-6.59e-01

Pos logit contributions
 adult+0.021
coon+0.018
co+0.018
 ambulance+0.018
 stret+0.018
Neg logit contributions
chard-0.024
 Sorry-0.023
ually-0.021
ap-0.020
 league-0.020
Token!
Token ID0
Feature activation+1.43e-01

Pos logit contributions
 adult+0.021
 ambulance+0.019
co+0.018
 stret+0.018
 sheep+0.017
Neg logit contributions
 Sorry-0.032
chard-0.030
ually-0.028
apped-0.027
rac-0.027
Token to
Token ID284
Feature activation-6.14e-01

Pos logit contributions
 Sorry+0.006
chard+0.006
ually+0.005
apped+0.005
ap+0.005
Neg logit contributions
 adult-0.006
 ambulance-0.005
co-0.005
 stret-0.005
<-0.005
Token reach
Token ID3151
Feature activation+1.18e-01

Pos logit contributions
 adult+0.027
 ambulance+0.023
 wolf+0.022
<+0.022
 stret+0.022
Neg logit contributions
 Sorry-0.024
chard-0.023
apped-0.022
ams-0.021
kins-0.021
Token out
Token ID503
Feature activation+7.14e-01

Pos logit contributions
 Sorry+0.004
chard+0.004
ually+0.004
apped+0.004
rac+0.004
Neg logit contributions
 adult-0.005
 ambulance-0.005
 wolf-0.004
<-0.004
 stret-0.004
Token and
Token ID290
Feature activation+1.05e-01

Pos logit contributions
 Sorry+0.031
chard+0.031
ually+0.028
apped+0.028
rac+0.027
Neg logit contributions
 adult-0.026
 ambulance-0.023
co-0.022
<-0.021
 stret-0.021
Token grab
Token ID5552
Feature activation-7.40e-02

Pos logit contributions
 Sorry+0.004
apped+0.004
ually+0.004
ap+0.004
rac+0.004
Neg logit contributions
 adult-0.004
 wolf-0.004
 ambulance-0.004
 warning-0.003
 sheep-0.003
Token a
Token ID257
Feature activation+9.46e-01

Pos logit contributions
 adult+0.003
 ambulance+0.003
 wolf+0.003
 sheep+0.003
co+0.003
Neg logit contributions
 Sorry-0.003
chard-0.003
ually-0.003
apped-0.002
arium-0.002
Token big
Token ID1263
Feature activation+1.35e+00

Pos logit contributions
chard+0.035
 Sorry+0.033
ually+0.028
arium+0.027
ap+0.027
Neg logit contributions
 adult-0.041
co-0.038
 ambulance-0.037
<-0.036
coon-0.035
Token branch
Token ID8478
Feature activation+8.78e-02

Pos logit contributions
 Sorry+0.058
chard+0.057
ually+0.053
apped+0.052
rac+0.050
Neg logit contributions
 adult-0.051
 ambulance-0.044
 stret-0.042
co-0.042
<-0.042
Token from
Token ID422
Feature activation+1.28e-01

Pos logit contributions
 Sorry+0.004
chard+0.004
ually+0.003
apped+0.003
ap+0.003
Neg logit contributions
 adult-0.003
 ambulance-0.003
co-0.003
<-0.003
 stret-0.003
Token the
Token ID262
Feature activation-8.60e-01

Pos logit contributions
 Sorry+0.005
chard+0.004
apped+0.004
ually+0.004
ap+0.004
Neg logit contributions
 adult-0.006
 ambulance-0.005
 stret-0.005
<-0.005
 wolf-0.005
Token tree
Token ID5509
Feature activation-7.91e-01

Pos logit contributions
 adult+0.034
co+0.030
<+0.030
 tele+0.030
coon+0.030
Neg logit contributions
chard-0.036
 Sorry-0.030
ually-0.027
nam-0.027
-0.027
Token
Token ID198
Feature activation-2.56e-01

Pos logit contributions
 adult+0.016
 ambulance+0.014
co+0.014
 wolf+0.013
coon+0.013
Neg logit contributions
 Sorry-0.013
chard-0.012
apped-0.012
ap-0.012
ually-0.011
TokenL
Token ID43
Feature activation+9.64e-01

Pos logit contributions
 adult+0.010
 ambulance+0.009
 wolf+0.008
 stret+0.008
co+0.008
Neg logit contributions
 Sorry-0.011
chard-0.010
ually-0.010
apped-0.009
ap-0.009
Tokenily
Token ID813
Feature activation-5.90e-01

Pos logit contributions
apped+0.050
 Sorry+0.050
chard+0.049
ap+0.048
kin+0.047
Neg logit contributions
 adult-0.024
 stret-0.021
 wolf-0.020
 ambulance-0.020
ousy-0.019
Token and
Token ID290
Feature activation-4.13e-01

Pos logit contributions
 adult+0.017
co+0.015
 ambulance+0.014
<+0.014
 sheep+0.013
Neg logit contributions
 Sorry-0.031
chard-0.030
ually-0.027
nam-0.027
apped-0.026
Token Ben
Token ID3932
Feature activation+9.84e-01

Pos logit contributions
 adult+0.013
 sheep+0.012
 ambulance+0.012
 ponies+0.011
 wolf+0.011
Neg logit contributions
ually-0.020
 Sorry-0.019
boat-0.018
chard-0.018
bel-0.018
Token thanked
Token ID26280
Feature activation+5.39e-01

Pos logit contributions
chard+0.039
 Sorry+0.038
ually+0.033
ishment+0.032
 league+0.031
Neg logit contributions
 adult-0.037
co-0.036
 sheep-0.034
<-0.034
 ambulance-0.034
Token the
Token ID262
Feature activation-5.08e-01

Pos logit contributions
 Sorry+0.019
chard+0.018
ually+0.017
apped+0.016
rac+0.015
Neg logit contributions
 adult-0.025
 ambulance-0.022
co-0.021
 sheep-0.021
<-0.021
Token fairy
Token ID25607
Feature activation+4.99e-01

Pos logit contributions
 adult+0.018
 stret+0.017
co+0.017
 ambulance+0.016
<+0.016
Neg logit contributions
 Sorry-0.021
chard-0.021
apped-0.019
arium-0.019
ually-0.019
Token and
Token ID290
Feature activation+4.55e-02

Pos logit contributions
chard+0.023
 Sorry+0.023
ually+0.021
apped+0.021
ap+0.020
Neg logit contributions
 adult-0.017
 ambulance-0.014
<-0.014
 stret-0.014
co-0.014
Token hugged
Token ID41130
Feature activation+4.79e-01

Pos logit contributions
 Sorry+0.002
chard+0.002
ually+0.002
icker+0.002
 league+0.002
Neg logit contributions
 adult-0.002
 sheep-0.002
 wolf-0.001
 pony-0.001
 ambulance-0.001
Token her
Token ID607
Feature activation-3.35e-01

Pos logit contributions
 Sorry+0.020
chard+0.020
ually+0.018
apped+0.018
ap+0.017
Neg logit contributions
 adult-0.019
 ambulance-0.016
<-0.016
co-0.016
 stret-0.015
Token and
Token ID290
Feature activation-5.33e-01

Pos logit contributions
 Sorry+0.023
chard+0.022
apped+0.021
ap+0.021
ually+0.021
Neg logit contributions
 adult-0.025
 stret-0.023
co-0.023
coon-0.022
 ambulance-0.022
Token didn
Token ID1422
Feature activation-1.07e+00

Pos logit contributions
 adult+0.018
 ambulance+0.017
 wolf+0.017
 warning+0.016
<+0.016
Neg logit contributions
chard-0.022
ually-0.022
 Sorry-0.022
rac-0.022
apped-0.022
Token't
Token ID470
Feature activation-8.59e-01

Pos logit contributions
 adult+0.041
co+0.037
 sheep+0.036
 ambulance+0.036
 stret+0.036
Neg logit contributions
 Sorry-0.044
chard-0.041
ap-0.038
ually-0.038
rac-0.037
Token understand
Token ID1833
Feature activation-3.32e-01

Pos logit contributions
 adult+0.039
co+0.037
 stret+0.037
ousy+0.036
 ambulance+0.036
Neg logit contributions
 Sorry-0.027
chard-0.026
ap-0.023
ually-0.022
apped-0.021
Token why
Token ID1521
Feature activation-3.27e-01

Pos logit contributions
 adult+0.013
<+0.012
 ambulance+0.012
co+0.012
 stret+0.012
Neg logit contributions
chard-0.013
 Sorry-0.012
ually-0.011
ap-0.011
nam-0.011
Token his
Token ID465
Feature activation+6.38e-01

Pos logit contributions
 adult+0.014
 ambulance+0.013
co+0.012
<+0.012
 stret+0.012
Neg logit contributions
 Sorry-0.012
chard-0.012
ually-0.011
apped-0.011
rac-0.010
Token friends
Token ID2460
Feature activation-5.48e-01

Pos logit contributions
 Sorry+0.022
chard+0.020
ap+0.020
apped+0.020
kins+0.020
Neg logit contributions
coon-0.025
 stret-0.025
co-0.025
 adult-0.025
Sweet-0.024
Token were
Token ID547
Feature activation-2.29e-01

Pos logit contributions
 adult+0.023
 ambulance+0.020
co+0.019
<+0.019
 stret+0.019
Neg logit contributions
 Sorry-0.021
chard-0.021
ually-0.019
apped-0.019
rac-0.018
Token being
Token ID852
Feature activation-1.64e+00

Pos logit contributions
 adult+0.009
<+0.008
ousy+0.008
Sweet+0.008
co+0.008
Neg logit contributions
chard-0.009
 Sorry-0.008
apped-0.007
nam-0.007
ap-0.007
Token mean
Token ID1612
Feature activation-2.65e-01

Pos logit contributions
 adult+0.060
 ambulance+0.052
co+0.049
<+0.048
 stret+0.048
Neg logit contributions
 Sorry-0.075
chard-0.071
ually-0.068
apped-0.065
arium-0.064
Token.
Token ID13
Feature activation+8.85e-02

Pos logit contributions
 adult+0.011
 stret+0.010
<+0.010
 ambulance+0.010
ousy+0.010
Neg logit contributions
 Sorry-0.009
chard-0.009
apped-0.009
ually-0.008
ap-0.008
TokenHelp
Token ID22087
Feature activation-1.82e-02

Pos logit contributions
 adult+0.011
 ambulance+0.010
 stret+0.009
ousy+0.009
<+0.009
Neg logit contributions
 Sorry-0.012
chard-0.011
kin-0.010
apped-0.010
rac-0.010
Token,
Token ID11
Feature activation-1.17e+00

Pos logit contributions
 adult+0.001
<+0.001
 ambulance+0.001
 stret+0.001
co+0.001
Neg logit contributions
 Sorry-0.001
chard-0.000
ap-0.000
apped-0.000
ually-0.000
Token help
Token ID1037
Feature activation-9.52e-01

Pos logit contributions
 adult+0.037
 ambulance+0.031
co+0.030
<+0.030
 stret+0.030
Neg logit contributions
 Sorry-0.057
chard-0.057
ually-0.053
apped-0.052
ap-0.050
Token!"
Token ID2474
Feature activation-3.61e-01

Pos logit contributions
 adult+0.042
 ambulance+0.037
co+0.036
<+0.036
 stret+0.036
Neg logit contributions
 Sorry-0.034
chard-0.034
ually-0.031
apped-0.030
ap-0.029
Token Lily
Token ID20037
Feature activation+4.30e-02

Pos logit contributions
 adult+0.012
 ambulance+0.011
<+0.011
 sheep+0.010
 wolf+0.010
Neg logit contributions
chard-0.016
ually-0.016
 Sorry-0.016
boat-0.015
apped-0.015
Token cries
Token ID24691
Feature activation-9.01e-01

Pos logit contributions
 Sorry+0.002
chard+0.002
ap+0.001
apped+0.001
rac+0.001
Neg logit contributions
 adult-0.002
 stret-0.001
coon-0.001
co-0.001
<-0.001
Token.
Token ID13
Feature activation-3.52e-01

Pos logit contributions
 adult+0.045
coon+0.040
ousy+0.040
<+0.040
 stret+0.040
Neg logit contributions
apped-0.024
ap-0.024
 Sorry-0.024
chard-0.023
icker-0.020
Token "
Token ID366
Feature activation-3.53e-02

Pos logit contributions
 adult+0.014
 stret+0.012
 ambulance+0.012
co+0.012
coon+0.012
Neg logit contributions
 Sorry-0.015
chard-0.014
apped-0.014
rac-0.013
ually-0.013
TokenI
Token ID40
Feature activation-2.09e-03

Pos logit contributions
 adult+0.001
 ambulance+0.001
co+0.001
 stret+0.001
<+0.001
Neg logit contributions
 Sorry-0.002
chard-0.002
ually-0.001
rac-0.001
apped-0.001
Token fell
Token ID3214
Feature activation+7.95e-01

Pos logit contributions
 adult+0.000
 ambulance+0.000
co+0.000
 stret+0.000
coon+0.000
Neg logit contributions
 Sorry-0.000
chard-0.000
ually-0.000
apped-0.000
ap-0.000
Token in
Token ID287
Feature activation-5.34e-01

Pos logit contributions
 Sorry+0.024
chard+0.022
apped+0.020
ap+0.020
boat+0.019
Neg logit contributions
 adult-0.039
 stret-0.035
<-0.034
co-0.034
coon-0.033
Token the
Token ID262
Feature activation+5.68e-03

Pos logit contributions
chard+0.000
 Sorry+0.000
ually+0.000
apped+0.000
arium+0.000
Neg logit contributions
 adult-0.000
 ambulance-0.000
co-0.000
<-0.000
 stret-0.000
Token paintings
Token ID21641
Feature activation-3.42e-01

Pos logit contributions
chard+0.000
 Sorry+0.000
ually+0.000
apped+0.000
kins+0.000
Neg logit contributions
 adult-0.000
ousy-0.000
<-0.000
co-0.000
coon-0.000
Token.
Token ID13
Feature activation+9.24e-02

Pos logit contributions
 adult+0.008
 ambulance+0.007
co+0.006
 sheep+0.006
 wolf+0.006
Neg logit contributions
 Sorry-0.020
chard-0.019
ually-0.018
apped-0.018
rac-0.017
Token He
Token ID679
Feature activation+8.14e-01

Pos logit contributions
chard+0.004
 Sorry+0.004
ually+0.004
ap+0.004
apped+0.004
Neg logit contributions
 adult-0.003
 ambulance-0.003
<-0.002
 stret-0.002
 wolf-0.002
Token told
Token ID1297
Feature activation-5.57e-01

Pos logit contributions
 Sorry+0.033
chard+0.032
ually+0.030
apped+0.028
ap+0.028
Neg logit contributions
 adult-0.033
 ambulance-0.029
co-0.028
<-0.028
 sheep-0.027
Token them
Token ID606
Feature activation-8.65e-01

Pos logit contributions
 adult+0.018
<+0.017
 stret+0.017
co+0.017
ousy+0.017
Neg logit contributions
 Sorry-0.023
chard-0.023
apped-0.021
rac-0.020
ap-0.020
Token stories
Token ID3923
Feature activation+9.73e-01

Pos logit contributions
 adult+0.036
 ambulance+0.032
co+0.030
<+0.030
 stret+0.030
Neg logit contributions
 Sorry-0.033
chard-0.033
ually-0.030
apped-0.029
ap-0.028
Token about
Token ID546
Feature activation+9.47e-02

Pos logit contributions
 Sorry+0.047
chard+0.047
ually+0.044
apped+0.043
ap+0.042
Neg logit contributions
 adult-0.030
 ambulance-0.025
<-0.024
 stret-0.024
co-0.024
Token his
Token ID465
Feature activation+3.05e-01

Pos logit contributions
 Sorry+0.004
chard+0.004
ually+0.003
apped+0.003
ap+0.003
Neg logit contributions
 adult-0.004
 ambulance-0.003
co-0.003
<-0.003
 stret-0.003
Token life
Token ID1204
Feature activation+9.84e-01

Pos logit contributions
 Sorry+0.012
chard+0.012
ually+0.011
apped+0.011
ap+0.011
Neg logit contributions
 adult-0.012
 ambulance-0.010
co-0.010
<-0.010
 stret-0.010
Token and
Token ID290
Feature activation+1.72e-02

Pos logit contributions
 Sorry+0.052
chard+0.050
ually+0.047
apped+0.047
rac+0.045
Neg logit contributions
 adult-0.028
 ambulance-0.024
co-0.023
 sheep-0.022
 wolf-0.021
Token good
Token ID922
Feature activation-6.52e-01

Pos logit contributions
 adult+0.010
 baby+0.010
 other+0.010
 babies+0.009
 warning+0.009
Neg logit contributions
chard-0.032
 league-0.032
 Sorry-0.032
arium-0.031
ishment-0.031
Token friends
Token ID2460
Feature activation-1.03e+00

Pos logit contributions
 adult+0.019
 ambulance+0.016
co+0.015
 sheep+0.015
 wolf+0.015
Neg logit contributions
chard-0.034
 Sorry-0.034
ually-0.031
rac-0.030
nam-0.029
Token.
Token ID13
Feature activation-1.58e-01

Pos logit contributions
 adult+0.033
co+0.029
 ambulance+0.029
<+0.028
 wolf+0.028
Neg logit contributions
 Sorry-0.048
chard-0.048
ually-0.043
apped-0.042
rac-0.042
Token
Token ID198
Feature activation-6.48e-01

Pos logit contributions
 adult+0.006
 ambulance+0.005
<+0.005
co+0.005
 stret+0.005
Neg logit contributions
chard-0.007
 Sorry-0.007
ually-0.006
apped-0.006
ap-0.006
Token
Token ID198
Feature activation-4.31e-01

Pos logit contributions
 adult+0.026
 ambulance+0.024
 stret+0.022
co+0.022
 kittens+0.022
Neg logit contributions
chard-0.024
ually-0.024
 Sorry-0.023
apped-0.023
arium-0.023
TokenThe
Token ID464
Feature activation-9.92e-02

Pos logit contributions
 adult+0.019
 ambulance+0.017
 stret+0.017
 wolf+0.016
co+0.016
Neg logit contributions
 Sorry-0.015
chard-0.015
ually-0.014
ap-0.013
apped-0.013
Token bird
Token ID6512
Feature activation-1.28e-02

Pos logit contributions
 adult+0.003
 stret+0.003
co+0.003
coon+0.003
ula+0.003
Neg logit contributions
chard-0.004
 Sorry-0.004
ap-0.004
ually-0.004
apped-0.004
Token lived
Token ID5615
Feature activation+1.20e+00

Pos logit contributions
 adult+0.000
co+0.000
 ambulance+0.000
 sheep+0.000
<+0.000
Neg logit contributions
 Sorry-0.001
chard-0.001
nam-0.001
ishment-0.001
apped-0.001
Token in
Token ID287
Feature activation-2.11e-01

Pos logit contributions
 Sorry+0.060
chard+0.051
ishment+0.050
ually+0.049
rac+0.049
Neg logit contributions
 adult-0.037
<-0.034
 living-0.033
co-0.033
 warning-0.032
Token the
Token ID262
Feature activation-8.12e-01

Pos logit contributions
 adult+0.006
 living+0.006
 other+0.006
 baby+0.005
 wolf+0.005
Neg logit contributions
 Sorry-0.011
chard-0.010
apped-0.010
ams-0.010
ouses-0.010
Token tree
Token ID5509
Feature activation-9.66e-01

Pos logit contributions
 adult+0.030
 ambulance+0.026
co+0.026
<+0.025
 stret+0.025
Neg logit contributions
 Sorry-0.035
chard-0.034
ually-0.032
apped-0.031
ap-0.030
Token
Token ID198
Feature activation-3.80e-01

Pos logit contributions
chard+0.002
 Sorry+0.002
ually+0.002
kins+0.002
apped+0.002
Neg logit contributions
 adult-0.001
<-0.001
 ambulance-0.001
co-0.001
Sweet-0.001
Token
Token ID198
Feature activation-2.14e-01

Pos logit contributions
 adult+0.018
 ambulance+0.017
 wolf+0.017
 stret+0.017
 kittens+0.016
Neg logit contributions
 Sorry-0.010
apped-0.010
chard-0.010
arium-0.010
rac-0.010
TokenJ
Token ID41
Feature activation+7.59e-01

Pos logit contributions
 adult+0.008
 ambulance+0.007
co+0.007
 stret+0.007
<+0.007
Neg logit contributions
 Sorry-0.009
chard-0.009
ually-0.008
apped-0.008
ap-0.008
Tokenenny
Token ID11870
Feature activation-8.35e-01

Pos logit contributions
 Sorry+0.026
chard+0.026
apped+0.024
ually+0.023
ap+0.022
Neg logit contributions
 adult-0.034
 ambulance-0.030
 stret-0.030
<-0.030
co-0.029
Token went
Token ID1816
Feature activation+5.87e-01

Pos logit contributions
 adult+0.033
co+0.030
 ambulance+0.029
 sheep+0.028
<+0.027
Neg logit contributions
 Sorry-0.035
chard-0.035
ually-0.030
nam-0.029
apped-0.029
Token to
Token ID284
Feature activation-1.02e+00

Pos logit contributions
chard+0.023
 Sorry+0.023
apped+0.021
ually+0.021
ap+0.020
Neg logit contributions
 adult-0.023
ousy-0.020
 ambulance-0.020
coon-0.020
 stret-0.020
Token the
Token ID262
Feature activation-4.69e-01

Pos logit contributions
 adult+0.046
 ambulance+0.041
<+0.040
co+0.039
 stret+0.039
Neg logit contributions
 Sorry-0.036
chard-0.035
ually-0.032
apped-0.031
ap-0.030
Token dance
Token ID9280
Feature activation+1.74e-01

Pos logit contributions
 adult+0.018
 ambulance+0.016
co+0.015
<+0.015
 stret+0.015
Neg logit contributions
 Sorry-0.019
chard-0.019
ually-0.018
apped-0.017
rac-0.017
Token rehears
Token ID28779
Feature activation+8.45e-01

Pos logit contributions
 Sorry+0.008
chard+0.008
ually+0.007
nam+0.007
rac+0.006
Neg logit contributions
 adult-0.006
 ambulance-0.006
co-0.006
 sheep-0.006
coon-0.005
Tokenals
Token ID874
Feature activation-1.17e+00

Pos logit contributions
chard+0.034
 Sorry+0.033
ually+0.029
nam+0.029
ishment+0.027
Neg logit contributions
 adult-0.032
co-0.031
 ambulance-0.031
 sheep-0.029
 living-0.028
Token,
Token ID11
Feature activation-3.08e-01

Pos logit contributions
 adult+0.031
 ambulance+0.027
co+0.026
 sheep+0.025
coon+0.023
Neg logit contributions
 Sorry-0.064
chard-0.060
ually-0.057
ishment-0.055
nam-0.054
Token!
Token ID0
Feature activation-1.10e+00

Pos logit contributions
chard+0.001
 Sorry+0.001
ually+0.001
apped+0.001
rac+0.001
Neg logit contributions
 adult-0.001
 ambulance-0.001
co-0.001
 wolf-0.001
 stret-0.001
Token You
Token ID921
Feature activation-4.53e-01

Pos logit contributions
 adult+0.038
 ambulance+0.032
<+0.031
coon+0.030
 wolf+0.030
Neg logit contributions
chard-0.051
 Sorry-0.047
ually-0.045
kins-0.044
arium-0.044
Token're
Token ID821
Feature activation-1.20e+00

Pos logit contributions
 adult+0.014
<+0.012
coon+0.012
 ambulance+0.012
 truth+0.011
Neg logit contributions
chard-0.023
 Sorry-0.021
ishment-0.020
ually-0.020
arium-0.019
Token the
Token ID262
Feature activation-1.37e+00

Pos logit contributions
 adult+0.043
 ambulance+0.038
<+0.036
coon+0.036
 wolf+0.036
Neg logit contributions
 Sorry-0.053
chard-0.051
ually-0.049
apped-0.048
arium-0.047
Token best
Token ID1266
Feature activation-4.74e-01

Pos logit contributions
 adult+0.051
co+0.048
<+0.048
 stret+0.048
coon+0.045
Neg logit contributions
 Sorry-0.055
chard-0.052
ap-0.048
 league-0.046
nam-0.044
Token sister
Token ID6621
Feature activation-1.14e+00

Pos logit contributions
 adult+0.019
co+0.019
<+0.018
 stret+0.018
coon+0.017
Neg logit contributions
 Sorry-0.018
apped-0.015
ap-0.015
chard-0.015
ually-0.014
Token ever
Token ID1683
Feature activation-3.79e-01

Pos logit contributions
 adult+0.040
 ambulance+0.036
co+0.035
coon+0.034
 wolf+0.034
Neg logit contributions
chard-0.051
 Sorry-0.049
ually-0.045
ishment-0.044
apped-0.044
Token!"
Token ID2474
Feature activation-5.25e-02

Pos logit contributions
 adult+0.011
co+0.010
coon+0.010
 ambulance+0.010
 wolf+0.009
Neg logit contributions
chard-0.019
 Sorry-0.018
ually-0.017
icker-0.017
apped-0.017
Token he
Token ID339
Feature activation+4.21e-01

Pos logit contributions
 adult+0.002
<+0.002
 ambulance+0.002
 stret+0.002
co+0.002
Neg logit contributions
chard-0.002
 Sorry-0.002
ually-0.002
ap-0.002
apped-0.002
Token said
Token ID531
Feature activation-2.82e-01

Pos logit contributions
chard+0.020
 Sorry+0.019
ually+0.018
ap+0.017
icker+0.017
Neg logit contributions
 adult-0.014
<-0.012
 ambulance-0.012
 sheep-0.012
co-0.012
Token,
Token ID11
Feature activation-4.37e-02

Pos logit contributions
 adult+0.008
 ambulance+0.007
 warning+0.007
 stret+0.007
<+0.006
Neg logit contributions
chard-0.014
 Sorry-0.014
ually-0.013
arium-0.013
ishment-0.013
Token
Token ID241
Feature activation-1.40e+00

Pos logit contributions
 Sorry+0.029
chard+0.028
ually+0.027
apped+0.026
rac+0.026
Neg logit contributions
 adult-0.024
 ambulance-0.020
coon-0.019
 wolf-0.019
ousy-0.019
TokenYou
Token ID1639
Feature activation-1.22e+00

Pos logit contributions
 adult+0.055
 ambulance+0.050
 stret+0.049
ousy+0.048
coon+0.048
Neg logit contributions
 Sorry-0.057
chard-0.054
ually-0.051
apped-0.050
ap-0.047
Tokenâ
Token ID22940
Feature activation-2.45e-01

Pos logit contributions
 adult+0.040
 ambulance+0.033
<+0.032
co+0.032
 sheep+0.032
Neg logit contributions
chard-0.059
 Sorry-0.056
apped-0.051
ishment-0.051
ually-0.050
Token
Token ID26391
Feature activation-8.43e-01

Pos logit contributions
 adult+0.008
co+0.008
 ambulance+0.007
coon+0.007
<+0.007
Neg logit contributions
chard-0.012
 Sorry-0.011
nam-0.011
apped-0.010
arium-0.010
Token
Token ID8151
Feature activation-9.08e-02

Pos logit contributions
 adult+0.028
co+0.028
 ambulance+0.026
<+0.026
 stret+0.025
Neg logit contributions
chard-0.038
 Sorry-0.033
ishment-0.033
nam-0.032
ually-0.032
Tokenve
Token ID303
Feature activation-3.52e+00

Pos logit contributions
 adult+0.004
 ambulance+0.003
co+0.003
<+0.003
 stret+0.003
Neg logit contributions
 Sorry-0.004
chard-0.004
ually-0.003
apped-0.003
ap-0.003
Token really
Token ID1107
Feature activation-5.82e-01

Pos logit contributions
 adult+0.142
co+0.124
 ambulance+0.120
coon+0.120
<+0.118
Neg logit contributions
 Sorry-0.143
chard-0.141
apped-0.124
ually-0.122
ap-0.121
Token improved
Token ID6596
Feature activation+1.85e-01

Pos logit contributions
 adult+0.017
<+0.015
 warning+0.014
co+0.014
 living+0.013
Neg logit contributions
chard-0.030
 Sorry-0.027
ishment-0.027
arium-0.026
apped-0.025
Token.
Token ID13
Feature activation+4.12e-01

Pos logit contributions
 Sorry+0.007
chard+0.007
ually+0.006
apped+0.006
ap+0.006
Neg logit contributions
 adult-0.008
 ambulance-0.007
co-0.007
 stret-0.007
<-0.007
Tokenâ
Token ID22940
Feature activation+8.71e-01

Pos logit contributions
 Sorry+0.017
chard+0.016
ually+0.015
apped+0.015
ap+0.014
Neg logit contributions
 adult-0.016
 ambulance-0.014
 stret-0.014
co-0.014
coon-0.014
Token
Token ID26391
Feature activation+1.74e-01

Pos logit contributions
chard+0.043
 Sorry+0.040
nam+0.037
apped+0.037
arium+0.036
Neg logit contributions
 adult-0.028
co-0.027
 ambulance-0.025
<-0.024
coon-0.023
Tokenily
Token ID813
Feature activation-3.27e-01

Pos logit contributions
apped+0.069
kin+0.068
kins+0.065
arium+0.063
ap+0.063
Neg logit contributions
 wolf-0.048
 stret-0.045
 adult-0.044
 sheep-0.043
 reins-0.043
Token went
Token ID1816
Feature activation+7.55e-01

Pos logit contributions
 adult+0.013
 ambulance+0.011
co+0.011
<+0.011
 stret+0.011
Neg logit contributions
 Sorry-0.013
chard-0.013
ually-0.012
apped-0.012
ap-0.011
Token to
Token ID284
Feature activation-1.03e+00

Pos logit contributions
chard+0.027
arium+0.022
apped+0.021
ually+0.020
 Sorry+0.020
Neg logit contributions
 adult-0.033
ousy-0.032
coon-0.031
 stret-0.029
 wolf-0.029
Token the
Token ID262
Feature activation-3.40e-01

Pos logit contributions
 adult+0.052
co+0.048
 ambulance+0.048
coon+0.047
<+0.046
Neg logit contributions
chard-0.030
 Sorry-0.029
ually-0.025
apped-0.024
ap-0.023
Token books
Token ID3835
Feature activation+6.05e-01

Pos logit contributions
 adult+0.011
 stret+0.010
<+0.010
co+0.010
 ambulance+0.010
Neg logit contributions
chard-0.016
 Sorry-0.015
nam-0.013
ually-0.013
rac-0.013
Tokenhe
Token ID258
Feature activation-2.92e+00

Pos logit contributions
 Sorry+0.032
chard+0.028
ually+0.027
ishment+0.026
nam+0.026
Neg logit contributions
 adult-0.018
 ambulance-0.017
co-0.016
 stret-0.015
<-0.014
Tokenlf
Token ID1652
Feature activation+5.66e-02

Pos logit contributions
 adult+0.133
 ambulance+0.118
<+0.116
co+0.115
 stret+0.115
Neg logit contributions
 Sorry-0.100
chard-0.099
ually-0.089
apped-0.087
ap-0.083
Token and
Token ID290
Feature activation-1.68e-01

Pos logit contributions
 Sorry+0.003
chard+0.003
ually+0.003
rac+0.003
apped+0.003
Neg logit contributions
 adult-0.002
 ambulance-0.001
co-0.001
 stret-0.001
 sheep-0.001
Token found
Token ID1043
Feature activation-2.50e-01

Pos logit contributions
 adult+0.006
 sheep+0.005
 ambulance+0.005
co+0.005
 tele+0.005
Neg logit contributions
 Sorry-0.007
ap-0.007
apped-0.006
ually-0.006
boat-0.006
Token an
Token ID281
Feature activation-2.31e-02

Pos logit contributions
 adult+0.012
 ambulance+0.010
co+0.010
<+0.010
 stret+0.010
Neg logit contributions
chard-0.008
 Sorry-0.008
ually-0.007
apped-0.007
rac-0.007
Token ancient
Token ID6156
Feature activation-1.16e-01

Pos logit contributions
Sweet+0.001
 stret+0.001
 adult+0.001
<+0.001
co+0.001
Neg logit contributions
chard-0.001
nam-0.001
 Sorry-0.001
ually-0.001
rac-0.001
Token It
Token ID632
Feature activation+1.54e-01

Pos logit contributions
 adult+0.011
 ambulance+0.009
<+0.009
coon+0.008
co+0.008
Neg logit contributions
chard-0.018
ually-0.017
 Sorry-0.017
kins-0.017
arium-0.016
Token is
Token ID318
Feature activation+2.08e-01

Pos logit contributions
 Sorry+0.007
chard+0.006
ually+0.006
apped+0.006
ap+0.006
Neg logit contributions
 adult-0.006
 ambulance-0.005
 stret-0.005
<-0.005
co-0.005
Token a
Token ID257
Feature activation+1.41e-01

Pos logit contributions
 Sorry+0.009
chard+0.009
ually+0.008
apped+0.008
ap+0.008
Neg logit contributions
 adult-0.008
 ambulance-0.006
<-0.006
co-0.006
 stret-0.006
Token stupid
Token ID8531
Feature activation-8.67e-01

Pos logit contributions
 Sorry+0.006
chard+0.006
ually+0.005
 league+0.005
ap+0.005
Neg logit contributions
 adult-0.005
co-0.004
 ambulance-0.004
<-0.004
ousy-0.004
Token robot
Token ID9379
Feature activation+4.08e-01

Pos logit contributions
 adult+0.032
co+0.028
<+0.028
ousy+0.028
 stret+0.028
Neg logit contributions
 Sorry-0.036
chard-0.034
apped-0.032
ap-0.031
ually-0.031
Token anyway
Token ID6949
Feature activation-3.39e+00

Pos logit contributions
 Sorry+0.017
chard+0.017
ually+0.015
apped+0.015
rac+0.015
Neg logit contributions
 adult-0.016
 ambulance-0.014
co-0.013
 stret-0.013
coon-0.013
Token.
Token ID13
Feature activation-3.53e-01

Pos logit contributions
 adult+0.146
 ambulance+0.130
 stret+0.127
<+0.127
co+0.126
Neg logit contributions
 Sorry-0.123
chard-0.121
ually-0.110
apped-0.108
ap-0.104
Token Watch
Token ID6305
Feature activation+4.31e-01

Pos logit contributions
 adult+0.011
 ambulance+0.010
<+0.009
coon+0.009
co+0.009
Neg logit contributions
chard-0.017
 Sorry-0.016
ually-0.016
kins-0.015
arium-0.015
Token this
Token ID428
Feature activation-1.42e+00

Pos logit contributions
 Sorry+0.015
chard+0.014
ap+0.013
rac+0.013
ually+0.013
Neg logit contributions
 adult-0.018
Sweet-0.017
ousy-0.016
<-0.016
 ambulance-0.016
Token!"
Token ID2474
Feature activation+2.25e-01

Pos logit contributions
 adult+0.059
 ambulance+0.052
<+0.051
co+0.051
 stret+0.050
Neg logit contributions
 Sorry-0.054
chard-0.053
ually-0.048
apped-0.047
ap-0.046
Token He
Token ID679
Feature activation+9.70e-01

Pos logit contributions
chard+0.012
 Sorry+0.012
ually+0.012
kins+0.011
kin+0.011
Neg logit contributions
 adult-0.005
<-0.005
 ambulance-0.004
 stret-0.004
Sweet-0.004
Token got
Token ID1392
Feature activation+4.02e-01

Pos logit contributions
 adult+0.012
co+0.012
coon+0.011
 ambulance+0.011
 wolf+0.011
Neg logit contributions
 Sorry-0.017
chard-0.017
ishment-0.016
ap-0.016
ually-0.016
Token their
Token ID511
Feature activation-3.03e-01

Pos logit contributions
 Sorry+0.018
chard+0.018
ually+0.017
arium+0.016
apped+0.016
Neg logit contributions
 adult-0.014
 ambulance-0.012
co-0.012
<-0.012
 stret-0.012
Token own
Token ID898
Feature activation-2.07e+00

Pos logit contributions
 adult+0.010
coon+0.010
 stret+0.009
 ambulance+0.009
 wolf+0.009
Neg logit contributions
 Sorry-0.012
ually-0.012
ap-0.012
apped-0.012
 collection-0.012
Token z
Token ID1976
Feature activation-2.57e+00

Pos logit contributions
 adult+0.067
coon+0.065
Sweet+0.061
 stret+0.061
ousy+0.060
Neg logit contributions
ually-0.083
ap-0.081
boat-0.080
 Sorry-0.080
apped-0.080
Tokenig
Token ID328
Feature activation-2.72e+00

Pos logit contributions
 adult+0.133
 ambulance+0.119
 stret+0.118
 wolf+0.117
<+0.116
Neg logit contributions
 Sorry-0.067
apped-0.064
ap-0.064
chard-0.063
ually-0.062
Tokenzag
Token ID50183
Feature activation-3.65e+00

Pos logit contributions
 adult+0.120
 ambulance+0.104
 wolf+0.099
 stret+0.099
<+0.099
Neg logit contributions
 Sorry-0.094
chard-0.094
ually-0.091
apped-0.090
ap-0.088
Token.
Token ID13
Feature activation-5.95e-02

Pos logit contributions
 adult+0.140
 ambulance+0.123
co+0.119
<+0.118
 stret+0.118
Neg logit contributions
 Sorry-0.152
chard-0.150
ually-0.137
apped-0.135
ap-0.129
Token It
Token ID632
Feature activation+5.82e-02

Pos logit contributions
 adult+0.003
 ambulance+0.003
co+0.003
 stret+0.003
coon+0.003
Neg logit contributions
 Sorry-0.002
chard-0.002
ually-0.002
apped-0.002
ap-0.002
Token looked
Token ID3114
Feature activation+3.69e-01

Pos logit contributions
 Sorry+0.002
chard+0.002
ually+0.002
apped+0.002
arium+0.002
Neg logit contributions
 adult-0.002
 ambulance-0.002
 stret-0.002
<-0.002
co-0.002
Token like
Token ID588
Feature activation+2.06e-01

Pos logit contributions
 Sorry+0.019
chard+0.018
ually+0.018
kins+0.017
apped+0.017
Neg logit contributions
 adult-0.010
 ambulance-0.009
<-0.008
 wolf-0.008
 sheep-0.008
Token they
Token ID484
Feature activation+5.94e-01

Pos logit contributions
 Sorry+0.007
chard+0.007
ually+0.006
apped+0.006
ap+0.006
Neg logit contributions
 adult-0.009
 ambulance-0.008
co-0.008
<-0.008
 stret-0.008
Token to
Token ID284
Feature activation+2.10e-01

Pos logit contributions
 adult+0.027
 stret+0.024
 ambulance+0.024
co+0.023
coon+0.023
Neg logit contributions
chard-0.026
 Sorry-0.024
ually-0.021
apped-0.021
rac-0.021
Token go
Token ID467
Feature activation+1.06e+00

Pos logit contributions
 Sorry+0.009
chard+0.009
apped+0.008
ually+0.008
ishment+0.008
Neg logit contributions
 adult-0.008
 ambulance-0.007
<-0.007
 stret-0.007
co-0.007
Token play
Token ID711
Feature activation-6.08e-01

Pos logit contributions
chard+0.040
 Sorry+0.038
ually+0.035
ap+0.035
apped+0.034
Neg logit contributions
 adult-0.045
coon-0.038
 stret-0.038
 ambulance-0.038
co-0.037
Token in
Token ID287
Feature activation+1.40e-01

Pos logit contributions
 adult+0.029
 ambulance+0.026
co+0.025
<+0.025
 stret+0.025
Neg logit contributions
 Sorry-0.020
chard-0.020
ually-0.017
apped-0.017
ap-0.016
Token the
Token ID262
Feature activation-7.35e-01

Pos logit contributions
 Sorry+0.005
chard+0.005
ually+0.004
apped+0.004
ap+0.004
Neg logit contributions
 adult-0.007
 ambulance-0.006
<-0.006
co-0.006
 stret-0.006
Token nearest
Token ID16936
Feature activation-3.03e+00

Pos logit contributions
 adult+0.026
 ambulance+0.021
 stret+0.021
co+0.021
coon+0.021
Neg logit contributions
 Sorry-0.033
chard-0.033
ually-0.030
apped-0.030
ap-0.029
Token tree
Token ID5509
Feature activation-1.41e-01

Pos logit contributions
 adult+0.094
 ambulance+0.085
 wolf+0.075
 sheep+0.074
 stret+0.072
Neg logit contributions
 Sorry-0.154
chard-0.146
ually-0.146
apped-0.146
kins-0.139
Token.
Token ID13
Feature activation-1.19e-01

Pos logit contributions
 adult+0.004
 ambulance+0.003
 living+0.003
 sheep+0.003
co+0.003
Neg logit contributions
 Sorry-0.008
apped-0.007
ually-0.007
rac-0.007
chard-0.007
Token He
Token ID679
Feature activation+3.95e-01

Pos logit contributions
 adult+0.005
 ambulance+0.004
co+0.004
<+0.004
 stret+0.004
Neg logit contributions
 Sorry-0.005
chard-0.005
ually-0.005
apped-0.004
ap-0.004
Token thought
Token ID1807
Feature activation-1.48e+00

Pos logit contributions
chard+0.013
 Sorry+0.012
ually+0.011
apped+0.010
rac+0.010
Neg logit contributions
 adult-0.019
 ambulance-0.017
 stret-0.017
coon-0.017
<-0.016
Token he
Token ID339
Feature activation+1.43e+00

Pos logit contributions
 adult+0.074
co+0.068
 stret+0.068
 ambulance+0.068
coon+0.066
Neg logit contributions
chard-0.041
 Sorry-0.039
apped-0.035
rac-0.034
ually-0.034