TOP ACTIVATIONS
MAX = 4.891

and Jimmy 's face sc run ched up as he ate
the park . Julia sc run ched up her face and
her candy , she sc run ched her eyes tight .
the dirt . He sc run ched up his face as
M am ma sc run ched up all of the
stick out . He sc run ched up his face and
was missing . She sc run ched up her face and
big arm chair in a small room . It was an
. He tried to out run the train but he was
the thunder bo omed high above her . She wanted it
play a game called " run and catch ". Ben was
her family lived in a small town . One day ,
her toy phone in her little purse . Lily 's favorite
little mouse in her cozy little hole in the wall of
do it . She sc run ched up her face .
while Molly wasn â t looking .
his family lived in a small hut in the forest .
The little girl was in a hurry . She wanted to
friends who lived in a small village . They liked to
hated it . He sc run ched his face up as

MIN ACTIVATIONS
MIN = -4.640

web . The cob web was so strong that
monkey and elephant played all sorts of fun games . They
ak . The cob web sh ivered and then
pretty v ase . The v ase was blue and had
. She would design all sorts of things like flowers ,
a small house . The v ase was very pretty and
, Tim my . The v ase must remain on the
. They would see all sorts of animals like lions ,
things . The v ase belonged to a little
chased after it . Finally after a lot of running ,
in the park . The v ase was covered in dirt
backyard . He saw all sorts of interesting things , like
a cob web . The cob web was very thin and
stream . He met all sorts of animals and sw am
He did all sorts of things with the axe
His mom explained that the cob web was just a spider
said and decided that maybe cob we bs weren 't so
the v ase . The v ase fell and break .
was too strong . The v ase broke into lots of
. They were playing all sorts of games . Suddenly ,

INTERVAL 2.508 - 4.891
CONTAINS 0.252%

mass aging her legs and arms . When it was finished
Tom . Tom lived in a bedroom and he loved to
and his dad were in a tent . They were camping
with her duck in her arms . The end . <|endoftext|>
. " Don 't be scared ", Bob said . "

INTERVAL 0.125 - 2.508
CONTAINS 43.605%

mom said that it was the most beautiful costume she had
. The little boy was so excited that he wanted to
J enny said , " I have some news .
a time there was a reliable car that liked to drive
came out of the house and saw the wet floor ,

INTERVAL -2.257 - 0.125
CONTAINS 55.509%

<|endoftext|> Once upon a time , there lived a thin cat
started crying and the man felt very sad . <|endoftext|> John
will lead you to a secret place ." S
a happy family . <|endoftext|> Anna and Ben like to play
girl near the ball . He asked her , " Can

INTERVAL -4.640 - -2.257
CONTAINS 0.634%

boy who always talked about lots of fun things . Every
hanging !" After playing , Lily was thirsty
. The moral of the story is that it 's good
lion !" Anna says . She shows Ben her lion .
giant and ran off . From that day on , the
Token and
Token ID290
Feature activation+1.40e+00

Pos logit contributions
ate+0.036
ires+0.029
ia+0.028
mill+0.027
urse+0.026
Neg logit contributions
arse-0.034
 -0.033
gency-0.032
 pleasure-0.031
 might-0.030
Token Jimmy
Token ID12963
Feature activation+2.38e+00

Pos logit contributions
arse+0.033
gency+0.031
 +0.030
 pleasure+0.028
 plunge+0.027
Neg logit contributions
ate-0.040
ires-0.034
mill-0.033
ia-0.033
der-0.032
Token's
Token ID338
Feature activation+1.35e+00

Pos logit contributions
arse+0.071
gency+0.068
 +0.067
 pleasure+0.062
 plunge+0.061
Neg logit contributions
ate-0.052
ires-0.042
mill-0.040
ia-0.040
der-0.040
Token face
Token ID1986
Feature activation+1.36e+00

Pos logit contributions
arse+0.034
gency+0.033
 +0.029
 plunge+0.028
riger+0.025
Neg logit contributions
ate-0.034
ia-0.031
ires-0.031
der-0.030
 from-0.029
Token sc
Token ID629
Feature activation+1.50e+00

Pos logit contributions
 +0.032
arse+0.032
gency+0.029
 might+0.029
're+0.026
Neg logit contributions
ate-0.037
ires-0.033
ia-0.030
mill-0.029
iring-0.029
Tokenrun
Token ID5143
Feature activation+4.89e+00

Pos logit contributions
arse+0.045
gency+0.043
 +0.042
 pleasure+0.039
 plunge+0.039
Neg logit contributions
ate-0.032
ires-0.027
ia-0.025
der-0.025
mill-0.025
Tokenched
Token ID1740
Feature activation+1.34e+00

Pos logit contributions
arse+0.137
gency+0.130
 pleasure+0.119
 +0.119
 plunge+0.116
Neg logit contributions
ate-0.122
mill-0.096
ia-0.094
ires-0.094
 from-0.092
Token up
Token ID510
Feature activation+1.60e-01

Pos logit contributions
arse+0.039
gency+0.038
 +0.034
 pleasure+0.034
 plunge+0.034
Neg logit contributions
ate-0.030
 from-0.026
der-0.025
 about-0.024
ires-0.024
Token as
Token ID355
Feature activation+5.98e-01

Pos logit contributions
arse+0.004
 +0.004
gency+0.004
 pleasure+0.004
 plunge+0.004
Neg logit contributions
ate-0.004
ires-0.003
mill-0.003
ia-0.003
der-0.003
Token he
Token ID339
Feature activation+6.93e-01

Pos logit contributions
 +0.013
arse+0.013
gency+0.012
 pleasure+0.012
 plunge+0.011
Neg logit contributions
ate-0.017
ires-0.014
mill-0.014
ia-0.013
iring-0.013
Token ate
Token ID15063
Feature activation+1.06e-01

Pos logit contributions
 +0.017
gency+0.016
arse+0.016
 might+0.015
 pleasure+0.014
Neg logit contributions
ate-0.018
ires-0.016
mill-0.015
iring-0.015
der-0.014
Token the
Token ID262
Feature activation+5.04e-01

Pos logit contributions
arse+0.047
gency+0.044
 plunge+0.040
 pleasure+0.039
use+0.038
Neg logit contributions
ate-0.026
 from-0.021
mill-0.018
ia-0.017
 and-0.017
Token park
Token ID3952
Feature activation+6.31e-01

Pos logit contributions
arse+0.014
gency+0.013
 +0.012
 pleasure+0.012
 plunge+0.012
Neg logit contributions
ate-0.012
ires-0.010
ia-0.010
mill-0.010
der-0.009
Token.
Token ID13
Feature activation-1.48e-02

Pos logit contributions
arse+0.016
 +0.016
gency+0.015
 pleasure+0.014
 plunge+0.014
Neg logit contributions
ate-0.016
ires-0.014
ia-0.013
mill-0.013
der-0.013
Token Julia
Token ID22300
Feature activation-2.02e-01

Pos logit contributions
ate+0.000
ires+0.000
ia+0.000
der+0.000
mill+0.000
Neg logit contributions
arse-0.000
gency-0.000
 -0.000
 plunge-0.000
 pleasure-0.000
Token sc
Token ID629
Feature activation+1.19e+00

Pos logit contributions
ate+0.005
ires+0.005
iring+0.004
mill+0.004
der+0.004
Neg logit contributions
 -0.005
 might-0.005
arse-0.005
gency-0.005
 pleasure-0.004
Tokenrun
Token ID5143
Feature activation+4.57e+00

Pos logit contributions
arse+0.030
gency+0.028
 +0.028
 pleasure+0.026
 plunge+0.025
Neg logit contributions
ate-0.032
ires-0.026
mill-0.026
der-0.025
ia-0.025
Tokenched
Token ID1740
Feature activation+9.03e-01

Pos logit contributions
arse+0.126
gency+0.118
 pleasure+0.109
 +0.107
 plunge+0.106
Neg logit contributions
ate-0.118
ia-0.093
mill-0.092
ires-0.090
 from-0.088
Token up
Token ID510
Feature activation-5.07e-01

Pos logit contributions
arse+0.030
gency+0.029
 pleasure+0.025
 plunge+0.025
use+0.025
Neg logit contributions
ate-0.018
 from-0.015
ia-0.013
ires-0.013
 about-0.013
Token her
Token ID607
Feature activation-4.30e-01

Pos logit contributions
ate+0.011
ires+0.009
mill+0.008
ia+0.008
der+0.008
Neg logit contributions
arse-0.015
 -0.015
gency-0.015
 pleasure-0.014
 plunge-0.013
Token face
Token ID1986
Feature activation+9.30e-01

Pos logit contributions
ate+0.012
ires+0.010
mill+0.010
ia+0.010
der+0.009
Neg logit contributions
arse-0.011
gency-0.010
 -0.010
 pleasure-0.009
 plunge-0.009
Token and
Token ID290
Feature activation+1.20e+00

Pos logit contributions
 +0.028
arse+0.028
gency+0.026
 pleasure+0.025
 might+0.025
Neg logit contributions
ate-0.019
ires-0.016
ia-0.014
mill-0.013
iring-0.013
Token her
Token ID607
Feature activation-9.57e-01

Pos logit contributions
ate+0.046
ires+0.041
ia+0.040
der+0.039
iring+0.038
Neg logit contributions
 -0.037
arse-0.036
gency-0.035
 pleasure-0.031
 suitcase-0.030
Token candy
Token ID18550
Feature activation-1.10e+00

Pos logit contributions
ate+0.028
ires+0.024
ia+0.023
mill+0.023
der+0.022
Neg logit contributions
arse-0.022
gency-0.021
 -0.020
 plunge-0.018
 pleasure-0.018
Token,
Token ID11
Feature activation-3.17e-02

Pos logit contributions
ate+0.029
ires+0.026
ia+0.024
mill+0.023
der+0.023
Neg logit contributions
 -0.027
arse-0.027
gency-0.025
 suitcase-0.023
 might-0.023
Token she
Token ID673
Feature activation-7.54e-01

Pos logit contributions
ate+0.001
ires+0.001
ia+0.001
mill+0.001
der+0.001
Neg logit contributions
arse-0.001
 -0.001
gency-0.001
 pleasure-0.001
 plunge-0.001
Token sc
Token ID629
Feature activation+1.17e+00

Pos logit contributions
ires+0.023
ate+0.021
iring+0.021
iders+0.020
mill+0.019
Neg logit contributions
 -0.017
 might-0.014
 suitcase-0.014
gency-0.013
arse-0.013
Tokenrun
Token ID5143
Feature activation+4.36e+00

Pos logit contributions
arse+0.032
 +0.031
gency+0.031
 pleasure+0.029
 plunge+0.027
Neg logit contributions
ate-0.029
mill-0.023
ires-0.023
der-0.023
ia-0.022
Tokenched
Token ID1740
Feature activation+4.52e-01

Pos logit contributions
arse+0.117
gency+0.109
 +0.103
 pleasure+0.101
 plunge+0.098
Neg logit contributions
ate-0.114
ia-0.090
mill-0.090
ires-0.089
 from-0.087
Token her
Token ID607
Feature activation+1.09e-01

Pos logit contributions
arse+0.014
gency+0.013
 plunge+0.011
 pleasure+0.011
 +0.011
Neg logit contributions
ate-0.011
 from-0.009
ires-0.008
 about-0.008
der-0.008
Token eyes
Token ID2951
Feature activation+1.31e+00

Pos logit contributions
arse+0.003
gency+0.003
 +0.002
 plunge+0.002
 pleasure+0.002
Neg logit contributions
ate-0.003
 from-0.002
ires-0.002
der-0.002
 about-0.002
Token tight
Token ID5381
Feature activation+1.26e+00

Pos logit contributions
arse+0.042
 +0.040
gency+0.039
 pleasure+0.036
 plunge+0.036
Neg logit contributions
ate-0.026
ires-0.021
ia-0.020
mill-0.020
iring-0.018
Token.
Token ID13
Feature activation+7.06e-01

Pos logit contributions
arse+0.040
gency+0.038
 +0.038
 pleasure+0.036
 plunge+0.035
Neg logit contributions
ate-0.025
ires-0.020
mill-0.019
ia-0.019
der-0.018
Token the
Token ID262
Feature activation-9.03e-03

Pos logit contributions
 +0.003
gency+0.002
 might+0.002
 pleasure+0.002
arse+0.002
Neg logit contributions
ate-0.003
ires-0.003
iring-0.003
ia-0.003
resp-0.003
Token dirt
Token ID13647
Feature activation+5.23e-01

Pos logit contributions
ate+0.000
ires+0.000
ia+0.000
der+0.000
mill+0.000
Neg logit contributions
arse-0.000
gency-0.000
 -0.000
 pleasure-0.000
 plunge-0.000
Token.
Token ID13
Feature activation-2.19e-02

Pos logit contributions
 +0.012
arse+0.011
 might+0.011
 plunge+0.010
gency+0.010
Neg logit contributions
ate-0.015
ires-0.013
ia-0.012
ena-0.012
urse-0.012
Token He
Token ID679
Feature activation+2.22e-01

Pos logit contributions
ate+0.001
mill+0.000
ires+0.000
ia+0.000
iring+0.000
Neg logit contributions
 -0.001
arse-0.001
gency-0.001
 pleasure-0.001
 plunge-0.000
Token sc
Token ID629
Feature activation+8.08e-01

Pos logit contributions
 +0.005
 might+0.005
gency+0.004
arse+0.004
 pleasure+0.004
Neg logit contributions
ate-0.006
ires-0.006
iring-0.006
mill-0.005
resp-0.005
Tokenrun
Token ID5143
Feature activation+4.33e+00

Pos logit contributions
arse+0.021
 +0.021
gency+0.020
 pleasure+0.020
 plunge+0.019
Neg logit contributions
ate-0.020
mill-0.017
ires-0.016
der-0.016
ia-0.015
Tokenched
Token ID1740
Feature activation+6.89e-01

Pos logit contributions
arse+0.112
gency+0.106
 +0.104
 pleasure+0.096
 plunge+0.094
Neg logit contributions
ate-0.112
ires-0.093
mill-0.091
ia-0.090
der-0.090
Token up
Token ID510
Feature activation-4.50e-01

Pos logit contributions
arse+0.019
gency+0.018
 +0.017
 pleasure+0.016
 plunge+0.016
Neg logit contributions
ate-0.017
ires-0.014
mill-0.013
ia-0.013
der-0.013
Token his
Token ID465
Feature activation+5.54e-01

Pos logit contributions
ate+0.010
ires+0.008
ia+0.007
mill+0.007
iring+0.007
Neg logit contributions
 -0.013
arse-0.013
gency-0.012
 pleasure-0.012
 might-0.011
Token face
Token ID1986
Feature activation+6.92e-01

Pos logit contributions
arse+0.012
gency+0.012
 +0.011
 pleasure+0.009
 plunge+0.009
Neg logit contributions
ate-0.016
ires-0.014
ia-0.014
 from-0.014
der-0.014
Token as
Token ID355
Feature activation+6.18e-01

Pos logit contributions
 +0.020
arse+0.020
gency+0.019
 might+0.018
 pleasure+0.018
Neg logit contributions
ate-0.015
ires-0.012
ia-0.011
mill-0.011
iring-0.010
Token
Token ID198
Feature activation+1.12e-01

Pos logit contributions
arse+0.006
gency+0.006
 pleasure+0.005
 plunge+0.005
 +0.005
Neg logit contributions
ate-0.005
ia-0.004
ires-0.004
 about-0.004
 from-0.004
TokenM
Token ID44
Feature activation-5.11e-01

Pos logit contributions
arse+0.003
 +0.003
gency+0.003
 plunge+0.003
 pleasure+0.003
Neg logit contributions
ate-0.003
ires-0.002
mill-0.002
der-0.002
ia-0.002
Tokenam
Token ID321
Feature activation+3.94e-01

Pos logit contributions
ate+0.013
mill+0.011
der+0.011
ires+0.011
 from+0.010
Neg logit contributions
arse-0.012
 -0.012
gency-0.012
 pleasure-0.011
 suitcase-0.010
Tokenma
Token ID2611
Feature activation+1.14e+00

Pos logit contributions
 +0.009
arse+0.009
gency+0.008
 plunge+0.008
 might+0.007
Neg logit contributions
ate-0.012
ires-0.010
der-0.010
mill-0.009
iring-0.009
Token sc
Token ID629
Feature activation+1.03e+00

Pos logit contributions
arse+0.029
gency+0.027
 +0.026
 pleasure+0.026
 plunge+0.024
Neg logit contributions
ate-0.030
mill-0.025
ia-0.024
 from-0.024
ires-0.024
Tokenrun
Token ID5143
Feature activation+4.31e+00

Pos logit contributions
arse+0.034
gency+0.033
 +0.032
 pleasure+0.030
 plunge+0.030
Neg logit contributions
ate-0.020
ires-0.015
mill-0.015
ia-0.014
der-0.014
Tokenched
Token ID1740
Feature activation+1.61e+00

Pos logit contributions
arse+0.131
gency+0.120
 pleasure+0.116
 +0.110
 plunge+0.109
Neg logit contributions
ate-0.101
ia-0.078
 from-0.077
mill-0.077
ires-0.075
Token up
Token ID510
Feature activation-5.36e-01

Pos logit contributions
arse+0.050
gency+0.046
 pleasure+0.043
 +0.040
 plunge+0.040
Neg logit contributions
ate-0.036
 from-0.031
ia-0.029
ires-0.028
 about-0.027
Token all
Token ID477
Feature activation+6.96e-01

Pos logit contributions
ate+0.014
ires+0.012
mill+0.011
ia+0.011
der+0.011
Neg logit contributions
 -0.014
arse-0.013
gency-0.013
 pleasure-0.012
 plunge-0.012
Token of
Token ID286
Feature activation-5.60e-03

Pos logit contributions
arse+0.020
 +0.020
gency+0.019
 pleasure+0.018
 plunge+0.018
Neg logit contributions
ate-0.015
ires-0.012
der-0.012
ia-0.012
mill-0.012
Token the
Token ID262
Feature activation-1.06e-01

Pos logit contributions
ate+0.000
ires+0.000
mill+0.000
ia+0.000
der+0.000
Neg logit contributions
arse-0.000
gency-0.000
 -0.000
 pleasure-0.000
 plunge-0.000
Token stick
Token ID4859
Feature activation+1.51e+00

Pos logit contributions
arse+0.015
 +0.014
gency+0.014
 plunge+0.013
 pleasure+0.012
Neg logit contributions
ate-0.015
ires-0.013
mill-0.012
ia-0.012
der-0.012
Token out
Token ID503
Feature activation+5.02e-01

Pos logit contributions
arse+0.038
 +0.035
gency+0.034
 plunge+0.031
 pleasure+0.031
Neg logit contributions
ate-0.040
ires-0.035
ia-0.033
mill-0.033
der-0.033
Token.
Token ID13
Feature activation+7.26e-01

Pos logit contributions
arse+0.014
 +0.013
gency+0.013
 pleasure+0.012
 plunge+0.012
Neg logit contributions
ate-0.012
ires-0.009
mill-0.009
ia-0.009
der-0.009
Token He
Token ID679
Feature activation-2.44e-01

Pos logit contributions
arse+0.018
 +0.018
gency+0.017
 pleasure+0.016
 plunge+0.015
Neg logit contributions
ate-0.019
ires-0.016
mill-0.016
ia-0.016
der-0.015
Token sc
Token ID629
Feature activation+8.68e-01

Pos logit contributions
ate+0.007
ires+0.007
iring+0.006
mill+0.006
der+0.006
Neg logit contributions
 -0.006
gency-0.005
 might-0.005
arse-0.005
 plunge-0.004
Tokenrun
Token ID5143
Feature activation+4.30e+00

Pos logit contributions
arse+0.024
gency+0.023
 +0.023
 pleasure+0.021
 plunge+0.020
Neg logit contributions
ate-0.021
ires-0.017
mill-0.017
der-0.017
ia-0.016
Tokenched
Token ID1740
Feature activation+9.78e-01

Pos logit contributions
arse+0.120
gency+0.112
 +0.106
 pleasure+0.104
 plunge+0.101
Neg logit contributions
ate-0.107
ia-0.085
ires-0.084
mill-0.084
der-0.081
Token up
Token ID510
Feature activation-6.75e-01

Pos logit contributions
arse+0.031
gency+0.029
 pleasure+0.027
 plunge+0.026
 +0.026
Neg logit contributions
ate-0.020
 from-0.017
 about-0.016
ia-0.016
ires-0.016
Token his
Token ID465
Feature activation+2.76e-01

Pos logit contributions
ate+0.016
ires+0.014
mill+0.012
iring+0.012
ia+0.012
Neg logit contributions
 -0.019
arse-0.018
gency-0.018
 pleasure-0.016
 suitcase-0.016
Token face
Token ID1986
Feature activation+4.91e-01

Pos logit contributions
arse+0.006
gency+0.006
 +0.006
 pleasure+0.005
 plunge+0.005
Neg logit contributions
ate-0.008
ires-0.007
ia-0.007
mill-0.007
der-0.007
Token and
Token ID290
Feature activation+1.37e+00

Pos logit contributions
 +0.015
arse+0.014
gency+0.014
 might+0.013
 suitcase+0.013
Neg logit contributions
ate-0.011
ires-0.009
ia-0.008
mill-0.007
emo-0.007
Token was
Token ID373
Feature activation+8.13e-01

Pos logit contributions
arse+0.011
 +0.010
gency+0.010
 plunge+0.009
 might+0.009
Neg logit contributions
ate-0.011
ires-0.010
ia-0.009
der-0.009
iring-0.009
Token missing
Token ID4814
Feature activation+2.99e-01

Pos logit contributions
arse+0.020
 +0.019
gency+0.018
 pleasure+0.017
 plunge+0.016
Neg logit contributions
ate-0.022
ires-0.019
ia-0.018
der-0.017
resp-0.017
Token.
Token ID13
Feature activation+7.46e-01

Pos logit contributions
arse+0.008
gency+0.007
 +0.007
 pleasure+0.007
 plunge+0.007
Neg logit contributions
ate-0.008
ires-0.006
mill-0.006
ia-0.006
der-0.006
Token She
Token ID1375
Feature activation-5.05e-01

Pos logit contributions
arse+0.018
 +0.018
gency+0.017
 pleasure+0.016
 plunge+0.015
Neg logit contributions
ate-0.020
ires-0.017
mill-0.017
ia-0.016
der-0.016
Token sc
Token ID629
Feature activation+8.62e-01

Pos logit contributions
ate+0.015
ires+0.014
mill+0.013
iring+0.013
urse+0.012
Neg logit contributions
 -0.012
 might-0.010
gency-0.010
arse-0.010
 brow-0.009
Tokenrun
Token ID5143
Feature activation+4.27e+00

Pos logit contributions
arse+0.025
gency+0.024
 +0.024
 pleasure+0.022
 plunge+0.022
Neg logit contributions
ate-0.020
mill-0.016
ires-0.015
der-0.015
ia-0.015
Tokenched
Token ID1740
Feature activation+1.40e+00

Pos logit contributions
arse+0.113
gency+0.107
 +0.105
 pleasure+0.098
 plunge+0.095
Neg logit contributions
ate-0.108
ires-0.089
mill-0.087
ia-0.086
der-0.085
Token up
Token ID510
Feature activation-1.18e-01

Pos logit contributions
arse+0.039
gency+0.037
 pleasure+0.033
 +0.032
 plunge+0.032
Neg logit contributions
ate-0.035
 from-0.029
ires-0.028
ia-0.028
 about-0.027
Token her
Token ID607
Feature activation-6.05e-01

Pos logit contributions
ate+0.003
ires+0.002
mill+0.002
ia+0.002
der+0.002
Neg logit contributions
arse-0.003
 -0.003
gency-0.003
 pleasure-0.003
 plunge-0.003
Token face
Token ID1986
Feature activation+9.71e-01

Pos logit contributions
ate+0.016
ires+0.014
ia+0.013
mill+0.013
der+0.013
Neg logit contributions
arse-0.015
gency-0.014
 -0.014
 pleasure-0.013
 plunge-0.012
Token and
Token ID290
Feature activation+1.62e+00

Pos logit contributions
arse+0.031
 +0.031
gency+0.029
 might+0.027
 pleasure+0.027
Neg logit contributions
ate-0.019
ires-0.016
ia-0.014
mill-0.013
iring-0.013
Token big
Token ID1263
Feature activation+1.37e+00

Pos logit contributions
arse+0.032
gency+0.031
 +0.030
 pleasure+0.028
 plunge+0.027
Neg logit contributions
ate-0.030
ires-0.025
mill-0.024
ia-0.024
der-0.024
Token arm
Token ID3211
Feature activation-5.14e-01

Pos logit contributions
arse+0.033
gency+0.031
 +0.031
 pleasure+0.029
 plunge+0.028
Neg logit contributions
ate-0.037
ires-0.031
der-0.030
mill-0.030
ia-0.030
Tokenchair
Token ID16337
Feature activation-2.20e-01

Pos logit contributions
ate+0.016
ires+0.014
ia+0.013
mill+0.013
iring+0.013
Neg logit contributions
 -0.011
arse-0.010
gency-0.010
 suitcase-0.009
 might-0.009
Token in
Token ID287
Feature activation+1.96e+00

Pos logit contributions
ate+0.005
ires+0.004
ia+0.004
iring+0.004
der+0.004
Neg logit contributions
arse-0.006
 -0.006
gency-0.006
 pleasure-0.005
 plunge-0.005
Token a
Token ID257
Feature activation+3.67e+00

Pos logit contributions
arse+0.063
gency+0.060
 +0.054
 pleasure+0.054
 plunge+0.054
Neg logit contributions
ate-0.043
ia-0.032
ires-0.032
 from-0.030
mill-0.030
Token small
Token ID1402
Feature activation+4.21e+00

Pos logit contributions
arse+0.137
gency+0.133
 plunge+0.127
 pleasure+0.127
're+0.117
Neg logit contributions
ate-0.054
ia-0.040
 also-0.038
her-0.038
 lady-0.036
Token room
Token ID2119
Feature activation-5.05e-01

Pos logit contributions
arse+0.141
gency+0.138
 pleasure+0.130
 plunge+0.129
 +0.120
Neg logit contributions
ate-0.075
 from-0.063
 and-0.056
mill-0.055
er-0.053
Token.
Token ID13
Feature activation+6.82e-01

Pos logit contributions
ate+0.014
ires+0.013
ia+0.012
der+0.012
iring+0.012
Neg logit contributions
 -0.011
gency-0.011
arse-0.010
 might-0.009
 suitcase-0.009
Token It
Token ID632
Feature activation+4.01e-01

Pos logit contributions
arse+0.015
gency+0.014
 +0.014
 pleasure+0.013
 plunge+0.012
Neg logit contributions
ate-0.020
ires-0.017
mill-0.017
ia-0.017
der-0.017
Token was
Token ID373
Feature activation+4.19e-01

Pos logit contributions
 +0.012
arse+0.011
gency+0.011
 might+0.010
 pleasure+0.010
Neg logit contributions
ate-0.009
ires-0.007
mill-0.007
der-0.007
iring-0.007
Token an
Token ID281
Feature activation+5.14e-01

Pos logit contributions
 +0.009
arse+0.008
gency+0.008
 pleasure+0.007
 might+0.007
Neg logit contributions
ate-0.013
der-0.011
ires-0.011
mill-0.011
ia-0.011
Token.
Token ID13
Feature activation+2.08e-01

Pos logit contributions
arse+0.017
 +0.017
gency+0.016
 pleasure+0.015
 plunge+0.015
Neg logit contributions
ate-0.016
ires-0.013
mill-0.013
der-0.012
ia-0.012
Token He
Token ID679
Feature activation-1.58e-01

Pos logit contributions
 +0.005
arse+0.005
gency+0.005
 pleasure+0.004
 plunge+0.004
Neg logit contributions
ate-0.006
mill-0.005
ires-0.005
ia-0.005
der-0.005
Token tried
Token ID3088
Feature activation-2.84e-01

Pos logit contributions
ate+0.004
ires+0.003
mill+0.003
der+0.003
ia+0.003
Neg logit contributions
arse-0.004
 -0.004
gency-0.004
 pleasure-0.004
 plunge-0.004
Token to
Token ID284
Feature activation-1.19e+00

Pos logit contributions
ate+0.008
ires+0.007
ia+0.006
mill+0.006
der+0.006
Neg logit contributions
arse-0.007
 -0.007
gency-0.007
 pleasure-0.006
 plunge-0.006
Token out
Token ID503
Feature activation+9.12e-02

Pos logit contributions
ate+0.034
ires+0.030
der+0.029
mill+0.028
ia+0.028
Neg logit contributions
arse-0.029
gency-0.026
 -0.026
 plunge-0.024
 pleasure-0.024
Tokenrun
Token ID5143
Feature activation+4.18e+00

Pos logit contributions
arse+0.002
 +0.002
gency+0.002
 pleasure+0.002
 plunge+0.002
Neg logit contributions
ate-0.002
ires-0.002
mill-0.002
ia-0.002
der-0.002
Token the
Token ID262
Feature activation-5.50e-01

Pos logit contributions
arse+0.115
gency+0.105
 pleasure+0.099
 suitcase+0.097
 +0.095
Neg logit contributions
ate-0.108
 from-0.086
ia-0.084
 about-0.082
ires-0.081
Token train
Token ID4512
Feature activation+5.27e-01

Pos logit contributions
ate+0.011
 from+0.009
ia+0.009
ires+0.009
iring+0.009
Neg logit contributions
gency-0.016
arse-0.016
 -0.015
 pleasure-0.014
eeee-0.013
Token but
Token ID475
Feature activation+9.18e-01

Pos logit contributions
arse+0.014
 +0.013
gency+0.013
 pleasure+0.012
 plunge+0.012
Neg logit contributions
ate-0.013
ires-0.011
ia-0.011
mill-0.010
der-0.010
Token he
Token ID339
Feature activation+9.82e-01

Pos logit contributions
arse+0.023
gency+0.021
 +0.020
 pleasure+0.019
 plunge+0.018
Neg logit contributions
ate-0.025
ires-0.021
mill-0.021
ia-0.021
der-0.020
Token was
Token ID373
Feature activation+3.34e-01

Pos logit contributions
arse+0.021
 +0.021
gency+0.020
 might+0.018
 pleasure+0.018
Neg logit contributions
ate-0.030
ires-0.026
mill-0.025
der-0.024
ia-0.024
Token the
Token ID262
Feature activation+5.27e-01

Pos logit contributions
arse+0.023
 +0.022
gency+0.021
 pleasure+0.020
 plunge+0.019
Neg logit contributions
ate-0.023
ires-0.019
mill-0.019
der-0.019
ia-0.018
Token thunder
Token ID18355
Feature activation+9.82e-01

Pos logit contributions
arse+0.015
gency+0.014
 +0.013
 pleasure+0.013
 plunge+0.012
Neg logit contributions
ate-0.012
ires-0.010
mill-0.010
ia-0.010
der-0.010
Token bo
Token ID1489
Feature activation+1.16e+00

Pos logit contributions
 +0.024
arse+0.024
gency+0.022
 might+0.022
 plunge+0.021
Neg logit contributions
ate-0.028
ires-0.024
ia-0.021
iring-0.020
urse-0.020
Tokenomed
Token ID12657
Feature activation+1.85e+00

Pos logit contributions
arse+0.026
gency+0.024
 +0.023
 plunge+0.021
 might+0.019
Neg logit contributions
ate-0.035
ires-0.030
der-0.028
mill-0.028
ia-0.028
Token high
Token ID1029
Feature activation+3.83e-01

Pos logit contributions
arse+0.049
 +0.045
gency+0.041
 plunge+0.040
 pleasure+0.039
Neg logit contributions
ate-0.048
ires-0.041
ia-0.039
mill-0.038
der-0.037
Token above
Token ID2029
Feature activation+4.14e+00

Pos logit contributions
arse+0.009
 +0.009
gency+0.009
 plunge+0.008
 might+0.008
Neg logit contributions
ate-0.010
ires-0.008
ia-0.008
mill-0.008
urse-0.008
Token her
Token ID607
Feature activation+4.69e-01

Pos logit contributions
arse+0.118
gency+0.115
 +0.103
 pleasure+0.102
 plunge+0.099
Neg logit contributions
ate-0.098
ia-0.081
ires-0.079
 from-0.079
 about-0.078
Token.
Token ID13
Feature activation+1.53e+00

Pos logit contributions
arse+0.012
 +0.012
gency+0.012
 pleasure+0.011
 plunge+0.010
Neg logit contributions
ate-0.012
ires-0.010
mill-0.010
ia-0.010
der-0.009
Token She
Token ID1375
Feature activation-3.97e-01

Pos logit contributions
arse+0.036
gency+0.033
 +0.033
 pleasure+0.030
 plunge+0.029
Neg logit contributions
ate-0.043
ires-0.036
mill-0.036
ia-0.035
der-0.035
Token wanted
Token ID2227
Feature activation+1.22e+00

Pos logit contributions
ate+0.012
ires+0.011
mill+0.010
iring+0.010
resp+0.010
Neg logit contributions
 -0.009
arse-0.008
gency-0.008
 might-0.008
 suitcase-0.007
Token it
Token ID340
Feature activation+8.04e-01

Pos logit contributions
 +0.028
arse+0.027
gency+0.027
 plunge+0.025
 pleasure+0.025
Neg logit contributions
ate-0.035
ires-0.030
ia-0.029
der-0.028
mill-0.028
Token play
Token ID711
Feature activation-1.37e+00

Pos logit contributions
ate+0.027
der+0.026
ires+0.026
mill+0.025
iring+0.024
Neg logit contributions
 -0.020
arse-0.020
gency-0.018
 plunge-0.018
 pleasure-0.018
Token a
Token ID257
Feature activation+4.29e-01

Pos logit contributions
ate+0.034
ires+0.029
mill+0.028
der+0.028
ia+0.028
Neg logit contributions
arse-0.035
 -0.034
gency-0.033
 plunge-0.031
 pleasure-0.031
Token game
Token ID983
Feature activation-1.93e+00

Pos logit contributions
arse+0.012
gency+0.012
 +0.011
 pleasure+0.010
 plunge+0.010
Neg logit contributions
ate-0.010
ia-0.009
ires-0.008
mill-0.008
der-0.008
Token called
Token ID1444
Feature activation+3.61e-01

Pos logit contributions
ate+0.054
ires+0.047
iring+0.046
der+0.046
mill+0.045
Neg logit contributions
 -0.040
arse-0.037
gency-0.036
 might-0.035
 plunge-0.034
Token "
Token ID366
Feature activation+1.73e+00

Pos logit contributions
arse+0.009
 +0.009
gency+0.009
 pleasure+0.008
 plunge+0.008
Neg logit contributions
ate-0.009
ires-0.008
mill-0.008
der-0.008
iring-0.008
Tokenrun
Token ID5143
Feature activation+4.13e+00

Pos logit contributions
arse+0.038
gency+0.036
 +0.035
 pleasure+0.031
 plunge+0.031
Neg logit contributions
ate-0.051
ires-0.044
ia-0.043
der-0.043
mill-0.042
Token and
Token ID290
Feature activation-2.34e-01

Pos logit contributions
arse+0.121
gency+0.108
 pleasure+0.105
 suitcase+0.101
 plunge+0.097
Neg logit contributions
ate-0.105
ia-0.082
 from-0.077
 about-0.076
ires-0.075
Token catch
Token ID4929
Feature activation+2.31e-01

Pos logit contributions
ate+0.007
ires+0.006
iring+0.006
der+0.006
mill+0.006
Neg logit contributions
 -0.005
gency-0.005
arse-0.005
 pleasure-0.005
 plunge-0.005
Token".
Token ID1911
Feature activation-3.48e-01

Pos logit contributions
arse+0.006
gency+0.006
 +0.006
 pleasure+0.005
 plunge+0.005
Neg logit contributions
ate-0.006
ires-0.005
ia-0.005
mill-0.005
der-0.004
Token Ben
Token ID3932
Feature activation-4.88e-01

Pos logit contributions
ate+0.009
mill+0.007
ires+0.007
ia+0.007
der+0.007
Neg logit contributions
arse-0.009
 -0.009
gency-0.009
 pleasure-0.008
 plunge-0.008
Token was
Token ID373
Feature activation+1.58e-01

Pos logit contributions
ate+0.012
ires+0.012
iring+0.011
mill+0.010
der+0.010
Neg logit contributions
 -0.013
 might-0.012
gency-0.011
arse-0.011
 pleasure-0.010
Token her
Token ID607
Feature activation-2.70e-01

Pos logit contributions
ate+0.009
ires+0.008
der+0.007
ia+0.007
iring+0.007
Neg logit contributions
 -0.006
arse-0.005
gency-0.005
 pleasure-0.005
 plunge-0.005
Token family
Token ID1641
Feature activation-3.66e-01

Pos logit contributions
ate+0.005
ia+0.004
ires+0.004
 from+0.004
mill+0.004
Neg logit contributions
arse-0.009
gency-0.008
 -0.008
 plunge-0.008
 pleasure-0.007
Token lived
Token ID5615
Feature activation-2.94e-01

Pos logit contributions
ate+0.009
ires+0.008
der+0.008
ia+0.008
iders+0.007
Neg logit contributions
 -0.010
arse-0.009
gency-0.009
 pleasure-0.008
 Bobby-0.008
Token in
Token ID287
Feature activation+1.26e+00

Pos logit contributions
ate+0.007
ires+0.005
der+0.005
ia+0.005
mill+0.005
Neg logit contributions
arse-0.008
 -0.008
gency-0.008
 pleasure-0.007
 plunge-0.007
Token a
Token ID257
Feature activation+3.61e+00

Pos logit contributions
arse+0.043
gency+0.040
 plunge+0.036
 +0.036
 pleasure+0.035
Neg logit contributions
ate-0.024
 from-0.019
mill-0.018
 about-0.018
ires-0.017
Token small
Token ID1402
Feature activation+4.06e+00

Pos logit contributions
arse+0.118
gency+0.117
 plunge+0.102
 pleasure+0.097
 +0.094
Neg logit contributions
ate-0.074
ia-0.056
ires-0.054
her-0.052
 from-0.052
Token town
Token ID3240
Feature activation+3.79e-01

Pos logit contributions
arse+0.117
gency+0.114
 +0.101
 plunge+0.100
 pleasure+0.099
Neg logit contributions
ate-0.097
 from-0.076
ires-0.075
mill-0.075
ia-0.074
Token.
Token ID13
Feature activation+3.51e-01

Pos logit contributions
 +0.009
arse+0.009
gency+0.008
 pleasure+0.008
 plunge+0.008
Neg logit contributions
ate-0.010
ires-0.008
ia-0.008
der-0.008
mill-0.008
Token One
Token ID1881
Feature activation-1.60e-01

Pos logit contributions
arse+0.009
 +0.009
gency+0.009
 pleasure+0.008
 plunge+0.008
Neg logit contributions
ate-0.009
ires-0.007
mill-0.007
der-0.007
ia-0.007
Token day
Token ID1110
Feature activation-8.33e-02

Pos logit contributions
ate+0.007
der+0.006
ires+0.006
mill+0.006
resp+0.006
Neg logit contributions
 -0.002
 pleasure-0.001
arse-0.001
 plunge-0.001
 might-0.001
Token,
Token ID11
Feature activation-4.56e-01

Pos logit contributions
ate+0.002
ires+0.002
mill+0.002
der+0.002
ia+0.002
Neg logit contributions
arse-0.002
 -0.002
gency-0.002
 pleasure-0.002
 plunge-0.002
Token her
Token ID607
Feature activation-1.98e-01

Pos logit contributions
ate+0.004
ires+0.003
ia+0.003
mill+0.003
der+0.003
Neg logit contributions
arse-0.003
 -0.003
gency-0.003
 pleasure-0.003
 plunge-0.003
Token toy
Token ID13373
Feature activation+6.76e-01

Pos logit contributions
ate+0.006
ires+0.005
mill+0.005
ia+0.005
der+0.005
Neg logit contributions
arse-0.005
gency-0.004
 -0.004
 pleasure-0.004
 plunge-0.004
Token phone
Token ID3072
Feature activation+1.08e+00

Pos logit contributions
arse+0.016
gency+0.015
 +0.015
 pleasure+0.014
 plunge+0.013
Neg logit contributions
ate-0.019
ires-0.016
mill-0.015
ia-0.015
der-0.015
Token in
Token ID287
Feature activation+2.15e+00

Pos logit contributions
arse+0.026
 +0.025
gency+0.025
 suitcase+0.022
 pleasure+0.022
Neg logit contributions
ate-0.029
ires-0.025
ia-0.024
mill-0.023
iring-0.023
Token her
Token ID607
Feature activation+1.78e+00

Pos logit contributions
arse+0.071
gency+0.065
 plunge+0.061
ksh+0.057
 pleasure+0.057
Neg logit contributions
ate-0.046
 from-0.037
 about-0.034
ia-0.032
ires-0.032
Token little
Token ID1310
Feature activation+4.05e+00

Pos logit contributions
arse+0.056
gency+0.052
 plunge+0.048
 +0.046
 pleasure+0.044
Neg logit contributions
ate-0.039
ires-0.031
ia-0.030
 from-0.029
 about-0.029
Token purse
Token ID27517
Feature activation+6.53e-01

Pos logit contributions
arse+0.116
gency+0.114
 plunge+0.101
 +0.100
 pleasure+0.097
Neg logit contributions
ate-0.091
mill-0.074
 from-0.073
er-0.070
ires-0.066
Token.
Token ID13
Feature activation+4.59e-01

Pos logit contributions
arse+0.017
gency+0.016
 +0.016
 pleasure+0.014
 plunge+0.014
Neg logit contributions
ate-0.017
ires-0.014
ia-0.013
mill-0.013
der-0.013
Token Lily
Token ID20037
Feature activation-5.27e-01

Pos logit contributions
arse+0.012
gency+0.011
 +0.010
 plunge+0.010
 pleasure+0.010
Neg logit contributions
ate-0.012
ires-0.010
ia-0.010
mill-0.009
der-0.009
Token's
Token ID338
Feature activation+1.10e-01

Pos logit contributions
ate+0.012
ires+0.011
iring+0.010
ia+0.010
der+0.010
Neg logit contributions
 -0.015
arse-0.014
gency-0.014
 might-0.013
 pleasure-0.013
Token favorite
Token ID4004
Feature activation+1.40e+00

Pos logit contributions
arse+0.003
gency+0.003
 +0.003
 pleasure+0.003
 plunge+0.002
Neg logit contributions
ate-0.003
ires-0.002
mill-0.002
ia-0.002
der-0.002
Token little
Token ID1310
Feature activation+1.67e+00

Pos logit contributions
arse+0.020
gency+0.019
 +0.018
 plunge+0.017
 pleasure+0.017
Neg logit contributions
ate-0.016
ia-0.013
ires-0.013
mill-0.012
der-0.012
Token mouse
Token ID10211
Feature activation-1.77e+00

Pos logit contributions
arse+0.049
gency+0.047
 +0.045
 pleasure+0.042
 plunge+0.041
Neg logit contributions
ate-0.038
mill-0.031
ia-0.031
ires-0.031
 from-0.029
Token in
Token ID287
Feature activation+1.66e+00

Pos logit contributions
ate+0.049
ires+0.042
iring+0.039
mill+0.038
der+0.038
Neg logit contributions
 -0.042
gency-0.038
arse-0.037
 might-0.035
 Bobby-0.033
Token her
Token ID607
Feature activation+1.85e+00

Pos logit contributions
arse+0.064
gency+0.060
 plunge+0.056
 pleasure+0.054
 suitcase+0.054
Neg logit contributions
ate-0.023
 from-0.018
 about-0.016
 not-0.016
ia-0.016
Token cozy
Token ID37438
Feature activation+3.50e+00

Pos logit contributions
arse+0.054
gency+0.053
 plunge+0.048
 +0.046
 pleasure+0.045
Neg logit contributions
ate-0.044
ia-0.035
ires-0.034
 from-0.032
mill-0.032
Token little
Token ID1310
Feature activation+3.99e+00

Pos logit contributions
arse+0.080
gency+0.076
 +0.069
 pleasure+0.066
 plunge+0.065
Neg logit contributions
ate-0.104
ires-0.086
ia-0.085
mill-0.085
 from-0.085
Token hole
Token ID7604
Feature activation-3.96e-01

Pos logit contributions
gency+0.127
arse+0.125
 pleasure+0.115
 plunge+0.114
 +0.108
Neg logit contributions
ate-0.079
 from-0.064
mill-0.059
iring-0.057
ia-0.057
Token in
Token ID287
Feature activation+1.45e+00

Pos logit contributions
ate+0.011
ires+0.009
mill+0.008
ia+0.008
der+0.008
Neg logit contributions
 -0.009
arse-0.009
gency-0.009
 might-0.008
 plunge-0.008
Token the
Token ID262
Feature activation+1.62e+00

Pos logit contributions
arse+0.048
gency+0.044
 plunge+0.040
 pleasure+0.040
 +0.039
Neg logit contributions
ate-0.029
 from-0.023
 about-0.023
ires-0.023
ia-0.022
Token wall
Token ID3355
Feature activation-3.98e-02

Pos logit contributions
arse+0.049
gency+0.046
 +0.042
 pleasure+0.041
 plunge+0.041
Neg logit contributions
ate-0.036
ires-0.029
 from-0.028
iring-0.028
der-0.028
Token of
Token ID286
Feature activation+9.33e-02

Pos logit contributions
ate+0.001
ires+0.001
mill+0.001
ia+0.001
der+0.001
Neg logit contributions
 -0.001
arse-0.001
gency-0.001
 pleasure-0.001
 plunge-0.001
Token do
Token ID466
Feature activation-6.54e-01

Pos logit contributions
ate+0.000
ires+0.000
iring+0.000
mill+0.000
ia+0.000
Neg logit contributions
 -0.000
arse-0.000
gency-0.000
 plunge-0.000
 might-0.000
Token it
Token ID340
Feature activation-6.38e-02

Pos logit contributions
ate+0.018
ires+0.016
mill+0.015
ia+0.015
der+0.015
Neg logit contributions
arse-0.015
gency-0.014
 -0.014
 pleasure-0.013
 plunge-0.013
Token.
Token ID13
Feature activation+9.32e-01

Pos logit contributions
ate+0.002
ires+0.001
ia+0.001
mill+0.001
der+0.001
Neg logit contributions
arse-0.002
 -0.002
gency-0.002
 pleasure-0.001
 plunge-0.001
Token She
Token ID1375
Feature activation-1.28e-01

Pos logit contributions
arse+0.029
gency+0.027
 plunge+0.025
 pleasure+0.024
 +0.023
Neg logit contributions
ate-0.019
ires-0.015
der-0.015
 about-0.015
iring-0.015
Token sc
Token ID629
Feature activation+8.70e-01

Pos logit contributions
ate+0.003
ires+0.003
mill+0.003
ia+0.002
der+0.002
Neg logit contributions
arse-0.003
 -0.003
gency-0.003
 pleasure-0.003
 plunge-0.003
Tokenrun
Token ID5143
Feature activation+3.99e+00

Pos logit contributions
arse+0.022
gency+0.020
 +0.020
 pleasure+0.019
 plunge+0.018
Neg logit contributions
ate-0.024
ires-0.020
mill-0.019
ia-0.019
der-0.019
Tokenched
Token ID1740
Feature activation+1.10e+00

Pos logit contributions
arse+0.105
gency+0.098
 +0.094
 pleasure+0.090
 plunge+0.088
Neg logit contributions
ate-0.104
ires-0.084
mill-0.083
ia-0.083
der-0.080
Token up
Token ID510
Feature activation-3.67e-01

Pos logit contributions
arse+0.034
gency+0.031
 pleasure+0.030
 lifetime+0.028
 +0.028
Neg logit contributions
ate-0.023
 from-0.020
 about-0.017
ires-0.017
mill-0.017
Token her
Token ID607
Feature activation-5.48e-01

Pos logit contributions
ate+0.008
ires+0.007
mill+0.006
ia+0.006
der+0.006
Neg logit contributions
arse-0.011
 -0.010
gency-0.010
 pleasure-0.009
 plunge-0.009
Token face
Token ID1986
Feature activation+8.93e-01

Pos logit contributions
ate+0.016
ires+0.014
ia+0.014
mill+0.013
der+0.013
Neg logit contributions
arse-0.012
gency-0.012
 -0.011
 pleasure-0.010
 plunge-0.010
Token.
Token ID13
Feature activation+8.06e-01

Pos logit contributions
arse+0.028
 +0.028
gency+0.027
 might+0.025
 pleasure+0.024
Neg logit contributions
ate-0.018
ires-0.014
ia-0.013
mill-0.012
iring-0.012
Token while
Token ID981
Feature activation+1.28e+00

Pos logit contributions
arse+0.029
gency+0.028
 +0.028
 pleasure+0.025
 plunge+0.025
Neg logit contributions
ate-0.027
ires-0.022
ia-0.021
mill-0.021
der-0.021
Token Molly
Token ID30236
Feature activation+8.02e-01

Pos logit contributions
arse+0.032
gency+0.030
 +0.030
 pleasure+0.028
 plunge+0.027
Neg logit contributions
ate-0.034
ires-0.028
mill-0.027
ia-0.027
der-0.027
Token wasn
Token ID2492
Feature activation+2.56e+00

Pos logit contributions
arse+0.021
 +0.021
gency+0.020
 pleasure+0.018
 might+0.018
Neg logit contributions
ate-0.020
ires-0.017
iring-0.016
mill-0.016
der-0.016
Tokenâ
Token ID22940
Feature activation+2.44e+00

Pos logit contributions
arse+0.064
gency+0.061
 +0.057
 pleasure+0.055
 plunge+0.052
Neg logit contributions
ate-0.067
ires-0.056
ia-0.055
mill-0.055
 from-0.054
Token
Token ID26391
Feature activation+3.97e-01

Pos logit contributions
arse+0.042
gency+0.038
 +0.037
 pleasure+0.033
 plunge+0.031
Neg logit contributions
ate-0.085
ires-0.074
mill-0.073
ia-0.073
der-0.071
Token
Token ID8151
Feature activation+3.97e+00

Pos logit contributions
arse+0.011
gency+0.010
 plunge+0.010
 pleasure+0.010
 +0.010
Neg logit contributions
ate-0.010
ia-0.008
ires-0.007
mill-0.007
 about-0.007
Tokent
Token ID83
Feature activation+8.12e-01

Pos logit contributions
gency+0.114
arse+0.109
 pleasure+0.106
 suitcase+0.101
 plunge+0.099
Neg logit contributions
ate-0.078
 from-0.072
 about-0.071
ia-0.070
mill-0.067
Token looking
Token ID2045
Feature activation+1.30e+00

Pos logit contributions
arse+0.020
gency+0.019
 +0.019
 pleasure+0.017
 plunge+0.017
Neg logit contributions
ate-0.022
ires-0.019
ia-0.018
mill-0.018
der-0.018
Token.
Token ID13
Feature activation+9.94e-01

Pos logit contributions
arse+0.040
gency+0.038
 +0.035
 plunge+0.034
 pleasure+0.033
Neg logit contributions
ate-0.027
 from-0.023
 about-0.023
ires-0.023
der-0.022
Token
Token ID198
Feature activation+4.29e-01

Pos logit contributions
arse+0.024
gency+0.021
 plunge+0.019
 suitcase+0.018
charged+0.017
Neg logit contributions
ate-0.028
ires-0.024
 about-0.024
ia-0.024
der-0.023
Token
Token ID198
Feature activation+1.92e-01

Pos logit contributions
arse+0.012
gency+0.012
 pleasure+0.010
 +0.010
 plunge+0.010
Neg logit contributions
ate-0.010
ia-0.008
ires-0.008
 about-0.008
mill-0.008
Token his
Token ID465
Feature activation+7.45e-01

Pos logit contributions
ate+0.001
ires+0.001
der+0.001
ia+0.001
iring+0.001
Neg logit contributions
 -0.001
arse-0.000
gency-0.000
 pleasure-0.000
 Bobby-0.000
Token family
Token ID1641
Feature activation-2.36e-01

Pos logit contributions
arse+0.024
gency+0.023
 +0.021
 plunge+0.020
icer+0.019
Neg logit contributions
ate-0.014
ires-0.012
ia-0.012
 from-0.011
der-0.011
Token lived
Token ID5615
Feature activation-3.90e-01

Pos logit contributions
ate+0.006
ires+0.006
ia+0.005
der+0.005
iders+0.005
Neg logit contributions
 -0.006
arse-0.005
gency-0.005
 pleasure-0.005
 Bobby-0.005
Token in
Token ID287
Feature activation+1.33e+00

Pos logit contributions
ate+0.009
ires+0.008
der+0.007
ia+0.007
mill+0.007
Neg logit contributions
arse-0.011
 -0.010
gency-0.010
 pleasure-0.009
 plunge-0.009
Token a
Token ID257
Feature activation+3.53e+00

Pos logit contributions
arse+0.044
gency+0.041
 +0.037
 plunge+0.037
 pleasure+0.036
Neg logit contributions
ate-0.027
 from-0.020
mill-0.020
ires-0.019
 about-0.019
Token small
Token ID1402
Feature activation+3.96e+00

Pos logit contributions
arse+0.109
gency+0.108
 plunge+0.094
 pleasure+0.090
 +0.090
Neg logit contributions
ate-0.078
ia-0.060
ires-0.059
 from-0.056
mill-0.055
Token hut
Token ID40812
Feature activation+8.24e-02

Pos logit contributions
arse+0.108
gency+0.104
 +0.095
 plunge+0.092
 pleasure+0.091
Neg logit contributions
ate-0.100
ires-0.080
mill-0.079
ia-0.078
 from-0.077
Token in
Token ID287
Feature activation+1.89e+00

Pos logit contributions
arse+0.002
 +0.002
gency+0.002
 pleasure+0.002
 plunge+0.002
Neg logit contributions
ate-0.002
ires-0.002
ia-0.002
der-0.002
mill-0.002
Token the
Token ID262
Feature activation+2.03e+00

Pos logit contributions
arse+0.060
gency+0.056
 plunge+0.050
 +0.050
 pleasure+0.049
Neg logit contributions
ate-0.042
ires-0.031
 from-0.031
mill-0.030
ia-0.029
Token forest
Token ID8222
Feature activation+3.58e-01

Pos logit contributions
arse+0.054
gency+0.051
 +0.049
 pleasure+0.046
 plunge+0.045
Neg logit contributions
ate-0.052
ires-0.043
mill-0.041
ia-0.041
der-0.040
Token.
Token ID13
Feature activation+3.96e-01

Pos logit contributions
 +0.008
arse+0.008
gency+0.007
 pleasure+0.006
 might+0.006
Neg logit contributions
ate-0.010
ires-0.009
der-0.009
mill-0.009
ia-0.008
TokenThe
Token ID464
Feature activation+1.31e-01

Pos logit contributions
arse+0.027
gency+0.025
 pleasure+0.023
 plunge+0.023
 suitcase+0.022
Neg logit contributions
ate-0.017
ia-0.012
 from-0.012
 about-0.012
ires-0.011
Token little
Token ID1310
Feature activation+2.33e+00

Pos logit contributions
arse+0.004
gency+0.003
 +0.003
 pleasure+0.003
 plunge+0.003
Neg logit contributions
ate-0.003
ires-0.003
mill-0.003
ia-0.003
der-0.003
Token girl
Token ID2576
Feature activation-6.92e-01

Pos logit contributions
arse+0.072
gency+0.071
 +0.064
 plunge+0.060
 pleasure+0.060
Neg logit contributions
ate-0.050
mill-0.042
ia-0.039
ires-0.039
 from-0.038
Token was
Token ID373
Feature activation+9.81e-01

Pos logit contributions
ate+0.017
ires+0.015
iring+0.014
der+0.014
mill+0.014
Neg logit contributions
 -0.018
gency-0.017
arse-0.017
 might-0.016
 pleasure-0.015
Token in
Token ID287
Feature activation+1.81e+00

Pos logit contributions
arse+0.022
 +0.021
gency+0.020
 pleasure+0.019
 plunge+0.018
Neg logit contributions
ate-0.029
ires-0.025
der-0.024
ia-0.024
mill-0.024
Token a
Token ID257
Feature activation+3.96e+00

Pos logit contributions
arse+0.052
gency+0.049
 +0.046
 pleasure+0.044
 plunge+0.044
Neg logit contributions
ate-0.042
ires-0.034
mill-0.033
 from-0.033
ia-0.032
Token hurry
Token ID23290
Feature activation+3.56e-01

Pos logit contributions
gency+0.140
arse+0.139
 plunge+0.123
rified+0.116
 pleasure+0.116
Neg logit contributions
ate-0.067
 also-0.048
ia-0.048
her-0.045
 from-0.045
Token.
Token ID13
Feature activation+7.04e-01

Pos logit contributions
arse+0.009
 +0.008
gency+0.008
 plunge+0.007
 pleasure+0.007
Neg logit contributions
ate-0.009
ires-0.008
ia-0.008
der-0.008
mill-0.008
Token She
Token ID1375
Feature activation-3.95e-01

Pos logit contributions
arse+0.019
gency+0.018
 +0.018
 pleasure+0.017
 plunge+0.016
Neg logit contributions
ate-0.017
ires-0.014
mill-0.014
ia-0.014
der-0.014
Token wanted
Token ID2227
Feature activation+1.16e+00

Pos logit contributions
ate+0.010
ires+0.009
mill+0.008
der+0.008
iring+0.008
Neg logit contributions
 -0.010
gency-0.010
arse-0.010
 might-0.009
 pleasure-0.009
Token to
Token ID284
Feature activation-3.83e-01

Pos logit contributions
 +0.024
gency+0.023
arse+0.022
 pleasure+0.021
 plunge+0.021
Neg logit contributions
ate-0.035
ires-0.031
ia-0.030
mill-0.029
der-0.029
Token friends
Token ID2460
Feature activation-2.18e-01

Pos logit contributions
arse+0.006
gency+0.005
 +0.005
 plunge+0.005
 pleasure+0.005
Neg logit contributions
ate-0.006
ires-0.005
 about-0.005
ia-0.005
mill-0.005
Token who
Token ID508
Feature activation+1.61e+00

Pos logit contributions
ate+0.004
ires+0.003
mill+0.003
 about+0.003
der+0.003
Neg logit contributions
arse-0.007
gency-0.007
 -0.006
 plunge-0.006
 pleasure-0.006
Token lived
Token ID5615
Feature activation-1.25e-01

Pos logit contributions
 +0.040
 might+0.036
 pleasure+0.033
're+0.033
gency+0.032
Neg logit contributions
ate-0.042
ires-0.038
iring-0.037
der-0.037
ble-0.036
Token in
Token ID287
Feature activation+2.01e+00

Pos logit contributions
ate+0.003
der+0.002
ires+0.002
ia+0.002
mill+0.002
Neg logit contributions
arse-0.004
 -0.003
gency-0.003
 pleasure-0.003
 plunge-0.003
Token a
Token ID257
Feature activation+3.56e+00

Pos logit contributions
arse+0.064
gency+0.061
 +0.056
 plunge+0.055
 pleasure+0.053
Neg logit contributions
ate-0.043
ires-0.033
mill-0.031
 from-0.031
ia-0.031
Token small
Token ID1402
Feature activation+3.94e+00

Pos logit contributions
arse+0.117
gency+0.115
 plunge+0.101
 pleasure+0.099
 +0.095
Neg logit contributions
ate-0.076
ia-0.056
ires-0.056
 from-0.051
mill-0.051
Token village
Token ID7404
Feature activation+5.90e-01

Pos logit contributions
arse+0.109
gency+0.104
 +0.096
 pleasure+0.092
 plunge+0.092
Neg logit contributions
ate-0.099
ires-0.079
mill-0.078
ia-0.077
 from-0.077
Token.
Token ID13
Feature activation-3.97e-01

Pos logit contributions
arse+0.015
 +0.014
gency+0.014
 pleasure+0.012
 plunge+0.012
Neg logit contributions
ate-0.015
ires-0.013
ia-0.013
der-0.012
mill-0.012
Token They
Token ID1119
Feature activation+2.45e-01

Pos logit contributions
ate+0.013
ires+0.012
mill+0.012
der+0.011
iring+0.011
Neg logit contributions
arse-0.007
 -0.007
gency-0.006
 pleasure-0.006
 plunge-0.005
Token liked
Token ID8288
Feature activation-1.95e-01

Pos logit contributions
 +0.007
arse+0.006
gency+0.006
 might+0.006
 pleasure+0.006
Neg logit contributions
ate-0.006
ires-0.005
iring-0.005
der-0.005
ia-0.005
Token to
Token ID284
Feature activation-6.93e-02

Pos logit contributions
ate+0.006
ia+0.005
ires+0.005
der+0.005
mill+0.005
Neg logit contributions
 -0.004
gency-0.004
arse-0.004
 pleasure-0.004
 plunge-0.004
Token hated
Token ID16563
Feature activation-7.17e-01

Pos logit contributions
ate+0.009
ires+0.008
mill+0.007
iring+0.007
der+0.007
Neg logit contributions
 -0.009
arse-0.009
gency-0.009
 might-0.008
 pleasure-0.008
Token it
Token ID340
Feature activation+5.50e-01

Pos logit contributions
ate+0.021
ires+0.017
mill+0.017
ia+0.017
iring+0.016
Neg logit contributions
arse-0.016
 -0.016
gency-0.016
 pleasure-0.014
 plunge-0.014
Token.
Token ID13
Feature activation+2.18e-01

Pos logit contributions
arse+0.015
gency+0.014
 +0.014
 pleasure+0.012
 plunge+0.012
Neg logit contributions
ate-0.014
ires-0.012
ia-0.011
mill-0.011
der-0.011
Token He
Token ID679
Feature activation-2.74e-01

Pos logit contributions
arse+0.006
gency+0.006
 plunge+0.005
 pleasure+0.005
 suitcase+0.005
Neg logit contributions
ate-0.005
ires-0.004
ia-0.004
 about-0.004
der-0.004
Token sc
Token ID629
Feature activation+3.62e-01

Pos logit contributions
ate+0.007
ires+0.006
mill+0.006
iring+0.005
ia+0.005
Neg logit contributions
arse-0.007
gency-0.007
 -0.007
 might-0.006
 pleasure-0.006
Tokenrun
Token ID5143
Feature activation+3.93e+00

Pos logit contributions
arse+0.011
gency+0.011
 +0.011
 pleasure+0.010
 plunge+0.010
Neg logit contributions
ate-0.007
mill-0.006
ires-0.006
der-0.005
ia-0.005
Tokenched
Token ID1740
Feature activation+3.56e-01

Pos logit contributions
arse+0.104
gency+0.097
 +0.094
 pleasure+0.089
 plunge+0.087
Neg logit contributions
ate-0.102
ires-0.083
mill-0.082
ia-0.081
der-0.079
Token his
Token ID465
Feature activation+5.94e-01

Pos logit contributions
arse+0.011
gency+0.010
 pleasure+0.009
 +0.009
 plunge+0.009
Neg logit contributions
ate-0.008
 from-0.006
ires-0.006
ia-0.006
 about-0.006
Token face
Token ID1986
Feature activation+8.03e-01

Pos logit contributions
arse+0.015
gency+0.013
 +0.013
 pleasure+0.012
 plunge+0.012
Neg logit contributions
ate-0.016
ia-0.014
ires-0.014
mill-0.013
der-0.013
Token up
Token ID510
Feature activation+3.35e-02

Pos logit contributions
arse+0.025
 +0.025
gency+0.024
 might+0.023
 suitcase+0.021
Neg logit contributions
ate-0.016
ires-0.013
ia-0.012
mill-0.011
iring-0.011
Token as
Token ID355
Feature activation+1.25e-01

Pos logit contributions
arse+0.001
 +0.001
gency+0.001
 pleasure+0.001
 forever+0.001
Neg logit contributions
ate-0.001
ires-0.001
ia-0.001
mill-0.000
resp-0.000
Tokenweb
Token ID12384
Feature activation+4.52e-01

Pos logit contributions
ate+0.050
ires+0.038
mill+0.033
der+0.032
oni+0.032
Neg logit contributions
gency-0.082
arse-0.082
 -0.081
 might-0.079
 plunge-0.076
Token.
Token ID13
Feature activation+5.47e-01

Pos logit contributions
arse+0.013
gency+0.012
 +0.012
 pleasure+0.011
 plunge+0.011
Neg logit contributions
ate-0.010
ires-0.008
mill-0.008
ia-0.008
der-0.008
Token
Token ID198
Feature activation-5.49e-02

Pos logit contributions
arse+0.016
gency+0.014
 suitcase+0.013
 plunge+0.013
 unnecessary+0.012
Neg logit contributions
ate-0.014
ia-0.012
 about-0.012
ires-0.011
der-0.010
Token
Token ID198
Feature activation-2.22e-01

Pos logit contributions
ate+0.001
ia+0.001
ires+0.001
 about+0.001
 from+0.001
Neg logit contributions
arse-0.002
gency-0.002
 pleasure-0.002
 suitcase-0.002
 plunge-0.002
TokenThe
Token ID464
Feature activation-1.29e+00

Pos logit contributions
ate+0.006
ires+0.005
mill+0.005
ia+0.005
der+0.005
Neg logit contributions
arse-0.005
gency-0.005
 -0.005
 pleasure-0.005
 plunge-0.004
Token cob
Token ID22843
Feature activation-4.64e+00

Pos logit contributions
ate+0.033
ires+0.027
mill+0.027
der+0.026
ia+0.025
Neg logit contributions
arse-0.033
 -0.031
gency-0.031
 pleasure-0.029
 plunge-0.029
Tokenweb
Token ID12384
Feature activation-4.68e-01

Pos logit contributions
ate+0.099
oni+0.083
resp+0.081
ires+0.078
aff+0.077
Neg logit contributions
 might-0.104
 -0.100
gency-0.099
 minute-0.098
robat-0.095
Token was
Token ID373
Feature activation+5.71e-01

Pos logit contributions
ate+0.012
ires+0.011
iring+0.010
mill+0.009
der+0.009
Neg logit contributions
 -0.012
arse-0.011
 might-0.011
gency-0.011
 plunge-0.010
Token so
Token ID523
Feature activation+6.31e-01

Pos logit contributions
 +0.012
arse+0.012
 pleasure+0.011
gency+0.011
 might+0.010
Neg logit contributions
ate-0.017
ires-0.015
der-0.015
ia-0.014
resp-0.014
Token strong
Token ID1913
Feature activation-1.60e+00

Pos logit contributions
 +0.012
 pleasure+0.010
arse+0.010
 might+0.010
 Bobby+0.009
Neg logit contributions
ate-0.020
der-0.018
resp-0.018
mill-0.018
iders-0.017
Token that
Token ID326
Feature activation+1.23e+00

Pos logit contributions
ate+0.037
ia+0.029
ires+0.029
urse+0.028
mill+0.026
Neg logit contributions
 -0.044
arse-0.043
 might-0.040
gency-0.040
 Bobby-0.040
Token monkey
Token ID21657
Feature activation-5.99e-01

Pos logit contributions
ate+0.019
ires+0.016
der+0.015
mill+0.015
ia+0.014
Neg logit contributions
arse-0.021
 -0.021
gency-0.020
 pleasure-0.019
 plunge-0.018
Token and
Token ID290
Feature activation+2.55e-01

Pos logit contributions
ires+0.017
ate+0.016
iring+0.015
iders+0.013
ble+0.013
Neg logit contributions
 -0.016
 might-0.013
arse-0.012
gency-0.011
 forever-0.011
Token elephant
Token ID20950
Feature activation-4.67e-01

Pos logit contributions
arse+0.007
gency+0.006
 +0.006
 pleasure+0.006
 plunge+0.006
Neg logit contributions
ate-0.006
ires-0.005
mill-0.005
ia-0.005
der-0.005
Token played
Token ID2826
Feature activation-1.97e+00

Pos logit contributions
ate+0.012
ires+0.011
iring+0.010
der+0.010
mill+0.010
Neg logit contributions
arse-0.012
 -0.011
gency-0.011
 pleasure-0.010
 plunge-0.010
Token all
Token ID477
Feature activation-8.73e-01

Pos logit contributions
ate+0.054
ires+0.046
mill+0.045
ia+0.044
der+0.044
Neg logit contributions
arse-0.047
 -0.045
gency-0.045
 pleasure-0.041
 plunge-0.040
Token sorts
Token ID10524
Feature activation-4.63e+00

Pos logit contributions
ate+0.030
ires+0.026
der+0.026
mill+0.025
ia+0.025
Neg logit contributions
arse-0.015
 -0.014
gency-0.014
 pleasure-0.012
 plunge-0.012
Token of
Token ID286
Feature activation-1.96e+00

Pos logit contributions
ate+0.093
mill+0.071
urse+0.065
ia+0.065
ires+0.064
Neg logit contributions
 -0.135
arse-0.132
 pleasure-0.127
 dearly-0.122
 plunge-0.121
Token fun
Token ID1257
Feature activation-2.33e+00

Pos logit contributions
ate+0.048
ires+0.039
ia+0.038
mill+0.037
der+0.037
Neg logit contributions
arse-0.056
gency-0.053
 -0.051
 pleasure-0.047
 plunge-0.046
Token games
Token ID1830
Feature activation-2.02e+00

Pos logit contributions
ate+0.059
ires+0.049
mill+0.048
ia+0.047
der+0.047
Neg logit contributions
arse-0.060
gency-0.057
 -0.057
 pleasure-0.052
 plunge-0.051
Token.
Token ID13
Feature activation-8.68e-01

Pos logit contributions
ate+0.054
ires+0.046
mill+0.044
ia+0.043
iring+0.043
Neg logit contributions
arse-0.049
 -0.047
gency-0.046
 pleasure-0.043
 plunge-0.041
Token They
Token ID1119
Feature activation-6.40e-01

Pos logit contributions
ate+0.022
mill+0.018
ia+0.018
ires+0.018
ble+0.018
Neg logit contributions
 -0.023
arse-0.022
gency-0.021
 pleasure-0.020
 might-0.018
Tokenak
Token ID461
Feature activation+1.37e+00

Pos logit contributions
arse+0.004
gency+0.004
 +0.004
 pleasure+0.004
 plunge+0.004
Neg logit contributions
ate-0.004
ires-0.003
der-0.003
mill-0.003
ia-0.003
Token.
Token ID13
Feature activation+6.00e-01

Pos logit contributions
arse+0.034
 +0.032
gency+0.032
 pleasure+0.029
 plunge+0.028
Neg logit contributions
ate-0.037
ires-0.031
ia-0.030
mill-0.029
der-0.028
Token
Token ID198
Feature activation+1.02e-01

Pos logit contributions
arse+0.015
gency+0.014
 +0.014
 pleasure+0.013
 plunge+0.012
Neg logit contributions
ate-0.016
ires-0.014
mill-0.013
ia-0.013
der-0.013
Token
Token ID198
Feature activation-1.49e-01

Pos logit contributions
arse+0.003
gency+0.003
 pleasure+0.003
 +0.003
 plunge+0.002
Neg logit contributions
ate-0.002
ia-0.002
ires-0.002
mill-0.002
 about-0.002
TokenThe
Token ID464
Feature activation-1.16e+00

Pos logit contributions
ate+0.004
ires+0.003
der+0.003
mill+0.003
ia+0.003
Neg logit contributions
 -0.004
arse-0.004
gency-0.004
 plunge-0.003
 pleasure-0.003
Token cob
Token ID22843
Feature activation-4.43e+00

Pos logit contributions
ate+0.031
ires+0.026
mill+0.025
der+0.025
ia+0.025
Neg logit contributions
arse-0.029
gency-0.028
 -0.027
 pleasure-0.025
 plunge-0.025
Tokenweb
Token ID12384
Feature activation-2.52e-02

Pos logit contributions
ate+0.098
oni+0.086
resp+0.082
emo+0.081
aff+0.080
Neg logit contributions
 might-0.093
 minute-0.089
 -0.087
gency-0.086
robat-0.084
Token sh
Token ID427
Feature activation-2.12e-01

Pos logit contributions
ate+0.001
ires+0.001
iring+0.001
mill+0.001
urse+0.001
Neg logit contributions
 -0.001
arse-0.001
 might-0.001
gency-0.001
 plunge-0.001
Tokenivered
Token ID6396
Feature activation+5.58e-02

Pos logit contributions
ate+0.005
 from+0.004
der+0.004
mill+0.004
ires+0.004
Neg logit contributions
arse-0.006
gency-0.005
 -0.005
 pleasure-0.005
charged-0.005
Token and
Token ID290
Feature activation+1.47e+00

Pos logit contributions
arse+0.002
 +0.002
gency+0.001
 pleasure+0.001
 Bobby+0.001
Neg logit contributions
ate-0.001
ires-0.001
ia-0.001
mill-0.001
ups-0.001
Token then
Token ID788
Feature activation+6.23e-01

Pos logit contributions
arse+0.040
gency+0.038
 +0.037
 pleasure+0.034
 plunge+0.034
Neg logit contributions
ate-0.037
ires-0.030
mill-0.029
ia-0.029
der-0.029
Token pretty
Token ID2495
Feature activation+4.03e-01

Pos logit contributions
arse+0.027
gency+0.025
 +0.025
 pleasure+0.023
 plunge+0.023
Neg logit contributions
ate-0.026
ires-0.022
mill-0.021
ia-0.021
der-0.021
Token v
Token ID410
Feature activation-2.52e+00

Pos logit contributions
 +0.008
arse+0.008
 suitcase+0.007
gency+0.007
 pleasure+0.007
Neg logit contributions
ate-0.012
ires-0.011
ups-0.010
der-0.010
ia-0.010
Tokenase
Token ID589
Feature activation-2.43e+00

Pos logit contributions
ate+0.064
ires+0.058
der+0.056
 about+0.054
mill+0.054
Neg logit contributions
arse-0.053
gency-0.052
 -0.050
icer-0.044
eeee-0.043
Token.
Token ID13
Feature activation-8.57e-01

Pos logit contributions
ate+0.068
ires+0.059
ia+0.058
der+0.056
emo+0.054
Neg logit contributions
arse-0.053
 -0.053
gency-0.049
 plunge-0.044
 suitcase-0.043
Token The
Token ID383
Feature activation-8.16e-01

Pos logit contributions
ate+0.025
mill+0.021
der+0.021
ires+0.021
iring+0.020
Neg logit contributions
 -0.021
gency-0.017
 might-0.016
 pleasure-0.016
arse-0.016
Token v
Token ID410
Feature activation-4.42e+00

Pos logit contributions
ate+0.022
ires+0.018
ia+0.018
mill+0.017
 from+0.017
Neg logit contributions
arse-0.022
gency-0.020
 -0.019
 pleasure-0.018
 plunge-0.018
Tokenase
Token ID589
Feature activation-1.95e+00

Pos logit contributions
ate+0.121
 from+0.117
 about+0.116
der+0.116
ires+0.116
Neg logit contributions
 -0.066
icer-0.065
gency-0.063
-0.052
robat-0.052
Token was
Token ID373
Feature activation-1.12e-01

Pos logit contributions
ate+0.059
ires+0.054
der+0.054
resp+0.053
aze+0.051
Neg logit contributions
 -0.042
 might-0.037
 may-0.028
gency-0.028
arse-0.028
Token blue
Token ID4171
Feature activation-1.01e-01

Pos logit contributions
ate+0.004
der+0.004
iders+0.004
ires+0.003
ia+0.003
Neg logit contributions
 -0.002
 pleasure-0.001
arse-0.001
gency-0.001
 might-0.001
Token and
Token ID290
Feature activation+3.98e-01

Pos logit contributions
ate+0.002
ires+0.002
ia+0.001
aze+0.001
mill+0.001
Neg logit contributions
 -0.003
arse-0.003
gency-0.003
 suitcase-0.003
 might-0.003
Token had
Token ID550
Feature activation-2.91e-01

Pos logit contributions
 +0.009
arse+0.008
gency+0.008
 pleasure+0.008
 might+0.008
Neg logit contributions
ate-0.011
ires-0.010
der-0.010
mill-0.009
ia-0.009
Token.
Token ID13
Feature activation+1.64e-01

Pos logit contributions
ate+0.054
mill+0.048
ires+0.047
der+0.047
ia+0.046
Neg logit contributions
 -0.067
arse-0.061
gency-0.058
 suitcase-0.054
 might-0.054
Token She
Token ID1375
Feature activation-1.03e+00

Pos logit contributions
arse+0.005
gency+0.004
 plunge+0.004
 +0.004
 pleasure+0.004
Neg logit contributions
ate-0.004
ires-0.003
ia-0.003
 about-0.003
der-0.003
Token would
Token ID561
Feature activation+6.20e-01

Pos logit contributions
ate+0.026
ires+0.022
mill+0.021
der+0.021
ia+0.020
Neg logit contributions
arse-0.027
 -0.026
gency-0.026
 pleasure-0.024
 plunge-0.023
Token design
Token ID1486
Feature activation-1.95e+00

Pos logit contributions
arse+0.015
 +0.015
gency+0.015
 pleasure+0.013
 plunge+0.013
Neg logit contributions
ate-0.016
ires-0.014
iring-0.013
der-0.013
mill-0.013
Token all
Token ID477
Feature activation-6.61e-01

Pos logit contributions
ate+0.050
mill+0.046
der+0.045
ires+0.043
ia+0.042
Neg logit contributions
 -0.046
arse-0.044
gency-0.042
 pleasure-0.039
 might-0.038
Token sorts
Token ID10524
Feature activation-4.42e+00

Pos logit contributions
ate+0.019
ires+0.016
der+0.015
mill+0.015
ia+0.015
Neg logit contributions
arse-0.015
 -0.014
gency-0.014
 pleasure-0.013
 plunge-0.013
Token of
Token ID286
Feature activation-1.65e+00

Pos logit contributions
ate+0.100
mill+0.085
urse+0.083
ia+0.077
der+0.076
Neg logit contributions
 -0.110
 pleasure-0.104
 dearly-0.101
 plunge-0.098
 Bobby-0.096
Token things
Token ID1243
Feature activation-2.50e+00

Pos logit contributions
ate+0.041
ires+0.034
mill+0.033
ia+0.033
der+0.033
Neg logit contributions
arse-0.044
gency-0.042
 -0.041
 pleasure-0.038
 plunge-0.037
Token like
Token ID588
Feature activation-3.87e-01

Pos logit contributions
ate+0.066
ia+0.058
urse+0.058
ross+0.056
ires+0.055
Neg logit contributions
 -0.057
 might-0.053
arse-0.049
 pleasure-0.047
 forever-0.047
Token flowers
Token ID12734
Feature activation-1.67e+00

Pos logit contributions
ate+0.011
ires+0.009
mill+0.009
der+0.009
iring+0.009
Neg logit contributions
 -0.009
gency-0.008
arse-0.008
 pleasure-0.008
 might-0.007
Token,
Token ID11
Feature activation-1.02e+00

Pos logit contributions
ate+0.046
ires+0.041
urse+0.038
mill+0.038
ross+0.038
Neg logit contributions
 -0.039
 might-0.036
arse-0.031
 forever-0.031
're-0.031
Token a
Token ID257
Feature activation+3.58e+00

Pos logit contributions
arse+0.054
gency+0.050
 +0.047
 pleasure+0.046
 plunge+0.045
Neg logit contributions
ate-0.040
ires-0.031
ia-0.030
mill-0.029
 from-0.029
Token small
Token ID1402
Feature activation+3.64e+00

Pos logit contributions
arse+0.130
gency+0.123
 pleasure+0.115
 plunge+0.115
 ease+0.106
Neg logit contributions
ate-0.065
her-0.046
ia-0.044
 lady-0.041
Some-0.038
Token house
Token ID2156
Feature activation-4.10e-01

Pos logit contributions
arse+0.121
gency+0.116
 pleasure+0.108
 plunge+0.106
 +0.101
Neg logit contributions
ate-0.074
 from-0.058
mill-0.053
er-0.052
ia-0.049
Token.
Token ID13
Feature activation+6.33e-01

Pos logit contributions
ate+0.011
ires+0.010
ia+0.010
der+0.009
resp+0.009
Neg logit contributions
 -0.009
gency-0.009
arse-0.009
 might-0.008
 plunge-0.007
Token The
Token ID383
Feature activation-6.79e-01

Pos logit contributions
arse+0.015
gency+0.014
 +0.013
 pleasure+0.012
 plunge+0.012
Neg logit contributions
ate-0.018
ires-0.016
mill-0.015
ia-0.015
der-0.015
Token v
Token ID410
Feature activation-4.41e+00

Pos logit contributions
ate+0.018
ia+0.014
ires+0.014
 from+0.014
mill+0.014
Neg logit contributions
arse-0.019
gency-0.017
 -0.016
 pleasure-0.015
 plunge-0.015
Tokenase
Token ID589
Feature activation-1.25e+00

Pos logit contributions
ate+0.122
ires+0.115
 about+0.115
der+0.114
 from+0.114
Neg logit contributions
icer-0.072
 -0.070
gency-0.064
arse-0.057
eeee-0.055
Token was
Token ID373
Feature activation+8.28e-01

Pos logit contributions
ate+0.034
ires+0.032
der+0.030
iring+0.029
resp+0.029
Neg logit contributions
 -0.030
 might-0.026
arse-0.023
gency-0.023
 suitcase-0.022
Token very
Token ID845
Feature activation+1.02e+00

Pos logit contributions
 +0.014
arse+0.013
gency+0.012
 pleasure+0.011
 might+0.010
Neg logit contributions
ate-0.028
der-0.026
ires-0.025
ia-0.025
mill-0.024
Token pretty
Token ID2495
Feature activation-5.86e-01

Pos logit contributions
arse+0.023
 +0.022
gency+0.021
 pleasure+0.020
 plunge+0.020
Neg logit contributions
ate-0.029
mill-0.024
der-0.024
ires-0.024
ia-0.023
Token and
Token ID290
Feature activation+1.01e+00

Pos logit contributions
ate+0.012
ires+0.010
ia+0.009
mill+0.009
der+0.008
Neg logit contributions
arse-0.017
 -0.017
gency-0.017
 pleasure-0.016
 might-0.015
Token,
Token ID11
Feature activation+1.14e-01

Pos logit contributions
arse+0.066
gency+0.063
ifying+0.060
 suitcase+0.059
 pleasure+0.059
Neg logit contributions
 about-0.023
ia-0.023
er-0.021
ate-0.020
:-0.020
Token Tim
Token ID5045
Feature activation+1.02e+00

Pos logit contributions
arse+0.003
gency+0.003
 +0.002
 pleasure+0.002
 plunge+0.002
Neg logit contributions
ate-0.003
ires-0.003
mill-0.003
ia-0.003
der-0.003
Tokenmy
Token ID1820
Feature activation+8.55e-02

Pos logit contributions
arse+0.027
gency+0.027
 pleasure+0.026
 plunge+0.026
 +0.025
Neg logit contributions
ate-0.025
ia-0.022
mill-0.021
 about-0.020
 from-0.020
Token.
Token ID13
Feature activation+5.03e-01

Pos logit contributions
arse+0.003
gency+0.002
 +0.002
 pleasure+0.002
 plunge+0.002
Neg logit contributions
ate-0.002
ia-0.001
 from-0.001
ires-0.001
 about-0.001
Token The
Token ID383
Feature activation-1.35e+00

Pos logit contributions
arse+0.012
gency+0.011
 +0.010
 plunge+0.009
 pleasure+0.009
Neg logit contributions
ate-0.015
ires-0.013
ia-0.012
mill-0.012
der-0.012
Token v
Token ID410
Feature activation-4.37e+00

Pos logit contributions
ate+0.039
ires+0.033
ia+0.032
mill+0.032
der+0.032
Neg logit contributions
arse-0.032
gency-0.030
 -0.029
 pleasure-0.026
 plunge-0.026
Tokenase
Token ID589
Feature activation-1.10e+00

Pos logit contributions
ate+0.116
 about+0.114
 from+0.113
ires+0.111
der+0.108
Neg logit contributions
gency-0.072
icer-0.072
 -0.067
arse-0.060
-0.058
Token must
Token ID1276
Feature activation-2.82e-01

Pos logit contributions
ate+0.031
ires+0.027
der+0.025
mill+0.025
iring+0.025
Neg logit contributions
arse-0.024
 might-0.024
 -0.024
gency-0.023
 plunge-0.021
Token remain
Token ID3520
Feature activation+3.82e-01

Pos logit contributions
ires+0.007
der+0.007
ate+0.007
mill+0.007
iring+0.007
Neg logit contributions
 -0.007
arse-0.006
 might-0.006
gency-0.006
 plunge-0.006
Token on
Token ID319
Feature activation+9.33e-01

Pos logit contributions
arse+0.010
gency+0.009
 +0.009
 pleasure+0.008
 plunge+0.008
Neg logit contributions
ate-0.010
ires-0.009
mill-0.008
ia-0.008
der-0.008
Token the
Token ID262
Feature activation-1.00e-01

Pos logit contributions
arse+0.025
gency+0.024
 +0.023
 pleasure+0.022
 plunge+0.021
Neg logit contributions
ate-0.024
ires-0.019
mill-0.019
ia-0.019
 from-0.019
Token.
Token ID13
Feature activation+7.23e-05

Pos logit contributions
ate+0.005
 from+0.004
mill+0.004
 about+0.004
ires+0.004
Neg logit contributions
arse-0.007
gency-0.007
 -0.006
 plunge-0.006
 pleasure-0.006
Token They
Token ID1119
Feature activation-2.51e-02

Pos logit contributions
arse+0.000
gency+0.000
 +0.000
 pleasure+0.000
 plunge+0.000
Neg logit contributions
ate-0.000
ires-0.000
mill-0.000
ia-0.000
der-0.000
Token would
Token ID561
Feature activation+5.66e-01

Pos logit contributions
ate+0.001
mill+0.000
 from+0.000
ia+0.000
 about+0.000
Neg logit contributions
arse-0.001
gency-0.001
 pleasure-0.001
 -0.001
 plunge-0.001
Token see
Token ID766
Feature activation-1.57e+00

Pos logit contributions
arse+0.015
gency+0.014
 +0.014
 pleasure+0.013
 plunge+0.012
Neg logit contributions
ate-0.015
ires-0.012
mill-0.012
ia-0.012
der-0.012
Token all
Token ID477
Feature activation-4.52e-01

Pos logit contributions
ate+0.040
ires+0.034
mill+0.033
ia+0.033
der+0.032
Neg logit contributions
arse-0.041
gency-0.039
 -0.038
 pleasure-0.035
 plunge-0.034
Token sorts
Token ID10524
Feature activation-4.36e+00

Pos logit contributions
ate+0.014
der+0.013
ires+0.012
ia+0.012
mill+0.012
Neg logit contributions
 -0.009
arse-0.009
gency-0.008
 pleasure-0.007
 plunge-0.007
Token of
Token ID286
Feature activation-1.61e+00

Pos logit contributions
ate+0.105
urse+0.088
mill+0.081
ia+0.081
der+0.076
Neg logit contributions
 -0.104
 Bobby-0.101
 pleasure-0.099
 plunge-0.098
 dearly-0.098
Token animals
Token ID4695
Feature activation-1.86e+00

Pos logit contributions
ate+0.030
ires+0.023
mill+0.022
ia+0.022
der+0.021
Neg logit contributions
arse-0.054
gency-0.051
 -0.051
 pleasure-0.048
 plunge-0.047
Token like
Token ID588
Feature activation-7.90e-02

Pos logit contributions
ate+0.046
ia+0.038
ires+0.038
urse+0.037
der+0.037
Neg logit contributions
 -0.050
arse-0.046
 might-0.046
 plunge-0.043
gency-0.043
Token lions
Token ID31420
Feature activation-6.97e-01

Pos logit contributions
ate+0.002
ires+0.001
mill+0.001
ia+0.001
der+0.001
Neg logit contributions
arse-0.003
gency-0.002
 -0.002
 pleasure-0.002
 plunge-0.002
Token,
Token ID11
Feature activation-4.50e-01

Pos logit contributions
ate+0.019
ires+0.016
mill+0.015
der+0.015
urse+0.015
Neg logit contributions
 -0.017
arse-0.016
 might-0.015
gency-0.015
 plunge-0.014
Token things
Token ID1243
Feature activation-4.69e-01

Pos logit contributions
ate+0.018
ires+0.015
der+0.014
mill+0.014
 from+0.014
Neg logit contributions
arse-0.017
gency-0.016
 -0.016
 pleasure-0.014
 plunge-0.014
Token.
Token ID13
Feature activation+2.91e-01

Pos logit contributions
ate+0.010
 from+0.008
ires+0.008
mill+0.008
 about+0.008
Neg logit contributions
arse-0.014
gency-0.014
 -0.013
 pleasure-0.013
 plunge-0.012
Token
Token ID198
Feature activation+4.24e-01

Pos logit contributions
arse+0.008
gency+0.008
 plunge+0.007
 unnecessary+0.006
charged+0.006
Neg logit contributions
ate-0.008
 about-0.006
ires-0.006
ia-0.006
der-0.006
Token
Token ID198
Feature activation-1.68e-01

Pos logit contributions
arse+0.015
gency+0.014
 pleasure+0.013
 suitcase+0.013
 plunge+0.013
Neg logit contributions
ate-0.008
ia-0.006
 about-0.006
ires-0.005
 from-0.005
TokenThe
Token ID464
Feature activation-8.74e-01

Pos logit contributions
ate+0.004
ires+0.004
ia+0.004
mill+0.004
 about+0.003
Neg logit contributions
arse-0.004
gency-0.004
 -0.004
 pleasure-0.004
 plunge-0.004
Token v
Token ID410
Feature activation-4.35e+00

Pos logit contributions
ate+0.023
ires+0.019
ia+0.018
mill+0.018
 from+0.018
Neg logit contributions
arse-0.023
gency-0.022
 -0.020
 pleasure-0.019
 plunge-0.019
Tokenase
Token ID589
Feature activation-1.50e+00

Pos logit contributions
ate+0.118
 about+0.115
ires+0.114
 from+0.111
der+0.110
Neg logit contributions
icer-0.074
 -0.071
gency-0.067
arse-0.063
day-0.058
Token belonged
Token ID19611
Feature activation-8.43e-01

Pos logit contributions
ate+0.040
ires+0.039
iring+0.036
der+0.035
resp+0.035
Neg logit contributions
 -0.036
 might-0.032
arse-0.030
gency-0.029
 suitcase-0.028
Token to
Token ID284
Feature activation-9.74e-01

Pos logit contributions
ate+0.022
der+0.019
ires+0.019
ia+0.019
mill+0.018
Neg logit contributions
arse-0.020
 -0.019
gency-0.019
 pleasure-0.018
 plunge-0.017
Token a
Token ID257
Feature activation+1.54e+00

Pos logit contributions
ate+0.025
ires+0.021
mill+0.021
ia+0.021
der+0.021
Neg logit contributions
arse-0.024
 -0.024
gency-0.023
 pleasure-0.022
 plunge-0.020
Token little
Token ID1310
Feature activation+1.52e+00

Pos logit contributions
arse+0.046
gency+0.043
 +0.041
 plunge+0.039
 pleasure+0.039
Neg logit contributions
ate-0.034
ires-0.027
mill-0.027
ia-0.027
 from-0.026
Token chased
Token ID26172
Feature activation+1.36e+00

Pos logit contributions
 +0.012
 Bobby+0.010
gency+0.009
arse+0.009
 might+0.009
Neg logit contributions
ate-0.014
ires-0.013
iders-0.011
iring-0.011
ia-0.011
Token after
Token ID706
Feature activation-6.39e-02

Pos logit contributions
arse+0.044
gency+0.040
 pleasure+0.037
 suitcase+0.037
icer+0.035
Neg logit contributions
 from-0.029
ate-0.026
 and-0.023
 with-0.023
 about-0.022
Token it
Token ID340
Feature activation-2.65e-01

Pos logit contributions
ate+0.002
 from+0.001
ires+0.001
der+0.001
 about+0.001
Neg logit contributions
arse-0.002
gency-0.002
 pleasure-0.001
 -0.001
 suitcase-0.001
Token.
Token ID13
Feature activation+5.17e-01

Pos logit contributions
ate+0.006
 from+0.005
ia+0.005
 about+0.005
ires+0.005
Neg logit contributions
arse-0.008
gency-0.008
 -0.007
 pleasure-0.007
 plunge-0.007
Token Finally
Token ID9461
Feature activation-1.17e+00

Pos logit contributions
arse+0.014
gency+0.013
 plunge+0.011
 pleasure+0.011
 +0.011
Neg logit contributions
ate-0.013
ires-0.011
der-0.011
 about-0.011
 from-0.011
Token after
Token ID706
Feature activation-4.34e+00

Pos logit contributions
ate+0.025
ia+0.020
 from+0.020
ires+0.019
der+0.019
Neg logit contributions
arse-0.036
gency-0.034
 -0.032
 pleasure-0.031
 plunge-0.031
Token a
Token ID257
Feature activation+4.12e-01

Pos logit contributions
ate+0.105
ires+0.084
mill+0.082
ia+0.078
iring+0.077
Neg logit contributions
 -0.121
arse-0.117
gency-0.114
 plunge-0.105
 Bobby-0.105
Token lot
Token ID1256
Feature activation-1.89e+00

Pos logit contributions
gency+0.010
arse+0.010
icer+0.009
 pleasure+0.009
charged+0.009
Neg logit contributions
 from-0.009
ia-0.009
ate-0.008
 ranger-0.008
 about-0.008
Token of
Token ID286
Feature activation-2.24e+00

Pos logit contributions
ate+0.048
mill+0.038
ires+0.037
ateful+0.035
ble+0.034
Neg logit contributions
 -0.049
 Bobby-0.044
 plunge-0.044
arse-0.044
 pleasure-0.042
Token running
Token ID2491
Feature activation-1.44e+00

Pos logit contributions
ate+0.065
ires+0.055
mill+0.054
ia+0.053
der+0.053
Neg logit contributions
arse-0.052
gency-0.049
 -0.048
 pleasure-0.044
 plunge-0.042
Token,
Token ID11
Feature activation-6.26e-01

Pos logit contributions
ate+0.041
ires+0.034
ia+0.033
mill+0.032
der+0.032
Neg logit contributions
arse-0.034
 -0.034
gency-0.032
 plunge-0.030
 pleasure-0.029
Token in
Token ID287
Feature activation+1.44e+00

Pos logit contributions
ate+0.051
ires+0.042
ia+0.041
der+0.041
mill+0.039
Neg logit contributions
arse-0.056
 -0.054
gency-0.053
 plunge-0.047
 pleasure-0.046
Token the
Token ID262
Feature activation+1.74e+00

Pos logit contributions
arse+0.046
gency+0.043
 pleasure+0.040
 +0.040
 plunge+0.040
Neg logit contributions
ate-0.032
ires-0.023
ia-0.023
mill-0.023
 from-0.023
Token park
Token ID3952
Feature activation-6.54e-01

Pos logit contributions
arse+0.052
gency+0.049
 +0.045
 pleasure+0.045
 plunge+0.044
Neg logit contributions
ate-0.041
ires-0.031
mill-0.031
ia-0.031
 from-0.030
Token.
Token ID13
Feature activation-3.58e-01

Pos logit contributions
ate+0.018
ires+0.015
der+0.015
mill+0.015
ia+0.014
Neg logit contributions
 -0.016
arse-0.015
gency-0.015
 plunge-0.013
 pleasure-0.013
Token The
Token ID383
Feature activation-7.95e-01

Pos logit contributions
ate+0.010
ires+0.008
der+0.008
mill+0.008
ia+0.008
Neg logit contributions
 -0.009
arse-0.008
gency-0.008
 pleasure-0.007
 plunge-0.007
Token v
Token ID410
Feature activation-4.34e+00

Pos logit contributions
ate+0.022
ires+0.018
ia+0.018
mill+0.017
 from+0.017
Neg logit contributions
arse-0.021
gency-0.019
 -0.018
 pleasure-0.017
 plunge-0.017
Tokenase
Token ID589
Feature activation-1.56e+00

Pos logit contributions
ate+0.117
 from+0.112
der+0.111
 about+0.111
ires+0.110
Neg logit contributions
icer-0.069
 -0.067
gency-0.061
arse-0.053
eeee-0.052
Token was
Token ID373
Feature activation+2.96e-01

Pos logit contributions
ate+0.043
ires+0.038
der+0.037
mill+0.036
resp+0.035
Neg logit contributions
 -0.037
 might-0.032
arse-0.030
gency-0.030
 hands-0.027
Token covered
Token ID5017
Feature activation-1.19e+00

Pos logit contributions
 +0.006
arse+0.005
gency+0.005
 might+0.004
 pleasure+0.004
Neg logit contributions
ate-0.010
der-0.008
ires-0.008
mill-0.008
ia-0.008
Token in
Token ID287
Feature activation+1.12e+00

Pos logit contributions
ate+0.024
ia+0.021
der+0.021
ires+0.020
resp+0.019
Neg logit contributions
 -0.033
arse-0.033
gency-0.030
 plunge-0.029
 Bobby-0.028
Token dirt
Token ID13647
Feature activation+1.79e-01

Pos logit contributions
arse+0.029
gency+0.027
 +0.027
 pleasure+0.025
 plunge+0.024
Neg logit contributions
ate-0.030
ires-0.024
mill-0.024
ia-0.024
der-0.023
Token backyard
Token ID24296
Feature activation+5.11e-01

Pos logit contributions
arse+0.035
 +0.034
gency+0.033
 pleasure+0.031
 plunge+0.030
Neg logit contributions
ate-0.037
ires-0.032
der-0.031
ia-0.031
mill-0.031
Token.
Token ID13
Feature activation-6.62e-01

Pos logit contributions
 +0.012
arse+0.011
gency+0.010
 might+0.010
 pleasure+0.010
Neg logit contributions
ate-0.014
ia-0.012
ires-0.012
resp-0.012
der-0.012
Token He
Token ID679
Feature activation-3.13e-01

Pos logit contributions
ate+0.019
ires+0.016
ia+0.016
mill+0.016
der+0.016
Neg logit contributions
arse-0.015
gency-0.014
 -0.014
 pleasure-0.013
 plunge-0.012
Token saw
Token ID2497
Feature activation-2.41e+00

Pos logit contributions
ate+0.008
ires+0.007
der+0.006
mill+0.006
iring+0.006
Neg logit contributions
 -0.008
arse-0.008
gency-0.008
 might-0.007
 pleasure-0.007
Token all
Token ID477
Feature activation-8.01e-01

Pos logit contributions
ate+0.070
ires+0.061
ia+0.060
mill+0.058
der+0.058
Neg logit contributions
 -0.052
arse-0.047
gency-0.047
 plunge-0.045
 pleasure-0.043
Token sorts
Token ID10524
Feature activation-4.31e+00

Pos logit contributions
ate+0.024
der+0.022
ia+0.021
ires+0.021
resp+0.021
Neg logit contributions
 -0.017
arse-0.015
 plunge-0.013
gency-0.013
 pleasure-0.013
Token of
Token ID286
Feature activation-2.05e+00

Pos logit contributions
ate+0.102
urse+0.085
mill+0.080
ia+0.078
ena+0.075
Neg logit contributions
 -0.108
 Bobby-0.101
 plunge-0.101
 pleasure-0.100
 dearly-0.100
Token interesting
Token ID3499
Feature activation-2.62e+00

Pos logit contributions
ate+0.053
mill+0.045
der+0.044
ires+0.044
ia+0.044
Neg logit contributions
arse-0.050
 -0.049
gency-0.047
 plunge-0.044
 pleasure-0.044
Token things
Token ID1243
Feature activation-2.41e+00

Pos logit contributions
ate+0.081
mill+0.072
der+0.071
ires+0.070
ia+0.070
Neg logit contributions
 -0.046
 might-0.042
arse-0.042
gency-0.041
 plunge-0.040
Token,
Token ID11
Feature activation-1.09e+00

Pos logit contributions
ate+0.070
ia+0.061
ires+0.061
urse+0.060
ross+0.058
Neg logit contributions
 -0.053
 might-0.050
arse-0.046
gency-0.045
 plunge-0.045
Token like
Token ID588
Feature activation-5.81e-01

Pos logit contributions
ate+0.031
ires+0.026
ia+0.026
mill+0.026
der+0.025
Neg logit contributions
 -0.025
arse-0.023
gency-0.023
 plunge-0.022
 pleasure-0.022
Token a
Token ID257
Feature activation+1.83e+00

Pos logit contributions
arse+0.032
gency+0.028
 +0.026
 pleasure+0.026
 plunge+0.026
Neg logit contributions
ate-0.023
 from-0.020
 about-0.019
ires-0.019
ia-0.019
Token cob
Token ID22843
Feature activation-2.97e+00

Pos logit contributions
arse+0.052
gency+0.048
 +0.046
 pleasure+0.042
 plunge+0.042
Neg logit contributions
ate-0.042
ia-0.035
mill-0.035
 from-0.034
der-0.034
Tokenweb
Token ID12384
Feature activation-2.69e-01

Pos logit contributions
ate+0.058
ires+0.043
mill+0.039
iring+0.036
ia+0.036
Neg logit contributions
gency-0.089
arse-0.085
 -0.082
 might-0.080
 plunge-0.079
Token.
Token ID13
Feature activation+3.65e-02

Pos logit contributions
ate+0.007
ires+0.006
ia+0.006
mill+0.006
iring+0.006
Neg logit contributions
arse-0.007
gency-0.006
 -0.006
 plunge-0.006
 pleasure-0.006
Token The
Token ID383
Feature activation-6.26e-01

Pos logit contributions
arse+0.001
gency+0.001
 plunge+0.001
 pleasure+0.001
 suitcase+0.001
Neg logit contributions
ate-0.001
 about-0.001
ires-0.001
ia-0.001
der-0.001
Token cob
Token ID22843
Feature activation-4.29e+00

Pos logit contributions
ate+0.016
ia+0.013
ires+0.013
der+0.013
mill+0.013
Neg logit contributions
arse-0.017
gency-0.016
 -0.016
 pleasure-0.014
 plunge-0.014
Tokenweb
Token ID12384
Feature activation-2.87e-01

Pos logit contributions
ate+0.089
aze+0.070
oni+0.068
resp+0.067
ires+0.067
Neg logit contributions
gency-0.105
 might-0.104
 -0.099
icer-0.098
lers-0.095
Token was
Token ID373
Feature activation+7.28e-01

Pos logit contributions
ate+0.008
ires+0.007
mill+0.007
iring+0.007
urse+0.006
Neg logit contributions
 -0.007
 might-0.007
arse-0.006
gency-0.006
 may-0.006
Token very
Token ID845
Feature activation+7.11e-01

Pos logit contributions
 +0.013
arse+0.013
gency+0.012
 pleasure+0.011
 might+0.011
Neg logit contributions
ate-0.024
ires-0.021
der-0.020
mill-0.020
ia-0.020
Token thin
Token ID7888
Feature activation-9.59e-01

Pos logit contributions
 +0.014
arse+0.014
gency+0.012
 pleasure+0.012
 might+0.012
Neg logit contributions
ate-0.022
mill-0.019
der-0.019
ires-0.018
ia-0.017
Token and
Token ID290
Feature activation+7.57e-01

Pos logit contributions
ate+0.018
ires+0.013
ia+0.012
mill+0.012
urse+0.011
Neg logit contributions
arse-0.031
 -0.031
gency-0.030
 pleasure-0.029
 might-0.029
Token stream
Token ID4269
Feature activation-1.07e+00

Pos logit contributions
ate+0.029
ires+0.024
mill+0.024
der+0.024
ia+0.023
Neg logit contributions
arse-0.025
 -0.024
gency-0.023
 plunge-0.021
 pleasure-0.021
Token.
Token ID13
Feature activation-9.31e-03

Pos logit contributions
ate+0.028
ires+0.025
mill+0.023
iring+0.022
der+0.022
Neg logit contributions
 -0.027
arse-0.025
gency-0.024
 plunge-0.023
 might-0.023
Token He
Token ID679
Feature activation-1.86e-01

Pos logit contributions
ate+0.000
ires+0.000
ia+0.000
der+0.000
mill+0.000
Neg logit contributions
arse-0.000
gency-0.000
 -0.000
 pleasure-0.000
 plunge-0.000
Token met
Token ID1138
Feature activation-7.42e-01

Pos logit contributions
ate+0.005
ires+0.005
iring+0.004
der+0.004
mill+0.004
Neg logit contributions
 -0.004
arse-0.004
gency-0.004
 might-0.004
 plunge-0.004
Token all
Token ID477
Feature activation-3.92e-01

Pos logit contributions
ate+0.020
ires+0.018
mill+0.017
der+0.017
ia+0.017
Neg logit contributions
 -0.017
arse-0.016
gency-0.015
 pleasure-0.015
 plunge-0.015
Token sorts
Token ID10524
Feature activation-4.28e+00

Pos logit contributions
ate+0.011
der+0.010
ires+0.009
ia+0.009
resp+0.009
Neg logit contributions
 -0.009
arse-0.008
 plunge-0.008
 pleasure-0.007
gency-0.007
Token of
Token ID286
Feature activation-2.14e+00

Pos logit contributions
ate+0.104
urse+0.083
mill+0.082
ena+0.074
iders+0.072
Neg logit contributions
 -0.106
 plunge-0.100
 Bobby-0.098
 dearly-0.097
 pleasure-0.097
Token animals
Token ID4695
Feature activation-1.97e+00

Pos logit contributions
ate+0.054
mill+0.046
der+0.045
ires+0.044
ia+0.043
Neg logit contributions
arse-0.053
 -0.052
gency-0.050
 plunge-0.049
 pleasure-0.048
Token and
Token ID290
Feature activation-7.20e-01

Pos logit contributions
ate+0.050
ires+0.040
ia+0.038
mill+0.038
urse+0.038
Neg logit contributions
 -0.052
arse-0.049
 might-0.046
gency-0.046
 plunge-0.046
Token sw
Token ID1509
Feature activation-2.82e-01

Pos logit contributions
ate+0.019
ires+0.016
mill+0.015
ia+0.014
der+0.014
Neg logit contributions
 -0.019
gency-0.017
arse-0.017
 might-0.017
 plunge-0.016
Tokenam
Token ID321
Feature activation+6.48e-01

Pos logit contributions
ate+0.005
mill+0.004
der+0.004
ires+0.004
 from+0.004
Neg logit contributions
arse-0.009
gency-0.009
 -0.009
 pleasure-0.008
 plunge-0.008
Token
Token ID198
Feature activation-2.08e-01

Pos logit contributions
ate+0.001
ires+0.001
der+0.001
mill+0.001
 about+0.001
Neg logit contributions
arse-0.001
gency-0.001
 plunge-0.001
 -0.001
 pleasure-0.001
Token
Token ID198
Feature activation-5.97e-01

Pos logit contributions
ate+0.004
ia+0.003
 about+0.003
ires+0.003
mill+0.003
Neg logit contributions
arse-0.007
gency-0.006
 pleasure-0.006
 plunge-0.006
 suitcase-0.006
TokenHe
Token ID1544
Feature activation-5.77e-01

Pos logit contributions
ate+0.018
ires+0.015
mill+0.015
ia+0.014
der+0.014
Neg logit contributions
arse-0.014
gency-0.013
 -0.013
 pleasure-0.011
 plunge-0.011
Token did
Token ID750
Feature activation-1.36e+00

Pos logit contributions
ate+0.016
ires+0.016
iring+0.015
iders+0.014
der+0.014
Neg logit contributions
 -0.013
gency-0.012
 might-0.012
arse-0.011
 suitcase-0.011
Token all
Token ID477
Feature activation-9.41e-01

Pos logit contributions
ate+0.024
mill+0.019
ires+0.019
iring+0.017
der+0.017
Neg logit contributions
 -0.047
gency-0.044
arse-0.044
 might-0.041
 pleasure-0.040
Token sorts
Token ID10524
Feature activation-4.27e+00

Pos logit contributions
ate+0.026
ires+0.024
der+0.023
ia+0.022
iring+0.022
Neg logit contributions
 -0.022
arse-0.021
gency-0.020
 pleasure-0.018
 plunge-0.018
Token of
Token ID286
Feature activation-1.75e+00

Pos logit contributions
ate+0.097
urse+0.080
mill+0.079
ia+0.072
castle+0.071
Neg logit contributions
 -0.113
 pleasure-0.107
arse-0.105
 plunge-0.103
 dearly-0.103
Token things
Token ID1243
Feature activation-2.27e+00

Pos logit contributions
ate+0.042
ires+0.034
 about+0.033
mill+0.032
ia+0.032
Neg logit contributions
arse-0.050
gency-0.048
 -0.045
 pleasure-0.043
 plunge-0.042
Token with
Token ID351
Feature activation+1.31e-01

Pos logit contributions
ate+0.063
ia+0.056
ires+0.054
urse+0.054
der+0.049
Neg logit contributions
 -0.051
arse-0.049
 might-0.049
gency-0.046
 plunge-0.045
Token the
Token ID262
Feature activation+1.74e-01

Pos logit contributions
arse+0.004
gency+0.003
 +0.003
 pleasure+0.003
 plunge+0.003
Neg logit contributions
ate-0.003
ires-0.003
mill-0.002
ia-0.002
der-0.002
Token axe
Token ID23272
Feature activation+5.07e-01

Pos logit contributions
arse+0.004
gency+0.004
 +0.004
 plunge+0.004
 pleasure+0.004
Neg logit contributions
ate-0.005
ires-0.004
iring-0.004
mill-0.004
der-0.004
Token His
Token ID2399
Feature activation+3.50e-01

Pos logit contributions
arse+0.015
 +0.015
gency+0.014
 pleasure+0.013
 plunge+0.012
Neg logit contributions
ate-0.016
ires-0.013
mill-0.013
ia-0.013
der-0.013
Token mom
Token ID1995
Feature activation+1.21e+00

Pos logit contributions
arse+0.011
gency+0.010
 +0.010
 plunge+0.009
 pleasure+0.009
Neg logit contributions
ate-0.007
ia-0.006
ires-0.006
mill-0.006
der-0.006
Token explained
Token ID4893
Feature activation+5.45e-01

Pos logit contributions
 +0.030
arse+0.028
gency+0.026
 might+0.024
 Bobby+0.023
Neg logit contributions
ate-0.034
ires-0.031
der-0.027
iring-0.027
resp-0.026
Token that
Token ID326
Feature activation+5.46e-01

Pos logit contributions
 +0.016
arse+0.016
gency+0.015
 might+0.014
 plunge+0.014
Neg logit contributions
ate-0.012
ires-0.010
ia-0.009
mill-0.009
iring-0.009
Token the
Token ID262
Feature activation-9.99e-01

Pos logit contributions
 +0.012
arse+0.011
gency+0.011
 pleasure+0.010
 plunge+0.010
Neg logit contributions
ate-0.016
ires-0.014
ia-0.014
mill-0.013
der-0.013
Token cob
Token ID22843
Feature activation-4.26e+00

Pos logit contributions
ate+0.026
ires+0.021
ia+0.021
mill+0.021
der+0.021
Neg logit contributions
arse-0.026
gency-0.025
 -0.024
 pleasure-0.022
 plunge-0.021
Tokenweb
Token ID12384
Feature activation-7.18e-01

Pos logit contributions
ate+0.089
aze+0.068
uted+0.067
oni+0.067
ires+0.066
Neg logit contributions
 might-0.110
lers-0.098
 -0.097
gency-0.096
 minute-0.094
Token was
Token ID373
Feature activation+2.40e-01

Pos logit contributions
ate+0.019
ires+0.016
iring+0.015
mill+0.015
ia+0.014
Neg logit contributions
 -0.018
 might-0.018
arse-0.017
gency-0.016
 may-0.015
Token just
Token ID655
Feature activation+2.72e-03

Pos logit contributions
arse+0.006
 +0.006
gency+0.005
 pleasure+0.005
 might+0.005
Neg logit contributions
ate-0.006
ires-0.005
ia-0.005
mill-0.005
der-0.005
Token a
Token ID257
Feature activation+5.69e-01

Pos logit contributions
arse+0.000
 +0.000
gency+0.000
 pleasure+0.000
 plunge+0.000
Neg logit contributions
ate-0.000
ia-0.000
ires-0.000
mill-0.000
der-0.000
Token spider
Token ID19230
Feature activation-6.94e-01

Pos logit contributions
arse+0.014
gency+0.013
 +0.012
 plunge+0.011
 pleasure+0.011
Neg logit contributions
ate-0.015
 from-0.013
der-0.013
mill-0.013
ia-0.013
Token said
Token ID531
Feature activation-1.37e-01

Pos logit contributions
 +0.011
arse+0.011
gency+0.011
 might+0.010
 pleasure+0.010
Neg logit contributions
ate-0.011
ires-0.010
mill-0.009
der-0.009
ia-0.009
Token and
Token ID290
Feature activation-5.68e-01

Pos logit contributions
ate+0.002
ires+0.002
mill+0.002
der+0.002
ia+0.002
Neg logit contributions
arse-0.005
gency-0.005
 -0.004
 pleasure-0.004
 plunge-0.004
Token decided
Token ID3066
Feature activation-7.74e-01

Pos logit contributions
ate+0.016
ires+0.015
iring+0.014
iders+0.014
resp+0.014
Neg logit contributions
 -0.014
gency-0.011
 might-0.011
 pleasure-0.010
 Bobby-0.010
Token that
Token ID326
Feature activation+3.40e-01

Pos logit contributions
ate+0.025
ires+0.023
iring+0.021
ia+0.021
mill+0.020
Neg logit contributions
 -0.015
gency-0.014
arse-0.013
 might-0.012
 plunge-0.011
Token maybe
Token ID3863
Feature activation-7.78e-01

Pos logit contributions
 +0.008
arse+0.007
gency+0.007
 pleasure+0.007
 might+0.007
Neg logit contributions
ate-0.009
ires-0.008
ia-0.008
iring-0.008
der-0.007
Token cob
Token ID22843
Feature activation-4.25e+00

Pos logit contributions
ate+0.022
ires+0.020
iring+0.019
uted+0.019
mill+0.019
Neg logit contributions
 -0.016
gency-0.015
 might-0.015
arse-0.014
 plunge-0.014
Tokenwe
Token ID732
Feature activation+8.71e-01

Pos logit contributions
ate+0.114
 parked+0.102
From+0.102
aff+0.100
aze+0.099
Neg logit contributions
 might-0.063
lers-0.057
 plunge-0.054
gency-0.052
 breaths-0.050
Tokenbs
Token ID1443
Feature activation+3.74e-02

Pos logit contributions
arse+0.019
gency+0.017
 +0.017
 pleasure+0.015
 plunge+0.015
Neg logit contributions
ate-0.026
ires-0.022
mill-0.022
ia-0.022
der-0.022
Token weren
Token ID6304
Feature activation+4.40e-01

Pos logit contributions
 +0.001
arse+0.001
 might+0.001
gency+0.001
 plunge+0.001
Neg logit contributions
ate-0.001
ires-0.001
iring-0.001
ia-0.001
der-0.001
Token't
Token ID470
Feature activation-4.47e-01

Pos logit contributions
arse+0.008
 +0.007
gency+0.006
 plunge+0.006
 might+0.006
Neg logit contributions
ate-0.015
ires-0.013
ia-0.013
der-0.013
mill-0.013
Token so
Token ID523
Feature activation+3.41e-01

Pos logit contributions
ate+0.013
ires+0.011
ia+0.010
der+0.010
mill+0.010
Neg logit contributions
arse-0.010
 -0.010
gency-0.009
 pleasure-0.009
 plunge-0.009
Token the
Token ID262
Feature activation-1.07e+00

Pos logit contributions
arse+0.004
 +0.004
gency+0.004
 pleasure+0.004
 plunge+0.004
Neg logit contributions
ate-0.004
ires-0.004
ia-0.004
mill-0.004
der-0.004
Token v
Token ID410
Feature activation-3.06e+00

Pos logit contributions
ate+0.031
ires+0.026
mill+0.026
ia+0.025
der+0.025
Neg logit contributions
arse-0.025
gency-0.023
 -0.023
 pleasure-0.021
 plunge-0.020
Tokenase
Token ID589
Feature activation-1.21e+00

Pos logit contributions
ate+0.077
ires+0.070
der+0.065
mill+0.065
 about+0.064
Neg logit contributions
gency-0.068
 -0.067
arse-0.067
icer-0.059
eeee-0.053
Token.
Token ID13
Feature activation+2.08e-01

Pos logit contributions
ate+0.029
ires+0.024
mill+0.022
ia+0.022
der+0.022
Neg logit contributions
arse-0.034
 -0.032
gency-0.031
 plunge-0.028
 pleasure-0.028
Token The
Token ID383
Feature activation-7.11e-01

Pos logit contributions
 +0.005
arse+0.005
gency+0.005
 pleasure+0.004
 plunge+0.004
Neg logit contributions
ate-0.006
ires-0.005
mill-0.005
ia-0.005
der-0.005
Token v
Token ID410
Feature activation-4.24e+00

Pos logit contributions
ate+0.020
ires+0.017
ia+0.017
mill+0.017
der+0.016
Neg logit contributions
arse-0.017
gency-0.016
 -0.016
 pleasure-0.015
 plunge-0.014
Tokenase
Token ID589
Feature activation-9.48e-01

Pos logit contributions
ate+0.114
ires+0.108
 from+0.107
 about+0.105
der+0.102
Neg logit contributions
gency-0.072
icer-0.071
 -0.069
arse-0.065
eeee-0.058
Token fell
Token ID3214
Feature activation+1.51e+00

Pos logit contributions
ate+0.027
ires+0.023
mill+0.021
iring+0.021
ia+0.021
Neg logit contributions
arse-0.022
 -0.022
gency-0.021
 might-0.020
 plunge-0.018
Token and
Token ID290
Feature activation+9.09e-01

Pos logit contributions
arse+0.038
gency+0.036
 +0.035
 pleasure+0.033
 plunge+0.032
Neg logit contributions
ate-0.040
ires-0.034
mill-0.033
ia-0.033
der-0.032
Token break
Token ID2270
Feature activation+1.26e+00

Pos logit contributions
arse+0.027
gency+0.026
 pleasure+0.025
 +0.025
 plunge+0.023
Neg logit contributions
ate-0.020
mill-0.016
ires-0.016
der-0.016
 from-0.016
Token.
Token ID13
Feature activation+2.41e-01

Pos logit contributions
arse+0.037
gency+0.037
 pleasure+0.035
 +0.034
 plunge+0.034
Neg logit contributions
ate-0.029
 from-0.023
der-0.023
mill-0.022
ires-0.022
Token was
Token ID373
Feature activation-1.91e-01

Pos logit contributions
 +0.009
arse+0.009
gency+0.009
 might+0.008
 plunge+0.007
Neg logit contributions
ate-0.013
ires-0.012
mill-0.011
der-0.011
iring-0.010
Token too
Token ID1165
Feature activation+2.80e-01

Pos logit contributions
ate+0.006
ires+0.005
ia+0.005
mill+0.005
resp+0.005
Neg logit contributions
arse-0.004
 -0.004
gency-0.004
 pleasure-0.003
 might-0.003
Token strong
Token ID1913
Feature activation-9.54e-01

Pos logit contributions
arse+0.005
 +0.005
gency+0.005
 might+0.004
 pleasure+0.004
Neg logit contributions
ate-0.009
der-0.008
mill-0.008
ires-0.008
resp-0.007
Token.
Token ID13
Feature activation+9.51e-01

Pos logit contributions
ate+0.024
ires+0.020
ia+0.020
mill+0.019
der+0.019
Neg logit contributions
arse-0.025
gency-0.024
 -0.023
 pleasure-0.022
 plunge-0.021
Token The
Token ID383
Feature activation-9.16e-01

Pos logit contributions
arse+0.032
gency+0.030
 plunge+0.027
 pleasure+0.027
 +0.026
Neg logit contributions
ate-0.018
ires-0.014
 about-0.014
der-0.014
 from-0.013
Token v
Token ID410
Feature activation-4.24e+00

Pos logit contributions
ate+0.024
ia+0.019
ires+0.019
 from+0.019
mill+0.019
Neg logit contributions
arse-0.025
gency-0.023
 pleasure-0.021
 -0.021
 plunge-0.020
Tokenase
Token ID589
Feature activation-9.19e-01

Pos logit contributions
ate+0.116
 about+0.113
ires+0.112
 from+0.112
der+0.107
Neg logit contributions
icer-0.067
gency-0.063
 -0.063
arse-0.058
-0.053
Token broke
Token ID6265
Feature activation+5.10e-01

Pos logit contributions
ate+0.026
ires+0.023
iring+0.020
mill+0.020
der+0.019
Neg logit contributions
 -0.022
arse-0.021
 might-0.021
gency-0.019
 plunge-0.018
Token into
Token ID656
Feature activation+9.88e-02

Pos logit contributions
arse+0.016
 +0.015
gency+0.015
 pleasure+0.013
 plunge+0.013
Neg logit contributions
ate-0.011
ires-0.009
ia-0.008
mill-0.008
der-0.008
Token lots
Token ID6041
Feature activation-7.27e-01

Pos logit contributions
arse+0.003
gency+0.002
 +0.002
 pleasure+0.002
 plunge+0.002
Neg logit contributions
ate-0.003
ires-0.002
ia-0.002
der-0.002
mill-0.002
Token of
Token ID286
Feature activation-1.21e+00

Pos logit contributions
ate+0.015
ires+0.011
mill+0.010
ia+0.010
ble+0.009
Neg logit contributions
 -0.022
arse-0.022
gency-0.020
 might-0.020
 suitcase-0.020
Token.
Token ID13
Feature activation-4.63e-01

Pos logit contributions
ate+0.018
ires+0.015
ia+0.015
der+0.015
mill+0.014
Neg logit contributions
arse-0.018
 -0.018
gency-0.017
 pleasure-0.015
 plunge-0.015
Token They
Token ID1119
Feature activation+1.45e-01

Pos logit contributions
ate+0.013
ires+0.011
ia+0.010
mill+0.010
der+0.010
Neg logit contributions
 -0.011
arse-0.011
gency-0.011
 pleasure-0.010
 suitcase-0.009
Token were
Token ID547
Feature activation-5.78e-03

Pos logit contributions
arse+0.004
gency+0.004
 +0.004
 pleasure+0.004
 plunge+0.003
Neg logit contributions
ate-0.003
ires-0.003
mill-0.003
ia-0.003
der-0.003
Token playing
Token ID2712
Feature activation-1.77e+00

Pos logit contributions
ate+0.000
ires+0.000
mill+0.000
der+0.000
ia+0.000
Neg logit contributions
arse-0.000
gency-0.000
 -0.000
 pleasure-0.000
 plunge-0.000
Token all
Token ID477
Feature activation-5.35e-01

Pos logit contributions
ate+0.039
ires+0.031
mill+0.030
ia+0.030
der+0.030
Neg logit contributions
arse-0.053
gency-0.050
 -0.050
 pleasure-0.046
 plunge-0.046
Token sorts
Token ID10524
Feature activation-4.24e+00

Pos logit contributions
ate+0.019
der+0.017
ires+0.017
ia+0.016
resp+0.016
Neg logit contributions
 -0.008
arse-0.008
gency-0.007
 plunge-0.006
 pleasure-0.006
Token of
Token ID286
Feature activation-2.00e+00

Pos logit contributions
ate+0.092
mill+0.068
urse+0.068
ia+0.065
der+0.063
Neg logit contributions
 -0.116
arse-0.112
 plunge-0.107
 pleasure-0.106
 dearly-0.104
Token games
Token ID1830
Feature activation-1.95e+00

Pos logit contributions
ate+0.055
ires+0.046
der+0.046
mill+0.046
ia+0.045
Neg logit contributions
arse-0.047
 -0.045
gency-0.044
 plunge-0.040
 pleasure-0.040
Token.
Token ID13
Feature activation-4.93e-01

Pos logit contributions
ate+0.050
ires+0.043
iring+0.040
ia+0.040
der+0.039
Neg logit contributions
 -0.048
arse-0.045
gency-0.045
 pleasure-0.042
 might-0.042
Token Suddenly
Token ID24975
Feature activation+1.25e-01

Pos logit contributions
ate+0.014
ires+0.012
mill+0.012
ia+0.012
der+0.012
Neg logit contributions
arse-0.011
gency-0.011
 -0.011
 pleasure-0.010
 plunge-0.009
Token,
Token ID11
Feature activation-1.29e+00

Pos logit contributions
arse+0.003
 +0.003
gency+0.003
 pleasure+0.002
 plunge+0.002
Neg logit contributions
ate-0.004
ires-0.003
mill-0.003
der-0.003
ia-0.003
Token mass
Token ID2347
Feature activation+2.41e-01

Pos logit contributions
arse+0.009
 +0.009
gency+0.008
 pleasure+0.007
 plunge+0.007
Neg logit contributions
ate-0.010
ires-0.009
ia-0.009
mill-0.008
iring-0.008
Tokenaging
Token ID3039
Feature activation+1.54e+00

Pos logit contributions
arse+0.007
 +0.007
gency+0.006
 pleasure+0.006
 plunge+0.006
Neg logit contributions
ate-0.006
ires-0.005
ia-0.004
mill-0.004
der-0.004
Token her
Token ID607
Feature activation+1.49e-01

Pos logit contributions
arse+0.042
gency+0.039
 +0.038
 pleasure+0.036
 plunge+0.035
Neg logit contributions
ate-0.039
ires-0.032
ia-0.031
mill-0.031
der-0.030
Token legs
Token ID7405
Feature activation+2.20e+00

Pos logit contributions
arse+0.003
gency+0.003
 +0.003
 pleasure+0.003
 plunge+0.003
Neg logit contributions
ate-0.004
ires-0.004
der-0.004
ia-0.004
mill-0.004
Token and
Token ID290
Feature activation+1.14e+00

Pos logit contributions
arse+0.071
 +0.068
gency+0.066
 pleasure+0.062
 plunge+0.061
Neg logit contributions
ate-0.044
ires-0.036
ia-0.034
mill-0.032
der-0.031
Token arms
Token ID5101
Feature activation+3.30e+00

Pos logit contributions
 +0.029
arse+0.028
gency+0.027
 pleasure+0.025
 plunge+0.025
Neg logit contributions
ate-0.031
ires-0.026
ia-0.024
mill-0.024
der-0.024
Token.
Token ID13
Feature activation+1.17e+00

Pos logit contributions
arse+0.088
 +0.082
gency+0.082
 pleasure+0.075
 plunge+0.074
Neg logit contributions
ate-0.084
ires-0.069
ia-0.067
mill-0.066
der-0.065
Token When
Token ID1649
Feature activation-1.66e-01

Pos logit contributions
arse+0.029
gency+0.027
 +0.025
 plunge+0.024
 pleasure+0.023
Neg logit contributions
ate-0.032
ires-0.027
ia-0.027
mill-0.026
der-0.026
Token it
Token ID340
Feature activation-1.66e-01

Pos logit contributions
ate+0.004
ires+0.003
ia+0.003
mill+0.003
der+0.003
Neg logit contributions
arse-0.005
gency-0.005
 -0.004
 pleasure-0.004
 plunge-0.004
Token was
Token ID373
Feature activation+2.69e-01

Pos logit contributions
ate+0.004
ires+0.004
mill+0.004
der+0.004
iring+0.004
Neg logit contributions
 -0.004
arse-0.004
 might-0.004
gency-0.004
 plunge-0.004
Token finished
Token ID5201
Feature activation-1.26e+00

Pos logit contributions
arse+0.006
 +0.006
 pleasure+0.005
gency+0.005
 plunge+0.005
Neg logit contributions
ate-0.008
ires-0.007
ia-0.007
der-0.006
mill-0.006
Token Tom
Token ID4186
Feature activation-3.23e-01

Pos logit contributions
arse+0.032
gency+0.028
 +0.027
 plunge+0.026
 pleasure+0.026
Neg logit contributions
ate-0.024
ia-0.020
 about-0.020
 from-0.020
ires-0.019
Token.
Token ID13
Feature activation-4.86e-01

Pos logit contributions
ate+0.007
mill+0.006
ia+0.006
ires+0.006
 about+0.006
Neg logit contributions
arse-0.010
gency-0.009
 -0.009
 pleasure-0.009
 plunge-0.009
Token Tom
Token ID4186
Feature activation-1.22e+00

Pos logit contributions
ate+0.010
ires+0.008
ia+0.008
mill+0.008
der+0.008
Neg logit contributions
arse-0.015
gency-0.014
 -0.014
 pleasure-0.013
 plunge-0.013
Token lived
Token ID5615
Feature activation-1.17e+00

Pos logit contributions
ate+0.033
ires+0.032
iring+0.030
der+0.030
mill+0.028
Neg logit contributions
 -0.029
 might-0.027
gency-0.024
arse-0.023
 Bobby-0.022
Token in
Token ID287
Feature activation+9.86e-01

Pos logit contributions
ate+0.028
der+0.026
ia+0.025
ires+0.024
mill+0.024
Neg logit contributions
arse-0.029
 -0.028
gency-0.027
 plunge-0.024
 handful-0.024
Token a
Token ID257
Feature activation+2.90e+00

Pos logit contributions
arse+0.032
gency+0.030
 +0.028
 plunge+0.027
 pleasure+0.027
Neg logit contributions
ate-0.021
ires-0.016
ia-0.015
mill-0.015
 from-0.015
Token bedroom
Token ID14043
Feature activation-4.89e-01

Pos logit contributions
arse+0.100
gency+0.095
 plunge+0.089
 pleasure+0.089
 ease+0.082
Neg logit contributions
ate-0.053
ia-0.042
her-0.039
urse-0.038
 lady-0.036
Token and
Token ID290
Feature activation-7.21e-01

Pos logit contributions
ate+0.012
ires+0.010
resp+0.010
mill+0.010
ia+0.009
Neg logit contributions
 -0.012
gency-0.012
arse-0.011
 suitcase-0.011
 might-0.010
Token he
Token ID339
Feature activation+3.28e-02

Pos logit contributions
ate+0.019
ires+0.017
iring+0.016
mill+0.016
resp+0.015
Neg logit contributions
 -0.019
gency-0.017
 might-0.016
 pleasure-0.015
arse-0.014
Token loved
Token ID6151
Feature activation+1.10e-01

Pos logit contributions
 +0.001
 might+0.001
gency+0.001
arse+0.001
 dearly+0.001
Neg logit contributions
ate-0.001
ires-0.001
iring-0.001
mill-0.001
der-0.001
Token to
Token ID284
Feature activation+3.14e-01

Pos logit contributions
arse+0.003
gency+0.003
 +0.003
 pleasure+0.003
 plunge+0.003
Neg logit contributions
ate-0.003
ires-0.002
mill-0.002
ia-0.002
der-0.002
Token and
Token ID290
Feature activation+2.09e-01

Pos logit contributions
ate+0.018
ires+0.016
iring+0.016
der+0.015
mill+0.015
Neg logit contributions
 -0.019
 might-0.017
gency-0.016
arse-0.016
 pleasure-0.016
Token his
Token ID465
Feature activation+2.15e-01

Pos logit contributions
arse+0.006
gency+0.006
 +0.006
 pleasure+0.005
 plunge+0.005
Neg logit contributions
ate-0.005
ia-0.004
ires-0.004
mill-0.004
der-0.004
Token dad
Token ID9955
Feature activation+7.76e-01

Pos logit contributions
arse+0.006
gency+0.006
 +0.006
 pleasure+0.005
 plunge+0.005
Neg logit contributions
ate-0.005
ires-0.004
ia-0.004
mill-0.004
der-0.004
Token were
Token ID547
Feature activation+2.22e-02

Pos logit contributions
arse+0.021
 +0.020
gency+0.019
 pleasure+0.018
 plunge+0.018
Neg logit contributions
ate-0.019
ires-0.017
ia-0.016
der-0.015
mill-0.015
Token in
Token ID287
Feature activation+1.20e+00

Pos logit contributions
arse+0.001
 +0.000
gency+0.000
 pleasure+0.000
 plunge+0.000
Neg logit contributions
ate-0.001
ires-0.001
ia-0.001
der-0.001
mill-0.001
Token a
Token ID257
Feature activation+3.23e+00

Pos logit contributions
arse+0.041
gency+0.039
 +0.035
 plunge+0.035
 pleasure+0.033
Neg logit contributions
ate-0.023
 from-0.018
 about-0.017
ires-0.017
mill-0.016
Token tent
Token ID11105
Feature activation+2.56e-01

Pos logit contributions
arse+0.107
gency+0.106
 pleasure+0.093
rified+0.090
 plunge+0.089
Neg logit contributions
ate-0.058
ia-0.045
 also-0.043
her-0.043
 from-0.041
Token.
Token ID13
Feature activation-5.95e-01

Pos logit contributions
 +0.006
arse+0.006
gency+0.005
 plunge+0.005
 pleasure+0.005
Neg logit contributions
ate-0.007
ires-0.006
ia-0.006
der-0.006
mill-0.006
Token They
Token ID1119
Feature activation+5.04e-01

Pos logit contributions
ate+0.013
ires+0.010
ia+0.010
der+0.010
 about+0.009
Neg logit contributions
arse-0.019
gency-0.018
 -0.016
 pleasure-0.016
 plunge-0.016
Token were
Token ID547
Feature activation+8.90e-02

Pos logit contributions
arse+0.013
 +0.013
gency+0.012
 pleasure+0.011
 plunge+0.011
Neg logit contributions
ate-0.013
ires-0.011
der-0.010
mill-0.010
ia-0.010
Token camping
Token ID22498
Feature activation+2.20e-01

Pos logit contributions
arse+0.002
gency+0.002
 +0.002
 pleasure+0.002
 plunge+0.002
Neg logit contributions
ate-0.003
ires-0.002
mill-0.002
der-0.002
ia-0.002
Token with
Token ID351
Feature activation+8.51e-01

Pos logit contributions
arse+0.001
gency+0.001
 +0.001
 pleasure+0.001
 plunge+0.001
Neg logit contributions
ate-0.001
ires-0.000
mill-0.000
der-0.000
ia-0.000
Token her
Token ID607
Feature activation-1.41e-01

Pos logit contributions
arse+0.031
gency+0.030
 plunge+0.026
use+0.025
riger+0.025
Neg logit contributions
ate-0.014
 from-0.012
 about-0.012
 not-0.010
der-0.010
Token duck
Token ID22045
Feature activation+1.75e+00

Pos logit contributions
ate+0.003
ires+0.002
 from+0.002
mill+0.002
ia+0.002
Neg logit contributions
arse-0.004
gency-0.004
 -0.004
 plunge-0.004
 pleasure-0.004
Token in
Token ID287
Feature activation+1.48e+00

Pos logit contributions
gency+0.052
arse+0.052
 +0.048
 pleasure+0.047
 plunge+0.046
Neg logit contributions
ate-0.036
mill-0.030
 from-0.030
der-0.028
 about-0.028
Token her
Token ID607
Feature activation+1.10e+00

Pos logit contributions
arse+0.052
gency+0.049
 plunge+0.046
ksh+0.043
cient+0.042
Neg logit contributions
ate-0.027
 from-0.024
 about-0.022
ia-0.019
 not-0.018
Token arms
Token ID5101
Feature activation+2.80e+00

Pos logit contributions
gency+0.034
arse+0.033
 plunge+0.029
rified+0.026
use+0.026
Neg logit contributions
ate-0.024
ia-0.021
 from-0.020
 about-0.019
 and-0.019
Token.
Token ID13
Feature activation+7.14e-01

Pos logit contributions
arse+0.081
gency+0.080
 +0.074
 pleasure+0.072
 plunge+0.071
Neg logit contributions
ate-0.062
der-0.050
mill-0.049
ires-0.049
 about-0.049
Token The
Token ID383
Feature activation-6.77e-01

Pos logit contributions
arse+0.022
gency+0.020
 plunge+0.019
 suitcase+0.017
 pleasure+0.017
Neg logit contributions
ate-0.016
 about-0.014
ires-0.014
 from-0.013
ia-0.013
Token end
Token ID886
Feature activation-1.61e+00

Pos logit contributions
ate+0.014
ires+0.011
mill+0.011
ia+0.011
der+0.011
Neg logit contributions
arse-0.021
gency-0.020
 -0.019
 pleasure-0.018
 plunge-0.018
Token.
Token ID13
Feature activation+8.46e-01

Pos logit contributions
ate+0.039
ires+0.032
mill+0.031
ia+0.031
der+0.031
Neg logit contributions
arse-0.045
gency-0.042
 -0.041
 pleasure-0.039
 plunge-0.038
Token<|endoftext|>
Token ID50256
Feature activation+8.72e-01

Pos logit contributions
arse+0.029
gency+0.027
 plunge+0.026
 suitcase+0.024
 pleasure+0.023
Neg logit contributions
ate-0.016
 about-0.013
 from-0.013
ires-0.012
ia-0.012
Token.
Token ID13
Feature activation-3.65e-02

Pos logit contributions
ate+0.026
ires+0.021
ia+0.021
iders+0.020
mill+0.020
Neg logit contributions
 -0.034
arse-0.034
 pleasure-0.031
gency-0.030
 suitcase-0.029
Token "
Token ID366
Feature activation+1.01e+00

Pos logit contributions
ate+0.001
mill+0.001
ires+0.001
ia+0.001
der+0.001
Neg logit contributions
 -0.001
arse-0.001
gency-0.001
 pleasure-0.001
 plunge-0.001
TokenDon
Token ID3987
Feature activation+1.49e-01

Pos logit contributions
arse+0.025
 +0.024
gency+0.023
 pleasure+0.021
 plunge+0.021
Neg logit contributions
ate-0.027
ires-0.023
ia-0.022
der-0.022
mill-0.022
Token't
Token ID470
Feature activation+6.57e-01

Pos logit contributions
arse+0.003
 +0.003
gency+0.003
 plunge+0.003
 pleasure+0.003
Neg logit contributions
ate-0.005
ires-0.004
mill-0.004
der-0.004
ia-0.004
Token be
Token ID307
Feature activation-3.44e-01

Pos logit contributions
arse+0.013
 +0.012
gency+0.012
 pleasure+0.010
 plunge+0.010
Neg logit contributions
ate-0.022
ires-0.019
mill-0.018
der-0.018
ia-0.018
Token scared
Token ID12008
Feature activation+2.58e+00

Pos logit contributions
ate+0.010
ires+0.008
der+0.008
ia+0.008
mill+0.008
Neg logit contributions
arse-0.008
 -0.008
gency-0.007
 pleasure-0.007
 plunge-0.007
Token",
Token ID1600
Feature activation-1.29e-01

Pos logit contributions
arse+0.069
gency+0.067
 +0.064
 pleasure+0.060
 plunge+0.057
Neg logit contributions
ate-0.061
 from-0.053
 about-0.052
ires-0.051
ia-0.051
Token Bob
Token ID5811
Feature activation+1.52e+00

Pos logit contributions
ate+0.004
ires+0.003
mill+0.003
ia+0.003
der+0.003
Neg logit contributions
 -0.003
arse-0.003
gency-0.003
 plunge-0.003
 pleasure-0.003
Token said
Token ID531
Feature activation+5.06e-01

Pos logit contributions
arse+0.040
 +0.040
gency+0.037
 plunge+0.034
 pleasure+0.033
Neg logit contributions
ate-0.039
ires-0.034
mill-0.032
der-0.031
ia-0.030
Token.
Token ID13
Feature activation+3.00e-01

Pos logit contributions
arse+0.014
gency+0.013
 +0.013
 pleasure+0.012
 plunge+0.012
Neg logit contributions
ate-0.012
ires-0.010
mill-0.010
ia-0.010
der-0.010
Token "
Token ID366
Feature activation+9.71e-01

Pos logit contributions
arse+0.007
 +0.007
gency+0.007
 pleasure+0.006
 plunge+0.006
Neg logit contributions
ate-0.008
ires-0.007
mill-0.007
ia-0.007
der-0.007
Token mom
Token ID1995
Feature activation+9.03e-01

Pos logit contributions
ate+0.031
ires+0.025
mill+0.024
ia+0.024
der+0.024
Neg logit contributions
arse-0.035
gency-0.033
 -0.032
 pleasure-0.030
 plunge-0.029
Token said
Token ID531
Feature activation+7.48e-01

Pos logit contributions
 +0.021
arse+0.017
 might+0.016
gency+0.015
 Bobby+0.014
Neg logit contributions
ate-0.027
ires-0.026
der-0.024
resp-0.024
iring-0.024
Token that
Token ID326
Feature activation+8.90e-01

Pos logit contributions
arse+0.018
 +0.018
gency+0.018
 pleasure+0.016
 plunge+0.016
Neg logit contributions
ate-0.020
ires-0.017
mill-0.016
ia-0.016
der-0.016
Token it
Token ID340
Feature activation+5.57e-01

Pos logit contributions
 +0.018
gency+0.016
arse+0.015
 pleasure+0.015
 might+0.015
Neg logit contributions
ate-0.027
ires-0.024
der-0.023
iring-0.023
mill-0.023
Token was
Token ID373
Feature activation+4.07e-01

Pos logit contributions
 +0.015
 might+0.014
arse+0.014
gency+0.013
 may+0.012
Neg logit contributions
ate-0.014
ires-0.013
der-0.012
iring-0.012
ia-0.011
Token the
Token ID262
Feature activation+3.31e-01

Pos logit contributions
 +0.009
arse+0.009
gency+0.009
 pleasure+0.008
 might+0.008
Neg logit contributions
ate-0.011
ires-0.010
der-0.010
mill-0.010
ia-0.009
Token most
Token ID749
Feature activation-1.42e+00

Pos logit contributions
arse+0.008
gency+0.007
 +0.007
 pleasure+0.006
 plunge+0.006
Neg logit contributions
ate-0.010
ires-0.008
mill-0.008
ia-0.008
 from-0.008
Token beautiful
Token ID4950
Feature activation-1.40e+00

Pos logit contributions
ate+0.043
mill+0.039
der+0.039
ia+0.038
urse+0.038
Neg logit contributions
 pleasure-0.026
 -0.023
arse-0.022
 proper-0.021
 plunge-0.021
Token costume
Token ID16569
Feature activation-1.34e+00

Pos logit contributions
ate+0.044
ires+0.041
der+0.039
iring+0.039
mill+0.039
Neg logit contributions
 -0.025
arse-0.023
gency-0.022
 pleasure-0.022
 suitcase-0.021
Token she
Token ID673
Feature activation-1.69e+00

Pos logit contributions
ate+0.035
ires+0.030
mill+0.028
der+0.028
ia+0.028
Neg logit contributions
 -0.032
arse-0.031
gency-0.030
 pleasure-0.030
 might-0.029
Token had
Token ID550
Feature activation-1.24e+00

Pos logit contributions
ate+0.049
der+0.044
ires+0.043
mill+0.043
iring+0.041
Neg logit contributions
 -0.035
gency-0.033
 might-0.032
 pleasure-0.032
arse-0.031
Token.
Token ID13
Feature activation-7.54e-02

Pos logit contributions
 +0.003
arse+0.003
gency+0.003
 might+0.003
 plunge+0.002
Neg logit contributions
ate-0.004
ires-0.004
resp-0.003
ia-0.003
iring-0.003
Token The
Token ID383
Feature activation-1.14e+00

Pos logit contributions
ate+0.002
ires+0.001
mill+0.001
 about+0.001
der+0.001
Neg logit contributions
arse-0.002
gency-0.002
 plunge-0.002
 pleasure-0.002
 -0.002
Token little
Token ID1310
Feature activation+9.38e-01

Pos logit contributions
ate+0.030
ires+0.025
ia+0.024
mill+0.024
der+0.024
Neg logit contributions
arse-0.029
 -0.029
gency-0.027
 plunge-0.026
 pleasure-0.025
Token boy
Token ID2933
Feature activation+7.24e-02

Pos logit contributions
arse+0.027
gency+0.025
 +0.025
 pleasure+0.023
 plunge+0.023
Neg logit contributions
ate-0.022
ires-0.018
mill-0.017
ia-0.017
der-0.017
Token was
Token ID373
Feature activation+3.14e-01

Pos logit contributions
 +0.002
arse+0.002
gency+0.002
 might+0.002
 Bobby+0.002
Neg logit contributions
ate-0.002
ires-0.002
mill-0.002
der-0.002
iring-0.002
Token so
Token ID523
Feature activation+3.10e-01

Pos logit contributions
 +0.006
arse+0.005
gency+0.005
 pleasure+0.004
 plunge+0.004
Neg logit contributions
ate-0.011
ires-0.010
der-0.010
ia-0.009
resp-0.009
Token excited
Token ID6568
Feature activation-1.01e+00

Pos logit contributions
 +0.006
arse+0.005
 pleasure+0.004
 might+0.004
 Bobby+0.004
Neg logit contributions
ate-0.010
der-0.009
mill-0.009
resp-0.009
er-0.009
Token that
Token ID326
Feature activation+5.78e-01

Pos logit contributions
ate+0.028
ires+0.023
mill+0.022
ia+0.022
urse+0.021
Neg logit contributions
 -0.024
arse-0.022
gency-0.022
 might-0.021
 Bobby-0.021
Token he
Token ID339
Feature activation-3.19e-01

Pos logit contributions
arse+0.016
 +0.016
gency+0.015
 pleasure+0.014
 plunge+0.014
Neg logit contributions
ate-0.014
ires-0.011
ia-0.011
mill-0.011
der-0.011
Token wanted
Token ID2227
Feature activation+5.44e-01

Pos logit contributions
ate+0.009
ires+0.008
mill+0.008
ia+0.007
der+0.007
Neg logit contributions
 -0.007
arse-0.007
gency-0.007
 might-0.007
 Bobby-0.006
Token to
Token ID284
Feature activation-7.61e-01

Pos logit contributions
arse+0.017
gency+0.016
 +0.015
 pleasure+0.014
 plunge+0.014
Neg logit contributions
ate-0.012
 about-0.009
ires-0.009
 from-0.009
mill-0.009
Token
Token ID198
Feature activation+4.82e-01

Pos logit contributions
arse+0.017
 +0.016
gency+0.016
 pleasure+0.014
 plunge+0.014
Neg logit contributions
ate-0.018
ires-0.015
mill-0.015
der-0.015
ia-0.014
TokenJ
Token ID41
Feature activation+7.64e-01

Pos logit contributions
 +0.012
arse+0.012
gency+0.011
 pleasure+0.009
 plunge+0.009
Neg logit contributions
ate-0.012
der-0.011
ires-0.011
mill-0.011
iring-0.010
Tokenenny
Token ID11870
Feature activation+3.05e-01

Pos logit contributions
arse+0.017
gency+0.015
 +0.015
 pleasure+0.013
 plunge+0.013
Neg logit contributions
ate-0.023
ires-0.020
mill-0.019
der-0.019
iring-0.018
Token said
Token ID531
Feature activation+5.92e-01

Pos logit contributions
 +0.007
arse+0.006
gency+0.006
 might+0.006
 Bobby+0.005
Neg logit contributions
ate-0.009
ires-0.008
iring-0.008
mill-0.008
der-0.007
Token,
Token ID11
Feature activation+6.97e-01

Pos logit contributions
arse+0.016
gency+0.015
 +0.015
 pleasure+0.014
 plunge+0.014
Neg logit contributions
ate-0.014
ires-0.012
mill-0.012
ia-0.011
der-0.011
Token "
Token ID366
Feature activation+9.70e-01

Pos logit contributions
 +0.019
arse+0.018
gency+0.017
 pleasure+0.015
 plunge+0.015
Neg logit contributions
ate-0.018
ires-0.015
ia-0.015
iring-0.014
mill-0.014
TokenI
Token ID40
Feature activation+1.53e-01

Pos logit contributions
 +0.023
arse+0.023
gency+0.022
 pleasure+0.019
 plunge+0.019
Neg logit contributions
ate-0.026
ires-0.023
der-0.023
ia-0.022
 about-0.022
Token have
Token ID423
Feature activation+4.21e-01

Pos logit contributions
arse+0.003
 +0.003
gency+0.003
 pleasure+0.003
 might+0.002
Neg logit contributions
ate-0.005
ires-0.004
der-0.004
mill-0.004
iring-0.004
Token some
Token ID617
Feature activation-6.12e-01

Pos logit contributions
arse+0.011
gency+0.011
 +0.010
 pleasure+0.010
 plunge+0.010
Neg logit contributions
ate-0.011
ires-0.009
mill-0.008
ia-0.008
der-0.008
Token news
Token ID1705
Feature activation-2.25e-01

Pos logit contributions
ate+0.018
mill+0.015
ires+0.015
der+0.015
ia+0.015
Neg logit contributions
arse-0.013
 -0.013
gency-0.012
 pleasure-0.012
 plunge-0.012
Token.
Token ID13
Feature activation+7.60e-01

Pos logit contributions
ate+0.006
ires+0.005
ia+0.005
der+0.005
iring+0.005
Neg logit contributions
 -0.005
gency-0.005
arse-0.005
 pleasure-0.005
 plunge-0.005
Token a
Token ID257
Feature activation+1.24e+00

Pos logit contributions
ate+0.005
ires+0.004
ia+0.004
mill+0.004
der+0.004
Neg logit contributions
arse-0.006
gency-0.006
 -0.006
 pleasure-0.005
 plunge-0.005
Token time
Token ID640
Feature activation-4.79e-01

Pos logit contributions
gency+0.034
arse+0.033
 plunge+0.030
 +0.028
 suitcase+0.028
Neg logit contributions
ate-0.029
ia-0.028
 about-0.026
ires-0.025
her-0.024
Token there
Token ID612
Feature activation+8.78e-01

Pos logit contributions
ate+0.010
ia+0.008
ires+0.008
 about+0.008
 from+0.008
Neg logit contributions
arse-0.015
gency-0.014
 pleasure-0.013
 -0.013
 plunge-0.013
Token was
Token ID373
Feature activation-4.05e-01

Pos logit contributions
arse+0.032
gency+0.031
 +0.029
 pleasure+0.029
 plunge+0.028
Neg logit contributions
ate-0.014
ia-0.010
ires-0.010
mill-0.009
 about-0.009
Token a
Token ID257
Feature activation+9.69e-01

Pos logit contributions
ate+0.008
ires+0.006
ia+0.006
 about+0.006
mill+0.006
Neg logit contributions
arse-0.014
gency-0.013
 -0.012
 plunge-0.012
 pleasure-0.011
Token reliable
Token ID9314
Feature activation+2.29e-01

Pos logit contributions
arse+0.029
gency+0.027
 +0.026
 pleasure+0.025
 plunge+0.024
Neg logit contributions
ate-0.022
ia-0.017
ires-0.017
mill-0.017
der-0.016
Token car
Token ID1097
Feature activation-4.27e-01

Pos logit contributions
arse+0.005
 +0.005
gency+0.005
 pleasure+0.005
 suitcase+0.004
Neg logit contributions
ate-0.007
ires-0.006
der-0.005
mill-0.005
iring-0.005
Token that
Token ID326
Feature activation+6.97e-01

Pos logit contributions
ate+0.012
ires+0.011
ia+0.010
iring+0.010
der+0.010
Neg logit contributions
 -0.010
gency-0.009
arse-0.009
 might-0.008
 suitcase-0.008
Token liked
Token ID8288
Feature activation+7.06e-01

Pos logit contributions
 +0.020
arse+0.020
gency+0.019
 pleasure+0.018
 plunge+0.017
Neg logit contributions
ate-0.016
ires-0.013
ia-0.013
der-0.013
mill-0.012
Token to
Token ID284
Feature activation+7.28e-01

Pos logit contributions
arse+0.021
gency+0.020
 +0.019
 pleasure+0.018
 plunge+0.018
Neg logit contributions
ate-0.016
mill-0.013
ires-0.013
der-0.012
 about-0.012
Token drive
Token ID3708
Feature activation+1.74e+00

Pos logit contributions
arse+0.017
 +0.016
gency+0.016
 plunge+0.015
 pleasure+0.014
Neg logit contributions
ate-0.020
ires-0.018
der-0.017
iring-0.017
ia-0.017
Token came
Token ID1625
Feature activation+1.32e+00

Pos logit contributions
arse+0.024
 +0.024
gency+0.023
 might+0.020
 plunge+0.020
Neg logit contributions
ate-0.025
ires-0.021
der-0.019
mill-0.019
iring-0.019
Token out
Token ID503
Feature activation-4.14e-01

Pos logit contributions
arse+0.042
 +0.040
gency+0.040
 pleasure+0.037
 plunge+0.036
Neg logit contributions
ate-0.027
ires-0.021
der-0.020
ia-0.020
mill-0.020
Token of
Token ID286
Feature activation-2.62e-01

Pos logit contributions
ate+0.009
ires+0.007
mill+0.006
der+0.006
ia+0.006
Neg logit contributions
 -0.013
arse-0.013
gency-0.011
 Bobby-0.011
 hands-0.011
Token the
Token ID262
Feature activation-1.05e+00

Pos logit contributions
ate+0.006
 from+0.005
ires+0.005
mill+0.005
ia+0.004
Neg logit contributions
arse-0.008
gency-0.007
 -0.007
 pleasure-0.007
 plunge-0.007
Token house
Token ID2156
Feature activation-4.65e-01

Pos logit contributions
ate+0.032
ires+0.028
der+0.028
mill+0.027
ia+0.027
Neg logit contributions
arse-0.021
 -0.021
gency-0.020
 suitcase-0.018
 plunge-0.018
Token and
Token ID290
Feature activation+3.85e-01

Pos logit contributions
ate+0.009
ires+0.007
emo+0.007
ia+0.007
resp+0.007
Neg logit contributions
 -0.015
arse-0.014
gency-0.014
 might-0.013
 suitcase-0.013
Token saw
Token ID2497
Feature activation-1.35e+00

Pos logit contributions
 +0.008
gency+0.007
arse+0.007
 might+0.006
 suitcase+0.006
Neg logit contributions
ate-0.013
ires-0.012
der-0.011
ia-0.011
iring-0.011
Token the
Token ID262
Feature activation-1.55e+00

Pos logit contributions
ate+0.039
ires+0.032
mill+0.031
ia+0.031
der+0.031
Neg logit contributions
arse-0.030
 -0.030
gency-0.028
 plunge-0.025
 might-0.025
Token wet
Token ID9583
Feature activation-6.78e-01

Pos logit contributions
ate+0.041
ires+0.034
ia+0.032
mill+0.032
der+0.032
Neg logit contributions
arse-0.040
gency-0.037
 -0.037
 plunge-0.034
 pleasure-0.034
Token floor
Token ID4314
Feature activation-1.08e+00

Pos logit contributions
ate+0.020
ires+0.018
aze+0.016
ia+0.016
iring+0.016
Neg logit contributions
 -0.014
arse-0.014
gency-0.013
 plunge-0.013
 suitcase-0.012
Token,
Token ID11
Feature activation+2.53e-01

Pos logit contributions
ate+0.033
ires+0.028
emo+0.027
resp+0.026
iring+0.025
Neg logit contributions
 -0.024
arse-0.022
 might-0.020
gency-0.020
 hands-0.018
Token<|endoftext|>
Token ID50256
Feature activation+7.11e-01

Pos logit contributions
arse+0.023
gency+0.021
 plunge+0.020
 suitcase+0.019
 pleasure+0.018
Neg logit contributions
ate-0.015
ia-0.012
ires-0.012
 about-0.012
mill-0.011
TokenOnce
Token ID7454
Feature activation+4.23e-01

Pos logit contributions
arse+0.022
gency+0.020
 pleasure+0.019
 plunge+0.018
 +0.017
Neg logit contributions
ate-0.017
ia-0.013
ires-0.012
 from-0.012
 about-0.012
Token upon
Token ID2402
Feature activation-4.24e-01

Pos logit contributions
gency+0.012
arse+0.012
 plunge+0.010
 pleasure+0.010
 +0.010
Neg logit contributions
ate-0.010
ia-0.009
 about-0.009
ires-0.009
 from-0.008
Token a
Token ID257
Feature activation+1.06e+00

Pos logit contributions
ate+0.010
ires+0.008
ia+0.007
mill+0.007
der+0.007
Neg logit contributions
arse-0.012
gency-0.012
 -0.011
 pleasure-0.011
 plunge-0.011
Token time
Token ID640
Feature activation-3.71e-01

Pos logit contributions
gency+0.028
arse+0.028
 plunge+0.025
 +0.024
 suitcase+0.023
Neg logit contributions
ate-0.026
ia-0.025
 about-0.023
ires-0.022
her-0.021
Token,
Token ID11
Feature activation-5.63e-01

Pos logit contributions
ate+0.007
ia+0.006
 about+0.006
 from+0.006
ires+0.006
Neg logit contributions
arse-0.012
gency-0.012
 pleasure-0.010
 -0.010
 plunge-0.010
Token there
Token ID612
Feature activation+6.82e-01

Pos logit contributions
ate+0.013
ires+0.011
ia+0.011
mill+0.010
 about+0.010
Neg logit contributions
arse-0.016
gency-0.016
 -0.014
 pleasure-0.014
 plunge-0.014
Token lived
Token ID5615
Feature activation-7.18e-01

Pos logit contributions
arse+0.025
gency+0.024
 +0.022
 pleasure+0.022
 plunge+0.022
Neg logit contributions
ate-0.011
ia-0.008
ires-0.007
mill-0.007
 about-0.007
Token a
Token ID257
Feature activation+8.78e-01

Pos logit contributions
ate+0.017
ires+0.014
mill+0.014
ia+0.014
der+0.014
Neg logit contributions
arse-0.020
gency-0.019
 -0.019
 pleasure-0.017
 plunge-0.017
Token thin
Token ID7888
Feature activation+1.30e+00

Pos logit contributions
arse+0.025
gency+0.024
 +0.023
 pleasure+0.022
 plunge+0.021
Neg logit contributions
ate-0.020
ires-0.017
ia-0.016
mill-0.016
der-0.016
Token cat
Token ID3797
Feature activation-1.13e+00

Pos logit contributions
arse+0.028
 +0.028
gency+0.028
 pleasure+0.026
 suitcase+0.025
Neg logit contributions
ate-0.038
ires-0.032
iring-0.030
mill-0.029
ups-0.029
Token started
Token ID2067
Feature activation+6.17e-02

Pos logit contributions
arse+0.032
 +0.031
gency+0.030
 pleasure+0.028
 plunge+0.027
Neg logit contributions
ate-0.026
ires-0.022
mill-0.020
ia-0.019
der-0.019
Token crying
Token ID13774
Feature activation-1.15e-01

Pos logit contributions
arse+0.002
 +0.002
gency+0.002
 pleasure+0.001
 plunge+0.001
Neg logit contributions
ate-0.002
ires-0.001
ia-0.001
mill-0.001
der-0.001
Token and
Token ID290
Feature activation+1.21e+00

Pos logit contributions
ate+0.002
ires+0.002
mill+0.001
ia+0.001
iring+0.001
Neg logit contributions
arse-0.004
 -0.004
gency-0.004
 pleasure-0.003
 suitcase-0.003
Token the
Token ID262
Feature activation-4.51e-01

Pos logit contributions
arse+0.033
gency+0.031
 +0.030
 pleasure+0.028
 plunge+0.028
Neg logit contributions
ate-0.030
ires-0.024
mill-0.024
ia-0.024
der-0.023
Token man
Token ID582
Feature activation+2.34e+00

Pos logit contributions
ate+0.011
ires+0.010
mill+0.009
ia+0.009
der+0.009
Neg logit contributions
arse-0.012
gency-0.011
 -0.011
 pleasure-0.010
 plunge-0.010
Token felt
Token ID2936
Feature activation-3.12e-01

Pos logit contributions
arse+0.070
gency+0.066
 +0.066
 pleasure+0.061
 plunge+0.060
Neg logit contributions
ate-0.052
ires-0.042
mill-0.040
ia-0.040
der-0.039
Token very
Token ID845
Feature activation-2.20e-01

Pos logit contributions
ate+0.009
ires+0.008
mill+0.008
ia+0.008
der+0.007
Neg logit contributions
arse-0.007
gency-0.007
 -0.007
 pleasure-0.006
 plunge-0.006
Token sad
Token ID6507
Feature activation-1.09e+00

Pos logit contributions
ate+0.007
mill+0.006
ires+0.006
ia+0.006
der+0.006
Neg logit contributions
arse-0.005
 -0.004
gency-0.004
 pleasure-0.004
 plunge-0.004
Token.
Token ID13
Feature activation+6.69e-01

Pos logit contributions
ate+0.030
ires+0.024
ia+0.023
mill+0.023
urse+0.021
Neg logit contributions
arse-0.027
 -0.027
gency-0.025
 pleasure-0.024
 might-0.023
Token<|endoftext|>
Token ID50256
Feature activation+9.93e-01

Pos logit contributions
arse+0.018
gency+0.017
 plunge+0.016
 +0.015
 pleasure+0.015
Neg logit contributions
ate-0.017
ires-0.014
ia-0.014
der-0.014
mill-0.013
TokenJohn
Token ID7554
Feature activation+7.15e-01

Pos logit contributions
arse+0.033
gency+0.030
 pleasure+0.028
 plunge+0.027
 proper+0.026
Neg logit contributions
ate-0.021
 from-0.016
ia-0.016
 about-0.016
ires-0.015
Token will
Token ID481
Feature activation-6.42e-01

Pos logit contributions
ate+0.009
ires+0.008
der+0.008
mill+0.008
iring+0.008
Neg logit contributions
arse-0.005
 -0.005
gency-0.005
 pleasure-0.004
 plunge-0.004
Token lead
Token ID1085
Feature activation+1.63e+00

Pos logit contributions
ate+0.018
ires+0.016
mill+0.015
der+0.015
ia+0.015
Neg logit contributions
arse-0.015
 -0.014
gency-0.014
 pleasure-0.013
 plunge-0.013
Token you
Token ID345
Feature activation+2.12e-02

Pos logit contributions
arse+0.040
gency+0.037
 +0.035
 pleasure+0.034
 plunge+0.032
Neg logit contributions
ate-0.046
ires-0.038
 from-0.037
ia-0.037
mill-0.037
Token to
Token ID284
Feature activation-7.50e-01

Pos logit contributions
arse+0.001
gency+0.001
 +0.001
 pleasure+0.000
 plunge+0.000
Neg logit contributions
ate-0.001
ires-0.000
ia-0.000
mill-0.000
der-0.000
Token a
Token ID257
Feature activation+1.40e+00

Pos logit contributions
ate+0.017
 from+0.014
 about+0.013
ires+0.013
ia+0.013
Neg logit contributions
arse-0.022
gency-0.020
 -0.020
 pleasure-0.018
 plunge-0.018
Token secret
Token ID3200
Feature activation-1.13e+00

Pos logit contributions
arse+0.043
gency+0.040
 +0.037
 pleasure+0.035
rified+0.035
Neg logit contributions
ate-0.028
ia-0.022
 from-0.022
mill-0.021
 ranger-0.021
Token place
Token ID1295
Feature activation+3.03e-01

Pos logit contributions
ate+0.030
ires+0.027
mill+0.026
der+0.026
ia+0.025
Neg logit contributions
arse-0.025
 -0.024
gency-0.024
 plunge-0.023
 pleasure-0.022
Token."
Token ID526
Feature activation-9.39e-01

Pos logit contributions
arse+0.007
 +0.006
gency+0.006
 pleasure+0.006
 plunge+0.006
Neg logit contributions
ate-0.009
ires-0.007
der-0.007
ia-0.007
mill-0.007
Token
Token ID198
Feature activation+1.27e-01

Pos logit contributions
ate+0.027
mill+0.024
ires+0.023
der+0.022
iring+0.022
Neg logit contributions
 -0.022
arse-0.019
gency-0.019
 pleasure-0.018
 plunge-0.017
Token
Token ID198
Feature activation-3.29e-01

Pos logit contributions
 +0.003
arse+0.002
gency+0.002
 plunge+0.002
 pleasure+0.002
Neg logit contributions
ate-0.004
mill-0.004
der-0.003
ires-0.003
iring-0.003
TokenS
Token ID50
Feature activation+8.28e-02

Pos logit contributions
ate+0.007
ires+0.007
der+0.006
mill+0.006
 about+0.006
Neg logit contributions
 -0.009
arse-0.008
gency-0.008
 plunge-0.008
 handful-0.007
Token a
Token ID257
Feature activation+4.85e-01

Pos logit contributions
ate+0.033
der+0.029
ires+0.028
mill+0.028
resp+0.027
Neg logit contributions
 -0.022
arse-0.021
 pleasure-0.019
gency-0.017
 hands-0.016
Token happy
Token ID3772
Feature activation-7.04e-01

Pos logit contributions
arse+0.013
gency+0.013
 +0.012
 plunge+0.011
 pleasure+0.011
Neg logit contributions
ate-0.012
mill-0.010
ia-0.010
ires-0.010
 from-0.009
Token family
Token ID1641
Feature activation-2.47e-01

Pos logit contributions
ate+0.021
ires+0.018
der+0.016
mill+0.016
iring+0.016
Neg logit contributions
arse-0.015
 -0.015
gency-0.014
 pleasure-0.014
 suitcase-0.013
Token.
Token ID13
Feature activation-8.61e-01

Pos logit contributions
ate+0.005
 from+0.004
mill+0.004
 about+0.004
ia+0.004
Neg logit contributions
arse-0.008
gency-0.008
 -0.007
 plunge-0.007
 pleasure-0.007
Token<|endoftext|>
Token ID50256
Feature activation-4.15e-01

Pos logit contributions
ate+0.020
ires+0.016
mill+0.016
ia+0.016
der+0.015
Neg logit contributions
 -0.024
arse-0.024
gency-0.023
 pleasure-0.022
 plunge-0.020
TokenAnna
Token ID31160
Feature activation-1.88e+00

Pos logit contributions
ate+0.011
ires+0.009
ia+0.009
mill+0.009
 from+0.009
Neg logit contributions
arse-0.011
gency-0.010
 -0.009
 pleasure-0.009
 plunge-0.009
Token and
Token ID290
Feature activation+1.71e-01

Pos logit contributions
iring+0.038
ires+0.038
ate+0.036
der+0.036
ble+0.035
Neg logit contributions
 -0.053
 might-0.049
use-0.046
 pleasure-0.046
gency-0.045
Token Ben
Token ID3932
Feature activation-6.19e-01

Pos logit contributions
arse+0.005
gency+0.005
 +0.004
 pleasure+0.004
 plunge+0.004
Neg logit contributions
ate-0.004
ires-0.003
ia-0.003
mill-0.003
der-0.003
Token like
Token ID588
Feature activation-1.10e+00

Pos logit contributions
ate+0.018
ires+0.018
iring+0.017
der+0.017
mill+0.017
Neg logit contributions
 -0.013
arse-0.011
gency-0.011
 pleasure-0.011
're-0.010
Token to
Token ID284
Feature activation+2.26e-02

Pos logit contributions
ate+0.032
ires+0.028
der+0.027
ia+0.027
iring+0.026
Neg logit contributions
 -0.023
gency-0.022
arse-0.022
 pleasure-0.021
 plunge-0.020
Token play
Token ID711
Feature activation-1.30e+00

Pos logit contributions
 +0.000
arse+0.000
gency+0.000
 pleasure+0.000
 plunge+0.000
Neg logit contributions
ate-0.001
ires-0.001
der-0.001
iring-0.001
mill-0.001
Token girl
Token ID2576
Feature activation-1.32e+00

Pos logit contributions
arse+0.016
gency+0.015
 +0.015
 pleasure+0.014
 plunge+0.013
Neg logit contributions
ate-0.014
ires-0.011
mill-0.011
ia-0.011
der-0.011
Token near
Token ID1474
Feature activation+1.10e+00

Pos logit contributions
ate+0.030
ires+0.024
der+0.024
ia+0.023
mill+0.023
Neg logit contributions
arse-0.038
 -0.037
gency-0.036
 pleasure-0.033
 plunge-0.033
Token the
Token ID262
Feature activation-5.35e-01

Pos logit contributions
arse+0.032
gency+0.030
 +0.028
 pleasure+0.027
 plunge+0.027
Neg logit contributions
ate-0.026
ires-0.021
ia-0.021
mill-0.020
 from-0.020
Token ball
Token ID2613
Feature activation-5.53e-01

Pos logit contributions
ate+0.014
ires+0.012
mill+0.012
ia+0.012
der+0.012
Neg logit contributions
arse-0.013
 -0.013
gency-0.013
 pleasure-0.011
 plunge-0.011
Token.
Token ID13
Feature activation-3.06e-01

Pos logit contributions
ate+0.013
ires+0.011
mill+0.011
ia+0.010
der+0.010
Neg logit contributions
arse-0.015
gency-0.015
 -0.014
 pleasure-0.013
 plunge-0.013
Token He
Token ID679
Feature activation-7.48e-01

Pos logit contributions
ate+0.009
ires+0.008
mill+0.007
der+0.007
iring+0.007
Neg logit contributions
 -0.007
gency-0.006
arse-0.006
 pleasure-0.006
 might-0.006
Token asked
Token ID1965
Feature activation+3.93e-01

Pos logit contributions
ate+0.024
ires+0.023
iring+0.021
der+0.021
mill+0.020
Neg logit contributions
 -0.017
 might-0.015
gency-0.013
 Bobby-0.012
arse-0.011
Token her
Token ID607
Feature activation-2.01e+00

Pos logit contributions
arse+0.011
gency+0.011
 +0.010
 pleasure+0.010
 plunge+0.010
Neg logit contributions
ate-0.009
ires-0.007
ia-0.007
 from-0.007
mill-0.007
Token,
Token ID11
Feature activation-2.90e-01

Pos logit contributions
ate+0.061
mill+0.053
ires+0.053
der+0.052
her+0.050
Neg logit contributions
 -0.044
arse-0.037
 might-0.035
gency-0.034
 pleasure-0.033
Token "
Token ID366
Feature activation+4.81e-01

Pos logit contributions
ate+0.006
ires+0.005
mill+0.005
ia+0.005
der+0.005
Neg logit contributions
arse-0.009
gency-0.008
 -0.008
 pleasure-0.008
 plunge-0.008
TokenCan
Token ID6090
Feature activation+2.62e-01

Pos logit contributions
arse+0.014
gency+0.014
 +0.013
 pleasure+0.013
 plunge+0.012
Neg logit contributions
ate-0.011
ires-0.009
mill-0.008
ia-0.008
der-0.008
Token boy
Token ID2933
Feature activation+6.01e-01

Pos logit contributions
arse+0.000
 pleasure+0.000
 +0.000
gency+0.000
 suitcase+0.000
Neg logit contributions
ate-0.000
ires-0.000
iring-0.000
ups-0.000
uted-0.000
Token who
Token ID508
Feature activation+1.36e+00

Pos logit contributions
arse+0.021
gency+0.020
 +0.020
 pleasure+0.019
 plunge+0.018
Neg logit contributions
ate-0.010
ires-0.008
mill-0.007
ia-0.007
der-0.007
Token always
Token ID1464
Feature activation+4.24e-01

Pos logit contributions
 +0.033
 might+0.030
gency+0.028
 Bobby+0.028
arse+0.027
Neg logit contributions
ate-0.034
ires-0.032
der-0.031
iring-0.030
ia-0.030
Token talked
Token ID6619
Feature activation-5.07e-01

Pos logit contributions
arse+0.011
 +0.011
gency+0.011
 pleasure+0.010
 plunge+0.010
Neg logit contributions
ate-0.010
ires-0.009
ia-0.008
der-0.008
mill-0.008
Token about
Token ID546
Feature activation-1.14e+00

Pos logit contributions
ate+0.011
ires+0.009
mill+0.009
ia+0.009
der+0.008
Neg logit contributions
arse-0.014
 -0.014
gency-0.014
 pleasure-0.013
 plunge-0.013
Token lots
Token ID6041
Feature activation-2.96e+00

Pos logit contributions
ate+0.027
ires+0.022
mill+0.021
ia+0.021
der+0.021
Neg logit contributions
arse-0.033
gency-0.031
 -0.030
 pleasure-0.028
 plunge-0.028
Token of
Token ID286
Feature activation-1.71e+00

Pos logit contributions
ate+0.074
ble+0.059
mill+0.059
urse+0.053
ia+0.052
Neg logit contributions
 -0.069
 might-0.067
fy-0.065
 dearly-0.063
 proper-0.062
Token fun
Token ID1257
Feature activation-1.63e+00

Pos logit contributions
ate+0.035
ires+0.028
ia+0.026
mill+0.026
der+0.025
Neg logit contributions
arse-0.055
gency-0.052
 -0.050
 pleasure-0.048
 plunge-0.047
Token things
Token ID1243
Feature activation-2.09e+00

Pos logit contributions
ate+0.047
mill+0.040
ia+0.039
ires+0.039
der+0.039
Neg logit contributions
 -0.033
arse-0.032
gency-0.031
 might-0.028
 plunge-0.027
Token.
Token ID13
Feature activation+5.34e-01

Pos logit contributions
ate+0.057
ia+0.048
ires+0.048
mill+0.046
der+0.044
Neg logit contributions
 -0.048
arse-0.048
gency-0.045
 pleasure-0.042
 might-0.042
Token Every
Token ID3887
Feature activation+1.74e+00

Pos logit contributions
arse+0.015
gency+0.014
 plunge+0.014
 unnecessary+0.012
 risk+0.012
Neg logit contributions
ate-0.013
 about-0.012
ia-0.011
ires-0.011
mill-0.010
Token hanging
Token ID10938
Feature activation-1.44e+00

Pos logit contributions
arse+0.008
gency+0.008
 +0.008
 pleasure+0.007
 plunge+0.007
Neg logit contributions
ate-0.009
ires-0.007
mill-0.007
ia-0.007
der-0.007
Token!"
Token ID2474
Feature activation+2.22e-01

Pos logit contributions
ate+0.032
ires+0.027
ia+0.026
urse+0.025
mill+0.025
Neg logit contributions
 -0.039
arse-0.038
 plunge-0.036
gency-0.036
 handful-0.033
Token
Token ID220
Feature activation-1.80e-01

Pos logit contributions
arse+0.006
gency+0.006
 +0.006
 pleasure+0.006
 plunge+0.005
Neg logit contributions
ate-0.005
ires-0.004
ia-0.004
mill-0.004
der-0.004
Token
Token ID198
Feature activation+2.18e-01

Pos logit contributions
ate+0.005
ires+0.004
ia+0.004
mill+0.004
der+0.004
Neg logit contributions
arse-0.005
gency-0.005
 -0.004
 pleasure-0.004
 plunge-0.004
Token
Token ID198
Feature activation-5.40e-01

Pos logit contributions
arse+0.006
gency+0.006
 pleasure+0.005
 +0.005
 plunge+0.005
Neg logit contributions
ate-0.005
ires-0.004
ia-0.004
mill-0.004
 about-0.004
TokenAfter
Token ID3260
Feature activation-2.54e+00

Pos logit contributions
ate+0.013
der+0.012
mill+0.012
ires+0.011
oni+0.011
Neg logit contributions
 -0.014
arse-0.014
gency-0.012
 proper-0.011
 plunge-0.011
Token playing
Token ID2712
Feature activation-1.88e+00

Pos logit contributions
ate+0.059
ires+0.048
mill+0.047
ia+0.047
der+0.046
Neg logit contributions
arse-0.074
gency-0.069
 -0.067
 pleasure-0.064
 plunge-0.062
Token,
Token ID11
Feature activation-8.70e-01

Pos logit contributions
ate+0.051
ires+0.043
mill+0.042
der+0.042
oni+0.040
Neg logit contributions
 -0.045
arse-0.042
gency-0.040
 plunge-0.039
 pleasure-0.038
Token Lily
Token ID20037
Feature activation-1.56e+00

Pos logit contributions
ate+0.020
ires+0.016
ia+0.016
mill+0.016
der+0.015
Neg logit contributions
arse-0.025
gency-0.024
 -0.023
 pleasure-0.022
 plunge-0.022
Token was
Token ID373
Feature activation-1.12e+00

Pos logit contributions
ate+0.046
ires+0.046
iring+0.041
mill+0.041
der+0.040
Neg logit contributions
 -0.035
arse-0.032
gency-0.030
 might-0.028
 suitcase-0.026
Token thirsty
Token ID47124
Feature activation-1.15e+00

Pos logit contributions
ate+0.035
ires+0.031
der+0.030
ia+0.029
mill+0.029
Neg logit contributions
arse-0.022
 -0.022
gency-0.020
 pleasure-0.019
 plunge-0.018
Token.
Token ID13
Feature activation-3.62e-01

Pos logit contributions
ate+0.004
ires+0.003
mill+0.003
ia+0.003
der+0.003
Neg logit contributions
arse-0.004
 -0.004
gency-0.004
 pleasure-0.004
 plunge-0.003
Token The
Token ID383
Feature activation-1.76e+00

Pos logit contributions
ate+0.010
mill+0.009
ires+0.008
ble+0.008
der+0.008
Neg logit contributions
 -0.009
 pleasure-0.008
 Bobby-0.007
gency-0.007
arse-0.007
Token moral
Token ID6573
Feature activation-1.21e+00

Pos logit contributions
ate+0.041
der+0.033
mill+0.033
ires+0.033
iders+0.032
Neg logit contributions
 -0.048
arse-0.047
 plunge-0.046
 pleasure-0.045
 risk-0.043
Token of
Token ID286
Feature activation-9.92e-01

Pos logit contributions
mill+0.027
ate+0.027
der+0.026
castle+0.025
uted+0.024
Neg logit contributions
 -0.031
 might-0.029
 risk-0.027
 pleasure-0.027
 plunge-0.026
Token the
Token ID262
Feature activation-2.00e+00

Pos logit contributions
ate+0.026
ires+0.022
der+0.022
mill+0.021
ia+0.021
Neg logit contributions
arse-0.025
 -0.024
gency-0.023
 pleasure-0.021
 plunge-0.021
Token story
Token ID1621
Feature activation-2.98e+00

Pos logit contributions
ate+0.057
ires+0.049
mill+0.048
der+0.048
ia+0.047
Neg logit contributions
arse-0.045
 -0.044
gency-0.043
 plunge-0.039
 pleasure-0.039
Token is
Token ID318
Feature activation-1.32e+00

Pos logit contributions
ate+0.087
aver+0.083
mill+0.082
iring+0.081
ires+0.079
Neg logit contributions
 might-0.056
 -0.055
 may-0.049
gency-0.044
 forever-0.043
Token that
Token ID326
Feature activation+7.19e-01

Pos logit contributions
ate+0.032
mill+0.029
der+0.028
uted+0.027
ia+0.027
Neg logit contributions
 -0.032
gency-0.030
arse-0.030
 pleasure-0.029
 might-0.029
Token it
Token ID340
Feature activation-1.14e+00

Pos logit contributions
arse+0.018
 +0.017
gency+0.017
 pleasure+0.016
 plunge+0.016
Neg logit contributions
ate-0.019
ires-0.016
ia-0.015
mill-0.015
der-0.015
Token's
Token ID338
Feature activation-8.37e-01

Pos logit contributions
ate+0.036
mill+0.033
ires+0.032
ia+0.032
der+0.031
Neg logit contributions
 -0.020
arse-0.020
 might-0.019
 may-0.017
gency-0.016
Token good
Token ID922
Feature activation-1.66e+00

Pos logit contributions
ate+0.022
ia+0.020
der+0.020
ires+0.020
mill+0.019
Neg logit contributions
arse-0.019
 -0.019
 pleasure-0.018
gency-0.017
 might-0.016
Token lion
Token ID18744
Feature activation+4.53e-01

Pos logit contributions
ate+0.023
ires+0.021
iring+0.020
ia+0.019
urse+0.018
Neg logit contributions
arse-0.011
 -0.011
gency-0.010
 plunge-0.009
 handful-0.009
Token!"
Token ID2474
Feature activation+6.38e-02

Pos logit contributions
arse+0.007
 +0.007
gency+0.007
 plunge+0.006
 pleasure+0.006
Neg logit contributions
ate-0.016
ires-0.014
iring-0.014
ia-0.013
der-0.013
Token Anna
Token ID11735
Feature activation-1.12e+00

Pos logit contributions
 +0.002
arse+0.002
gency+0.001
 pleasure+0.001
 plunge+0.001
Neg logit contributions
ate-0.002
ires-0.001
mill-0.001
der-0.001
iring-0.001
Token says
Token ID1139
Feature activation+2.22e-01

Pos logit contributions
ate+0.030
mill+0.026
ires+0.026
iring+0.024
ross+0.023
Neg logit contributions
 -0.029
arse-0.027
gency-0.025
 might-0.024
 pleasure-0.023
Token.
Token ID13
Feature activation-1.48e+00

Pos logit contributions
arse+0.007
gency+0.007
 pleasure+0.006
 +0.006
 plunge+0.006
Neg logit contributions
ate-0.005
 from-0.004
 about-0.004
der-0.004
ires-0.004
Token She
Token ID1375
Feature activation-2.65e+00

Pos logit contributions
ate+0.040
ires+0.033
mill+0.033
ia+0.032
der+0.031
Neg logit contributions
 -0.037
arse-0.036
gency-0.034
 pleasure-0.031
 plunge-0.030
Token shows
Token ID2523
Feature activation-5.50e-01

Pos logit contributions
iring+0.075
ate+0.071
ross+0.070
mill+0.068
ires+0.066
Neg logit contributions
 -0.054
 might-0.048
 may-0.045
 hands-0.044
arse-0.043
Token Ben
Token ID3932
Feature activation+2.78e-01

Pos logit contributions
ate+0.014
ires+0.012
ia+0.012
der+0.011
 from+0.011
Neg logit contributions
arse-0.015
gency-0.014
 -0.013
 pleasure-0.013
 plunge-0.012
Token her
Token ID607
Feature activation-4.88e-01

Pos logit contributions
 +0.007
arse+0.007
gency+0.006
 handful+0.006
 might+0.006
Neg logit contributions
ate-0.007
ires-0.006
der-0.006
iring-0.006
ia-0.005
Token lion
Token ID18744
Feature activation-1.12e-01

Pos logit contributions
ate+0.013
ires+0.011
mill+0.011
ia+0.011
der+0.011
Neg logit contributions
arse-0.012
gency-0.011
 -0.011
 pleasure-0.010
 plunge-0.010
Token.
Token ID13
Feature activation-8.70e-01

Pos logit contributions
ate+0.003
ires+0.003
iring+0.003
der+0.003
mill+0.002
Neg logit contributions
 -0.003
gency-0.002
arse-0.002
 might-0.002
 handful-0.002
Token giant
Token ID6175
Feature activation+1.43e-01

Pos logit contributions
ate+0.012
ires+0.010
mill+0.009
der+0.009
ia+0.009
Neg logit contributions
arse-0.012
 -0.012
gency-0.011
 pleasure-0.011
 plunge-0.011
Token and
Token ID290
Feature activation+3.68e-01

Pos logit contributions
arse+0.005
 +0.004
gency+0.004
 pleasure+0.004
 plunge+0.004
Neg logit contributions
ate-0.003
ires-0.002
mill-0.002
ia-0.002
der-0.002
Token ran
Token ID4966
Feature activation-3.29e-01

Pos logit contributions
 +0.009
 Bobby+0.008
gency+0.008
 pleasure+0.007
arse+0.007
Neg logit contributions
ate-0.010
ires-0.010
iders-0.009
iring-0.009
ia-0.008
Token off
Token ID572
Feature activation+9.81e-02

Pos logit contributions
ate+0.010
ires+0.008
mill+0.008
ia+0.008
 from+0.008
Neg logit contributions
arse-0.008
gency-0.007
 -0.007
 pleasure-0.006
 plunge-0.006
Token.
Token ID13
Feature activation+1.19e-01

Pos logit contributions
arse+0.003
gency+0.003
 +0.003
 pleasure+0.003
 plunge+0.002
Neg logit contributions
ate-0.002
ires-0.002
mill-0.002
 from-0.002
ia-0.002
Token From
Token ID3574
Feature activation-2.27e+00

Pos logit contributions
 +0.003
arse+0.003
gency+0.003
 pleasure+0.003
 plunge+0.002
Neg logit contributions
ate-0.003
ires-0.003
mill-0.003
ia-0.003
der-0.002
Token that
Token ID326
Feature activation+3.01e-01

Pos logit contributions
ate+0.050
der+0.044
resp+0.044
urse+0.044
ires+0.041
Neg logit contributions
 -0.061
arse-0.058
gency-0.057
 Bobby-0.055
 plunge-0.055
Token day
Token ID1110
Feature activation+4.71e-01

Pos logit contributions
arse+0.009
gency+0.008
 plunge+0.007
 unnecessary+0.007
 pleasure+0.007
Neg logit contributions
ate-0.007
 from-0.006
 about-0.006
mill-0.006
ia-0.006
Token on
Token ID319
Feature activation-7.00e-01

Pos logit contributions
arse+0.013
gency+0.013
 pleasure+0.013
 proper+0.012
 +0.012
Neg logit contributions
ate-0.010
 from-0.009
ires-0.008
 about-0.008
ia-0.008
Token,
Token ID11
Feature activation-2.42e-01

Pos logit contributions
ate+0.017
ires+0.014
mill+0.014
ia+0.014
der+0.013
Neg logit contributions
arse-0.019
gency-0.018
 -0.018
 pleasure-0.017
 plunge-0.016
Token the
Token ID262
Feature activation-5.28e-01

Pos logit contributions
ate+0.006
ires+0.005
ia+0.005
mill+0.005
der+0.005
Neg logit contributions
arse-0.006
 -0.006
gency-0.006
 pleasure-0.006
 plunge-0.005