TOP ACTIVATIONS
MAX = 11.436

my were on a play date . Alice was playing with
friends over for a play date . She showed them her
some snow for a play date with his friends . He
own toys to the play date . The first one brought
came over for a play date . Her friend asked for
my house for a play date . I invite you ."
! She finished her play date and went home , with
would have a play - date and wear makeup .
could have her play - date after all . <|endoftext|> One
and had a fun play date . <|endoftext|> Once upon a
for using the past - date cheese . Sam learned an
can have your play - date with R uf us now
almost ready for the play date ! Billy opened
Alex had used past - date cheese ! Sam yelled ,
let 's have a play date !" The steak was
the wall . But , oh no ! The stack fell
Uh oh , what should I do
" Uh oh !" said Liz .
fox said , " Uh oh , I shouldn 't stay
" Uh oh , we have a problem

MIN ACTIVATIONS
MIN = -14.192

began screaming again . Sc utt les was so scared ,
came from it . Sc utt les was so frightened ,
too . Sc utt les was scared but he
, but especially for Sc utt les . <|endoftext|> B oots
his little nose and sc utt led away . From that
. He screamed and sc utt led away .
tall tree . He sc utt led over to it and
some small colourful creatures sc utt ling around . He was
pillar happily complied and sc utt led closer to Rose .
crab named Charlie , sc utt ling towards him . Charlie
the dry grass and sc utt led away with its friends
learned a hard lesson . Taking things that are not yours
happier . Everyone agreed . Taking the medicine felt nice and
be healthy and happy . Taking your pill is good for
oral of the story : Taking regular n aps can help
's best for you . Taking a bath is good for
always have its rewards . Taking a break is important ,
share with your sister . Taking her dolls was very bad
that 's not nice . Taking things that aren 't yours
learned an important lesson . Taking things without asking was wrong

INTERVAL 5.029 - 11.436
CONTAINS 8.037%

to a little island with a tree . When the potato
Then suddenly , the line started to tug ! He yelled
loud noise . It was a grow l , like an
that Ben was a true friend . He thought of her
and his mom spent the afternoon playing and having fun .

INTERVAL -1.378 - 5.029
CONTAINS 56.893%

Later that day , Benny saw his friend , the squirrel
was ok . <|endoftext|> Once there was a playful puppy .
Sammy was hurt and she wanted to help . She brought
Let 's draw a face on it ," Sue said .
massage , Lily asked if she could go outside to play

INTERVAL -7.785 - -1.378
CONTAINS 29.137%

the hose finally stopped . At the end of
go on an adventure . He looked to the left and
He waited with her until they came . The
my , if you had cleaned up your toys like I
Mom , can I have some water ?" Sara asked .

INTERVAL -14.192 - -7.785
CONTAINS 5.932%

He had a mild heart and was always very kind to
likes the animals too . She thanks the farmer for inviting
cars . He looked around and noticed lots of different buildings
. She felt silly . She was not brave like Ben
to the living room . She saw Ben sitting on the
Tokenmy
Token ID1820
Feature activation+3.08e+00

Pos logit contributions
 ha+0.240
 babys+0.236
 Lee+0.231
 happened+0.230
 principal+0.227
Neg logit contributions
From-0.167
Years-0.162
 refreshing-0.161
lessly-0.157
utt-0.145
Token were
Token ID547
Feature activation+3.01e+00

Pos logit contributions
 ha+0.082
 happened+0.079
 babys+0.078
 client+0.078
 classmate+0.077
Neg logit contributions
From-0.070
 refreshing-0.070
Years-0.068
lessly-0.065
asting-0.065
Token on
Token ID319
Feature activation-7.25e-01

Pos logit contributions
 client+0.084
 grade+0.083
 classmate+0.083
 principal+0.082
 ha+0.081
Neg logit contributions
From-0.065
 refreshing-0.063
 Every-0.063
 often-0.061
asting-0.060
Token a
Token ID257
Feature activation+5.78e+00

Pos logit contributions
 Every+0.011
From+0.011
 refreshing+0.011
 all+0.010
 tight+0.010
Neg logit contributions
 classmate-0.025
 client-0.025
 principal-0.024
 ha-0.024
 grade-0.024
Token play
Token ID711
Feature activation-1.41e+00

Pos logit contributions
 client+0.175
 classmate+0.174
 principal+0.173
 grade+0.172
 ha+0.170
Neg logit contributions
 refreshing-0.112
From-0.112
 Every-0.107
 often-0.102
Years-0.102
Tokendate
Token ID4475
Feature activation+1.14e+01

Pos logit contributions
From+0.025
 refreshing+0.024
 Every+0.023
asting+0.022
 often+0.022
Neg logit contributions
 client-0.045
 classmate-0.045
 principal-0.045
 grade-0.045
 ha-0.044
Token.
Token ID13
Feature activation+2.48e+00

Pos logit contributions
 client+0.421
 principal+0.413
 ha+0.407
 liar+0.407
 classmate+0.406
Neg logit contributions
 all-0.153
 Every-0.149
 often-0.147
 around-0.146
 every-0.143
Token Alice
Token ID14862
Feature activation+2.32e+00

Pos logit contributions
 classmate+0.080
 client+0.080
 grade+0.079
 principal+0.079
 ha+0.079
Neg logit contributions
From-0.043
 refreshing-0.041
 Every-0.040
asting-0.038
Years-0.038
Token was
Token ID373
Feature activation+4.52e+00

Pos logit contributions
 classmate+0.065
 ha+0.065
 client+0.064
 principal+0.064
 grade+0.063
Neg logit contributions
From-0.050
 refreshing-0.049
 Every-0.047
asting-0.046
Years-0.046
Token playing
Token ID2712
Feature activation-2.71e+00

Pos logit contributions
 client+0.143
 grade+0.139
 classmate+0.137
 principal+0.135
 ha+0.132
Neg logit contributions
 refreshing-0.082
 often-0.080
 Every-0.080
From-0.079
 all-0.079
Token with
Token ID351
Feature activation-4.02e+00

Pos logit contributions
From+0.045
 refreshing+0.043
 Every+0.042
asting+0.039
Years+0.039
Neg logit contributions
 classmate-0.090
 client-0.090
 grade-0.089
 principal-0.089
 ha-0.089
Token friends
Token ID2460
Feature activation+3.42e+00

Pos logit contributions
 grade+0.152
 liar+0.141
 principal+0.140
fer+0.138
 client+0.137
Neg logit contributions
 all-0.107
 Every-0.101
 every-0.100
 tight-0.100
 refreshing-0.098
Token over
Token ID625
Feature activation+1.17e+00

Pos logit contributions
 classmate+0.101
 client+0.099
 principal+0.099
 ha+0.099
 grade+0.099
Neg logit contributions
From-0.071
 refreshing-0.069
 Every-0.065
Years-0.063
asting-0.063
Token for
Token ID329
Feature activation-2.98e-01

Pos logit contributions
 client+0.038
 classmate+0.038
 ha+0.038
 principal+0.038
 grade+0.037
Neg logit contributions
From-0.020
 Every-0.019
 refreshing-0.018
 often-0.018
 tight-0.018
Token a
Token ID257
Feature activation+5.55e+00

Pos logit contributions
From+0.005
 refreshing+0.005
 Every+0.005
asting+0.005
Years+0.005
Neg logit contributions
 classmate-0.009
 client-0.009
 ha-0.009
 principal-0.009
 grade-0.009
Token play
Token ID711
Feature activation-1.45e+00

Pos logit contributions
 client+0.161
 principal+0.161
 grade+0.156
 classmate+0.156
 ha+0.156
Neg logit contributions
 refreshing-0.114
From-0.108
 tight-0.107
 often-0.107
 Every-0.104
Tokendate
Token ID4475
Feature activation+1.14e+01

Pos logit contributions
 Every+0.041
From+0.041
 all+0.040
 often+0.039
 refreshing+0.038
Neg logit contributions
 client-0.031
 grade-0.031
 principal-0.030
 classmate-0.030
 babys-0.029
Token.
Token ID13
Feature activation+1.83e+00

Pos logit contributions
 client+0.418
 principal+0.406
 classmate+0.403
 grade+0.402
 ha+0.401
Neg logit contributions
 all-0.153
 every-0.146
 often-0.145
 Every-0.144
 around-0.144
Token She
Token ID1375
Feature activation-7.76e+00

Pos logit contributions
 client+0.075
 ha+0.075
 grade+0.075
 classmate+0.075
 principal+0.074
Neg logit contributions
 Every-0.016
From-0.015
 refreshing-0.013
 often-0.012
Years-0.011
Token showed
Token ID3751
Feature activation+5.89e-01

Pos logit contributions
From+0.151
 refreshing+0.144
 Every+0.143
 often+0.141
Years+0.134
Neg logit contributions
 grade-0.235
 client-0.234
 classmate-0.231
 principal-0.230
 ha-0.228
Token them
Token ID606
Feature activation+1.23e+00

Pos logit contributions
 classmate+0.023
 client+0.023
 principal+0.023
 grade+0.023
 ha+0.023
Neg logit contributions
From-0.006
 refreshing-0.006
 Every-0.005
asting-0.005
Years-0.005
Token her
Token ID607
Feature activation+5.05e+00

Pos logit contributions
 classmate+0.040
 client+0.039
 principal+0.039
 ha+0.039
 babys+0.038
Neg logit contributions
From-0.022
 refreshing-0.021
asting-0.019
Years-0.019
lessly-0.019
Token some
Token ID617
Feature activation-1.46e+00

Pos logit contributions
From+0.079
Years+0.076
asting+0.074
 refreshing+0.073
lessly+0.069
Neg logit contributions
 ha-0.085
 client-0.079
 haircut-0.079
 grade-0.077
 classmate-0.077
Token snow
Token ID6729
Feature activation-3.38e+00

Pos logit contributions
From+0.033
Years+0.029
 refreshing+0.028
Even+0.027
asting+0.027
Neg logit contributions
 ha-0.042
 grade-0.041
 classmate-0.040
 client-0.040
 babys-0.040
Token for
Token ID329
Feature activation+7.08e-02

Pos logit contributions
From+0.070
 refreshing+0.067
 Every+0.066
Years+0.063
asting+0.063
Neg logit contributions
 classmate-0.098
 client-0.098
 principal-0.097
 grade-0.097
 ha-0.096
Token a
Token ID257
Feature activation+5.38e+00

Pos logit contributions
 ha+0.002
 client+0.002
 principal+0.002
 grade+0.002
 classmate+0.002
Neg logit contributions
From-0.001
 refreshing-0.001
 Every-0.001
Years-0.001
asting-0.001
Token play
Token ID711
Feature activation-1.33e+00

Pos logit contributions
 ha+0.145
 classmate+0.145
 client+0.144
 grade+0.143
 babys+0.141
Neg logit contributions
From-0.124
 refreshing-0.117
 Every-0.117
asting-0.111
Years-0.109
Tokendate
Token ID4475
Feature activation+1.14e+01

Pos logit contributions
From+0.029
 refreshing+0.028
 Every+0.026
Years+0.026
 From+0.025
Neg logit contributions
 ha-0.038
 classmate-0.037
 grade-0.037
 client-0.037
 principal-0.036
Token with
Token ID351
Feature activation-4.02e+00

Pos logit contributions
 client+0.386
 classmate+0.381
 principal+0.379
 ha+0.378
 grade+0.377
Neg logit contributions
 Every-0.170
 often-0.166
 all-0.164
From-0.162
 refreshing-0.158
Token his
Token ID465
Feature activation+3.92e+00

Pos logit contributions
From+0.082
 refreshing+0.074
Years+0.072
utt+0.070
asting+0.069
Neg logit contributions
 ha-0.119
 classmate-0.115
 principal-0.114
 babys-0.114
 client-0.111
Token friends
Token ID2460
Feature activation+3.47e+00

Pos logit contributions
 client+0.093
 classmate+0.093
 grade+0.092
 principal+0.092
 ha+0.091
Neg logit contributions
From-0.101
 refreshing-0.099
 Every-0.097
asting-0.093
Years-0.093
Token.
Token ID13
Feature activation+2.39e+00

Pos logit contributions
 classmate+0.088
 ha+0.087
 client+0.086
 principal+0.085
 Lee+0.085
Neg logit contributions
From-0.089
 refreshing-0.085
Years-0.081
asting-0.078
 Every-0.077
Token He
Token ID679
Feature activation-6.76e+00

Pos logit contributions
 Lee+0.073
 classmate+0.071
 principal+0.071
 ha+0.071
 client+0.070
Neg logit contributions
asting-0.049
 refreshing-0.049
From-0.044
Years-0.044
rant-0.042
Token own
Token ID898
Feature activation-1.60e-01

Pos logit contributions
 principal+0.068
 grade+0.068
 client+0.068
 classmate+0.068
 ha+0.066
Neg logit contributions
 refreshing-0.061
From-0.059
 Every-0.059
 tight-0.056
 often-0.056
Token toys
Token ID14958
Feature activation+1.19e+00

Pos logit contributions
 refreshing+0.004
 Every+0.004
From+0.004
 tight+0.004
 all+0.003
Neg logit contributions
 principal-0.004
 client-0.004
 grade-0.004
 classmate-0.004
 ha-0.004
Token to
Token ID284
Feature activation+4.77e+00

Pos logit contributions
 classmate+0.031
 grade+0.030
 babys+0.030
 principal+0.030
 client+0.030
Neg logit contributions
From-0.030
 refreshing-0.027
 Every-0.026
Years-0.026
asting-0.025
Token the
Token ID262
Feature activation-1.18e+00

Pos logit contributions
 classmate+0.148
 client+0.148
 grade+0.146
 principal+0.146
 ha+0.146
Neg logit contributions
From-0.090
 refreshing-0.086
 Every-0.084
asting-0.080
Years-0.080
Token play
Token ID711
Feature activation-1.48e+00

Pos logit contributions
 refreshing+0.025
From+0.024
 Every+0.024
 often+0.023
 all+0.023
Neg logit contributions
 grade-0.033
 classmate-0.032
 client-0.032
 principal-0.031
ird-0.031
Tokendate
Token ID4475
Feature activation+1.13e+01

Pos logit contributions
From+0.043
 Every+0.042
 refreshing+0.041
 often+0.041
asting+0.040
Neg logit contributions
 client-0.031
 classmate-0.030
 grade-0.030
 principal-0.029
 ha-0.029
Token.
Token ID13
Feature activation+1.95e+00

Pos logit contributions
 client+0.370
 ha+0.363
 grade+0.362
 classmate+0.362
 principal+0.360
Neg logit contributions
 Every-0.185
 often-0.178
 all-0.176
From-0.175
 refreshing-0.173
Token The
Token ID383
Feature activation-7.74e+00

Pos logit contributions
 client+0.065
 Lee+0.065
 classmate+0.065
 principal+0.064
 babys+0.063
Neg logit contributions
asting-0.034
 refreshing-0.033
From-0.031
Years-0.029
 tighter-0.028
Token first
Token ID717
Feature activation+7.37e+00

Pos logit contributions
 refreshing+0.184
From+0.184
 Every+0.180
Years+0.174
 often+0.171
Neg logit contributions
 grade-0.196
 client-0.196
 classmate-0.191
 principal-0.189
 ha-0.187
Token one
Token ID530
Feature activation+2.37e+00

Pos logit contributions
 principal+0.192
 client+0.188
 grade+0.184
 babys+0.179
 classmate+0.176
Neg logit contributions
 Every-0.174
 often-0.172
 refreshing-0.166
From-0.165
 all-0.164
Token brought
Token ID3181
Feature activation-2.26e+00

Pos logit contributions
 classmate+0.068
 client+0.068
 grade+0.067
 principal+0.067
 ha+0.067
Neg logit contributions
From-0.050
 refreshing-0.049
 Every-0.047
asting-0.045
Years-0.045
Token came
Token ID1625
Feature activation+3.87e+00

Pos logit contributions
 grade+0.246
 client+0.241
 laptop+0.229
 principal+0.227
 liar+0.226
Neg logit contributions
 Every-0.103
 all-0.102
 Nem-0.101
 Fin-0.096
 over-0.095
Token over
Token ID625
Feature activation+8.56e-01

Pos logit contributions
 client+0.154
 classmate+0.153
 grade+0.153
 principal+0.153
 ha+0.153
Neg logit contributions
From-0.038
 Every-0.036
 refreshing-0.035
asting-0.031
 often-0.031
Token for
Token ID329
Feature activation-1.92e-01

Pos logit contributions
 client+0.032
 classmate+0.031
 ha+0.031
ird+0.031
 principal+0.031
Neg logit contributions
 Every-0.011
 every-0.011
 all-0.011
From-0.011
 often-0.010
Token a
Token ID257
Feature activation+5.14e+00

Pos logit contributions
 refreshing+0.004
From+0.004
 Every+0.003
 all+0.003
asting+0.003
Neg logit contributions
 client-0.006
 classmate-0.006
 principal-0.006
 grade-0.006
 ha-0.006
Token play
Token ID711
Feature activation-1.49e+00

Pos logit contributions
 client+0.162
 principal+0.157
 classmate+0.151
 Lee+0.148
 clients+0.147
Neg logit contributions
 refreshing-0.104
 tight-0.095
 fresh-0.091
Years-0.088
asting-0.087
Tokendate
Token ID4475
Feature activation+1.13e+01

Pos logit contributions
 all+0.037
 refreshing+0.037
 Every+0.036
 often+0.036
From+0.036
Neg logit contributions
 client-0.037
 principal-0.034
 grade-0.034
 classmate-0.034
 babys-0.032
Token.
Token ID13
Feature activation+2.00e+00

Pos logit contributions
 client+0.417
 principal+0.399
 classmate+0.399
 ha+0.394
 grade+0.394
Neg logit contributions
 all-0.146
 around-0.143
 Every-0.139
 every-0.137
 often-0.135
Token Her
Token ID2332
Feature activation-1.43e+00

Pos logit contributions
 client+0.075
 ha+0.074
 classmate+0.074
 grade+0.073
 principal+0.073
Neg logit contributions
 Every-0.026
From-0.024
 refreshing-0.023
 From-0.021
 tight-0.020
Token friend
Token ID1545
Feature activation+7.00e+00

Pos logit contributions
 tight+0.030
 refreshing+0.029
 Every+0.028
 all+0.026
 tightly+0.026
Neg logit contributions
 grade-0.042
 principal-0.041
ird-0.040
fer-0.040
 client-0.040
Token asked
Token ID1965
Feature activation+3.37e+00

Pos logit contributions
 grade+0.208
 client+0.206
 principal+0.199
 classmate+0.197
 babys+0.192
Neg logit contributions
From-0.141
 Every-0.140
 refreshing-0.136
 often-0.136
 all-0.131
Token for
Token ID329
Feature activation-2.63e-01

Pos logit contributions
 client+0.099
 classmate+0.099
 grade+0.098
 principal+0.097
 ha+0.096
Neg logit contributions
From-0.069
 Every-0.067
 often-0.064
 refreshing-0.063
 all-0.062
Token my
Token ID616
Feature activation+1.96e+00

Pos logit contributions
 ha+0.153
 client+0.153
 principal+0.152
 classmate+0.151
 grade+0.150
Neg logit contributions
From-0.076
 refreshing-0.074
 Every-0.069
asting-0.069
Years-0.066
Token house
Token ID2156
Feature activation+2.69e+00

Pos logit contributions
 grade+0.055
 happened+0.051
 principal+0.050
 client+0.050
 classmate+0.050
Neg logit contributions
 refreshing-0.044
 tight-0.044
 Every-0.043
 hay-0.043
 all-0.042
Token for
Token ID329
Feature activation-2.68e-01

Pos logit contributions
 ha+0.075
 principal+0.074
 classmate+0.073
 grade+0.072
 client+0.072
Neg logit contributions
From-0.061
 refreshing-0.058
Years-0.055
 Every-0.055
asting-0.053
Token a
Token ID257
Feature activation+5.18e+00

Pos logit contributions
From+0.005
 refreshing+0.005
 Every+0.004
asting+0.004
Years+0.004
Neg logit contributions
 classmate-0.009
 client-0.009
 grade-0.008
 principal-0.008
 ha-0.008
Token play
Token ID711
Feature activation-1.64e+00

Pos logit contributions
 client+0.152
 principal+0.151
 classmate+0.151
 grade+0.149
 ha+0.146
Neg logit contributions
 refreshing-0.104
From-0.100
 Every-0.096
Years-0.095
 tight-0.095
Tokendate
Token ID4475
Feature activation+1.12e+01

Pos logit contributions
From+0.043
 Every+0.042
 refreshing+0.042
 often+0.041
asting+0.040
Neg logit contributions
 client-0.038
 classmate-0.038
 grade-0.038
 principal-0.037
 babys-0.036
Token.
Token ID13
Feature activation+2.04e+00

Pos logit contributions
 client+0.368
 classmate+0.364
 grade+0.360
 principal+0.357
 babys+0.353
Neg logit contributions
 all-0.184
 Every-0.183
 often-0.182
 around-0.173
 every-0.170
Token I
Token ID314
Feature activation-1.96e+00

Pos logit contributions
 Lee+0.051
 ha+0.050
 client+0.049
 babys+0.047
 principal+0.047
Neg logit contributions
 refreshing-0.054
asting-0.052
Years-0.052
From-0.048
lated-0.047
Token invite
Token ID14037
Feature activation-2.64e-01

Pos logit contributions
From+0.044
 refreshing+0.042
 Every+0.041
Years+0.039
asting+0.039
Neg logit contributions
 ha-0.054
 grade-0.054
 classmate-0.054
 client-0.053
 principal-0.052
Token you
Token ID345
Feature activation+1.64e+00

Pos logit contributions
 refreshing+0.006
From+0.006
asting+0.005
Years+0.005
lessly+0.005
Neg logit contributions
 ha-0.008
 Lee-0.008
 client-0.007
 principal-0.007
 classmate-0.007
Token."
Token ID526
Feature activation+3.87e+00

Pos logit contributions
 ha+0.043
 classmate+0.043
 principal+0.042
 client+0.042
 grade+0.041
Neg logit contributions
From-0.040
 refreshing-0.038
Years-0.035
 Every-0.035
asting-0.034
Token!
Token ID0
Feature activation+1.28e+00

Pos logit contributions
 classmate+0.064
 client+0.064
 ha+0.064
 principal+0.063
 grade+0.063
Neg logit contributions
From-0.048
 refreshing-0.046
 Every-0.045
asting-0.043
Years-0.043
Token She
Token ID1375
Feature activation-7.66e+00

Pos logit contributions
 classmate+0.035
 client+0.035
 principal+0.035
 grade+0.035
 Lee+0.035
Neg logit contributions
asting-0.029
 refreshing-0.029
Years-0.026
From-0.026
 often-0.025
Token finished
Token ID5201
Feature activation+2.09e+00

Pos logit contributions
orting+0.186
 Every+0.181
asting+0.179
ales+0.176
ores+0.173
Neg logit contributions
 ha-0.200
 classmate-0.191
 babys-0.177
 principal-0.176
ird-0.172
Token her
Token ID607
Feature activation+4.87e+00

Pos logit contributions
 ha+0.053
 classmate+0.052
 grade+0.052
 babys+0.051
 principal+0.051
Neg logit contributions
From-0.050
 refreshing-0.049
 Every-0.048
asting-0.047
Years-0.047
Token play
Token ID711
Feature activation-1.76e+00

Pos logit contributions
 client+0.127
 principal+0.125
 classmate+0.125
 grade+0.123
 babys+0.121
Neg logit contributions
 refreshing-0.116
From-0.114
 Every-0.108
Years-0.106
 tight-0.106
Tokendate
Token ID4475
Feature activation+1.12e+01

Pos logit contributions
From+0.041
 refreshing+0.040
 Every+0.040
asting+0.038
Years+0.037
Neg logit contributions
 ha-0.047
 classmate-0.046
 grade-0.046
 principal-0.045
 client-0.045
Token and
Token ID290
Feature activation-8.14e+00

Pos logit contributions
 classmate+0.392
 client+0.391
 grade+0.389
 principal+0.388
 ha+0.388
Neg logit contributions
From-0.164
 refreshing-0.156
 Every-0.150
asting-0.140
Years-0.140
Token went
Token ID1816
Feature activation+1.44e+00

Pos logit contributions
From+0.165
 Every+0.159
 refreshing+0.159
Years+0.153
asting+0.153
Neg logit contributions
 classmate-0.237
 ha-0.237
 client-0.233
 principal-0.232
 grade-0.232
Token home
Token ID1363
Feature activation+7.41e+00

Pos logit contributions
 ha+0.040
 grade+0.040
 classmate+0.040
 principal+0.040
 babys+0.039
Neg logit contributions
From-0.032
 refreshing-0.030
 Every-0.030
Years-0.028
asting-0.028
Token,
Token ID11
Feature activation+3.93e+00

Pos logit contributions
 client+0.245
 classmate+0.241
 grade+0.237
 principal+0.237
 ha+0.235
Neg logit contributions
From-0.122
 refreshing-0.117
 tight-0.116
 often-0.115
 all-0.114
Token with
Token ID351
Feature activation-4.42e+00

Pos logit contributions
 babys+0.087
 classmate+0.083
 ha+0.083
 haircut+0.080
 desk+0.080
Neg logit contributions
ales-0.113
rant-0.110
igators-0.110
reci-0.109
ion-0.108
Token would
Token ID561
Feature activation+3.92e+00

Pos logit contributions
From+0.028
 refreshing+0.027
 Every+0.026
 often+0.025
asting+0.025
Neg logit contributions
 client-0.049
 classmate-0.049
 principal-0.049
 grade-0.048
 ha-0.048
Token have
Token ID423
Feature activation-4.35e+00

Pos logit contributions
 client+0.102
 ha+0.102
 grade+0.101
 classmate+0.098
 principal+0.097
Neg logit contributions
From-0.095
 refreshing-0.094
lessly-0.087
Years-0.085
 Every-0.084
Token a
Token ID257
Feature activation+5.48e+00

Pos logit contributions
From+0.083
 refreshing+0.081
 Every+0.079
asting+0.075
Years+0.074
Neg logit contributions
 client-0.133
 classmate-0.132
 grade-0.131
 principal-0.131
 ha-0.131
Token play
Token ID711
Feature activation-1.45e+00

Pos logit contributions
 principal+0.166
 client+0.165
 Lee+0.164
 classmate+0.164
 ha+0.162
Neg logit contributions
 refreshing-0.105
Years-0.100
 tight-0.098
 often-0.096
 all-0.093
Token-
Token ID12
Feature activation-1.29e+00

Pos logit contributions
 all+0.040
 often+0.039
From+0.039
 Every+0.039
 refreshing+0.038
Neg logit contributions
 client-0.032
 classmate-0.030
 babys-0.030
 principal-0.030
 grade-0.030
Tokendate
Token ID4475
Feature activation+1.12e+01

Pos logit contributions
From+0.031
 refreshing+0.029
 Every+0.029
asting+0.028
Years+0.028
Neg logit contributions
 client-0.033
 classmate-0.033
 ha-0.033
 principal-0.033
 grade-0.033
Token and
Token ID290
Feature activation-7.98e+00

Pos logit contributions
 client+0.421
 ha+0.411
 classmate+0.409
 principal+0.408
 babys+0.405
Neg logit contributions
 all-0.144
 often-0.134
 every-0.131
 around-0.130
 Every-0.124
Token wear
Token ID5806
Feature activation-8.62e+00

Pos logit contributions
 refreshing+0.141
From+0.140
 Every+0.137
 often+0.134
 all+0.130
Neg logit contributions
 classmate-0.248
 client-0.247
 grade-0.245
 principal-0.244
 ha-0.242
Token makeup
Token ID16029
Feature activation-9.41e-01

Pos logit contributions
From+0.213
 Every+0.197
 refreshing+0.195
asting+0.189
Years+0.187
Neg logit contributions
 classmate-0.222
 grade-0.216
 ha-0.216
 client-0.216
 principal-0.213
Token.
Token ID13
Feature activation+2.16e+00

Pos logit contributions
 refreshing+0.014
 all+0.014
From+0.014
 Every+0.014
 often+0.014
Neg logit contributions
 client-0.032
 classmate-0.032
 principal-0.031
 ha-0.031
 grade-0.031
Token
Token ID198
Feature activation-2.39e+00

Pos logit contributions
 classmate+0.074
 babys+0.072
 client+0.072
 principal+0.071
 ha+0.071
Neg logit contributions
From-0.034
 refreshing-0.033
asting-0.032
Years-0.031
 often-0.029
Token could
Token ID714
Feature activation+4.26e+00

Pos logit contributions
From+0.170
 refreshing+0.164
 Every+0.161
asting+0.154
Years+0.154
Neg logit contributions
 classmate-0.241
 client-0.241
 ha-0.239
 principal-0.238
 grade-0.238
Token have
Token ID423
Feature activation-4.22e+00

Pos logit contributions
 ha+0.121
 grade+0.119
 client+0.118
 classmate+0.118
 principal+0.117
Neg logit contributions
From-0.092
 refreshing-0.091
asting-0.086
 Every-0.084
Years-0.084
Token her
Token ID607
Feature activation+4.98e+00

Pos logit contributions
From+0.074
 refreshing+0.072
 Every+0.069
asting+0.065
Years+0.065
Neg logit contributions
 client-0.138
 classmate-0.137
 grade-0.135
 principal-0.135
 ha-0.133
Token play
Token ID711
Feature activation-1.62e+00

Pos logit contributions
 client+0.144
 classmate+0.143
 principal+0.142
 grade+0.141
 ha+0.140
Neg logit contributions
From-0.102
 refreshing-0.101
 Every-0.098
Years-0.093
asting-0.092
Token-
Token ID12
Feature activation-1.51e+00

Pos logit contributions
From+0.031
 refreshing+0.029
 Every+0.029
 often+0.028
Years+0.028
Neg logit contributions
 client-0.050
 classmate-0.049
 principal-0.049
 grade-0.049
 ha-0.048
Tokendate
Token ID4475
Feature activation+1.11e+01

Pos logit contributions
From+0.038
 refreshing+0.037
 Every+0.036
Years+0.035
asting+0.034
Neg logit contributions
 classmate-0.037
 client-0.037
 grade-0.036
 ha-0.036
 principal-0.036
Token after
Token ID706
Feature activation+3.97e+00

Pos logit contributions
 client+0.384
 classmate+0.374
 babys+0.371
 principal+0.370
 grade+0.368
Neg logit contributions
 refreshing-0.162
From-0.162
 often-0.161
 all-0.158
 Every-0.157
Token all
Token ID477
Feature activation-5.85e+00

Pos logit contributions
 client+0.145
 classmate+0.145
 grade+0.143
 principal+0.142
 ha+0.139
Neg logit contributions
From-0.054
 Every-0.051
 refreshing-0.050
 all-0.049
 often-0.048
Token.
Token ID13
Feature activation+1.73e+00

Pos logit contributions
From+0.110
 refreshing+0.105
Years+0.101
 Every+0.100
asting+0.096
Neg logit contributions
 ha-0.180
 classmate-0.180
 grade-0.177
 client-0.177
 principal-0.176
Token<|endoftext|>
Token ID50256
Feature activation-2.22e-01

Pos logit contributions
 ha+0.051
 Lee+0.051
 classmate+0.051
 client+0.051
 babys+0.049
Neg logit contributions
 refreshing-0.034
asting-0.034
Years-0.032
From-0.031
orting-0.029
TokenOne
Token ID3198
Feature activation-3.99e+00

Pos logit contributions
From+0.005
 refreshing+0.005
Years+0.005
 Every+0.005
asting+0.005
Neg logit contributions
 classmate-0.006
 ha-0.006
 client-0.006
 principal-0.006
 grade-0.006
Token and
Token ID290
Feature activation-8.28e+00

Pos logit contributions
From+0.012
 refreshing+0.012
 Every+0.011
utt+0.011
Years+0.011
Neg logit contributions
 classmate-0.020
 ha-0.019
 grade-0.019
 principal-0.019
 client-0.019
Token had
Token ID550
Feature activation-1.21e-01

Pos logit contributions
From+0.160
 Every+0.159
Years+0.155
asting+0.152
 refreshing+0.151
Neg logit contributions
 classmate-0.246
 ha-0.239
 client-0.236
 grade-0.236
 principal-0.236
Token a
Token ID257
Feature activation+5.07e+00

Pos logit contributions
From+0.003
lessly+0.003
orting+0.003
utt+0.003
rant+0.003
Neg logit contributions
 happened-0.003
 classmate-0.003
 ha-0.003
 grade-0.003
 principal-0.003
Token fun
Token ID1257
Feature activation+1.01e+00

Pos logit contributions
 classmate+0.145
 ha+0.144
 grade+0.144
 client+0.142
 principal+0.141
Neg logit contributions
From-0.106
 Every-0.102
 refreshing-0.099
asting-0.097
Years-0.093
Token play
Token ID711
Feature activation-1.53e+00

Pos logit contributions
 grade+0.029
 classmate+0.029
 ha+0.028
 client+0.028
 babys+0.027
Neg logit contributions
From-0.021
 refreshing-0.020
 Every-0.020
asting-0.019
lessly-0.018
Tokendate
Token ID4475
Feature activation+1.11e+01

Pos logit contributions
From+0.031
 refreshing+0.030
 Every+0.029
Years+0.028
asting+0.028
Neg logit contributions
 client-0.045
 classmate-0.044
 principal-0.044
 grade-0.044
 ha-0.043
Token.
Token ID13
Feature activation+1.77e+00

Pos logit contributions
 client+0.412
 classmate+0.403
 babys+0.401
 principal+0.401
 ha+0.396
Neg logit contributions
 often-0.135
From-0.133
 all-0.133
 refreshing-0.130
 Every-0.127
Token<|endoftext|>
Token ID50256
Feature activation+2.12e-01

Pos logit contributions
 classmate+0.047
 principal+0.044
 grade+0.043
 Lee+0.043
 babys+0.042
Neg logit contributions
asting-0.041
 refreshing-0.037
Years-0.036
ales-0.035
 often-0.035
TokenOnce
Token ID7454
Feature activation+4.10e+00

Pos logit contributions
 classmate+0.007
 client+0.007
 ha+0.007
 principal+0.007
 grade+0.007
Neg logit contributions
From-0.003
 refreshing-0.003
Years-0.003
 Every-0.003
asting-0.003
Token upon
Token ID2402
Feature activation-2.66e+00

Pos logit contributions
 client+0.125
 classmate+0.124
 principal+0.124
 ha+0.124
 grade+0.123
Neg logit contributions
From-0.078
 Every-0.075
 refreshing-0.075
 often-0.070
asting-0.069
Token a
Token ID257
Feature activation+5.82e+00

Pos logit contributions
From+0.046
 Every+0.043
 refreshing+0.042
asting+0.040
 often+0.039
Neg logit contributions
 client-0.089
 classmate-0.089
 principal-0.088
 ha-0.087
 grade-0.087
Token for
Token ID329
Feature activation-3.19e-01

Pos logit contributions
 classmate+0.054
 happened+0.054
 grade+0.053
 client+0.053
 Lee+0.052
Neg logit contributions
From-0.038
 refreshing-0.037
asting-0.036
Years-0.034
rant-0.033
Token using
Token ID1262
Feature activation-7.82e+00

Pos logit contributions
From+0.007
 Every+0.007
 refreshing+0.007
Years+0.006
 From+0.006
Neg logit contributions
 ha-0.009
 principal-0.009
 client-0.009
 classmate-0.009
 grade-0.009
Token the
Token ID262
Feature activation-1.31e+00

Pos logit contributions
From+0.163
Years+0.144
 refreshing+0.135
utt+0.133
 Every+0.131
Neg logit contributions
 ha-0.236
 classmate-0.233
 grade-0.233
 client-0.230
 happened-0.230
Token past
Token ID1613
Feature activation-2.21e+00

Pos logit contributions
 refreshing+0.031
From+0.031
 Every+0.030
asting+0.029
Years+0.028
Neg logit contributions
 client-0.034
 classmate-0.034
 principal-0.033
 grade-0.033
 ha-0.033
Token-
Token ID12
Feature activation-1.41e+00

Pos logit contributions
 Every+0.044
From+0.041
 refreshing+0.041
 all+0.039
 often+0.039
Neg logit contributions
 principal-0.066
 client-0.066
 classmate-0.065
 babys-0.065
 grade-0.065
Tokendate
Token ID4475
Feature activation+1.11e+01

Pos logit contributions
 refreshing+0.032
From+0.032
 Every+0.030
Years+0.029
 often+0.028
Neg logit contributions
 classmate-0.037
 client-0.037
 ha-0.037
 grade-0.037
 principal-0.036
Token cheese
Token ID9891
Feature activation-3.57e+00

Pos logit contributions
 client+0.349
 classmate+0.344
 grade+0.344
 ha+0.341
 principal+0.340
Neg logit contributions
 refreshing-0.188
 Every-0.187
From-0.185
 often-0.182
 all-0.177
Token.
Token ID13
Feature activation+1.92e+00

Pos logit contributions
From+0.076
 refreshing+0.071
 Every+0.070
Years+0.067
asting+0.066
Neg logit contributions
 classmate-0.103
 client-0.102
 ha-0.101
 principal-0.101
 grade-0.101
Token Sam
Token ID3409
Feature activation+2.20e+00

Pos logit contributions
 classmate+0.062
 principal+0.061
 client+0.060
 Lee+0.059
 ha+0.058
Neg logit contributions
asting-0.036
 refreshing-0.035
From-0.031
Years-0.031
 fresh-0.030
Token learned
Token ID4499
Feature activation+6.18e-01

Pos logit contributions
 ha+0.055
 classmate+0.053
 babys+0.051
 principal+0.051
 client+0.050
Neg logit contributions
 refreshing-0.055
asting-0.054
Years-0.054
 Every-0.054
From-0.054
Token an
Token ID281
Feature activation+2.85e+00

Pos logit contributions
 client+0.021
 principal+0.021
 classmate+0.021
 grade+0.021
 ha+0.021
Neg logit contributions
 refreshing-0.009
From-0.008
asting-0.008
 Every-0.008
rant-0.008
Token can
Token ID460
Feature activation+4.03e+00

Pos logit contributions
From+0.075
 refreshing+0.074
Years+0.071
 Every+0.068
orting+0.067
Neg logit contributions
 ha-0.079
 grade-0.075
 client-0.074
ird-0.073
 classmate-0.073
Token have
Token ID423
Feature activation-4.26e+00

Pos logit contributions
 ha+0.105
 grade+0.100
 client+0.099
 principal+0.094
 classmate+0.094
Neg logit contributions
 refreshing-0.102
From-0.100
Years-0.094
lessly-0.092
asting-0.092
Token your
Token ID534
Feature activation+4.89e-01

Pos logit contributions
From+0.076
 refreshing+0.071
 Every+0.069
Years+0.068
asting+0.068
Neg logit contributions
 ha-0.138
 client-0.134
 classmate-0.134
 grade-0.133
 principal-0.132
Token play
Token ID711
Feature activation-1.65e+00

Pos logit contributions
 classmate+0.013
 client+0.013
 principal+0.013
 grade+0.013
 ha+0.012
Neg logit contributions
 refreshing-0.011
From-0.011
 Every-0.011
 often-0.010
Years-0.010
Token-
Token ID12
Feature activation-1.47e+00

Pos logit contributions
From+0.038
 refreshing+0.036
 Every+0.036
 often+0.036
 all+0.035
Neg logit contributions
 client-0.044
 classmate-0.043
 principal-0.042
 babys-0.042
 grade-0.042
Tokendate
Token ID4475
Feature activation+1.11e+01

Pos logit contributions
From+0.036
 refreshing+0.035
 Every+0.034
Years+0.032
 often+0.032
Neg logit contributions
 ha-0.038
 grade-0.037
 classmate-0.037
 client-0.037
 principal-0.036
Token with
Token ID351
Feature activation-4.37e+00

Pos logit contributions
 client+0.344
 classmate+0.341
 principal+0.336
 grade+0.335
 babys+0.335
Neg logit contributions
From-0.202
 refreshing-0.198
 Every-0.194
 often-0.190
asting-0.184
Token R
Token ID371
Feature activation-2.78e+00

Pos logit contributions
From+0.084
 refreshing+0.079
 Every+0.077
Years+0.076
asting+0.074
Neg logit contributions
 ha-0.139
 client-0.133
 classmate-0.132
 principal-0.131
 grade-0.131
Tokenuf
Token ID3046
Feature activation-4.09e+00

Pos logit contributions
From+0.080
 refreshing+0.078
 Every+0.076
Years+0.073
asting+0.072
Neg logit contributions
 client-0.061
 classmate-0.059
 grade-0.059
 ha-0.059
 principal-0.059
Tokenus
Token ID385
Feature activation+2.40e+00

Pos logit contributions
 refreshing+0.125
From+0.125
 Every+0.121
Years+0.117
asting+0.111
Neg logit contributions
 client-0.082
 principal-0.080
 ha-0.080
 classmate-0.079
 grade-0.076
Token now
Token ID783
Feature activation+3.43e+00

Pos logit contributions
 client+0.061
 classmate+0.061
 ha+0.061
 principal+0.060
 grade+0.060
Neg logit contributions
From-0.058
 refreshing-0.056
 Every-0.055
Years-0.053
asting-0.052
Token almost
Token ID2048
Feature activation+1.67e+00

Pos logit contributions
 classmate+0.134
 ha+0.133
 principal+0.132
 client+0.130
 grade+0.130
Neg logit contributions
From-0.082
 refreshing-0.076
 Every-0.075
Years-0.073
orting-0.070
Token ready
Token ID3492
Feature activation+6.90e+00

Pos logit contributions
 client+0.046
 classmate+0.046
 principal+0.045
 grade+0.045
 ha+0.045
Neg logit contributions
From-0.037
 refreshing-0.036
 Every-0.035
asting-0.034
Years-0.034
Token for
Token ID329
Feature activation-2.99e-01

Pos logit contributions
 grade+0.204
 classmate+0.202
 ha+0.201
 principal+0.200
 client+0.200
Neg logit contributions
From-0.140
 refreshing-0.132
 Every-0.129
Years-0.125
asting-0.122
Token the
Token ID262
Feature activation-1.31e+00

Pos logit contributions
From+0.006
 refreshing+0.006
 Every+0.006
Years+0.005
asting+0.005
Neg logit contributions
 ha-0.009
 principal-0.009
 grade-0.009
 classmate-0.009
 client-0.009
Token play
Token ID711
Feature activation-1.72e+00

Pos logit contributions
From+0.031
 refreshing+0.031
 Every+0.029
asting+0.029
Years+0.028
Neg logit contributions
 client-0.034
 classmate-0.034
 principal-0.033
 grade-0.033
 ha-0.033
Tokendate
Token ID4475
Feature activation+1.11e+01

Pos logit contributions
 often+0.043
From+0.043
 refreshing+0.043
asting+0.043
 Every+0.043
Neg logit contributions
 client-0.041
 classmate-0.039
 principal-0.038
 babys-0.038
 grade-0.038
Token!
Token ID0
Feature activation+1.42e+00

Pos logit contributions
 client+0.367
 classmate+0.356
 principal+0.352
 ha+0.352
 grade+0.352
Neg logit contributions
 often-0.172
 Every-0.169
 all-0.169
 refreshing-0.166
From-0.162
Token
Token ID198
Feature activation-2.50e+00

Pos logit contributions
 classmate+0.044
 client+0.043
 babys+0.043
 principal+0.043
 Lee+0.042
Neg logit contributions
 refreshing-0.027
asting-0.026
From-0.025
Years-0.025
 often-0.024
Token
Token ID198
Feature activation-2.55e+00

Pos logit contributions
 refreshing+0.049
rant+0.043
 often+0.042
 tighter+0.041
From+0.041
Neg logit contributions
 classmate-0.074
 principal-0.074
 babys-0.074
 ha-0.072
 client-0.072
TokenBilly
Token ID49640
Feature activation-5.67e-01

Pos logit contributions
 refreshing+0.052
rant+0.047
 Every+0.047
utt+0.043
 crunch+0.043
Neg logit contributions
 classmate-0.075
 ha-0.074
 client-0.074
 principal-0.074
 babys-0.072
Token opened
Token ID4721
Feature activation-4.03e+00

Pos logit contributions
 refreshing+0.013
From+0.013
Years+0.013
 Every+0.013
orting+0.013
Neg logit contributions
 ha-0.015
 classmate-0.014
 principal-0.014
 babys-0.013
 client-0.013
Token Alex
Token ID4422
Feature activation+1.60e+00

Pos logit contributions
 classmate+0.075
 client+0.074
 ha+0.074
 principal+0.074
 grade+0.073
Neg logit contributions
 refreshing-0.039
From-0.038
asting-0.036
Years-0.036
 Every-0.034
Token had
Token ID550
Feature activation-9.97e-02

Pos logit contributions
 ha+0.042
 classmate+0.040
 client+0.039
 principal+0.039
 grade+0.039
Neg logit contributions
 refreshing-0.037
From-0.037
Years-0.037
asting-0.035
 Every-0.035
Token used
Token ID973
Feature activation-4.54e+00

Pos logit contributions
From+0.002
 refreshing+0.002
 Every+0.002
asting+0.002
Years+0.002
Neg logit contributions
 classmate-0.003
 ha-0.003
 client-0.003
 principal-0.003
 grade-0.003
Token past
Token ID1613
Feature activation-2.09e+00

Pos logit contributions
From+0.106
orting+0.091
 refreshing+0.090
asting+0.089
Years+0.087
Neg logit contributions
 classmate-0.126
 ha-0.124
 happened-0.123
 grade-0.122
 client-0.120
Token-
Token ID12
Feature activation-1.35e+00

Pos logit contributions
From+0.044
 refreshing+0.041
asting+0.039
Years+0.039
orting+0.036
Neg logit contributions
 classmate-0.061
 ha-0.059
 Lee-0.059
 client-0.059
 happened-0.058
Tokendate
Token ID4475
Feature activation+1.10e+01

Pos logit contributions
 refreshing+0.031
From+0.030
 Every+0.028
asting+0.026
 tight+0.026
Neg logit contributions
 grade-0.036
 classmate-0.036
 ha-0.036
 client-0.036
 principal-0.035
Token cheese
Token ID9891
Feature activation-3.69e+00

Pos logit contributions
 client+0.341
 classmate+0.340
 grade+0.337
 principal+0.336
 ha+0.336
Neg logit contributions
From-0.202
 refreshing-0.195
 Every-0.191
asting-0.181
Years-0.180
Token!
Token ID0
Feature activation+1.90e+00

Pos logit contributions
From+0.072
 refreshing+0.067
 Every+0.065
Years+0.062
asting+0.060
Neg logit contributions
 classmate-0.114
 client-0.111
 ha-0.111
 principal-0.111
 grade-0.110
Token Sam
Token ID3409
Feature activation+2.23e+00

Pos logit contributions
 classmate+0.064
 client+0.063
 principal+0.063
 ha+0.062
 grade+0.062
Neg logit contributions
 refreshing-0.030
From-0.029
asting-0.029
Years-0.027
 often-0.025
Token yelled
Token ID22760
Feature activation+3.59e+00

Pos logit contributions
 ha+0.054
 classmate+0.054
 happened+0.050
 principal+0.050
 client+0.050
Neg logit contributions
 refreshing-0.054
From-0.053
Years-0.053
orting-0.052
asting-0.052
Token,
Token ID11
Feature activation+4.04e+00

Pos logit contributions
 classmate+0.107
 client+0.106
 grade+0.105
 ha+0.105
 principal+0.105
Neg logit contributions
From-0.072
 refreshing-0.070
 Every-0.067
asting-0.065
Years-0.065
Token let
Token ID1309
Feature activation-2.44e+00

Pos logit contributions
 ha+0.119
 principal+0.116
 classmate+0.116
 client+0.116
 grade+0.116
Neg logit contributions
From-0.088
 refreshing-0.083
 Every-0.083
Years-0.082
asting-0.079
Token's
Token ID338
Feature activation+3.70e+00

Pos logit contributions
From+0.021
 Every+0.021
 refreshing+0.019
 all+0.019
asting+0.017
Neg logit contributions
 client-0.101
 grade-0.101
 classmate-0.100
 principal-0.099
 babys-0.098
Token have
Token ID423
Feature activation-4.19e+00

Pos logit contributions
 classmate+0.113
 client+0.112
 principal+0.111
 grade+0.111
 ha+0.110
Neg logit contributions
From-0.071
 refreshing-0.069
 Every-0.067
asting-0.064
 often-0.063
Token a
Token ID257
Feature activation+5.08e+00

Pos logit contributions
From+0.069
 refreshing+0.065
 Every+0.063
Years+0.060
asting+0.060
Neg logit contributions
 ha-0.139
 classmate-0.138
 client-0.138
 grade-0.137
 principal-0.137
Token play
Token ID711
Feature activation-1.62e+00

Pos logit contributions
 client+0.152
 classmate+0.150
 principal+0.150
 ha+0.146
 grade+0.146
Neg logit contributions
 refreshing-0.100
From-0.096
 Every-0.093
 tight-0.091
Years-0.091
Tokendate
Token ID4475
Feature activation+1.09e+01

Pos logit contributions
From+0.028
 refreshing+0.027
 Every+0.026
 often+0.025
asting+0.025
Neg logit contributions
 client-0.053
 classmate-0.052
 principal-0.052
 grade-0.051
 babys-0.051
Token!"
Token ID2474
Feature activation+4.72e+00

Pos logit contributions
 client+0.382
 classmate+0.371
 babys+0.363
 principal+0.363
 grade+0.362
Neg logit contributions
 all-0.164
 often-0.155
 Every-0.153
 around-0.150
 every-0.150
Token
Token ID198
Feature activation-2.08e+00

Pos logit contributions
 client+0.160
 classmate+0.160
 grade+0.159
 principal+0.158
 ha+0.158
Neg logit contributions
From-0.074
 Every-0.069
 refreshing-0.069
Years-0.063
 often-0.062
TokenThe
Token ID464
Feature activation-7.54e+00

Pos logit contributions
 refreshing+0.035
From+0.033
 Every+0.032
asting+0.031
 often+0.030
Neg logit contributions
 classmate-0.068
 client-0.068
 principal-0.068
 ha-0.068
 grade-0.067
Token steak
Token ID26320
Feature activation-1.32e+00

Pos logit contributions
From+0.175
 refreshing+0.174
 Every+0.167
 tight+0.163
Years+0.162
Neg logit contributions
 grade-0.192
 client-0.191
 classmate-0.190
 principal-0.186
 ha-0.185
Token was
Token ID373
Feature activation+4.50e+00

Pos logit contributions
From+0.031
 Every+0.031
 refreshing+0.030
Years+0.029
asting+0.029
Neg logit contributions
 classmate-0.034
 ha-0.034
 grade-0.033
 principal-0.033
 client-0.033
Token the
Token ID262
Feature activation-1.05e+00

Pos logit contributions
 classmate+0.024
 client+0.024
 grade+0.024
 principal+0.024
 ha+0.023
Neg logit contributions
From-0.012
 refreshing-0.011
 Every-0.011
Years-0.011
asting-0.010
Token wall
Token ID3355
Feature activation+2.67e-01

Pos logit contributions
From+0.028
 refreshing+0.027
 Every+0.027
asting+0.026
Years+0.026
Neg logit contributions
 client-0.024
 classmate-0.024
 principal-0.024
 ha-0.024
 grade-0.024
Token.
Token ID13
Feature activation+1.98e+00

Pos logit contributions
 grade+0.006
 classmate+0.006
 principal+0.006
 ha+0.006
 client+0.006
Neg logit contributions
From-0.007
 refreshing-0.007
asting-0.006
Years-0.006
 Nem-0.006
Token But
Token ID887
Feature activation-2.40e+00

Pos logit contributions
 classmate+0.066
 client+0.065
 grade+0.065
 ha+0.065
 principal+0.064
Neg logit contributions
From-0.032
 refreshing-0.032
asting-0.030
Years-0.029
 Every-0.028
Token,
Token ID11
Feature activation+4.08e+00

Pos logit contributions
From+0.036
 refreshing+0.034
 Every+0.033
 often+0.031
Years+0.031
Neg logit contributions
 client-0.084
 classmate-0.084
 principal-0.083
 grade-0.083
 ha-0.081
Token oh
Token ID11752
Feature activation+1.07e+01

Pos logit contributions
 client+0.122
 classmate+0.120
 principal+0.120
 grade+0.118
 babys+0.115
Neg logit contributions
From-0.083
 refreshing-0.080
 Every-0.077
 often-0.076
 all-0.075
Token no
Token ID645
Feature activation+2.87e+00

Pos logit contributions
 principal+0.408
 client+0.389
 classmate+0.385
 grade+0.377
 babys+0.376
Neg logit contributions
 all-0.143
 tight-0.142
 tightly-0.137
 Every-0.133
 every-0.133
Token!
Token ID0
Feature activation+1.76e+00

Pos logit contributions
 classmate+0.088
 principal+0.087
 client+0.086
 babys+0.083
 grade+0.082
Neg logit contributions
From-0.057
 Every-0.056
 refreshing-0.055
asting-0.054
 tight-0.053
Token The
Token ID383
Feature activation-7.71e+00

Pos logit contributions
 client+0.058
 classmate+0.058
 grade+0.057
 principal+0.057
 ha+0.057
Neg logit contributions
From-0.029
 refreshing-0.028
 Every-0.028
Years-0.025
asting-0.025
Token stack
Token ID8931
Feature activation-3.35e+00

Pos logit contributions
From+0.163
 Every+0.162
 refreshing+0.156
 all+0.155
 tight+0.154
Neg logit contributions
 client-0.216
 classmate-0.215
 grade-0.210
 ha-0.208
 principal-0.208
Token fell
Token ID3214
Feature activation-8.84e-01

Pos logit contributions
From+0.072
 refreshing+0.070
 Every+0.069
 often+0.065
asting+0.065
Neg logit contributions
 client-0.094
 classmate-0.094
 principal-0.093
 grade-0.093
 ha-0.092
Token
Token ID95
Feature activation-7.15e+00

Pos logit contributions
From+0.070
 refreshing+0.068
 Every+0.067
asting+0.064
Years+0.064
Neg logit contributions
 client-0.075
 classmate-0.074
 ha-0.074
 grade-0.074
 principal-0.074
Token
Token ID26391
Feature activation+8.71e+00

Pos logit contributions
 refreshing+0.171
From+0.170
 Every+0.165
 often+0.159
Years+0.157
Neg logit contributions
 client-0.183
 principal-0.183
 grade-0.182
 ha-0.181
 classmate-0.180
Token
Token ID129
Feature activation-8.44e+00

Pos logit contributions
 classmate+0.211
 grade+0.208
 client+0.207
 babys+0.204
 principal+0.204
Neg logit contributions
From-0.220
 refreshing-0.209
 Every-0.207
 often-0.198
 tight-0.197
Token
Token ID241
Feature activation+1.63e+00

Pos logit contributions
 refreshing+0.190
From+0.171
 Every+0.171
asting+0.167
 sunlight+0.164
Neg logit contributions
 client-0.230
 principal-0.216
 classmate-0.215
 ha-0.213
 grade-0.208
TokenUh
Token ID34653
Feature activation+4.04e+00

Pos logit contributions
 classmate+0.059
 client+0.058
 principal+0.058
 ha+0.057
 babys+0.057
Neg logit contributions
From-0.024
 Every-0.022
 refreshing-0.021
asting-0.020
 tight-0.019
Token oh
Token ID11752
Feature activation+1.07e+01

Pos logit contributions
 classmate+0.092
 client+0.092
 principal+0.091
 grade+0.090
 ha+0.088
Neg logit contributions
From-0.110
 refreshing-0.106
 Every-0.104
asting-0.102
Years-0.102
Token,
Token ID11
Feature activation+4.02e+00

Pos logit contributions
 principal+0.389
 client+0.369
 classmate+0.369
 grade+0.363
 babys+0.352
Neg logit contributions
 tight-0.163
 tightly-0.154
 Every-0.148
 all-0.148
From-0.146
Token what
Token ID644
Feature activation+6.10e-01

Pos logit contributions
 ha+0.125
 client+0.118
 grade+0.117
 classmate+0.117
 babys+0.116
Neg logit contributions
Years-0.077
From-0.076
 Every-0.075
 refreshing-0.075
asting-0.074
Token should
Token ID815
Feature activation+1.71e+00

Pos logit contributions
 babys+0.017
 principal+0.017
 classmate+0.017
 client+0.017
 grade+0.016
Neg logit contributions
From-0.013
 all-0.013
 often-0.012
 Every-0.012
 sometimes-0.012
Token I
Token ID314
Feature activation-1.72e+00

Pos logit contributions
 classmate+0.059
 principal+0.059
 client+0.059
 liar+0.058
 grade+0.057
Neg logit contributions
 Every-0.025
From-0.025
 refreshing-0.024
 all-0.021
 often-0.021
Token do
Token ID466
Feature activation+1.88e+00

Pos logit contributions
From+0.039
 refreshing+0.039
 Every+0.039
 often+0.037
 tight+0.037
Neg logit contributions
 client-0.048
 principal-0.047
 classmate-0.046
 Lee-0.045
 babys-0.045
Token
Token ID220
Feature activation+2.35e+00

Pos logit contributions
 client+0.063
 classmate+0.062
 principal+0.061
 ha+0.061
 grade+0.061
Neg logit contributions
 Every-0.024
From-0.024
 refreshing-0.021
 From-0.020
 tight-0.019
Token
Token ID198
Feature activation-2.79e+00

Pos logit contributions
 classmate+0.078
 client+0.078
 principal+0.077
 grade+0.077
 ha+0.077
Neg logit contributions
 refreshing-0.037
From-0.037
 Every-0.033
asting-0.033
Years-0.033
Token
Token ID198
Feature activation-2.80e+00

Pos logit contributions
From+0.037
 Every+0.034
 refreshing+0.033
Years+0.031
asting+0.030
Neg logit contributions
 client-0.103
 classmate-0.102
 grade-0.102
 principal-0.101
 ha-0.101
Token"
Token ID1
Feature activation+6.17e-01

Pos logit contributions
From+0.051
 refreshing+0.048
 Every+0.047
Years+0.045
asting+0.044
Neg logit contributions
 client-0.089
 classmate-0.089
 grade-0.089
 principal-0.088
 ha-0.088
TokenUh
Token ID34653
Feature activation+4.04e+00

Pos logit contributions
 client+0.019
 classmate+0.018
 grade+0.018
 principal+0.018
 ha+0.018
Neg logit contributions
From-0.013
 refreshing-0.012
 Every-0.012
Years-0.011
asting-0.011
Token oh
Token ID11752
Feature activation+1.06e+01

Pos logit contributions
 principal+0.115
 client+0.113
 classmate+0.111
 grade+0.110
 babys+0.108
Neg logit contributions
 all-0.088
 often-0.087
From-0.087
 how-0.083
asting-0.083
Token!"
Token ID2474
Feature activation+4.98e+00

Pos logit contributions
 principal+0.416
 classmate+0.392
 client+0.391
 laptop+0.386
 classroom+0.377
Neg logit contributions
 all-0.149
 for-0.133
 how-0.129
 tight-0.127
 as-0.124
Token said
Token ID531
Feature activation+7.44e+00

Pos logit contributions
 client+0.155
 principal+0.153
 liar+0.151
 classmate+0.149
 grade+0.149
Neg logit contributions
 Every-0.104
From-0.104
 all-0.096
 tight-0.095
 From-0.094
Token Liz
Token ID28313
Feature activation+1.26e+00

Pos logit contributions
 liar+0.259
 client+0.257
 principal+0.255
fer+0.254
 grade+0.246
Neg logit contributions
 all-0.130
 Every-0.120
 tight-0.115
 around-0.114
 every-0.111
Token.
Token ID13
Feature activation+1.95e+00

Pos logit contributions
 client+0.044
 grade+0.044
 principal+0.043
 liar+0.043
 classmate+0.042
Neg logit contributions
 tight-0.018
 tightly-0.017
 around-0.017
 all-0.017
 often-0.017
Token
Token ID198
Feature activation-2.87e+00

Pos logit contributions
 client+0.068
 classmate+0.068
 grade+0.068
 principal+0.067
 ha+0.067
Neg logit contributions
 Every-0.029
From-0.029
 refreshing-0.025
 tight-0.025
 From-0.025
Token fox
Token ID21831
Feature activation+1.39e+00

Pos logit contributions
From+0.194
 refreshing+0.189
 Every+0.187
asting+0.180
lessly+0.180
Neg logit contributions
 ha-0.180
 client-0.178
 classmate-0.171
 principal-0.167
 babys-0.164
Token said
Token ID531
Feature activation+7.44e+00

Pos logit contributions
 ha+0.031
 classmate+0.029
 happened+0.028
 client+0.027
 babys+0.027
Neg logit contributions
 refreshing-0.039
 Every-0.038
asting-0.038
ores-0.037
From-0.037
Token,
Token ID11
Feature activation+3.97e+00

Pos logit contributions
 client+0.226
 principal+0.223
 classmate+0.222
 grade+0.221
 babys+0.216
Neg logit contributions
From-0.149
 Every-0.139
 often-0.137
 tight-0.137
 all-0.137
Token "
Token ID366
Feature activation-2.73e+00

Pos logit contributions
 Lee+0.089
 ha+0.082
 babys+0.081
 classmate+0.080
 principal+0.078
Neg logit contributions
 refreshing-0.115
asting-0.112
Years-0.104
From-0.103
ales-0.102
TokenUh
Token ID34653
Feature activation+3.97e+00

Pos logit contributions
 refreshing+0.072
 all+0.071
 deeply+0.071
utt+0.071
 often+0.071
Neg logit contributions
 ha-0.062
 classmate-0.055
 client-0.054
fer-0.053
 principal-0.053
Token oh
Token ID11752
Feature activation+1.06e+01

Pos logit contributions
 principal+0.077
 grade+0.077
 classmate+0.076
 client+0.075
 babys+0.072
Neg logit contributions
From-0.121
 often-0.118
 refreshing-0.115
asting-0.114
Years-0.114
Token,
Token ID11
Feature activation+4.02e+00

Pos logit contributions
 principal+0.379
 classmate+0.362
 client+0.361
 grade+0.355
 laptop+0.350
Neg logit contributions
 all-0.164
 tight-0.161
 Every-0.152
 tightly-0.149
 often-0.144
Token I
Token ID314
Feature activation-1.73e+00

Pos logit contributions
 ha+0.112
 client+0.098
 Lee+0.096
 liar+0.096
 happened+0.095
Neg logit contributions
Years-0.101
From-0.095
rant-0.094
aling-0.093
 Later-0.093
Token shouldn
Token ID6584
Feature activation+1.60e+00

Pos logit contributions
From+0.038
 refreshing+0.037
 Every+0.037
 often+0.035
asting+0.035
Neg logit contributions
 classmate-0.048
 client-0.047
 principal-0.047
 grade-0.047
 ha-0.046
Token't
Token ID470
Feature activation+6.64e-02

Pos logit contributions
 classmate+0.046
 grade+0.044
 client+0.044
 principal+0.044
 liar+0.042
Neg logit contributions
From-0.037
 Every-0.035
 all-0.032
 refreshing-0.032
 often-0.031
Token stay
Token ID2652
Feature activation-3.47e+00

Pos logit contributions
 classmate+0.002
 principal+0.002
 client+0.002
 grade+0.002
 ha+0.002
Neg logit contributions
From-0.001
 refreshing-0.001
 Every-0.001
asting-0.001
 often-0.001
Token
Token ID220
Feature activation+2.44e+00

Pos logit contributions
 client+0.081
 classmate+0.080
 grade+0.080
 principal+0.080
 ha+0.079
Neg logit contributions
From-0.030
 Every-0.030
 refreshing-0.028
 From-0.025
 tight-0.024
Token
Token ID198
Feature activation-2.53e+00

Pos logit contributions
 classmate+0.082
 client+0.081
 principal+0.081
 grade+0.081
 ha+0.080
Neg logit contributions
From-0.038
 refreshing-0.038
 Every-0.034
asting-0.033
Years-0.033
Token
Token ID198
Feature activation-2.57e+00

Pos logit contributions
From+0.034
 Every+0.032
 refreshing+0.031
asting+0.028
Years+0.028
Neg logit contributions
 client-0.092
 classmate-0.092
 grade-0.091
 principal-0.091
 ha-0.091
Token"
Token ID1
Feature activation+6.12e-01

Pos logit contributions
 refreshing+0.057
 Every+0.053
From+0.053
asting+0.052
utt+0.052
Neg logit contributions
 ha-0.069
 classmate-0.068
 client-0.067
 babys-0.067
 principal-0.066
TokenUh
Token ID34653
Feature activation+4.16e+00

Pos logit contributions
 client+0.019
 classmate+0.019
 grade+0.019
 principal+0.019
 ha+0.018
Neg logit contributions
From-0.012
 refreshing-0.011
 Every-0.011
Years-0.011
asting-0.011
Token oh
Token ID11752
Feature activation+1.06e+01

Pos logit contributions
 principal+0.108
 client+0.106
 classmate+0.105
 grade+0.104
 laptop+0.097
Neg logit contributions
From-0.101
 all-0.100
 often-0.099
 tight-0.096
asting-0.096
Token,
Token ID11
Feature activation+4.05e+00

Pos logit contributions
 principal+0.445
 client+0.420
 classmate+0.415
 laptop+0.409
 classroom+0.405
Neg logit contributions
 all-0.118
 for-0.109
 how-0.108
 too-0.108
 that-0.107
Token we
Token ID356
Feature activation-1.01e+00

Pos logit contributions
 ha+0.126
 classmate+0.125
 client+0.124
 principal+0.123
 grade+0.123
Neg logit contributions
From-0.076
 Every-0.072
 refreshing-0.071
Years-0.069
asting-0.069
Token have
Token ID423
Feature activation-4.24e+00

Pos logit contributions
From+0.021
 refreshing+0.021
 Every+0.020
asting+0.020
Years+0.019
Neg logit contributions
 classmate-0.029
 client-0.028
 principal-0.028
 grade-0.028
 ha-0.028
Token a
Token ID257
Feature activation+5.52e+00

Pos logit contributions
 refreshing+0.076
From+0.074
 Every+0.073
 all+0.069
 often+0.069
Neg logit contributions
 classmate-0.134
 client-0.133
 principal-0.131
 grade-0.131
 ha-0.127
Token problem
Token ID1917
Feature activation+6.84e+00

Pos logit contributions
fer+0.149
 principal+0.148
 Lee+0.147
 classmate+0.146
 client+0.145
Neg logit contributions
 refreshing-0.125
 tight-0.125
 fresh-0.115
 all-0.113
Years-0.112
Token began
Token ID2540
Feature activation+5.10e+00

Pos logit contributions
 Every+0.200
From+0.194
asting+0.191
Years+0.189
reci+0.181
Neg logit contributions
 ha-0.214
 happened-0.200
 classmate-0.183
 Lee-0.182
 babys-0.182
Token screaming
Token ID14788
Feature activation-2.01e+00

Pos logit contributions
 ha+0.112
 babys+0.108
 classmate+0.108
 grade+0.108
 client+0.105
Neg logit contributions
From-0.143
 refreshing-0.136
Years-0.131
asting-0.131
 Every-0.131
Token again
Token ID757
Feature activation+9.71e-01

Pos logit contributions
From+0.047
 refreshing+0.045
Years+0.043
 Every+0.043
asting+0.042
Neg logit contributions
 classmate-0.052
 ha-0.051
 grade-0.051
 client-0.050
 principal-0.049
Token.
Token ID13
Feature activation+1.53e+00

Pos logit contributions
 ha+0.024
 classmate+0.024
 client+0.024
 grade+0.024
 principal+0.023
Neg logit contributions
From-0.024
 refreshing-0.023
Years-0.022
 Every-0.022
asting-0.021
Token Sc
Token ID1446
Feature activation-3.69e+00

Pos logit contributions
 client+0.045
 Lee+0.045
 principal+0.045
 ha+0.044
 classmate+0.044
Neg logit contributions
asting-0.031
 refreshing-0.030
rant-0.028
From-0.027
Years-0.027
Tokenutt
Token ID15318
Feature activation-1.42e+01

Pos logit contributions
 refreshing+0.098
 fresh+0.090
 crunch+0.090
 all+0.087
 hay+0.086
Neg logit contributions
 client-0.089
ird-0.085
 classmate-0.083
 grade-0.082
acher-0.082
Tokenles
Token ID829
Feature activation-3.46e+00

Pos logit contributions
 Every+0.319
 refreshing+0.311
From+0.307
Years+0.299
 crunch+0.279
Neg logit contributions
 ha-0.383
 happened-0.365
 classmate-0.363
 client-0.346
fer-0.341
Token was
Token ID373
Feature activation+4.37e+00

Pos logit contributions
From+0.073
 refreshing+0.073
 Every+0.071
Years+0.070
asting+0.064
Neg logit contributions
 ha-0.098
 classmate-0.095
 happened-0.092
 grade-0.091
 babys-0.089
Token so
Token ID523
Feature activation+5.79e+00

Pos logit contributions
 ha+0.118
 happened+0.114
 classmate+0.113
 babys+0.111
 principal+0.107
Neg logit contributions
From-0.096
 Every-0.089
reci-0.088
orting-0.087
Years-0.085
Token scared
Token ID12008
Feature activation+3.57e-01

Pos logit contributions
 happened+0.091
 ha+0.090
 classmate+0.082
 liar+0.080
 client+0.076
Neg logit contributions
ales-0.185
Years-0.180
rant-0.179
orting-0.179
asting-0.177
Token,
Token ID11
Feature activation+3.82e+00

Pos logit contributions
 happened+0.010
 ha+0.009
 Lee+0.009
 grade+0.009
 classmate+0.009
Neg logit contributions
From-0.008
Years-0.008
 refreshing-0.008
asting-0.007
 Every-0.007
Token came
Token ID1625
Feature activation+4.14e+00

Pos logit contributions
 happened+0.013
 classmate+0.013
 ha+0.013
 grade+0.013
 client+0.013
Neg logit contributions
From-0.012
 refreshing-0.011
 Every-0.011
Years-0.011
 From-0.010
Token from
Token ID422
Feature activation-6.83e-01

Pos logit contributions
 classmate+0.115
 client+0.114
 ha+0.113
 babys+0.112
 grade+0.111
Neg logit contributions
 refreshing-0.088
From-0.087
Years-0.081
 Every-0.080
asting-0.076
Token it
Token ID340
Feature activation+6.56e-01

Pos logit contributions
From+0.013
 refreshing+0.012
 Every+0.012
Years+0.012
asting+0.012
Neg logit contributions
 ha-0.021
 client-0.021
 grade-0.021
 principal-0.020
 classmate-0.020
Token.
Token ID13
Feature activation+1.62e+00

Pos logit contributions
 classmate+0.013
 happened+0.013
 ha+0.013
 client+0.013
 grade+0.013
Neg logit contributions
From-0.019
 refreshing-0.018
Years-0.018
asting-0.017
 Every-0.016
Token Sc
Token ID1446
Feature activation-3.60e+00

Pos logit contributions
 Lee+0.052
 classmate+0.051
 principal+0.051
 babys+0.050
 ha+0.050
Neg logit contributions
asting-0.031
Years-0.028
 refreshing-0.028
From-0.026
 often-0.026
Tokenutt
Token ID15318
Feature activation-1.42e+01

Pos logit contributions
 refreshing+0.094
 fresh+0.086
 crunch+0.085
From+0.083
 autumn+0.082
Neg logit contributions
 client-0.091
 classmate-0.086
ird-0.085
 grade-0.085
 principal-0.085
Tokenles
Token ID829
Feature activation-3.37e+00

Pos logit contributions
 Every+0.351
 refreshing+0.344
From+0.338
Years+0.335
 crunch+0.311
Neg logit contributions
 ha-0.346
 happened-0.343
 classmate-0.326
fer-0.306
 client-0.305
Token was
Token ID373
Feature activation+4.40e+00

Pos logit contributions
From+0.077
 refreshing+0.076
 Every+0.075
Years+0.074
asting+0.068
Neg logit contributions
 ha-0.088
 classmate-0.087
 happened-0.085
 grade-0.082
 babys-0.082
Token so
Token ID523
Feature activation+5.89e+00

Pos logit contributions
 happened+0.116
 ha+0.116
 classmate+0.115
 babys+0.113
 principal+0.109
Neg logit contributions
From-0.097
 Every-0.089
reci-0.088
orting-0.088
Years-0.087
Token frightened
Token ID24776
Feature activation+7.31e-01

Pos logit contributions
 happened+0.109
 ha+0.098
 classmate+0.095
 liar+0.091
 client+0.089
Neg logit contributions
orting-0.176
Years-0.174
ales-0.173
asting-0.172
rant-0.171
Token,
Token ID11
Feature activation+4.07e+00

Pos logit contributions
 happened+0.021
 ha+0.020
 Lee+0.020
 babys+0.019
 grade+0.019
Neg logit contributions
From-0.016
Years-0.015
 refreshing-0.015
 Every-0.014
asting-0.014
Token too
Token ID1165
Feature activation+3.86e+00

Pos logit contributions
 classmate+0.069
 ha+0.069
 client+0.068
 grade+0.067
 babys+0.067
Neg logit contributions
From-0.064
 refreshing-0.062
 Every-0.059
Years-0.059
asting-0.056
Token.
Token ID13
Feature activation+1.48e+00

Pos logit contributions
 ha+0.084
 client+0.083
 grade+0.080
 classmate+0.079
 liar+0.079
Neg logit contributions
From-0.104
 refreshing-0.103
Years-0.098
asting-0.095
 Every-0.094
Token
Token ID198
Feature activation-2.79e+00

Pos logit contributions
 babys+0.040
 Lee+0.039
 ha+0.039
 principal+0.038
 client+0.038
Neg logit contributions
 refreshing-0.034
asting-0.033
rant-0.031
Years-0.031
 tighter-0.030
Token
Token ID198
Feature activation-2.83e+00

Pos logit contributions
 refreshing+0.064
rant+0.057
 tighter+0.056
 crunch+0.055
ales+0.054
Neg logit contributions
 ha-0.074
 babys-0.072
 principal-0.071
 classmate-0.070
 client-0.070
TokenSc
Token ID3351
Feature activation-7.08e+00

Pos logit contributions
rant+0.061
 refreshing+0.059
 thanks+0.056
 Every+0.052
ales+0.051
Neg logit contributions
 ha-0.087
 client-0.075
 babys-0.075
 principal-0.074
ird-0.074
Tokenutt
Token ID15318
Feature activation-1.41e+01

Pos logit contributions
 refreshing+0.187
 fresh+0.170
 crunch+0.168
 autumn+0.165
 all+0.164
Neg logit contributions
 client-0.175
 classmate-0.168
ird-0.163
acher-0.162
 grade-0.162
Tokenles
Token ID829
Feature activation-3.46e+00

Pos logit contributions
 Every+0.310
 refreshing+0.304
From+0.297
Years+0.286
 crunch+0.274
Neg logit contributions
 ha-0.388
 classmate-0.368
 happened-0.362
 client-0.352
 principal-0.348
Token was
Token ID373
Feature activation+4.42e+00

Pos logit contributions
 refreshing+0.073
From+0.072
 Every+0.072
Years+0.070
asting+0.065
Neg logit contributions
 ha-0.099
 classmate-0.095
 babys-0.091
 principal-0.090
 happened-0.090
Token scared
Token ID12008
Feature activation+3.47e-01

Pos logit contributions
 ha+0.123
 babys+0.120
 classmate+0.119
 happened+0.117
 principal+0.114
Neg logit contributions
From-0.093
 Every-0.085
orting-0.084
 refreshing-0.084
Years-0.083
Token but
Token ID475
Feature activation-4.53e+00

Pos logit contributions
 happened+0.008
 ha+0.008
 grade+0.008
 babys+0.007
 Lee+0.007
Neg logit contributions
From-0.009
 refreshing-0.009
Years-0.009
asting-0.008
 Every-0.008
Token he
Token ID339
Feature activation-1.32e+00

Pos logit contributions
Years+0.102
ales+0.102
rant+0.100
asting+0.096
reci+0.095
Neg logit contributions
 ha-0.125
 babys-0.112
 classmate-0.111
 client-0.111
 happened-0.108
Token,
Token ID11
Feature activation+3.66e+00

Pos logit contributions
 classmate+0.034
 ha+0.034
 client+0.033
 grade+0.033
 principal+0.033
Neg logit contributions
From-0.028
 Every-0.026
Years-0.026
 refreshing-0.026
asting-0.025
Token but
Token ID475
Feature activation-4.59e+00

Pos logit contributions
 classmate+0.089
 babys+0.088
 Lee+0.087
 principal+0.086
 ha+0.085
Neg logit contributions
lated-0.086
rant-0.086
Years-0.084
 Later-0.082
reci-0.081
Token especially
Token ID2592
Feature activation-7.77e+00

Pos logit contributions
Years+0.093
ales+0.089
asting+0.087
 Every+0.087
 refreshing+0.087
Neg logit contributions
 ha-0.130
 happened-0.129
 client-0.125
 classmate-0.123
 babys-0.122
Token for
Token ID329
Feature activation-3.97e-01

Pos logit contributions
From+0.159
Years+0.145
 Every+0.140
 refreshing+0.133
 Later+0.130
Neg logit contributions
 classmate-0.235
 happened-0.234
 client-0.230
 ha-0.229
 grade-0.227
Token Sc
Token ID1446
Feature activation-3.75e+00

Pos logit contributions
From+0.008
 refreshing+0.008
Years+0.008
 Every+0.007
mediately+0.007
Neg logit contributions
 ha-0.012
 client-0.011
 principal-0.011
 babys-0.011
 grade-0.011
Tokenutt
Token ID15318
Feature activation-1.41e+01

Pos logit contributions
 refreshing+0.095
From+0.087
 fresh+0.087
 crunch+0.087
 sunlight+0.086
Neg logit contributions
 client-0.092
ird-0.087
 grade-0.086
acher-0.084
 principal-0.084
Tokenles
Token ID829
Feature activation-3.46e+00

Pos logit contributions
From+0.215
 Every+0.202
 refreshing+0.196
Years+0.190
 From+0.170
Neg logit contributions
 ha-0.488
 classmate-0.481
 client-0.474
 happened-0.470
 principal-0.456
Token.
Token ID13
Feature activation+1.50e+00

Pos logit contributions
From+0.072
 refreshing+0.069
 Every+0.066
Years+0.065
asting+0.062
Neg logit contributions
 classmate-0.101
 ha-0.101
 client-0.099
 grade-0.099
 principal-0.098
Token<|endoftext|>
Token ID50256
Feature activation-3.20e-01

Pos logit contributions
 Lee+0.034
 babys+0.033
 client+0.033
ird+0.032
 classmate+0.032
Neg logit contributions
 refreshing-0.040
Years-0.039
rant-0.039
asting-0.039
 tighter-0.037
TokenB
Token ID33
Feature activation-1.04e+00

Pos logit contributions
From+0.008
 refreshing+0.008
Years+0.007
asting+0.007
 Every+0.007
Neg logit contributions
 ha-0.008
 classmate-0.008
 principal-0.008
 grade-0.008
 client-0.008
Tokenoots
Token ID13880
Feature activation-7.76e+00

Pos logit contributions
 refreshing+0.026
From+0.026
Years+0.025
 crunch+0.024
 hay+0.024
Neg logit contributions
 babys-0.025
 classmate-0.025
 ha-0.024
 principal-0.023
 client-0.023
Token his
Token ID465
Feature activation+3.58e+00

Pos logit contributions
asting+0.067
From+0.063
Years+0.061
 refreshing+0.061
orting+0.059
Neg logit contributions
 classmate-0.081
 ha-0.076
 grade-0.074
 Lee-0.074
 happened-0.073
Token little
Token ID1310
Feature activation+5.62e-01

Pos logit contributions
 classmate+0.085
 ha+0.082
 client+0.082
 grade+0.079
 babys+0.079
Neg logit contributions
From-0.100
 Every-0.093
asting-0.091
 refreshing-0.090
Years-0.090
Token nose
Token ID9686
Feature activation+1.59e+00

Pos logit contributions
 client+0.014
 classmate+0.014
 principal+0.014
 grade+0.014
 ha+0.014
Neg logit contributions
From-0.014
 refreshing-0.013
 Every-0.013
asting-0.013
Years-0.013
Token and
Token ID290
Feature activation-8.06e+00

Pos logit contributions
 classmate+0.046
 ha+0.044
 happened+0.044
 grade+0.043
 babys+0.043
Neg logit contributions
From-0.033
Years-0.031
 Every-0.030
 refreshing-0.030
asting-0.029
Token sc
Token ID629
Feature activation-9.34e-01

Pos logit contributions
From+0.159
 Every+0.155
asting+0.151
 refreshing+0.151
Years+0.148
Neg logit contributions
 classmate-0.240
 ha-0.239
 client-0.233
 principal-0.231
 grade-0.230
Tokenutt
Token ID15318
Feature activation-1.41e+01

Pos logit contributions
 refreshing+0.020
From+0.019
asting+0.019
 autumn+0.019
Years+0.018
Neg logit contributions
 classmate-0.026
 ha-0.026
ird-0.026
 client-0.026
 grade-0.025
Tokenled
Token ID992
Feature activation-3.02e+00

Pos logit contributions
From+0.333
 Every+0.329
 refreshing+0.318
Years+0.305
asting+0.290
Neg logit contributions
 ha-0.362
 classmate-0.354
 happened-0.342
 client-0.332
fer-0.324
Token away
Token ID1497
Feature activation+4.51e+00

Pos logit contributions
From+0.065
asting+0.063
 refreshing+0.058
Years+0.057
orting+0.056
Neg logit contributions
 classmate-0.082
 grade-0.080
 ha-0.080
 happened-0.077
 Lee-0.075
Token.
Token ID13
Feature activation+1.92e+00

Pos logit contributions
 classmate+0.132
 client+0.130
 grade+0.130
 ha+0.130
 principal+0.128
Neg logit contributions
From-0.093
 refreshing-0.088
 Every-0.087
Years-0.084
asting-0.083
Token From
Token ID3574
Feature activation-8.21e+00

Pos logit contributions
 classmate+0.056
 Lee+0.056
 principal+0.054
 babys+0.054
 ha+0.053
Neg logit contributions
asting-0.041
 refreshing-0.038
Years-0.037
rant-0.035
From-0.035
Token that
Token ID326
Feature activation-1.48e+00

Pos logit contributions
asting+0.138
 Every+0.126
reci+0.125
lessly+0.123
From+0.121
Neg logit contributions
 ha-0.265
 classmate-0.255
 client-0.244
 happened-0.243
 principal-0.239
Token.
Token ID13
Feature activation+1.60e+00

Pos logit contributions
From+0.072
Years+0.067
 Every+0.065
 refreshing+0.065
 From+0.061
Neg logit contributions
 ha-0.078
 classmate-0.077
 Lee-0.076
 grade-0.076
 principal-0.076
Token He
Token ID679
Feature activation-7.19e+00

Pos logit contributions
 Lee+0.048
 ha+0.048
 client+0.047
 grade+0.047
 principal+0.046
Neg logit contributions
asting-0.031
 refreshing-0.029
Years-0.028
From-0.027
ock-0.027
Token screamed
Token ID25421
Feature activation+2.29e+00

Pos logit contributions
From+0.185
orting+0.185
 Every+0.184
Years+0.176
 refreshing+0.173
Neg logit contributions
 ha-0.170
 classmate-0.161
 happened-0.160
 babys-0.154
 client-0.153
Token and
Token ID290
Feature activation-8.37e+00

Pos logit contributions
 classmate+0.063
 grade+0.063
 Lee+0.063
 babys+0.063
 ha+0.062
Neg logit contributions
From-0.046
reci-0.045
 refreshing-0.045
 Every-0.044
Years-0.042
Token sc
Token ID629
Feature activation-1.02e+00

Pos logit contributions
From+0.212
 Every+0.206
Years+0.203
asting+0.200
 refreshing+0.198
Neg logit contributions
 ha-0.210
 classmate-0.198
 babys-0.195
 client-0.192
 principal-0.191
Tokenutt
Token ID15318
Feature activation-1.40e+01

Pos logit contributions
From+0.023
 refreshing+0.022
 Every+0.021
Years+0.021
 often+0.021
Neg logit contributions
 ha-0.027
ird-0.027
 classmate-0.026
 client-0.026
 grade-0.026
Tokenled
Token ID992
Feature activation-2.94e+00

Pos logit contributions
From+0.300
 Every+0.295
 refreshing+0.291
Years+0.274
asting+0.260
Neg logit contributions
 ha-0.392
 classmate-0.384
 happened-0.372
 client-0.372
 principal-0.366
Token away
Token ID1497
Feature activation+4.62e+00

Pos logit contributions
From+0.070
reci+0.063
asting+0.063
 refreshing+0.061
Years+0.060
Neg logit contributions
 ha-0.075
 grade-0.073
 classmate-0.071
 Lee-0.071
 happened-0.068
Token.
Token ID13
Feature activation+1.45e+00

Pos logit contributions
 classmate+0.130
 client+0.128
 ha+0.128
 grade+0.128
 principal+0.127
Neg logit contributions
From-0.100
 refreshing-0.095
 Every-0.095
Years-0.091
asting-0.090
Token
Token ID198
Feature activation-2.93e+00

Pos logit contributions
 Lee+0.042
 ha+0.042
 client+0.041
 classmate+0.040
 principal+0.040
Neg logit contributions
asting-0.029
 refreshing-0.029
Years-0.027
ock-0.027
From-0.026
Token
Token ID198
Feature activation-2.93e+00

Pos logit contributions
 refreshing+0.065
 tighter+0.055
asting+0.055
Years+0.054
rant+0.054
Neg logit contributions
 ha-0.080
 classmate-0.078
 principal-0.078
 client-0.078
 babys-0.077
Token tall
Token ID7331
Feature activation-3.61e+00

Pos logit contributions
 grade+0.179
 client+0.179
 principal+0.177
 classmate+0.177
ird+0.170
Neg logit contributions
 refreshing-0.095
 tight-0.089
From-0.084
 crunch-0.082
 fresh-0.081
Token tree
Token ID5509
Feature activation+3.27e-01

Pos logit contributions
From+0.072
 refreshing+0.068
 Every+0.068
Years+0.066
asting+0.065
Neg logit contributions
 ha-0.108
 client-0.108
 classmate-0.107
 grade-0.106
 principal-0.106
Token.
Token ID13
Feature activation+1.91e+00

Pos logit contributions
 ha+0.008
 grade+0.008
 desk+0.008
 classmate+0.008
 client+0.008
Neg logit contributions
From-0.008
 refreshing-0.008
Years-0.008
 Every-0.007
 Nem-0.007
Token He
Token ID679
Feature activation-6.97e+00

Pos logit contributions
 grade+0.062
 Lee+0.062
 principal+0.062
 client+0.061
 classmate+0.061
Neg logit contributions
 refreshing-0.033
asting-0.033
Years-0.031
From-0.031
 often-0.030
Token sc
Token ID629
Feature activation-9.97e-01

Pos logit contributions
 refreshing+0.164
asting+0.158
From+0.158
 Every+0.157
orting+0.151
Neg logit contributions
 ha-0.182
 classmate-0.176
 client-0.172
 happened-0.169
 babys-0.167
Tokenutt
Token ID15318
Feature activation-1.40e+01

Pos logit contributions
 refreshing+0.018
From+0.017
asting+0.016
 Every+0.016
 often+0.016
Neg logit contributions
 classmate-0.032
 client-0.032
 ha-0.032
 grade-0.031
 principal-0.031
Tokenled
Token ID992
Feature activation-2.94e+00

Pos logit contributions
 refreshing+0.342
 Every+0.341
From+0.338
Years+0.319
asting+0.308
Neg logit contributions
 happened-0.340
 ha-0.336
 classmate-0.323
 client-0.310
 Lee-0.303
Token over
Token ID625
Feature activation+1.06e+00

Pos logit contributions
asting+0.082
From+0.078
reci+0.076
 refreshing+0.075
orting+0.074
Neg logit contributions
 Lee-0.058
 happened-0.058
 ha-0.057
 classmate-0.056
 grade-0.055
Token to
Token ID284
Feature activation+4.78e+00

Pos logit contributions
 ha+0.024
 happened+0.023
 Lee+0.023
 grade+0.022
 classmate+0.020
Neg logit contributions
From-0.030
asting-0.029
 refreshing-0.028
orting-0.027
reci-0.027
Token it
Token ID340
Feature activation+7.41e-01

Pos logit contributions
 ha+0.111
 grade+0.106
 principal+0.098
 client+0.098
 Lee+0.093
Neg logit contributions
 refreshing-0.133
From-0.132
lessly-0.127
Years-0.127
orting-0.123
Token and
Token ID290
Feature activation-8.28e+00

Pos logit contributions
 desk+0.020
 Lee+0.019
 classmate+0.019
 grade+0.019
 ha+0.019
Neg logit contributions
From-0.016
 refreshing-0.016
Years-0.015
asting-0.015
reci-0.015
Token some
Token ID617
Feature activation-1.57e+00

Pos logit contributions
 ha+0.090
 classmate+0.089
 principal+0.089
 client+0.089
 babys+0.087
Neg logit contributions
From-0.056
 refreshing-0.052
 Every-0.052
Years-0.051
asting-0.050
Token small
Token ID1402
Feature activation-2.99e+00

Pos logit contributions
From+0.031
 Every+0.029
 refreshing+0.029
Years+0.028
 often+0.027
Neg logit contributions
 classmate-0.047
 client-0.047
 ha-0.047
 grade-0.046
 principal-0.046
Token colourful
Token ID44799
Feature activation-4.46e+00

Pos logit contributions
From+0.055
 refreshing+0.054
 Every+0.053
asting+0.049
 tight+0.049
Neg logit contributions
 classmate-0.092
 principal-0.092
 client-0.092
 grade-0.091
 ha-0.090
Token creatures
Token ID8109
Feature activation-6.98e-01

Pos logit contributions
From+0.083
 refreshing+0.078
 Every+0.077
Years+0.076
asting+0.073
Neg logit contributions
 ha-0.139
 classmate-0.138
 client-0.137
 grade-0.136
 babys-0.134
Token sc
Token ID629
Feature activation-1.05e+00

Pos logit contributions
From+0.016
Years+0.016
 refreshing+0.015
 Every+0.015
asting+0.014
Neg logit contributions
 classmate-0.018
 happened-0.018
 ha-0.018
 grade-0.018
 client-0.018
Tokenutt
Token ID15318
Feature activation-1.39e+01

Pos logit contributions
From+0.023
 refreshing+0.022
Years+0.021
 often+0.020
 Every+0.020
Neg logit contributions
 client-0.031
 classmate-0.030
 grade-0.030
 principal-0.030
ird-0.029
Tokenling
Token ID1359
Feature activation-6.34e+00

Pos logit contributions
From+0.297
 Every+0.269
Years+0.260
 refreshing+0.252
Even+0.244
Neg logit contributions
 happened-0.400
 classmate-0.381
 ha-0.375
 Lee-0.371
 client-0.365
Token around
Token ID1088
Feature activation-2.41e+00

Pos logit contributions
From+0.129
 refreshing+0.113
Years+0.110
 Every+0.107
asting+0.102
Neg logit contributions
 happened-0.189
 classmate-0.184
 grade-0.178
 ha-0.177
 Lee-0.175
Token.
Token ID13
Feature activation+1.87e+00

Pos logit contributions
From+0.052
 refreshing+0.049
Years+0.047
 Every+0.044
asting+0.043
Neg logit contributions
 Lee-0.070
 classmate-0.069
 ha-0.068
 principal-0.067
 grade-0.067
Token He
Token ID679
Feature activation-7.02e+00

Pos logit contributions
 principal+0.054
 Lee+0.052
 classmate+0.052
 client+0.051
 ha+0.050
Neg logit contributions
 refreshing-0.042
asting-0.040
From-0.039
Years-0.038
 often-0.036
Token was
Token ID373
Feature activation+4.43e+00

Pos logit contributions
From+0.185
 refreshing+0.185
 Every+0.184
asting+0.184
orting+0.183
Neg logit contributions
 classmate-0.158
 ha-0.157
 happened-0.149
 principal-0.146
 client-0.145
Tokenpillar
Token ID41643
Feature activation+7.90e-01

Pos logit contributions
 Every+0.140
From+0.140
 refreshing+0.140
Years+0.137
asting+0.136
Neg logit contributions
 ha-0.086
 classmate-0.085
 grade-0.080
 principal-0.080
 client-0.079
Token happily
Token ID18177
Feature activation-5.77e-01

Pos logit contributions
 ha+0.020
 classmate+0.019
 babys+0.018
 happened+0.018
 client+0.018
Neg logit contributions
asting-0.020
 Every-0.020
From-0.019
lessly-0.019
orting-0.019
Token complied
Token ID35413
Feature activation+2.09e+00

Pos logit contributions
From+0.013
 Every+0.013
 refreshing+0.012
Years+0.012
asting+0.012
Neg logit contributions
 ha-0.016
 classmate-0.016
 babys-0.015
 grade-0.015
 happened-0.015
Token and
Token ID290
Feature activation-8.08e+00

Pos logit contributions
 ha+0.060
 classmate+0.060
 Lee+0.059
 principal+0.058
 babys+0.057
Neg logit contributions
 refreshing-0.042
reci-0.040
 Every-0.039
utt-0.038
From-0.038
Token sc
Token ID629
Feature activation-9.42e-01

Pos logit contributions
From+0.154
 Every+0.151
 refreshing+0.146
asting+0.145
Years+0.143
Neg logit contributions
 ha-0.248
 classmate-0.246
 principal-0.241
 grade-0.240
 client-0.238
Tokenutt
Token ID15318
Feature activation-1.39e+01

Pos logit contributions
 refreshing+0.018
From+0.018
asting+0.017
 Every+0.017
 often+0.017
Neg logit contributions
 ha-0.029
 classmate-0.028
 grade-0.028
 client-0.028
ird-0.028
Tokenled
Token ID992
Feature activation-2.84e+00

Pos logit contributions
From+0.282
 refreshing+0.275
 Every+0.275
Years+0.256
asting+0.253
Neg logit contributions
 ha-0.404
 classmate-0.402
 client-0.393
 principal-0.385
 grade-0.383
Token closer
Token ID5699
Feature activation+3.31e+00

Pos logit contributions
From+0.067
asting+0.066
 refreshing+0.063
Years+0.061
 Every+0.059
Neg logit contributions
 ha-0.073
 classmate-0.072
 grade-0.071
 Lee-0.068
 client-0.066
Token to
Token ID284
Feature activation+5.01e+00

Pos logit contributions
 grade+0.077
 classmate+0.076
 ha+0.075
 Lee+0.072
 client+0.072
Neg logit contributions
From-0.089
 refreshing-0.083
asting-0.080
Years-0.079
 Every-0.078
Token Rose
Token ID8049
Feature activation+1.45e+00

Pos logit contributions
 classmate+0.158
 client+0.158
 grade+0.156
 principal+0.156
 ha+0.156
Neg logit contributions
From-0.092
 refreshing-0.090
 Every-0.086
asting-0.083
Years-0.083
Token.
Token ID13
Feature activation+2.18e+00

Pos logit contributions
 classmate+0.035
 ha+0.034
 grade+0.034
 client+0.034
 babys+0.033
Neg logit contributions
From-0.038
 refreshing-0.037
Years-0.036
 Every-0.034
asting-0.034
Token crab
Token ID32202
Feature activation+1.29e+00

Pos logit contributions
 client+0.021
 grade+0.021
 classmate+0.021
 principal+0.021
 ha+0.020
Neg logit contributions
From-0.020
 refreshing-0.020
 Every-0.019
 tight-0.018
 often-0.018
Token named
Token ID3706
Feature activation-5.70e-01

Pos logit contributions
 ha+0.035
 classmate+0.033
 babys+0.033
 client+0.033
 principal+0.032
Neg logit contributions
From-0.030
 refreshing-0.030
Years-0.029
 Every-0.029
asting-0.028
Token Charlie
Token ID11526
Feature activation+1.74e+00

Pos logit contributions
From+0.008
 Every+0.008
 refreshing+0.008
asting+0.007
 often+0.007
Neg logit contributions
 client-0.020
 grade-0.020
 classmate-0.020
 principal-0.020
 ha-0.020
Token,
Token ID11
Feature activation+4.07e+00

Pos logit contributions
 ha+0.044
 classmate+0.043
 client+0.043
 babys+0.043
 principal+0.042
Neg logit contributions
From-0.044
 refreshing-0.043
Years-0.041
 Every-0.040
asting-0.039
Token sc
Token ID629
Feature activation-6.80e-01

Pos logit contributions
 ha+0.098
 babys+0.097
 happened+0.097
 classmate+0.096
 Lee+0.093
Neg logit contributions
From-0.101
 refreshing-0.100
Years-0.099
asting-0.095
 Every-0.094
Tokenutt
Token ID15318
Feature activation-1.38e+01

Pos logit contributions
From+0.012
 refreshing+0.011
 Every+0.011
Years+0.010
asting+0.010
Neg logit contributions
 classmate-0.022
 client-0.022
 ha-0.022
 principal-0.022
 grade-0.022
Tokenling
Token ID1359
Feature activation-6.29e+00

Pos logit contributions
From+0.217
 refreshing+0.210
 Every+0.210
Years+0.193
asting+0.174
Neg logit contributions
 ha-0.470
 classmate-0.466
 client-0.454
 happened-0.448
 babys-0.446
Token towards
Token ID3371
Feature activation+1.80e+00

Pos logit contributions
From+0.118
 refreshing+0.115
asting+0.109
Years+0.106
 Every+0.102
Neg logit contributions
 classmate-0.194
 ha-0.188
 grade-0.186
 client-0.184
 happened-0.183
Token him
Token ID683
Feature activation+5.26e+00

Pos logit contributions
 classmate+0.055
 client+0.053
 principal+0.053
 grade+0.053
 ha+0.053
Neg logit contributions
From-0.035
 refreshing-0.035
Years-0.031
asting-0.031
lessly-0.030
Token.
Token ID13
Feature activation+2.01e+00

Pos logit contributions
 classmate+0.117
 client+0.115
 ha+0.113
 grade+0.113
 Lee+0.112
Neg logit contributions
From-0.146
 refreshing-0.140
Years-0.138
asting-0.128
 Every-0.125
Token Charlie
Token ID11526
Feature activation+1.67e+00

Pos logit contributions
 classmate+0.057
 Lee+0.056
 client+0.056
 principal+0.055
 babys+0.054
Neg logit contributions
asting-0.044
 refreshing-0.042
From-0.042
Years-0.041
lessly-0.039
Token the
Token ID262
Feature activation-1.04e+00

Pos logit contributions
From+0.022
 refreshing+0.020
Years+0.020
asting+0.020
orting+0.020
Neg logit contributions
 ha-0.032
 grade-0.031
 client-0.031
 classmate-0.031
 Lee-0.031
Token dry
Token ID5894
Feature activation-5.47e+00

Pos logit contributions
From+0.023
 Every+0.022
Years+0.021
 From+0.020
 often+0.020
Neg logit contributions
 ha-0.030
 classmate-0.030
 principal-0.029
 client-0.029
 grade-0.029
Token grass
Token ID8701
Feature activation-4.10e+00

Pos logit contributions
From+0.117
Years+0.109
 Every+0.103
 From+0.098
 often+0.095
Neg logit contributions
 classmate-0.161
 ha-0.161
 babys-0.158
 grade-0.156
 principal-0.152
Token and
Token ID290
Feature activation-8.08e+00

Pos logit contributions
From+0.105
Years+0.095
 refreshing+0.093
reci+0.089
 Every+0.088
Neg logit contributions
 classmate-0.098
 Lee-0.097
 babys-0.093
 desk-0.092
 phone-0.092
Token sc
Token ID629
Feature activation-7.56e-01

Pos logit contributions
From+0.172
 Every+0.172
Years+0.160
asting+0.152
orting+0.149
Neg logit contributions
 ha-0.233
 classmate-0.224
 happened-0.216
 grade-0.214
 principal-0.214
Tokenutt
Token ID15318
Feature activation-1.38e+01

Pos logit contributions
From+0.016
 refreshing+0.015
Years+0.015
 Every+0.015
 autumn+0.014
Neg logit contributions
 ha-0.021
ird-0.021
 grade-0.021
 classmate-0.021
 client-0.020
Tokenled
Token ID992
Feature activation-2.82e+00

Pos logit contributions
From+0.290
 Every+0.280
Years+0.261
 refreshing+0.260
 From+0.246
Neg logit contributions
 ha-0.385
 happened-0.378
 classmate-0.377
fer-0.354
 client-0.350
Token away
Token ID1497
Feature activation+4.59e+00

Pos logit contributions
From+0.069
Years+0.061
asting+0.061
orting+0.059
 Every+0.058
Neg logit contributions
 happened-0.067
 ha-0.065
 grade-0.064
 classmate-0.064
 haircut-0.057
Token with
Token ID351
Feature activation-4.40e+00

Pos logit contributions
 classmate+0.121
 grade+0.117
 ha+0.117
 client+0.117
 babys+0.115
Neg logit contributions
From-0.108
 Every-0.099
Years-0.099
 refreshing-0.098
asting-0.094
Token its
Token ID663
Feature activation-1.71e+00

Pos logit contributions
From+0.096
Years+0.086
orting+0.085
asting+0.082
 Later+0.081
Neg logit contributions
 ha-0.124
 classmate-0.124
 happened-0.120
 client-0.118
 liar-0.117
Token friends
Token ID2460
Feature activation+3.26e+00

Pos logit contributions
From+0.042
 Every+0.039
Years+0.038
 refreshing+0.038
asting+0.038
Neg logit contributions
 classmate-0.044
 ha-0.043
 client-0.043
 babys-0.042
 grade-0.042
Token learned
Token ID4499
Feature activation+6.38e-01

Pos logit contributions
From+0.202
 refreshing+0.197
asting+0.189
Years+0.188
 Every+0.180
Neg logit contributions
 ha-0.238
 client-0.233
 classmate-0.226
 principal-0.226
 babys-0.224
Token a
Token ID257
Feature activation+4.70e+00

Pos logit contributions
 client+0.019
 principal+0.019
 classmate+0.019
 babys+0.019
 grade+0.019
Neg logit contributions
From-0.012
 refreshing-0.012
 Every-0.011
asting-0.011
Years-0.011
Token hard
Token ID1327
Feature activation-2.36e+00

Pos logit contributions
 client+0.142
 classmate+0.141
 grade+0.139
 principal+0.138
 ha+0.138
Neg logit contributions
From-0.092
 refreshing-0.089
 Every-0.088
asting-0.083
Years-0.081
Token lesson
Token ID11483
Feature activation+2.67e+00

Pos logit contributions
From+0.049
 Every+0.047
 refreshing+0.046
Years+0.046
 often+0.045
Neg logit contributions
 classmate-0.068
 ha-0.068
 principal-0.068
 client-0.067
 babys-0.067
Token.
Token ID13
Feature activation+1.54e+00

Pos logit contributions
 client+0.075
 principal+0.073
 classmate+0.073
 grade+0.073
 ha+0.072
Neg logit contributions
 refreshing-0.060
From-0.057
 Every-0.054
Years-0.053
asting-0.052
Token Taking
Token ID20879
Feature activation-1.28e+01

Pos logit contributions
 Lee+0.052
 client+0.051
 principal+0.051
 classmate+0.050
 babys+0.050
Neg logit contributions
asting-0.026
 refreshing-0.026
Years-0.024
From-0.021
 tighter-0.021
Token things
Token ID1243
Feature activation+4.65e-02

Pos logit contributions
From+0.315
 refreshing+0.284
utt+0.273
Years+0.269
 Later+0.265
Neg logit contributions
 grade-0.346
 client-0.345
 ha-0.343
 classmate-0.328
 happened-0.327
Token that
Token ID326
Feature activation-1.67e+00

Pos logit contributions
 happened+0.001
 grade+0.001
 ha+0.001
 client+0.001
 classmate+0.001
Neg logit contributions
From-0.001
 refreshing-0.001
 Every-0.001
asting-0.001
Years-0.001
Token are
Token ID389
Feature activation+2.62e+00

Pos logit contributions
From+0.042
 refreshing+0.039
Years+0.039
 Every+0.038
asting+0.038
Neg logit contributions
 client-0.043
 grade-0.043
 principal-0.042
 ha-0.042
 classmate-0.042
Token not
Token ID407
Feature activation-4.14e-01

Pos logit contributions
 happened+0.060
 client+0.059
 ha+0.056
 classmate+0.056
 principal+0.056
Neg logit contributions
From-0.069
 Every-0.065
Hop-0.063
 From-0.062
utt-0.062
Token yours
Token ID12431
Feature activation+1.00e+00

Pos logit contributions
From+0.010
 refreshing+0.008
 Every+0.008
Years+0.008
 From+0.008
Neg logit contributions
 happened-0.012
 client-0.012
 grade-0.011
 liar-0.011
 classmate-0.011
Token happier
Token ID23030
Feature activation-8.42e-01

Pos logit contributions
ales+0.078
Years+0.078
utt+0.078
From+0.076
 Every+0.069
Neg logit contributions
 happened-0.093
 babys-0.089
 classmate-0.089
 ha-0.088
 grade-0.086
Token.
Token ID13
Feature activation+1.59e+00

Pos logit contributions
From+0.018
 Every+0.017
 refreshing+0.017
Years+0.017
asting+0.016
Neg logit contributions
 grade-0.023
 classmate-0.023
 ha-0.023
 client-0.023
 principal-0.022
Token Everyone
Token ID11075
Feature activation-5.35e+00

Pos logit contributions
 Lee+0.045
 babys+0.044
 grade+0.043
 classmate+0.043
 principal+0.042
Neg logit contributions
asting-0.036
 refreshing-0.033
Years-0.033
lated-0.030
From-0.030
Token agreed
Token ID4987
Feature activation+6.74e+00

Pos logit contributions
From+0.124
 refreshing+0.123
 Every+0.123
Years+0.123
asting+0.117
Neg logit contributions
 classmate-0.136
 ha-0.135
 happened-0.133
 grade-0.129
 client-0.125
Token.
Token ID13
Feature activation+1.69e+00

Pos logit contributions
 grade+0.183
 happened+0.180
 haircut+0.178
 ha+0.178
 classmate+0.176
Neg logit contributions
ales-0.135
From-0.134
 refreshing-0.134
aling-0.132
 Every-0.132
Token Taking
Token ID20879
Feature activation-1.27e+01

Pos logit contributions
 Lee+0.044
ird+0.040
 classmate+0.040
 babys+0.039
 grade+0.039
Neg logit contributions
asting-0.042
Years-0.041
 refreshing-0.040
ales-0.038
lated-0.037
Token the
Token ID262
Feature activation-1.43e+00

Pos logit contributions
From+0.301
Years+0.272
utt+0.267
 refreshing+0.258
 Later+0.252
Neg logit contributions
 ha-0.339
 happened-0.338
 grade-0.336
 classmate-0.334
 client-0.329
Token medicine
Token ID9007
Feature activation-2.08e+00

Pos logit contributions
From+0.031
 refreshing+0.030
 Every+0.030
asting+0.028
Years+0.028
Neg logit contributions
 classmate-0.040
 client-0.040
 grade-0.039
 principal-0.039
 ha-0.039
Token felt
Token ID2936
Feature activation-8.38e-01

Pos logit contributions
From+0.044
 refreshing+0.041
 Every+0.041
asting+0.039
Years+0.039
Neg logit contributions
 ha-0.060
 classmate-0.060
 grade-0.059
 principal-0.058
 client-0.058
Token nice
Token ID3621
Feature activation-4.51e-01

Pos logit contributions
ales+0.016
Years+0.016
From+0.016
utt+0.015
 Every+0.015
Neg logit contributions
 happened-0.024
 classmate-0.023
 grade-0.023
 ha-0.023
 babys-0.022
Token and
Token ID290
Feature activation-8.28e+00

Pos logit contributions
From+0.009
Years+0.008
 Every+0.008
 refreshing+0.008
utt+0.007
Neg logit contributions
 grade-0.014
 happened-0.014
 classmate-0.014
 ha-0.013
 Lee-0.013
Token be
Token ID307
Feature activation-3.36e-01

Pos logit contributions
 classmate+0.135
 client+0.135
 grade+0.133
 principal+0.133
 ha+0.133
Neg logit contributions
From-0.096
 refreshing-0.093
 Every-0.091
asting-0.087
Years-0.087
Token healthy
Token ID5448
Feature activation-2.52e+00

Pos logit contributions
From+0.005
 refreshing+0.005
 often+0.005
 Every+0.005
 tight+0.005
Neg logit contributions
 client-0.012
 grade-0.012
 principal-0.011
 classmate-0.011
 ha-0.011
Token and
Token ID290
Feature activation-8.47e+00

Pos logit contributions
 Every+0.024
From+0.024
 often+0.023
 refreshing+0.022
 all+0.022
Neg logit contributions
 classmate-0.099
 client-0.098
 principal-0.097
 grade-0.097
 babys-0.096
Token happy
Token ID3772
Feature activation-1.53e-01

Pos logit contributions
From+0.162
 Every+0.156
 refreshing+0.154
asting+0.147
Years+0.144
Neg logit contributions
 client-0.259
 ha-0.258
 grade-0.257
 classmate-0.257
 principal-0.254
Token.
Token ID13
Feature activation+1.54e+00

Pos logit contributions
 often+0.002
 Every+0.002
 all+0.002
From+0.002
 every+0.002
Neg logit contributions
 classmate-0.005
 client-0.005
 principal-0.005
 babys-0.005
 grade-0.005
Token Taking
Token ID20879
Feature activation-1.27e+01

Pos logit contributions
 client+0.041
 ha+0.040
 principal+0.039
 Lee+0.039
 grade+0.039
Neg logit contributions
 refreshing-0.038
asting-0.037
Years-0.035
From-0.034
 often-0.033
Token your
Token ID534
Feature activation+4.93e-01

Pos logit contributions
From+0.260
 refreshing+0.247
 Every+0.238
Years+0.233
asting+0.231
Neg logit contributions
 client-0.373
 classmate-0.373
 ha-0.372
 grade-0.371
 principal-0.368
Token pill
Token ID9582
Feature activation-2.74e+00

Pos logit contributions
 classmate+0.014
 client+0.014
 principal+0.014
 ha+0.013
 grade+0.013
Neg logit contributions
 Every-0.010
 refreshing-0.010
From-0.010
 tight-0.010
 often-0.009
Token is
Token ID318
Feature activation+3.79e+00

Pos logit contributions
From+0.054
 refreshing+0.052
 Every+0.049
asting+0.048
Years+0.047
Neg logit contributions
 grade-0.082
 client-0.082
 classmate-0.082
 principal-0.081
 ha-0.081
Token good
Token ID922
Feature activation+1.03e+00

Pos logit contributions
 client+0.116
 ha+0.116
 classmate+0.115
 grade+0.115
 principal+0.114
Neg logit contributions
From-0.071
 refreshing-0.068
 Every-0.067
asting-0.062
Years-0.061
Token for
Token ID329
Feature activation-3.71e-01

Pos logit contributions
 classmate+0.042
 client+0.041
 principal+0.040
 babys+0.040
 ha+0.039
Neg logit contributions
 all-0.009
 Every-0.009
 every-0.009
 around-0.008
 sometimes-0.008
Tokenoral
Token ID6864
Feature activation+1.80e+00

Pos logit contributions
 client+0.048
 grade+0.048
 classmate+0.048
 principal+0.047
 ha+0.047
Neg logit contributions
 refreshing-0.052
From-0.051
 Every-0.049
asting-0.048
 often-0.047
Token of
Token ID286
Feature activation-3.53e+00

Pos logit contributions
 client+0.056
 principal+0.056
 grade+0.056
 ha+0.055
 classmate+0.054
Neg logit contributions
 refreshing-0.035
From-0.030
Years-0.029
 Nem-0.028
asting-0.028
Token the
Token ID262
Feature activation-1.54e+00

Pos logit contributions
 refreshing+0.066
lessly+0.063
utt+0.061
Years+0.061
From+0.059
Neg logit contributions
 principal-0.102
 ha-0.102
 babys-0.101
 grade-0.100
 classmate-0.100
Token story
Token ID1621
Feature activation+1.25e+00

Pos logit contributions
 refreshing+0.062
From+0.060
ores+0.060
Years+0.059
ales+0.059
Neg logit contributions
 babys-0.013
 ha-0.013
 client-0.013
 principal-0.012
 classmate-0.012
Token:
Token ID25
Feature activation+5.17e+00

Pos logit contributions
 client+0.031
 classmate+0.031
 principal+0.031
 grade+0.030
 ha+0.030
Neg logit contributions
 refreshing-0.031
From-0.030
asting-0.029
Years-0.028
 Every-0.028
Token Taking
Token ID20879
Feature activation-1.27e+01

Pos logit contributions
 writing+0.109
 sister+0.109
 grade+0.107
 ha+0.106
 client+0.104
Neg logit contributions
 refreshing-0.156
lated-0.151
Years-0.135
ales-0.128
asting-0.125
Token regular
Token ID3218
Feature activation-2.20e+00

Pos logit contributions
From+0.323
 refreshing+0.291
Years+0.288
utt+0.285
 Later+0.273
Neg logit contributions
 ha-0.318
 grade-0.317
 client-0.310
 happened-0.302
 classmate-0.300
Token n
Token ID299
Feature activation-2.48e+00

Pos logit contributions
From+0.056
 refreshing+0.053
 Every+0.051
Years+0.050
asting+0.049
Neg logit contributions
 grade-0.058
 ha-0.055
 client-0.053
 happened-0.053
 classmate-0.052
Tokenaps
Token ID1686
Feature activation-8.10e+00

Pos logit contributions
From+0.086
 refreshing+0.086
Years+0.081
 Every+0.080
 tight+0.080
Neg logit contributions
 classmate-0.037
 grade-0.036
 client-0.035
 ha-0.034
 principal-0.034
Token can
Token ID460
Feature activation+3.99e+00

Pos logit contributions
From+0.166
 refreshing+0.160
Years+0.156
 Every+0.151
asting+0.149
Neg logit contributions
 classmate-0.231
 grade-0.229
 ha-0.226
 client-0.222
 happened-0.221
Token help
Token ID1037
Feature activation+2.76e+00

Pos logit contributions
 ha+0.094
 grade+0.093
 client+0.089
 babys+0.089
 classmate+0.086
Neg logit contributions
From-0.103
 refreshing-0.102
Years-0.096
 Every-0.092
 From-0.092
Token's
Token ID338
Feature activation+3.89e+00

Pos logit contributions
 classmate+0.019
 client+0.019
 principal+0.019
 babys+0.019
 grade+0.019
Neg logit contributions
 Every-0.008
From-0.008
 often-0.008
 refreshing-0.008
 all-0.007
Token best
Token ID1266
Feature activation+3.61e+00

Pos logit contributions
 classmate+0.127
 client+0.126
 principal+0.125
 babys+0.123
 ha+0.123
Neg logit contributions
 all-0.062
 refreshing-0.060
 Every-0.060
 tight-0.059
 around-0.058
Token for
Token ID329
Feature activation-3.51e-01

Pos logit contributions
 principal+0.148
 classmate+0.146
 client+0.145
 babys+0.144
fer+0.144
Neg logit contributions
 for-0.034
 all-0.030
 Every-0.029
 every-0.029
 over-0.028
Token you
Token ID345
Feature activation+1.62e+00

Pos logit contributions
From+0.007
 refreshing+0.007
 Every+0.007
Years+0.007
asting+0.006
Neg logit contributions
 ha-0.010
 client-0.010
 classmate-0.010
 grade-0.010
 principal-0.010
Token.
Token ID13
Feature activation+1.51e+00

Pos logit contributions
 classmate+0.050
 client+0.050
 principal+0.050
 ha+0.049
 grade+0.049
Neg logit contributions
From-0.030
 Every-0.029
 refreshing-0.029
 often-0.028
asting-0.027
Token Taking
Token ID20879
Feature activation-1.27e+01

Pos logit contributions
 ha+0.038
 client+0.038
 Lee+0.037
 grade+0.037
 principal+0.037
Neg logit contributions
 refreshing-0.038
asting-0.038
Years-0.036
From-0.035
 often-0.034
Token a
Token ID257
Feature activation+5.00e+00

Pos logit contributions
From+0.260
 refreshing+0.247
 Every+0.239
Years+0.234
asting+0.232
Neg logit contributions
 classmate-0.371
 ha-0.370
 client-0.370
 grade-0.369
 principal-0.365
Token bath
Token ID7837
Feature activation-2.03e+00

Pos logit contributions
 client+0.151
 principal+0.150
 classmate+0.149
 Lee+0.142
 ha+0.141
Neg logit contributions
 tight-0.098
 refreshing-0.097
From-0.093
 Every-0.091
 tightly-0.089
Token is
Token ID318
Feature activation+3.75e+00

Pos logit contributions
From+0.038
 refreshing+0.037
 Every+0.037
asting+0.035
Years+0.034
Neg logit contributions
 client-0.062
 classmate-0.062
 grade-0.061
 principal-0.061
 ha-0.060
Token good
Token ID922
Feature activation+1.01e+00

Pos logit contributions
 ha+0.114
 client+0.113
 classmate+0.112
 grade+0.112
 principal+0.111
Neg logit contributions
From-0.072
 refreshing-0.068
 Every-0.067
asting-0.063
Years-0.062
Token for
Token ID329
Feature activation-3.77e-01

Pos logit contributions
 client+0.036
 classmate+0.036
 principal+0.035
 babys+0.034
ird+0.034
Neg logit contributions
 all-0.013
 Every-0.013
 around-0.012
 thanks-0.012
 every-0.012
Token always
Token ID1464
Feature activation-6.54e+00

Pos logit contributions
 ha+0.073
 grade+0.071
 babys+0.069
 client+0.066
 happened+0.066
Neg logit contributions
From-0.078
 refreshing-0.078
Years-0.076
 tighter-0.073
 Every-0.073
Token have
Token ID423
Feature activation-4.38e+00

Pos logit contributions
From+0.156
 refreshing+0.147
 Every+0.147
Years+0.141
lessly+0.138
Neg logit contributions
 ha-0.171
 grade-0.169
 client-0.165
 classmate-0.164
 principal-0.160
Token its
Token ID663
Feature activation-1.70e+00

Pos logit contributions
From+0.099
 Every+0.090
Years+0.090
 refreshing+0.088
asting+0.086
Neg logit contributions
 ha-0.119
 happened-0.118
 classmate-0.117
 grade-0.116
 client-0.115
Token rewards
Token ID11530
Feature activation-1.55e+00

Pos logit contributions
From+0.039
 refreshing+0.035
 Every+0.034
Years+0.034
asting+0.034
Neg logit contributions
 ha-0.046
 client-0.046
 classmate-0.046
 grade-0.046
 babys-0.044
Token.
Token ID13
Feature activation+1.65e+00

Pos logit contributions
From+0.035
 refreshing+0.034
 Every+0.033
Years+0.032
 From+0.030
Neg logit contributions
 grade-0.042
 ha-0.042
 client-0.041
 classmate-0.041
 principal-0.041
Token Taking
Token ID20879
Feature activation-1.27e+01

Pos logit contributions
 ha+0.037
 Lee+0.034
 client+0.034
ird+0.034
 principal+0.032
Neg logit contributions
 refreshing-0.049
Years-0.047
asting-0.047
lated-0.046
 tighter-0.043
Token a
Token ID257
Feature activation+5.07e+00

Pos logit contributions
From+0.311
Years+0.285
 refreshing+0.275
utt+0.273
 Later+0.267
Neg logit contributions
 ha-0.334
 happened-0.319
 grade-0.319
 client-0.306
 classmate-0.302
Token break
Token ID2270
Feature activation-3.85e+00

Pos logit contributions
 classmate+0.139
 grade+0.139
 client+0.139
 ha+0.139
 principal+0.136
Neg logit contributions
From-0.113
 refreshing-0.108
 Every-0.107
asting-0.102
Years-0.101
Token is
Token ID318
Feature activation+3.75e+00

Pos logit contributions
From+0.091
 refreshing+0.086
Years+0.085
 Every+0.081
asting+0.077
Neg logit contributions
 ha-0.100
 grade-0.099
 classmate-0.097
 client-0.097
 principal-0.095
Token important
Token ID1593
Feature activation-1.21e+00

Pos logit contributions
 ha+0.110
 client+0.108
 classmate+0.107
 grade+0.107
 principal+0.105
Neg logit contributions
From-0.077
 refreshing-0.071
 Every-0.070
Years-0.068
asting-0.065
Token,
Token ID11
Feature activation+3.85e+00

Pos logit contributions
From+0.026
 refreshing+0.026
 Every+0.025
 often+0.024
Years+0.024
Neg logit contributions
 classmate-0.034
 client-0.033
 principal-0.033
 ha-0.033
 babys-0.032
Token share
Token ID2648
Feature activation-3.07e+00

Pos logit contributions
 client+0.133
 classmate+0.133
 ha+0.132
 grade+0.132
 principal+0.132
Neg logit contributions
From-0.087
 refreshing-0.084
 Every-0.082
asting-0.078
Years-0.078
Token with
Token ID351
Feature activation-4.61e+00

Pos logit contributions
From+0.056
 refreshing+0.055
 Every+0.051
Years+0.051
asting+0.050
Neg logit contributions
 client-0.097
 ha-0.097
 classmate-0.096
 principal-0.096
 grade-0.095
Token your
Token ID534
Feature activation+3.71e-01

Pos logit contributions
 refreshing+0.099
From+0.098
asting+0.092
Years+0.090
 Every+0.089
Neg logit contributions
 client-0.136
 ha-0.134
 principal-0.133
 classmate-0.131
 Lee-0.126
Token sister
Token ID6621
Feature activation+8.28e+00

Pos logit contributions
 grade+0.009
 client+0.009
 principal+0.009
 classmate+0.009
 ha+0.009
Neg logit contributions
 refreshing-0.009
 tight-0.008
 Every-0.008
From-0.008
 all-0.008
Token.
Token ID13
Feature activation+1.63e+00

Pos logit contributions
 grade+0.238
 classmate+0.237
 client+0.237
 principal+0.234
 ha+0.229
Neg logit contributions
 Every-0.167
From-0.166
 refreshing-0.161
 often-0.157
 tight-0.156
Token Taking
Token ID20879
Feature activation-1.27e+01

Pos logit contributions
 client+0.047
 principal+0.046
 ha+0.045
 classmate+0.045
 grade+0.045
Neg logit contributions
 refreshing-0.036
From-0.034
asting-0.034
Years-0.033
 often-0.031
Token her
Token ID607
Feature activation+4.82e+00

Pos logit contributions
From+0.269
 refreshing+0.252
Years+0.240
 Every+0.238
asting+0.237
Neg logit contributions
 ha-0.371
 client-0.369
 classmate-0.368
 grade-0.367
 principal-0.363
Token dolls
Token ID36062
Feature activation-1.19e+00

Pos logit contributions
 classmate+0.125
 principal+0.125
 client+0.123
 grade+0.120
 babys+0.119
Neg logit contributions
 Every-0.109
 tight-0.106
 refreshing-0.106
 all-0.103
From-0.102
Token was
Token ID373
Feature activation+4.19e+00

Pos logit contributions
From+0.023
 refreshing+0.022
 Every+0.021
Years+0.020
asting+0.020
Neg logit contributions
 ha-0.037
 client-0.036
 grade-0.036
 classmate-0.036
 principal-0.036
Token very
Token ID845
Feature activation+5.82e+00

Pos logit contributions
 client+0.126
 classmate+0.126
 ha+0.125
 grade+0.124
 principal+0.124
Neg logit contributions
From-0.083
 refreshing-0.079
 Every-0.077
Years-0.072
asting-0.072
Token bad
Token ID2089
Feature activation+2.89e+00

Pos logit contributions
 ha+0.171
 client+0.171
 classmate+0.170
 grade+0.168
 principal+0.168
Neg logit contributions
From-0.119
 Every-0.114
 refreshing-0.113
Years-0.107
asting-0.106
Token that
Token ID326
Feature activation-1.82e+00

Pos logit contributions
 ha+0.119
 grade+0.108
 haircut+0.107
 principal+0.105
 desk+0.104
Neg logit contributions
From-0.103
 Every-0.100
 refreshing-0.099
Years-0.098
rant-0.095
Token's
Token ID338
Feature activation+3.84e+00

Pos logit contributions
From+0.046
Years+0.045
 refreshing+0.045
 Every+0.043
asting+0.042
Neg logit contributions
 ha-0.046
 client-0.044
 classmate-0.044
 principal-0.043
 grade-0.042
Token not
Token ID407
Feature activation-3.76e-01

Pos logit contributions
 client+0.112
 classmate+0.112
 ha+0.111
 principal+0.110
 grade+0.110
Neg logit contributions
From-0.080
 refreshing-0.076
 Every-0.075
Years-0.071
asting-0.071
Token nice
Token ID3621
Feature activation-4.23e-01

Pos logit contributions
From+0.008
 refreshing+0.008
 Every+0.008
Years+0.007
asting+0.007
Neg logit contributions
 client-0.011
 ha-0.011
 grade-0.011
 classmate-0.011
 principal-0.010
Token.
Token ID13
Feature activation+1.88e+00

Pos logit contributions
 Every+0.007
 refreshing+0.007
 all+0.007
From+0.007
 often+0.007
Neg logit contributions
 classmate-0.013
 principal-0.013
 client-0.013
 grade-0.013
 ha-0.013
Token Taking
Token ID20879
Feature activation-1.27e+01

Pos logit contributions
 client+0.050
 Lee+0.049
 ha+0.049
 principal+0.048
 classmate+0.047
Neg logit contributions
Years-0.046
 refreshing-0.046
asting-0.045
From-0.044
 often-0.041
Token things
Token ID1243
Feature activation+2.06e-01

Pos logit contributions
From+0.304
Years+0.273
 refreshing+0.267
asting+0.257
 Later+0.249
Neg logit contributions
 ha-0.363
 client-0.348
 grade-0.342
 classmate-0.337
 principal-0.333
Token that
Token ID326
Feature activation-1.68e+00

Pos logit contributions
 ha+0.006
 grade+0.006
 client+0.006
 babys+0.006
 principal+0.006
Neg logit contributions
From-0.004
 refreshing-0.004
 Every-0.004
Years-0.004
asting-0.004
Token aren
Token ID3588
Feature activation+5.18e-01

Pos logit contributions
From+0.041
 refreshing+0.038
Years+0.038
 Every+0.037
asting+0.036
Neg logit contributions
 ha-0.044
 client-0.044
 classmate-0.043
 grade-0.043
 principal-0.043
Token't
Token ID470
Feature activation-1.50e-01

Pos logit contributions
 ha+0.005
 happened+0.005
 client+0.004
 babys+0.004
 grade+0.004
Neg logit contributions
 refreshing-0.021
Years-0.019
From-0.019
 shallow-0.019
 tighter-0.019
Token yours
Token ID12431
Feature activation+1.16e+00

Pos logit contributions
From+0.003
 Every+0.003
 refreshing+0.003
 From+0.003
Years+0.003
Neg logit contributions
 happened-0.004
 client-0.004
 grade-0.004
 ha-0.004
 classmate-0.004
Token learned
Token ID4499
Feature activation+5.29e-01

Pos logit contributions
From+0.002
 refreshing+0.002
 Every+0.001
Years+0.001
asting+0.001
Neg logit contributions
 ha-0.002
 classmate-0.002
 client-0.002
 principal-0.002
 grade-0.002
Token an
Token ID281
Feature activation+2.97e+00

Pos logit contributions
 client+0.016
 classmate+0.016
 ha+0.015
 principal+0.015
 grade+0.015
Neg logit contributions
From-0.010
 refreshing-0.010
 Every-0.010
asting-0.010
Years-0.009
Token important
Token ID1593
Feature activation-1.28e+00

Pos logit contributions
 client+0.084
 classmate+0.078
 grade+0.078
 principal+0.075
 ha+0.073
Neg logit contributions
 refreshing-0.068
From-0.063
 lakes-0.061
 Fin-0.059
 Nem-0.059
Token lesson
Token ID11483
Feature activation+2.87e+00

Pos logit contributions
From+0.022
Years+0.022
 Every+0.021
 tight+0.021
 often+0.020
Neg logit contributions
 classmate-0.041
 principal-0.041
 ha-0.040
 babys-0.040
 client-0.040
Token.
Token ID13
Feature activation+1.78e+00

Pos logit contributions
 client+0.085
 classmate+0.084
 grade+0.083
 principal+0.083
 ha+0.083
Neg logit contributions
 refreshing-0.058
From-0.057
 Every-0.054
Years-0.052
asting-0.052
Token Taking
Token ID20879
Feature activation-1.27e+01

Pos logit contributions
 client+0.053
 principal+0.052
 classmate+0.052
 Lee+0.051
 grade+0.051
Neg logit contributions
 refreshing-0.038
asting-0.036
Years-0.033
From-0.032
 crunch-0.032
Token things
Token ID1243
Feature activation+3.42e-01

Pos logit contributions
From+0.276
 refreshing+0.258
utt+0.243
Years+0.242
asting+0.238
Neg logit contributions
 ha-0.363
 client-0.363
 grade-0.362
 classmate-0.360
 principal-0.350
Token without
Token ID1231
Feature activation-1.32e+00

Pos logit contributions
 grade+0.009
 ha+0.009
 happened+0.009
 client+0.008
 classmate+0.008
Neg logit contributions
From-0.008
 refreshing-0.008
 Every-0.008
asting-0.008
Years-0.008
Token asking
Token ID4737
Feature activation+1.53e+00

Pos logit contributions
From+0.035
 refreshing+0.034
 Every+0.033
Years+0.033
asting+0.031
Neg logit contributions
 ha-0.031
 client-0.031
 grade-0.030
 classmate-0.029
 principal-0.029
Token was
Token ID373
Feature activation+4.33e+00

Pos logit contributions
 client+0.046
 classmate+0.045
 principal+0.045
 ha+0.045
 grade+0.044
Neg logit contributions
 refreshing-0.031
From-0.030
 Every-0.027
utt-0.027
Years-0.027
Token wrong
Token ID2642
Feature activation+5.18e+00

Pos logit contributions
 ha+0.127
 client+0.127
 classmate+0.124
 principal+0.124
 grade+0.123
Neg logit contributions
From-0.090
 refreshing-0.085
 Every-0.082
asting-0.078
Years-0.077
Token to
Token ID284
Feature activation+4.83e+00

Pos logit contributions
 classmate+0.115
 ha+0.110
 client+0.110
 grade+0.109
 principal+0.107
Neg logit contributions
From-0.100
 refreshing-0.097
Years-0.093
asting-0.091
 Every-0.090
Token a
Token ID257
Feature activation+5.33e+00

Pos logit contributions
 grade+0.136
 client+0.135
 principal+0.135
 ha+0.135
 classmate+0.134
Neg logit contributions
From-0.103
 refreshing-0.098
 Every-0.095
Years-0.094
asting-0.093
Token little
Token ID1310
Feature activation+6.05e-01

Pos logit contributions
 client+0.172
 classmate+0.171
 grade+0.170
 principal+0.169
 ha+0.168
Neg logit contributions
From-0.093
 refreshing-0.091
 Every-0.086
asting-0.082
Years-0.082
Token island
Token ID7022
Feature activation-3.09e+00

Pos logit contributions
 ha+0.017
 classmate+0.017
 client+0.017
 grade+0.017
 principal+0.017
Neg logit contributions
From-0.013
 Every-0.012
 refreshing-0.012
Years-0.012
asting-0.012
Token with
Token ID351
Feature activation-4.31e+00

Pos logit contributions
From+0.085
Years+0.080
 Every+0.079
 refreshing+0.077
 From+0.073
Neg logit contributions
 classmate-0.069
 ha-0.068
 principal-0.067
 happened-0.067
 babys-0.066
Token a
Token ID257
Feature activation+5.36e+00

Pos logit contributions
From+0.103
Years+0.099
orting+0.096
asting+0.091
ches+0.090
Neg logit contributions
 happened-0.110
 classmate-0.109
 ha-0.108
 babys-0.106
 principal-0.103
Token tree
Token ID5509
Feature activation+4.97e-01

Pos logit contributions
 client+0.182
 principal+0.181
 grade+0.179
 classmate+0.175
 Lee+0.172
Neg logit contributions
 refreshing-0.096
 tight-0.084
From-0.081
 crunch-0.078
 fresh-0.077
Token.
Token ID13
Feature activation+1.68e+00

Pos logit contributions
 ha+0.011
 classmate+0.010
 babys+0.010
 client+0.010
 desk+0.010
Neg logit contributions
From-0.014
Years-0.014
 refreshing-0.013
 Every-0.013
 From-0.012
Token When
Token ID1649
Feature activation-8.83e-01

Pos logit contributions
 Lee+0.049
 principal+0.047
 classmate+0.047
 grade+0.046
 client+0.046
Neg logit contributions
asting-0.036
Years-0.035
From-0.033
 often-0.033
 refreshing-0.032
Token the
Token ID262
Feature activation-1.19e+00

Pos logit contributions
From+0.017
Years+0.017
 Every+0.017
asting+0.017
 refreshing+0.016
Neg logit contributions
 ha-0.026
 principal-0.025
 classmate-0.025
 client-0.025
 grade-0.025
Token potato
Token ID21219
Feature activation-2.35e-01

Pos logit contributions
From+0.027
 Every+0.025
Years+0.025
 often+0.024
 From+0.024
Neg logit contributions
 ha-0.033
 classmate-0.033
 babys-0.032
 client-0.032
 principal-0.032
Token Then
Token ID3244
Feature activation+1.82e-01

Pos logit contributions
 Lee+0.053
 principal+0.052
 classmate+0.052
 grade+0.051
 babys+0.051
Neg logit contributions
 refreshing-0.041
asting-0.039
From-0.037
 tighter-0.037
Years-0.036
Token suddenly
Token ID6451
Feature activation+2.81e+00

Pos logit contributions
 classmate+0.005
 client+0.005
 principal+0.005
 grade+0.005
 babys+0.005
Neg logit contributions
From-0.004
 Every-0.004
 refreshing-0.004
 often-0.004
 all-0.004
Token,
Token ID11
Feature activation+4.03e+00

Pos logit contributions
 classmate+0.083
 client+0.083
 principal+0.082
 grade+0.081
 babys+0.080
Neg logit contributions
From-0.056
 Every-0.054
 all-0.054
 refreshing-0.053
 often-0.053
Token the
Token ID262
Feature activation-1.17e+00

Pos logit contributions
 client+0.137
 classmate+0.137
 ha+0.137
 grade+0.136
 principal+0.136
Neg logit contributions
From-0.063
 refreshing-0.060
 Every-0.058
Years-0.055
asting-0.055
Token line
Token ID1627
Feature activation-2.71e+00

Pos logit contributions
From+0.028
 Every+0.027
 From+0.025
Years+0.025
 often+0.024
Neg logit contributions
 classmate-0.031
 ha-0.031
 client-0.031
 principal-0.031
 grade-0.031
Token started
Token ID2067
Feature activation+5.38e+00

Pos logit contributions
From+0.064
 refreshing+0.062
 Every+0.061
Years+0.059
asting+0.057
Neg logit contributions
 classmate-0.071
 ha-0.070
 principal-0.069
 client-0.069
 grade-0.069
Token to
Token ID284
Feature activation+4.74e+00

Pos logit contributions
 grade+0.129
 classmate+0.129
 babys+0.127
 principal+0.127
 client+0.126
Neg logit contributions
From-0.140
 refreshing-0.130
Years-0.128
 Every-0.127
asting-0.121
Token tug
Token ID27762
Feature activation-7.11e+00

Pos logit contributions
 ha+0.114
 grade+0.113
 principal+0.110
 babys+0.109
 client+0.107
Neg logit contributions
From-0.123
Years-0.115
 Every-0.113
 From-0.113
lessly-0.112
Token!
Token ID0
Feature activation+1.49e+00

Pos logit contributions
From+0.117
 refreshing+0.111
 Every+0.107
asting+0.107
 tight+0.103
Neg logit contributions
 client-0.239
 classmate-0.238
 principal-0.237
 babys-0.233
 grade-0.233
Token He
Token ID679
Feature activation-7.01e+00

Pos logit contributions
 classmate+0.048
 Lee+0.048
 principal+0.048
 babys+0.048
 ha+0.048
Neg logit contributions
 refreshing-0.025
asting-0.024
Years-0.024
From-0.024
 often-0.023
Token yelled
Token ID22760
Feature activation+3.80e+00

Pos logit contributions
From+0.177
 Every+0.177
 refreshing+0.171
Years+0.168
orting+0.167
Neg logit contributions
 ha-0.169
 classmate-0.163
 happened-0.160
 client-0.158
 principal-0.156
Token loud
Token ID7812
Feature activation-2.65e+00

Pos logit contributions
 classmate+0.141
 client+0.141
 grade+0.140
 principal+0.139
 ha+0.136
Neg logit contributions
From-0.116
 refreshing-0.110
 Every-0.106
Years-0.105
 tight-0.104
Token noise
Token ID7838
Feature activation+6.69e-01

Pos logit contributions
From+0.078
 refreshing+0.077
 Every+0.075
Years+0.073
asting+0.073
Neg logit contributions
 ha-0.053
 grade-0.053
 client-0.053
 classmate-0.052
 principal-0.052
Token.
Token ID13
Feature activation+1.33e+00

Pos logit contributions
 client+0.020
 classmate+0.020
 principal+0.020
 ha+0.020
 grade+0.020
Neg logit contributions
From-0.013
 refreshing-0.013
 Every-0.012
asting-0.012
Years-0.012
Token It
Token ID632
Feature activation-6.22e+00

Pos logit contributions
 classmate+0.048
 client+0.048
 grade+0.048
 principal+0.048
 ha+0.047
Neg logit contributions
 refreshing-0.017
From-0.017
asting-0.016
Years-0.016
 often-0.015
Token was
Token ID373
Feature activation+4.21e+00

Pos logit contributions
From+0.128
 refreshing+0.124
 Every+0.121
Years+0.114
 often+0.113
Neg logit contributions
 client-0.187
 grade-0.184
 principal-0.183
 classmate-0.183
 ha-0.180
Token a
Token ID257
Feature activation+5.10e+00

Pos logit contributions
 classmate+0.134
 client+0.134
 grade+0.133
 principal+0.132
 ha+0.132
Neg logit contributions
From-0.075
 refreshing-0.073
 Every-0.071
asting-0.067
Years-0.067
Token grow
Token ID1663
Feature activation-3.50e+00

Pos logit contributions
 classmate+0.157
 grade+0.155
 client+0.153
 principal+0.153
ird+0.152
Neg logit contributions
 refreshing-0.095
 tight-0.093
From-0.086
 fresh-0.084
 crunch-0.083
Tokenl
Token ID75
Feature activation-3.62e+00

Pos logit contributions
From+0.071
 Every+0.067
 refreshing+0.067
Years+0.064
 From+0.061
Neg logit contributions
 classmate-0.103
 ha-0.101
 client-0.101
 principal-0.101
 grade-0.100
Token,
Token ID11
Feature activation+3.70e+00

Pos logit contributions
From+0.065
 refreshing+0.063
 Every+0.061
asting+0.058
Years+0.057
Neg logit contributions
 classmate-0.115
 client-0.114
 principal-0.113
 grade-0.113
 ha-0.113
Token like
Token ID588
Feature activation-4.41e+00

Pos logit contributions
 Lee+0.108
 grade+0.108
 babys+0.108
 principal+0.108
 classmate+0.107
Neg logit contributions
Years-0.069
From-0.069
 Every-0.067
 refreshing-0.065
asting-0.064
Token an
Token ID281
Feature activation+3.06e+00

Pos logit contributions
From+0.093
 refreshing+0.087
 Every+0.086
Years+0.084
asting+0.084
Neg logit contributions
 client-0.127
 ha-0.126
 principal-0.126
 classmate-0.126
 grade-0.124
Token that
Token ID326
Feature activation-1.66e+00

Pos logit contributions
 client+0.060
 classmate+0.060
 ha+0.060
 principal+0.060
 grade+0.059
Neg logit contributions
From-0.022
 refreshing-0.021
 Every-0.020
asting-0.020
Years-0.019
Token Ben
Token ID3932
Feature activation+2.00e+00

Pos logit contributions
From+0.037
 refreshing+0.036
Years+0.035
asting+0.034
 Every+0.034
Neg logit contributions
 client-0.046
 principal-0.046
 ha-0.046
 classmate-0.046
 grade-0.045
Token was
Token ID373
Feature activation+4.09e+00

Pos logit contributions
 classmate+0.051
 client+0.051
 happened+0.050
 ha+0.049
 principal+0.049
Neg logit contributions
From-0.049
 refreshing-0.048
Years-0.046
 Every-0.045
asting-0.045
Token a
Token ID257
Feature activation+4.83e+00

Pos logit contributions
 classmate+0.114
 client+0.114
 ha+0.113
 principal+0.113
 grade+0.109
Neg logit contributions
From-0.091
 refreshing-0.087
 Every-0.086
Years-0.084
utt-0.079
Token true
Token ID2081
Feature activation+2.39e+00

Pos logit contributions
 grade+0.164
 principal+0.161
 Lee+0.159
 classmate+0.159
ird+0.157
Neg logit contributions
 tight-0.072
From-0.071
 refreshing-0.070
 often-0.068
Years-0.065
Token friend
Token ID1545
Feature activation+7.00e+00

Pos logit contributions
 principal+0.059
 grade+0.058
 classmate+0.057
 ha+0.056
 client+0.055
Neg logit contributions
 tight-0.055
 often-0.054
 Every-0.054
 thanks-0.053
 all-0.053
Token.
Token ID13
Feature activation+1.22e+00

Pos logit contributions
 client+0.204
 classmate+0.204
 ha+0.201
 principal+0.201
 grade+0.200
Neg logit contributions
From-0.144
 refreshing-0.140
 Every-0.135
Years-0.130
asting-0.130
Token He
Token ID679
Feature activation-7.40e+00

Pos logit contributions
 classmate+0.034
 client+0.034
 principal+0.034
 grade+0.033
 Lee+0.033
Neg logit contributions
 refreshing-0.027
asting-0.026
From-0.026
Years-0.025
 often-0.024
Token thought
Token ID1807
Feature activation+3.15e+00

Pos logit contributions
 refreshing+0.175
asting+0.169
Years+0.165
 Every+0.165
From+0.164
Neg logit contributions
 classmate-0.190
 client-0.188
 ha-0.185
 principal-0.185
 happened-0.182
Token of
Token ID286
Feature activation-3.72e+00

Pos logit contributions
 grade+0.097
 principal+0.095
 classmate+0.093
 happened+0.093
 ha+0.092
Neg logit contributions
From-0.063
 refreshing-0.060
Years-0.060
asting-0.056
 Later-0.055
Token her
Token ID607
Feature activation+4.63e+00

Pos logit contributions
 refreshing+0.092
From+0.091
asting+0.089
utt+0.089
Years+0.089
Neg logit contributions
 ha-0.091
 client-0.090
 principal-0.090
 classmate-0.086
 Lee-0.085
Token and
Token ID290
Feature activation-8.23e+00

Pos logit contributions
 ha+0.012
 classmate+0.011
 principal+0.010
 client+0.010
 grade+0.010
Neg logit contributions
 refreshing-0.013
asting-0.012
ores-0.012
From-0.012
orting-0.012
Token his
Token ID465
Feature activation+3.64e+00

Pos logit contributions
 Every+0.075
From+0.068
 all+0.068
 often+0.066
 tight+0.061
Neg logit contributions
 ha-0.325
 grade-0.325
ird-0.322
 principal-0.320
 liar-0.319
Token mom
Token ID1995
Feature activation+8.24e+00

Pos logit contributions
 grade+0.086
 ha+0.082
 client+0.082
 principal+0.082
 classmate+0.080
Neg logit contributions
 refreshing-0.093
 Every-0.091
From-0.091
 tight-0.090
 often-0.088
Token spent
Token ID3377
Feature activation-5.17e+00

Pos logit contributions
 ha+0.212
ird+0.206
 classmate+0.194
 babys+0.189
 principal+0.188
Neg logit contributions
 refreshing-0.200
From-0.198
 Every-0.186
Years-0.179
ores-0.178
Token the
Token ID262
Feature activation-1.35e+00

Pos logit contributions
From+0.081
 refreshing+0.076
 Every+0.074
utt+0.073
lessly+0.069
Neg logit contributions
 ha-0.175
 classmate-0.171
 principal-0.168
 client-0.168
 grade-0.167
Token afternoon
Token ID6672
Feature activation+5.78e+00

Pos logit contributions
 Every+0.031
 refreshing+0.029
From+0.029
ores+0.028
utt+0.028
Neg logit contributions
 ha-0.036
 grade-0.034
 classmate-0.033
 happened-0.033
 client-0.032
Token playing
Token ID2712
Feature activation-3.01e+00

Pos logit contributions
 ha+0.133
 classmate+0.130
 grade+0.129
 principal+0.128
 client+0.127
Neg logit contributions
From-0.160
 Every-0.154
 refreshing-0.152
Years-0.146
asting-0.142
Token and
Token ID290
Feature activation-8.12e+00

Pos logit contributions
From+0.055
 refreshing+0.053
 Every+0.052
asting+0.048
Years+0.046
Neg logit contributions
 ha-0.095
 classmate-0.094
 principal-0.093
 grade-0.093
 client-0.092
Token having
Token ID1719
Feature activation-5.63e+00

Pos logit contributions
From+0.221
 Every+0.215
 refreshing+0.212
Years+0.207
asting+0.200
Neg logit contributions
 ha-0.185
 principal-0.185
 classmate-0.184
 client-0.180
 grade-0.179
Token fun
Token ID1257
Feature activation+1.00e+00

Pos logit contributions
From+0.120
rant+0.112
orting+0.108
 refreshing+0.107
asting+0.107
Neg logit contributions
 ha-0.158
 classmate-0.150
 principal-0.150
 happened-0.145
 grade-0.144
Token.
Token ID13
Feature activation+1.76e+00

Pos logit contributions
 ha+0.033
 classmate+0.033
 grade+0.033
 client+0.033
 principal+0.032
Neg logit contributions
From-0.017
 refreshing-0.016
 Every-0.016
Years-0.015
asting-0.015
TokenLater
Token ID18602
Feature activation-2.83e+00

Pos logit contributions
 thanks+0.059
ion+0.058
utt+0.058
ores+0.057
 refreshing+0.056
Neg logit contributions
 ha-0.056
 classmate-0.055
 client-0.049
 babys-0.048
 pupil-0.047
Token that
Token ID326
Feature activation-1.33e+00

Pos logit contributions
From+0.030
 refreshing+0.028
 Every+0.027
asting+0.024
Years+0.023
Neg logit contributions
 client-0.111
 classmate-0.111
 principal-0.110
 grade-0.110
 ha-0.109
Token day
Token ID1110
Feature activation+6.78e+00

Pos logit contributions
 refreshing+0.028
From+0.028
Years+0.027
asting+0.026
lessly+0.025
Neg logit contributions
 classmate-0.036
 client-0.036
 babys-0.036
 grade-0.036
 happened-0.036
Token,
Token ID11
Feature activation+4.11e+00

Pos logit contributions
 principal+0.210
 client+0.209
 classmate+0.202
 grade+0.199
 liar+0.191
Neg logit contributions
From-0.138
 Every-0.132
 refreshing-0.132
 all-0.129
 tight-0.127
Token Benny
Token ID44275
Feature activation-1.15e+00

Pos logit contributions
 happened+0.126
 classmate+0.126
 babys+0.125
 ha+0.125
 grade+0.122
Neg logit contributions
Years-0.067
From-0.066
asting-0.066
utt-0.064
ion-0.063
Token saw
Token ID2497
Feature activation+2.61e+00

Pos logit contributions
From+0.024
 refreshing+0.023
 Every+0.023
 often+0.022
asting+0.021
Neg logit contributions
 client-0.034
 classmate-0.033
 grade-0.033
 principal-0.033
 ha-0.033
Token his
Token ID465
Feature activation+3.85e+00

Pos logit contributions
 classmate+0.078
 client+0.078
 principal+0.077
 ha+0.077
 grade+0.076
Neg logit contributions
From-0.051
 refreshing-0.049
 Every-0.047
asting-0.046
Years-0.046
Token friend
Token ID1545
Feature activation+7.12e+00

Pos logit contributions
ird+0.107
 grade+0.105
 principal+0.104
 ha+0.104
fer+0.097
Neg logit contributions
 tight-0.086
 refreshing-0.085
 Every-0.083
 all-0.081
 tightly-0.080
Token,
Token ID11
Feature activation+4.07e+00

Pos logit contributions
 grade+0.216
 client+0.215
 principal+0.212
 classmate+0.211
 ha+0.210
Neg logit contributions
 Every-0.135
From-0.134
 refreshing-0.131
 often-0.124
asting-0.123
Token the
Token ID262
Feature activation-1.27e+00

Pos logit contributions
 classmate+0.128
 client+0.127
 principal+0.126
 ha+0.126
 grade+0.125
Neg logit contributions
From-0.074
 refreshing-0.072
 Every-0.069
Years-0.066
asting-0.066
Token squirrel
Token ID33039
Feature activation-2.99e-01

Pos logit contributions
 refreshing+0.032
 Every+0.032
From+0.032
 tight+0.031
 all+0.030
Neg logit contributions
 grade-0.030
ird-0.030
 principal-0.029
 client-0.029
 ha-0.029
Token was
Token ID373
Feature activation+4.29e+00

Pos logit contributions
From+0.169
 Every+0.158
 refreshing+0.156
Years+0.154
asting+0.150
Neg logit contributions
 classmate-0.248
 ha-0.247
 principal-0.247
 grade-0.244
 client-0.244
Token ok
Token ID12876
Feature activation+5.09e+00

Pos logit contributions
 ha+0.127
 classmate+0.127
 principal+0.124
 client+0.123
 babys+0.123
Neg logit contributions
From-0.088
 Every-0.081
 refreshing-0.081
Years-0.078
orting-0.076
Token.
Token ID13
Feature activation+1.77e+00

Pos logit contributions
 classmate+0.147
 client+0.147
 grade+0.145
 principal+0.145
 ha+0.145
Neg logit contributions
From-0.106
 refreshing-0.102
 Every-0.099
Years-0.095
asting-0.095
Token<|endoftext|>
Token ID50256
Feature activation-2.69e-01

Pos logit contributions
 principal+0.052
 Lee+0.051
 classmate+0.051
 client+0.050
 grade+0.049
Neg logit contributions
 refreshing-0.036
asting-0.035
Years-0.035
From-0.033
 tighter-0.033
TokenOnce
Token ID7454
Feature activation+3.69e+00

Pos logit contributions
From+0.005
 refreshing+0.005
 Every+0.005
Years+0.005
asting+0.005
Neg logit contributions
 classmate-0.008
 client-0.008
 ha-0.008
 principal-0.008
 grade-0.008
Token there
Token ID612
Feature activation+1.16e-01

Pos logit contributions
 classmate+0.107
 client+0.107
 grade+0.106
 principal+0.106
 ha+0.106
Neg logit contributions
From-0.075
 refreshing-0.073
 Every-0.071
asting-0.068
Years-0.068
Token was
Token ID373
Feature activation+4.54e+00

Pos logit contributions
 client+0.003
 classmate+0.003
 principal+0.003
 grade+0.003
 ha+0.003
Neg logit contributions
From-0.002
 Every-0.002
 refreshing-0.002
 often-0.002
asting-0.002
Token a
Token ID257
Feature activation+5.77e+00

Pos logit contributions
 classmate+0.123
 ha+0.122
 grade+0.121
 client+0.121
 babys+0.119
Neg logit contributions
From-0.098
 refreshing-0.096
Years-0.092
 Every-0.089
asting-0.089
Token playful
Token ID34264
Feature activation-3.22e+00

Pos logit contributions
 classmate+0.160
 client+0.159
 ha+0.158
 principal+0.157
 grade+0.157
Neg logit contributions
From-0.126
 refreshing-0.122
 Every-0.121
asting-0.115
Years-0.115
Token puppy
Token ID26188
Feature activation+2.73e+00

Pos logit contributions
From+0.099
Years+0.093
 refreshing+0.093
 Every+0.092
asting+0.092
Neg logit contributions
 ha-0.065
 classmate-0.065
 babys-0.064
 client-0.063
 principal-0.062
Token.
Token ID13
Feature activation+2.44e+00

Pos logit contributions
 ha+0.071
 client+0.071
 classmate+0.070
 babys+0.069
 principal+0.069
Neg logit contributions
 refreshing-0.064
From-0.064
Years-0.061
asting-0.060
 Every-0.060
Token Sammy
Token ID44067
Feature activation+9.13e-02

Pos logit contributions
From+0.028
 refreshing+0.027
 Every+0.026
Years+0.025
asting+0.025
Neg logit contributions
 classmate-0.047
 client-0.047
 ha-0.046
 grade-0.046
 principal-0.046
Token was
Token ID373
Feature activation+4.22e+00

Pos logit contributions
 happened+0.002
 ha+0.002
 classmate+0.002
 client+0.002
 babys+0.002
Neg logit contributions
From-0.002
Years-0.002
 Every-0.002
 refreshing-0.002
asting-0.002
Token hurt
Token ID5938
Feature activation-2.77e-01

Pos logit contributions
 ha+0.089
 classmate+0.088
 client+0.088
 babys+0.087
 grade+0.085
Neg logit contributions
From-0.122
 refreshing-0.117
Years-0.115
 Every-0.114
asting-0.111
Token and
Token ID290
Feature activation-8.48e+00

Pos logit contributions
 refreshing+0.005
From+0.005
Years+0.005
 Every+0.004
utt+0.004
Neg logit contributions
 classmate-0.009
 client-0.009
 ha-0.009
 babys-0.009
 grade-0.008
Token she
Token ID673
Feature activation-1.34e+00

Pos logit contributions
From+0.193
 refreshing+0.185
 Every+0.181
Years+0.180
asting+0.179
Neg logit contributions
 classmate-0.226
 client-0.226
 ha-0.224
 babys-0.221
 principal-0.220
Token wanted
Token ID2227
Feature activation+1.92e+00

Pos logit contributions
From+0.032
 refreshing+0.031
 Every+0.030
asting+0.030
Years+0.029
Neg logit contributions
 classmate-0.035
 client-0.035
 ha-0.035
 principal-0.034
 babys-0.034
Token to
Token ID284
Feature activation+4.76e+00

Pos logit contributions
 ha+0.045
 babys+0.044
 principal+0.044
 client+0.044
 classmate+0.044
Neg logit contributions
From-0.052
 refreshing-0.049
asting-0.048
 Every-0.047
lessly-0.047
Token help
Token ID1037
Feature activation+2.92e+00

Pos logit contributions
 classmate+0.144
 principal+0.144
 client+0.142
 grade+0.140
 babys+0.136
Neg logit contributions
From-0.094
 Every-0.091
 refreshing-0.090
 often-0.087
asting-0.084
Token.
Token ID13
Feature activation+1.67e+00

Pos logit contributions
 classmate+0.099
 client+0.098
 ha+0.098
 principal+0.097
 grade+0.097
Neg logit contributions
From-0.047
 refreshing-0.046
 Every-0.043
Years-0.041
asting-0.041
Token She
Token ID1375
Feature activation-7.74e+00

Pos logit contributions
 classmate+0.054
 client+0.054
 principal+0.053
 ha+0.053
 babys+0.053
Neg logit contributions
From-0.028
 refreshing-0.028
asting-0.027
Years-0.026
 Every-0.025
Token brought
Token ID3181
Feature activation-2.39e+00

Pos logit contributions
asting+0.203
 refreshing+0.201
 Every+0.198
From+0.191
lessly+0.190
Neg logit contributions
 ha-0.186
 classmate-0.178
 babys-0.174
 client-0.172
 happened-0.166
TokenLet
Token ID5756
Feature activation-2.55e+00

Pos logit contributions
 client+0.017
 classmate+0.017
 grade+0.017
 principal+0.016
 ha+0.016
Neg logit contributions
From-0.011
 refreshing-0.010
 Every-0.010
Years-0.010
asting-0.009
Token's
Token ID338
Feature activation+3.96e+00

Pos logit contributions
From+0.046
 refreshing+0.044
 Every+0.044
 all+0.041
 tight+0.041
Neg logit contributions
 client-0.084
 principal-0.084
 grade-0.083
 classmate-0.082
 liar-0.080
Token draw
Token ID3197
Feature activation-4.74e+00

Pos logit contributions
 client+0.113
 classmate+0.113
 ha+0.112
 grade+0.111
 principal+0.111
Neg logit contributions
From-0.084
 refreshing-0.081
 Every-0.079
asting-0.075
Years-0.075
Token a
Token ID257
Feature activation+5.19e+00

Pos logit contributions
From+0.087
 refreshing+0.084
 Every+0.081
Years+0.079
asting+0.077
Neg logit contributions
 classmate-0.147
 ha-0.147
 client-0.145
 grade-0.145
 principal-0.145
Token face
Token ID1986
Feature activation-1.21e+00

Pos logit contributions
 grade+0.153
 client+0.153
 principal+0.153
 ha+0.149
ird+0.148
Neg logit contributions
 refreshing-0.106
 tight-0.101
From-0.095
 crunch-0.094
 fresh-0.094
Token on
Token ID319
Feature activation-7.92e-01

Pos logit contributions
From+0.021
 Every+0.021
 refreshing+0.020
 all+0.020
 often+0.020
Neg logit contributions
 client-0.039
 classmate-0.038
 principal-0.038
 babys-0.038
 grade-0.037
Token it
Token ID340
Feature activation+7.21e-01

Pos logit contributions
 refreshing+0.013
From+0.013
Years+0.012
asting+0.012
utt+0.012
Neg logit contributions
 client-0.026
 ha-0.026
 classmate-0.026
 principal-0.026
 Lee-0.025
Token,"
Token ID553
Feature activation+3.37e+00

Pos logit contributions
 client+0.017
 classmate+0.017
 principal+0.017
 grade+0.017
 ha+0.017
Neg logit contributions
From-0.019
 refreshing-0.018
 Every-0.018
asting-0.017
Years-0.017
Token Sue
Token ID26089
Feature activation+1.88e+00

Pos logit contributions
 client+0.097
 grade+0.097
 classmate+0.096
 principal+0.096
 ha+0.093
Neg logit contributions
From-0.071
 Every-0.069
 refreshing-0.069
 often-0.063
 tight-0.062
Token said
Token ID531
Feature activation+7.45e+00

Pos logit contributions
 client+0.050
 classmate+0.049
 principal+0.049
 grade+0.049
 ha+0.048
Neg logit contributions
From-0.044
 Every-0.043
 refreshing-0.042
 often-0.040
Years-0.040
Token.
Token ID13
Feature activation+1.65e+00

Pos logit contributions
 client+0.259
 principal+0.257
 ha+0.251
 classmate+0.247
 grade+0.246
Neg logit contributions
 tight-0.113
 tightly-0.112
 Every-0.112
From-0.106
 all-0.105
Token massage
Token ID26900
Feature activation-2.64e+00

Pos logit contributions
 Every+0.028
From+0.028
 refreshing+0.027
 tight+0.027
 often+0.026
Neg logit contributions
 principal-0.035
 classmate-0.035
 client-0.034
 liar-0.033
 grade-0.032
Token,
Token ID11
Feature activation+4.03e+00

Pos logit contributions
 refreshing+0.051
From+0.051
 often+0.050
 Every+0.050
 all+0.048
Neg logit contributions
 client-0.078
 classmate-0.077
 principal-0.076
 grade-0.075
 liar-0.074
Token Lily
Token ID20037
Feature activation+1.97e+00

Pos logit contributions
 client+0.153
 principal+0.150
 classmate+0.149
 grade+0.148
 ha+0.145
Neg logit contributions
From-0.052
 refreshing-0.051
 all-0.050
 Every-0.049
 often-0.046
Token asked
Token ID1965
Feature activation+3.34e+00

Pos logit contributions
 client+0.058
 classmate+0.057
 grade+0.057
 principal+0.057
 ha+0.056
Neg logit contributions
From-0.041
 refreshing-0.039
 Every-0.038
asting-0.036
Years-0.036
Token if
Token ID611
Feature activation+2.04e+00

Pos logit contributions
 classmate+0.107
 client+0.106
 principal+0.104
 grade+0.103
 ha+0.103
Neg logit contributions
From-0.061
 all-0.060
 Every-0.060
 often-0.058
 every-0.057
Token she
Token ID673
Feature activation-1.26e+00

Pos logit contributions
 classmate+0.058
 grade+0.057
 client+0.057
 principal+0.056
 babys+0.055
Neg logit contributions
From-0.042
 refreshing-0.040
 often-0.040
 all-0.040
 Every-0.039
Token could
Token ID714
Feature activation+4.06e+00

Pos logit contributions
From+0.018
 Every+0.018
 refreshing+0.017
 often+0.016
 tight+0.016
Neg logit contributions
 client-0.044
 principal-0.044
 classmate-0.044
 babys-0.043
 grade-0.043
Token go
Token ID467
Feature activation+1.72e-01

Pos logit contributions
 classmate+0.119
 client+0.119
 grade+0.118
 principal+0.118
 ha+0.117
Neg logit contributions
From-0.083
 refreshing-0.080
 Every-0.078
asting-0.074
Years-0.074
Token outside
Token ID2354
Feature activation+5.22e-01

Pos logit contributions
 client+0.006
 classmate+0.006
 principal+0.006
 ha+0.005
 grade+0.005
Neg logit contributions
From-0.003
 refreshing-0.003
 Every-0.003
asting-0.003
 often-0.003
Token to
Token ID284
Feature activation+4.79e+00

Pos logit contributions
 client+0.020
 classmate+0.020
 ha+0.019
 principal+0.019
 liar+0.019
Neg logit contributions
 often-0.006
 Every-0.006
 all-0.006
 every-0.006
 around-0.006
Token play
Token ID711
Feature activation-1.46e+00

Pos logit contributions
 classmate+0.153
 client+0.153
 principal+0.151
 grade+0.150
 ha+0.150
Neg logit contributions
From-0.084
 refreshing-0.082
 Every-0.080
asting-0.075
Years-0.074
Token the
Token ID262
Feature activation-1.16e+00

Pos logit contributions
From+0.008
Years+0.008
reci+0.008
asting+0.008
 Every+0.008
Neg logit contributions
 ha-0.009
 happened-0.009
 grade-0.008
 babys-0.008
 Lee-0.008
Token hose
Token ID31489
Feature activation-2.64e+00

Pos logit contributions
From+0.029
 Every+0.026
 From+0.025
aling+0.025
 often+0.025
Neg logit contributions
 babys-0.031
 ha-0.030
 classmate-0.030
 client-0.029
 grade-0.029
Token finally
Token ID3443
Feature activation+3.10e+00

Pos logit contributions
From+0.071
Years+0.068
 Every+0.067
 refreshing+0.067
asting+0.064
Neg logit contributions
 classmate-0.060
 ha-0.059
 happened-0.059
 babys-0.058
 principal-0.056
Token stopped
Token ID5025
Feature activation+3.42e+00

Pos logit contributions
 happened+0.073
 ha+0.069
 classmate+0.069
 grade+0.067
 babys+0.064
Neg logit contributions
From-0.082
 Every-0.080
Years-0.078
 refreshing-0.076
asting-0.073
Token.
Token ID13
Feature activation+1.73e+00

Pos logit contributions
 ha+0.085
 happened+0.085
 babys+0.085
 grade+0.085
 classmate+0.085
Neg logit contributions
From-0.083
 refreshing-0.078
Years-0.078
 Every-0.076
asting-0.073
Token
Token ID198
Feature activation-2.87e+00

Pos logit contributions
 Lee+0.044
 ha+0.042
 happened+0.042
 babys+0.041
 client+0.041
Neg logit contributions
 refreshing-0.040
asting-0.040
Years-0.039
 tighter-0.039
ock-0.038
Token
Token ID198
Feature activation-2.90e+00

Pos logit contributions
 refreshing+0.064
From+0.055
Years+0.055
 tighter+0.055
asting+0.054
Neg logit contributions
 ha-0.076
 classmate-0.075
 principal-0.075
 client-0.074
 babys-0.073
TokenAt
Token ID2953
Feature activation-5.37e+00

Pos logit contributions
 refreshing+0.065
rant+0.062
reci+0.061
orting+0.060
 thanks+0.059
Neg logit contributions
 ha-0.078
 client-0.066
 liar-0.065
ggy-0.064
orman-0.063
Token the
Token ID262
Feature activation-1.22e+00

Pos logit contributions
From+0.093
 refreshing+0.089
asting+0.088
 Every+0.087
Years+0.082
Neg logit contributions
 ha-0.173
 principal-0.169
 grade-0.169
 classmate-0.167
 client-0.166
Token end
Token ID886
Feature activation+6.56e+00

Pos logit contributions
From+0.025
 Every+0.023
 refreshing+0.023
Years+0.022
asting+0.022
Neg logit contributions
 classmate-0.036
 ha-0.036
 client-0.035
 babys-0.035
 principal-0.035
Token of
Token ID286
Feature activation-3.41e+00

Pos logit contributions
 classmate+0.239
 ha+0.238
 client+0.237
 babys+0.234
 grade+0.234
Neg logit contributions
From-0.086
 refreshing-0.081
 Every-0.076
Years-0.076
asting-0.074
Token go
Token ID467
Feature activation+1.86e-01

Pos logit contributions
 classmate+0.148
 client+0.147
 principal+0.147
 grade+0.145
 ha+0.144
Neg logit contributions
From-0.090
 Every-0.087
 refreshing-0.085
 often-0.081
asting-0.080
Token on
Token ID319
Feature activation-9.09e-01

Pos logit contributions
 grade+0.005
 classmate+0.005
 client+0.005
 ha+0.005
 principal+0.005
Neg logit contributions
From-0.004
 refreshing-0.004
 Every-0.003
Years-0.003
asting-0.003
Token an
Token ID281
Feature activation+3.39e+00

Pos logit contributions
 Every+0.016
From+0.016
 refreshing+0.015
 all+0.015
 often+0.015
Neg logit contributions
 classmate-0.030
 client-0.029
 ha-0.029
 principal-0.029
 grade-0.029
Token adventure
Token ID8855
Feature activation-1.86e+00

Pos logit contributions
 client+0.109
 classmate+0.108
 principal+0.108
 grade+0.108
 ha+0.106
Neg logit contributions
From-0.056
 refreshing-0.054
lessly-0.050
 Every-0.050
asting-0.048
Token.
Token ID13
Feature activation+1.74e+00

Pos logit contributions
From+0.045
 refreshing+0.044
 Every+0.040
Years+0.040
asting+0.039
Neg logit contributions
 grade-0.049
 ha-0.048
 principal-0.048
 classmate-0.048
 client-0.048
Token He
Token ID679
Feature activation-6.99e+00

Pos logit contributions
 client+0.060
 grade+0.060
 Lee+0.059
 principal+0.058
 classmate+0.058
Neg logit contributions
 refreshing-0.028
From-0.028
asting-0.027
Years-0.024
rant-0.023
Token looked
Token ID3114
Feature activation+1.07e+00

Pos logit contributions
From+0.172
 refreshing+0.168
asting+0.160
 Every+0.159
orting+0.155
Neg logit contributions
 ha-0.179
 classmate-0.173
 client-0.172
 principal-0.169
 grade-0.166
Token to
Token ID284
Feature activation+4.79e+00

Pos logit contributions
 classmate+0.025
 grade+0.024
 happened+0.024
 seeker+0.023
 sister+0.023
Neg logit contributions
From-0.028
asting-0.024
Even-0.024
ion-0.024
Years-0.023
Token the
Token ID262
Feature activation-1.25e+00

Pos logit contributions
 principal+0.142
 grade+0.140
 client+0.140
 ha+0.140
 classmate+0.136
Neg logit contributions
From-0.098
 refreshing-0.092
ion-0.090
orting-0.087
lessly-0.085
Token left
Token ID1364
Feature activation-1.10e+00

Pos logit contributions
From+0.030
 Every+0.027
 refreshing+0.027
 From+0.026
ion+0.026
Neg logit contributions
 client-0.034
 principal-0.034
 ha-0.033
 classmate-0.033
 babys-0.032
Token and
Token ID290
Feature activation-8.44e+00

Pos logit contributions
From+0.023
 refreshing+0.022
Years+0.021
 Every+0.020
Hop+0.019
Neg logit contributions
 principal-0.034
 ha-0.034
 client-0.033
 grade-0.033
 classmate-0.033
Token He
Token ID679
Feature activation-7.40e+00

Pos logit contributions
 principal+0.039
 client+0.039
 classmate+0.039
 babys+0.038
 ha+0.038
Neg logit contributions
 refreshing-0.019
From-0.018
asting-0.018
Years-0.018
 often-0.016
Token waited
Token ID13488
Feature activation+3.02e+00

Pos logit contributions
From+0.187
 refreshing+0.185
 Every+0.181
asting+0.179
Years+0.174
Neg logit contributions
 classmate-0.179
 client-0.177
 ha-0.177
 principal-0.176
 babys-0.173
Token with
Token ID351
Feature activation-4.59e+00

Pos logit contributions
 grade+0.093
 babys+0.089
 Lee+0.089
 classmate+0.088
 ha+0.088
Neg logit contributions
From-0.055
 refreshing-0.053
 Every-0.051
Years-0.049
reci-0.046
Token her
Token ID607
Feature activation+4.80e+00

Pos logit contributions
From+0.109
Years+0.095
orting+0.095
asting+0.094
reci+0.093
Neg logit contributions
 ha-0.127
 Lee-0.123
 principal-0.123
 classmate-0.121
 babys-0.121
Token until
Token ID1566
Feature activation-4.92e-01

Pos logit contributions
 client+0.148
 classmate+0.148
 grade+0.145
 ha+0.145
 principal+0.145
Neg logit contributions
From-0.095
 refreshing-0.089
 Every-0.086
Years-0.084
asting-0.082
Token they
Token ID484
Feature activation-1.80e+00

Pos logit contributions
From+0.008
 refreshing+0.008
 Every+0.008
 often+0.007
asting+0.007
Neg logit contributions
 client-0.016
 classmate-0.016
 principal-0.016
 grade-0.016
 ha-0.015
Token came
Token ID1625
Feature activation+3.90e+00

Pos logit contributions
From+0.043
 refreshing+0.042
 Every+0.042
Years+0.040
asting+0.040
Neg logit contributions
 classmate-0.046
 ha-0.046
 client-0.045
 principal-0.045
 grade-0.045
Token.
Token ID13
Feature activation+1.11e+00

Pos logit contributions
 principal+0.128
 client+0.128
 classmate+0.126
 grade+0.126
 ha+0.126
Neg logit contributions
From-0.066
 Every-0.065
 refreshing-0.062
 tight-0.061
 often-0.061
Token
Token ID198
Feature activation-3.14e+00

Pos logit contributions
 principal+0.035
 Lee+0.035
 client+0.035
 babys+0.035
 classmate+0.035
Neg logit contributions
asting-0.019
 refreshing-0.019
From-0.019
Years-0.019
 often-0.017
Token
Token ID198
Feature activation-3.16e+00

Pos logit contributions
 refreshing+0.061
From+0.054
Years+0.053
asting+0.053
 Every+0.052
Neg logit contributions
 principal-0.093
 ha-0.092
 babys-0.092
 classmate-0.092
 client-0.091
TokenThe
Token ID464
Feature activation-7.93e+00

Pos logit contributions
 refreshing+0.075
 Every+0.074
 tightly+0.068
utt+0.068
From+0.068
Neg logit contributions
 ha-0.083
 principal-0.078
 client-0.077
 classmate-0.077
 babys-0.076
Tokenmy
Token ID1820
Feature activation+3.48e+00

Pos logit contributions
From+0.035
 refreshing+0.033
 Every+0.031
Years+0.030
asting+0.030
Neg logit contributions
 classmate-0.079
 ha-0.079
 client-0.079
 principal-0.078
 babys-0.078
Token,
Token ID11
Feature activation+4.08e+00

Pos logit contributions
 principal+0.096
 classmate+0.096
 grade+0.095
 client+0.095
 babys+0.092
Neg logit contributions
From-0.075
 Every-0.073
 refreshing-0.073
 often-0.070
 tight-0.070
Token if
Token ID611
Feature activation+2.03e+00

Pos logit contributions
 ha+0.115
 grade+0.101
 sister+0.099
 principal+0.099
 client+0.099
Neg logit contributions
 Every-0.100
Years-0.096
From-0.095
rant-0.094
 Later-0.094
Token you
Token ID345
Feature activation+1.69e+00

Pos logit contributions
 classmate+0.067
 client+0.066
 principal+0.066
 grade+0.066
 babys+0.064
Neg logit contributions
From-0.035
 refreshing-0.033
 Every-0.032
 often-0.030
Years-0.030
Token had
Token ID550
Feature activation-8.50e-02

Pos logit contributions
 grade+0.044
 ha+0.044
 client+0.043
 classmate+0.042
 babys+0.042
Neg logit contributions
From-0.042
 refreshing-0.039
 Every-0.038
Years-0.038
 From-0.036
Token cleaned
Token ID20750
Feature activation-5.86e+00

Pos logit contributions
 refreshing+0.002
From+0.002
 all+0.002
 Every+0.002
asting+0.002
Neg logit contributions
 client-0.002
 classmate-0.002
 principal-0.002
 grade-0.002
 Lee-0.002
Token up
Token ID510
Feature activation+1.14e+00

Pos logit contributions
From+0.112
 refreshing+0.106
Years+0.105
 Every+0.104
asting+0.101
Neg logit contributions
 ha-0.179
 client-0.175
 classmate-0.175
 principal-0.174
 grade-0.173
Token your
Token ID534
Feature activation+5.29e-01

Pos logit contributions
 ha+0.035
 grade+0.034
 client+0.034
 principal+0.034
 classmate+0.033
Neg logit contributions
From-0.022
Years-0.021
 refreshing-0.021
 Every-0.021
asting-0.020
Token toys
Token ID14958
Feature activation+1.07e+00

Pos logit contributions
 client+0.013
 classmate+0.013
 principal+0.013
 grade+0.013
 ha+0.013
Neg logit contributions
 refreshing-0.013
 Every-0.013
 tight-0.012
From-0.012
 tightly-0.012
Token like
Token ID588
Feature activation-4.17e+00

Pos logit contributions
 classmate+0.028
 client+0.028
 ha+0.028
 grade+0.028
 principal+0.028
Neg logit contributions
From-0.025
 refreshing-0.024
 Every-0.023
Years-0.022
asting-0.022
Token I
Token ID314
Feature activation-1.93e+00

Pos logit contributions
From+0.083
 Every+0.076
 refreshing+0.075
Years+0.075
asting+0.075
Neg logit contributions
 ha-0.128
 babys-0.123
 principal-0.122
 client-0.121
 grade-0.121
TokenMom
Token ID29252
Feature activation+5.99e+00

Pos logit contributions
 classmate+0.017
 client+0.017
 ha+0.017
 grade+0.017
 principal+0.017
Neg logit contributions
From-0.013
 refreshing-0.013
 Every-0.013
asting-0.012
Years-0.012
Token,
Token ID11
Feature activation+4.07e+00

Pos logit contributions
 client+0.171
 classmate+0.169
 grade+0.167
 principal+0.166
 babys+0.163
Neg logit contributions
From-0.124
 Every-0.123
 refreshing-0.123
 often-0.115
asting-0.114
Token can
Token ID460
Feature activation+3.95e+00

Pos logit contributions
 classmate+0.136
 client+0.135
 principal+0.133
 grade+0.133
 babys+0.131
Neg logit contributions
From-0.065
 refreshing-0.064
 Every-0.063
 often-0.058
asting-0.057
Token I
Token ID314
Feature activation-1.93e+00

Pos logit contributions
 ha+0.107
 client+0.105
 classmate+0.104
 principal+0.103
 grade+0.103
Neg logit contributions
From-0.091
 refreshing-0.088
Years-0.086
asting-0.085
 Every-0.085
Token have
Token ID423
Feature activation-4.33e+00

Pos logit contributions
From+0.041
 refreshing+0.040
 Every+0.038
Years+0.037
asting+0.037
Neg logit contributions
 ha-0.055
 client-0.055
 classmate-0.055
 grade-0.055
 principal-0.054
Token some
Token ID617
Feature activation-1.74e+00

Pos logit contributions
From+0.077
 refreshing+0.072
Years+0.068
asting+0.068
 Every+0.067
Neg logit contributions
 ha-0.142
 client-0.137
 principal-0.135
 babys-0.135
 grade-0.135
Token water
Token ID1660
Feature activation-3.41e+00

Pos logit contributions
 Every+0.033
 refreshing+0.032
 all+0.031
From+0.031
 tight+0.030
Neg logit contributions
 classmate-0.054
 principal-0.054
 client-0.053
 grade-0.052
fer-0.052
Token?"
Token ID1701
Feature activation+5.28e+00

Pos logit contributions
 refreshing+0.082
 Every+0.081
From+0.080
 all+0.079
asting+0.078
Neg logit contributions
 client-0.087
 classmate-0.086
 principal-0.083
 grade-0.081
 ha-0.080
Token Sara
Token ID24799
Feature activation+1.71e-01

Pos logit contributions
 client+0.183
 principal+0.182
 classmate+0.179
 grade+0.178
fer+0.173
Neg logit contributions
 Every-0.094
From-0.088
 refreshing-0.084
 often-0.078
 all-0.075
Token asked
Token ID1965
Feature activation+3.33e+00

Pos logit contributions
 client+0.006
 grade+0.006
 liar+0.006
 principal+0.006
 classmate+0.006
Neg logit contributions
 Every-0.003
 often-0.003
 tight-0.003
From-0.003
 carefully-0.003
Token.
Token ID13
Feature activation+1.80e+00

Pos logit contributions
 client+0.131
 principal+0.130
 liar+0.128
 grade+0.123
 ha+0.122
Neg logit contributions
 tight-0.040
 tightly-0.038
 over-0.038
 all-0.038
 around-0.038
Token He
Token ID679
Feature activation-6.82e+00

Pos logit contributions
 grade+0.062
 classmate+0.061
 principal+0.060
 client+0.058
 classroom+0.057
Neg logit contributions
 refreshing-0.049
asting-0.049
From-0.048
Years-0.045
ion-0.045
Token had
Token ID550
Feature activation+8.67e-02

Pos logit contributions
From+0.102
 refreshing+0.097
 Every+0.094
asting+0.088
Years+0.087
Neg logit contributions
 client-0.238
 classmate-0.238
 grade-0.236
 principal-0.235
 ha-0.235
Token a
Token ID257
Feature activation+5.67e+00

Pos logit contributions
 classmate+0.003
 client+0.003
 grade+0.003
 principal+0.003
 ha+0.003
Neg logit contributions
From-0.002
 refreshing-0.001
 Every-0.001
 often-0.001
asting-0.001
Token mild
Token ID11607
Feature activation-3.31e+00

Pos logit contributions
 principal+0.185
 grade+0.184
ird+0.184
fer+0.183
 Lee+0.179
Neg logit contributions
 tight-0.098
 refreshing-0.095
 often-0.088
 all-0.085
 fresh-0.083
Token heart
Token ID2612
Feature activation-2.19e+00

Pos logit contributions
 refreshing+0.074
 Every+0.072
From+0.071
 tight+0.070
 often+0.070
Neg logit contributions
 principal-0.088
 client-0.087
 classmate-0.087
 ha-0.085
 grade-0.084
Token and
Token ID290
Feature activation-8.10e+00

Pos logit contributions
From+0.036
 refreshing+0.033
 Every+0.032
Years+0.031
asting+0.029
Neg logit contributions
 classmate-0.074
 client-0.074
 grade-0.074
 principal-0.073
 ha-0.073
Token was
Token ID373
Feature activation+4.46e+00

Pos logit contributions
From+0.124
 refreshing+0.118
 Every+0.115
asting+0.106
Years+0.106
Neg logit contributions
 client-0.279
 classmate-0.279
 grade-0.276
 principal-0.276
 ha-0.275
Token always
Token ID1464
Feature activation-6.48e+00

Pos logit contributions
 classmate+0.141
 ha+0.141
 client+0.140
 babys+0.139
 principal+0.137
Neg logit contributions
From-0.083
orting-0.076
 refreshing-0.074
Years-0.072
asting-0.071
Token very
Token ID845
Feature activation+6.08e+00

Pos logit contributions
 refreshing+0.127
From+0.125
 Every+0.120
 often+0.118
 tight+0.117
Neg logit contributions
 client-0.196
 principal-0.193
 grade-0.192
 classmate-0.192
 ha-0.188
Token kind
Token ID1611
Feature activation-1.10e+00

Pos logit contributions
 classmate+0.173
 client+0.173
 ha+0.173
 grade+0.170
 principal+0.167
Neg logit contributions
From-0.131
 refreshing-0.121
 Every-0.121
Years-0.121
asting-0.116
Token to
Token ID284
Feature activation+4.92e+00

Pos logit contributions
From+0.024
 refreshing+0.024
 Every+0.022
Years+0.022
asting+0.022
Neg logit contributions
 classmate-0.031
 client-0.031
 principal-0.030
 ha-0.030
 grade-0.030
Token likes
Token ID7832
Feature activation-3.57e+00

Pos logit contributions
From+0.035
 refreshing+0.033
 Every+0.032
Years+0.032
asting+0.032
Neg logit contributions
 client-0.037
 ha-0.037
 grade-0.036
 classmate-0.036
 principal-0.036
Token the
Token ID262
Feature activation-1.63e+00

Pos logit contributions
From+0.071
 refreshing+0.064
asting+0.062
Years+0.062
 Every+0.060
Neg logit contributions
 client-0.109
 ha-0.108
 principal-0.108
 classmate-0.108
 babys-0.107
Token animals
Token ID4695
Feature activation-5.30e-01

Pos logit contributions
From+0.038
 refreshing+0.037
 Every+0.036
Years+0.035
asting+0.035
Neg logit contributions
 client-0.043
 classmate-0.043
 ha-0.042
 grade-0.042
 principal-0.042
Token too
Token ID1165
Feature activation+3.35e+00

Pos logit contributions
From+0.013
Years+0.012
 refreshing+0.012
 Every+0.012
asting+0.011
Neg logit contributions
 classmate-0.014
 client-0.014
 ha-0.013
 grade-0.013
 principal-0.013
Token.
Token ID13
Feature activation+1.03e+00

Pos logit contributions
 client+0.090
 classmate+0.089
 grade+0.088
 ha+0.088
 principal+0.087
Neg logit contributions
From-0.077
 refreshing-0.075
 Every-0.070
Years-0.070
asting-0.069
Token She
Token ID1375
Feature activation-8.14e+00

Pos logit contributions
 client+0.028
 classmate+0.028
 Lee+0.028
 grade+0.027
 principal+0.027
Neg logit contributions
asting-0.023
From-0.022
 refreshing-0.022
Years-0.021
 often-0.020
Token thanks
Token ID5176
Feature activation-3.38e+00

Pos logit contributions
asting+0.218
From+0.217
 refreshing+0.205
Years+0.204
lessly+0.202
Neg logit contributions
 ha-0.194
 client-0.185
 classmate-0.184
 grade-0.183
 principal-0.181
Token the
Token ID262
Feature activation-1.65e+00

Pos logit contributions
From+0.080
 refreshing+0.077
asting+0.075
Years+0.073
 Every+0.071
Neg logit contributions
 classmate-0.089
 ha-0.089
 babys-0.088
 client-0.088
 principal-0.088
Token farmer
Token ID18739
Feature activation+3.95e+00

Pos logit contributions
From+0.044
 refreshing+0.041
 Every+0.040
asting+0.039
 From+0.039
Neg logit contributions
 client-0.041
 ha-0.040
 babys-0.040
 classmate-0.039
 principal-0.039
Token for
Token ID329
Feature activation-4.74e-01

Pos logit contributions
 Lee+0.115
 client+0.114
 classmate+0.114
 ha+0.110
 principal+0.110
Neg logit contributions
From-0.090
 refreshing-0.085
Years-0.083
Even-0.078
asting-0.077
Token inviting
Token ID23310
Feature activation-3.46e+00

Pos logit contributions
From+0.011
 refreshing+0.011
 Every+0.011
Years+0.010
asting+0.010
Neg logit contributions
 principal-0.013
 ha-0.013
 babys-0.012
 Lee-0.012
 grade-0.012
Token cars
Token ID5006
Feature activation-2.58e+00

Pos logit contributions
From+0.096
Years+0.087
 Every+0.084
 refreshing+0.083
orting+0.080
Neg logit contributions
 classmate-0.115
 ha-0.114
 babys-0.114
 client-0.114
 principal-0.112
Token.
Token ID13
Feature activation+2.22e+00

Pos logit contributions
From+0.055
 refreshing+0.050
Years+0.050
 Every+0.046
 From+0.043
Neg logit contributions
 babys-0.074
 grade-0.073
 classmate-0.073
 principal-0.072
 ha-0.072
Token He
Token ID679
Feature activation-6.84e+00

Pos logit contributions
 principal+0.071
 classmate+0.071
 Lee+0.070
 client+0.070
 babys+0.070
Neg logit contributions
 refreshing-0.040
asting-0.038
From-0.037
Years-0.036
 tighter-0.034
Token looked
Token ID3114
Feature activation+1.02e+00

Pos logit contributions
From+0.160
 refreshing+0.157
 Every+0.154
orting+0.149
Years+0.147
Neg logit contributions
 ha-0.181
 classmate-0.177
 principal-0.173
 client-0.172
 grade-0.167
Token around
Token ID1088
Feature activation-2.81e+00

Pos logit contributions
 classmate+0.031
 grade+0.030
 babys+0.029
 principal+0.029
 ha+0.029
Neg logit contributions
From-0.021
Years-0.017
 Every-0.017
asting-0.017
ion-0.016
Token and
Token ID290
Feature activation-8.11e+00

Pos logit contributions
From+0.055
 refreshing+0.053
asting+0.049
utt+0.047
Years+0.046
Neg logit contributions
 grade-0.086
 classmate-0.084
 principal-0.083
 ha-0.082
 babys-0.082
Token noticed
Token ID6810
Feature activation+2.03e+00

Pos logit contributions
From+0.242
 refreshing+0.229
 Every+0.228
asting+0.226
Years+0.223
Neg logit contributions
 classmate-0.165
 ha-0.162
 principal-0.161
 grade-0.159
 client-0.155
Token lots
Token ID6041
Feature activation-6.97e-01

Pos logit contributions
 ha+0.061
 classmate+0.061
 principal+0.061
 client+0.060
 grade+0.059
Neg logit contributions
From-0.040
 refreshing-0.037
 Every-0.035
Years-0.035
asting-0.034
Token of
Token ID286
Feature activation-3.30e+00

Pos logit contributions
From+0.015
 Later+0.015
Years+0.014
 refreshing+0.014
 Every+0.013
Neg logit contributions
 grade-0.019
 babys-0.018
 classmate-0.018
 principal-0.018
 happened-0.018
Token different
Token ID1180
Feature activation-4.58e+00

Pos logit contributions
From+0.081
 refreshing+0.075
 Every+0.074
Years+0.073
 From+0.070
Neg logit contributions
 ha-0.085
 principal-0.085
 client-0.085
 classmate-0.084
 babys-0.083
Token buildings
Token ID6832
Feature activation-2.80e+00

Pos logit contributions
From+0.097
 refreshing+0.093
 Every+0.091
Years+0.087
asting+0.087
Neg logit contributions
 classmate-0.131
 client-0.131
 principal-0.129
 grade-0.129
 ha-0.129
Token.
Token ID13
Feature activation+1.22e+00

Pos logit contributions
 ha+0.081
 babys+0.076
 Lee+0.075
 client+0.075
 classmate+0.075
Neg logit contributions
Years-0.096
From-0.096
 refreshing-0.092
asting-0.087
 Every-0.086
Token She
Token ID1375
Feature activation-7.92e+00

Pos logit contributions
 classmate+0.037
 grade+0.036
 client+0.036
 principal+0.036
 Lee+0.036
Neg logit contributions
From-0.024
asting-0.024
 refreshing-0.023
Years-0.023
 often-0.022
Token felt
Token ID2936
Feature activation-9.51e-01

Pos logit contributions
 refreshing+0.198
From+0.197
asting+0.196
Years+0.192
 Every+0.192
Neg logit contributions
 ha-0.193
 classmate-0.191
 client-0.187
 principal-0.186
 babys-0.183
Token silly
Token ID14397
Feature activation-5.30e-01

Pos logit contributions
From+0.025
 refreshing+0.024
Years+0.024
 Every+0.023
asting+0.023
Neg logit contributions
 classmate-0.022
 client-0.022
 ha-0.022
 principal-0.022
 grade-0.021
Token.
Token ID13
Feature activation+1.22e+00

Pos logit contributions
From+0.007
 refreshing+0.007
 Every+0.007
asting+0.006
 often+0.006
Neg logit contributions
 client-0.019
 classmate-0.019
 principal-0.019
 ha-0.019
 grade-0.019
Token She
Token ID1375
Feature activation-7.87e+00

Pos logit contributions
 client+0.040
 classmate+0.040
 ha+0.039
 grade+0.039
 principal+0.039
Neg logit contributions
From-0.021
 Every-0.020
 refreshing-0.020
Years-0.018
 often-0.018
Token was
Token ID373
Feature activation+4.05e+00

Pos logit contributions
From+0.190
 refreshing+0.186
 Every+0.184
asting+0.180
Years+0.179
Neg logit contributions
 ha-0.198
 classmate-0.198
 client-0.196
 principal-0.194
 grade-0.191
Token not
Token ID407
Feature activation-4.47e-01

Pos logit contributions
 classmate+0.098
 ha+0.097
 client+0.097
 principal+0.096
 grade+0.095
Neg logit contributions
From-0.105
 refreshing-0.099
 Every-0.099
Years-0.097
asting-0.095
Token brave
Token ID14802
Feature activation-2.40e-01

Pos logit contributions
 all+0.008
 refreshing+0.008
asting+0.008
 crunch+0.008
 around+0.008
Neg logit contributions
 client-0.014
 principal-0.014
 grade-0.014
 classmate-0.013
 babys-0.013
Token like
Token ID588
Feature activation-4.50e+00

Pos logit contributions
 Every+0.004
 all+0.004
 often+0.004
 around+0.003
 refreshing+0.003
Neg logit contributions
 classmate-0.008
 principal-0.008
 client-0.008
 grade-0.008
 babys-0.008
Token Ben
Token ID3932
Feature activation+2.15e+00

Pos logit contributions
From+0.108
 refreshing+0.096
Years+0.095
asting+0.094
 Every+0.091
Neg logit contributions
 principal-0.126
 grade-0.123
 ha-0.123
 classmate-0.123
 client-0.123
Token to
Token ID284
Feature activation+4.46e+00

Pos logit contributions
 client+0.112
 classmate+0.112
 principal+0.110
 ha+0.110
 grade+0.110
Neg logit contributions
From-0.071
 refreshing-0.069
 Every-0.068
asting-0.064
Years-0.064
Token the
Token ID262
Feature activation-1.64e+00

Pos logit contributions
 client+0.168
 classmate+0.168
 grade+0.163
 principal+0.163
 ha+0.162
Neg logit contributions
 Every-0.054
 refreshing-0.051
From-0.050
 all-0.047
 often-0.046
Token living
Token ID2877
Feature activation-5.38e+00

Pos logit contributions
 refreshing+0.040
From+0.040
 Every+0.038
 tight+0.038
 often+0.037
Neg logit contributions
 grade-0.040
 classmate-0.040
 client-0.039
 principal-0.038
 ha-0.037
Token room
Token ID2119
Feature activation+3.21e+00

Pos logit contributions
 all+0.118
 refreshing+0.116
 Every+0.115
 tight+0.115
 around+0.112
Neg logit contributions
 client-0.144
 grade-0.143
 liar-0.137
 classmate-0.136
 happened-0.135
Token.
Token ID13
Feature activation+1.54e+00

Pos logit contributions
 classmate+0.089
 client+0.089
 grade+0.089
 principal+0.089
 ha+0.088
Neg logit contributions
From-0.070
 refreshing-0.068
 Every-0.066
Years-0.063
asting-0.063
Token She
Token ID1375
Feature activation-7.80e+00

Pos logit contributions
 classmate+0.043
 principal+0.042
 grade+0.042
 babys+0.042
 client+0.042
Neg logit contributions
asting-0.034
From-0.033
 refreshing-0.032
Years-0.031
 often-0.030
Token saw
Token ID2497
Feature activation+2.31e+00

Pos logit contributions
From+0.162
 refreshing+0.156
 Every+0.154
asting+0.151
Years+0.148
Neg logit contributions
 classmate-0.225
 ha-0.225
 principal-0.224
 babys-0.223
 client-0.220
Token Ben
Token ID3932
Feature activation+2.06e+00

Pos logit contributions
 principal+0.075
 classmate+0.074
 babys+0.073
 client+0.073
 ha+0.073
Neg logit contributions
From-0.041
 refreshing-0.039
Years-0.038
asting-0.037
 often-0.036
Token sitting
Token ID5586
Feature activation-6.71e+00

Pos logit contributions
 ha+0.058
 classmate+0.057
 client+0.057
 babys+0.057
 grade+0.057
Neg logit contributions
From-0.046
 refreshing-0.045
Years-0.044
 Every-0.042
lessly-0.039
Token on
Token ID319
Feature activation-9.81e-01

Pos logit contributions
From+0.135
 refreshing+0.130
 Every+0.127
asting+0.121
Years+0.121
Neg logit contributions
 classmate-0.200
 client-0.200
 principal-0.197
 grade-0.197
 ha-0.197
Token the
Token ID262
Feature activation-1.66e+00

Pos logit contributions
From+0.017
 refreshing+0.016
Years+0.015
asting+0.015
 Every+0.015
Neg logit contributions
 principal-0.032
 client-0.032
 ha-0.032
 classmate-0.032
 grade-0.032