TOP ACTIVATIONS
MAX = 0.129

for you ." Bill gave Bob the bread , and Bob
agreed and they gave the man some of their breakfast .
ily and Ben gave the man some bread . They taped
, and they gave the man some food and water .
! The old elephant gave Bob a hug . Bob was
she loved to give the boy lots of kisses .
The elderly man was giving daddy lots of advice .
iz thanked him , gave John a big hug , and
moment and then gave the deer a big , juicy apple
agreed , and gave the farmer some of their eggs ,
Tom gave the man a new blanket and wished
. Lily and Tom give him some treats . They pet
So , she gave the bear a sandwich .
away . He gave the children some corn . He told
you ." He gave Tim my a sandwich and some juice
man smiled and gave the owl a piece of the apple
smiled , giving his grand pa a big hug . <|endoftext|>
others . They gave each person a medal to show that
Mom nodded and gave them some bread cr umbs from
nodded eagerly . Mum gave them some food and they sprinkled

MIN ACTIVATIONS
MIN = -0.119

are going to a far away place !" Tom said ,
She had never seen a math book before , so she
red . It was a yoga mat ! Ell a was
, going to a far away place . "
. She had a big math book in her bag .
take us to a far away place ." L
can find animals in the math book . Maybe I can
take you to a far away place . You have to
a story about a far away land that was very distant
away . She has a math book with many pictures and
, she went to a yoga class with her mom .
. Lily has a pink yoga mat . She puts it
was a nightmare about a scary monster that wouldn 't go
a map of a far away land . "
it was from a far away place . She said it
to travel to a far away place and brought Spot along
. Can we play together again ?" Ben smiles
. Can we play soccer again ? I will work hard
. Can we play together again ?" he said .
. Can we play together again ?" L ila

INTERVAL 0.067 - 0.129
CONTAINS 0.331%

their favorite animals were the sheep . The sheep were soft
and hugged the t eddy bear . They thanked their mom
J ill and Jack got very excited and started playing
, so he gave Sus ie the paint and they both
thanked her mom for being so helpful . That night ,

INTERVAL 0.005 - 0.067
CONTAINS 38.907%

do it again soon ." The fat cat
looked around for the other animals he had shared his cheese
it was just the right size to sail back home .
One day , he decided to visit the living room .
to go to church with her family every Sunday . One

INTERVAL -0.057 - 0.005
CONTAINS 60.251%

falls out and they become bald . Tim my didn 't
One day , she saw a patient dog sitting in the
it was special . Every day it sat on the
Once upon a time , there was a little girl called
living room . They sit on the floor . They pretend

INTERVAL -0.119 - -0.057
CONTAINS 0.511%

mess . As she was playing , she saw a little
he could see in the dark . John
k itty . Are you lost ? You look sad and
friends . At school , Amy saw a boy
- even when she was asleep . <|endoftext|> M ummy was
Token for
Token ID329
Feature activation+1.36e-02

Pos logit contributions
umbs+0.046
beh+0.046
lette+0.044
known+0.042
izza+0.042
Neg logit contributions
 notice-0.021
But-0.021
 stage-0.020
 woke-0.019
 imagined-0.018
Token you
Token ID345
Feature activation+1.02e-02

Pos logit contributions
umbs+0.027
beh+0.026
lette+0.026
icking+0.024
izza+0.024
Neg logit contributions
 notice-0.015
But-0.014
 stage-0.014
 woke-0.013
Who-0.013
Token."
Token ID526
Feature activation+3.27e-02

Pos logit contributions
umbs+0.023
beh+0.022
lette+0.022
izza+0.020
igators+0.020
Neg logit contributions
 notice-0.009
But-0.009
 stage-0.008
 woke-0.008
 imagined-0.008
Token Bill
Token ID3941
Feature activation+4.25e-02

Pos logit contributions
beh+0.071
umbs+0.070
lette+0.068
igators+0.067
icking+0.066
Neg logit contributions
But-0.030
 notice-0.028
 woke-0.026
Who-0.026
 suddenly-0.025
Token gave
Token ID2921
Feature activation+2.81e-02

Pos logit contributions
umbs+0.100
beh+0.100
lette+0.094
 supposed+0.091
igators+0.091
Neg logit contributions
 woke-0.036
 suddenly-0.032
But-0.032
 heard-0.031
 imagined-0.031
Token Bob
Token ID5811
Feature activation+1.29e-01

Pos logit contributions
umbs+0.056
beh+0.054
lette+0.050
izza+0.050
ocado+0.050
Neg logit contributions
 notice-0.033
But-0.032
 suddenly-0.032
 heard-0.031
 woke-0.030
Token the
Token ID262
Feature activation+4.42e-02

Pos logit contributions
beh+0.376
ini+0.371
rol+0.366
anes+0.358
 supposed+0.354
Neg logit contributions
 again-0.055
 heard-0.047
 saw-0.043
,-0.040
 her-0.039
Token bread
Token ID8509
Feature activation+8.31e-02

Pos logit contributions
beh+0.083
umbs+0.073
rol+0.068
igators+0.066
illing+0.065
Neg logit contributions
 notice-0.065
 stage-0.063
But-0.062
 heard-0.061
 dress-0.061
Token,
Token ID11
Feature activation+3.57e-02

Pos logit contributions
beh+0.245
known+0.219
igators+0.217
 supposed+0.216
oun+0.214
Neg logit contributions
,-0.033
 again-0.033
 and-0.028
 dress-0.022
 heard-0.022
Token and
Token ID290
Feature activation+2.48e-02

Pos logit contributions
beh+0.079
umbs+0.076
lette+0.072
igators+0.070
oun+0.070
Neg logit contributions
But-0.036
 notice-0.036
 woke-0.034
 suddenly-0.033
 heard-0.032
Token Bob
Token ID5811
Feature activation+6.42e-02

Pos logit contributions
umbs+0.054
beh+0.051
lette+0.050
izza+0.048
ady+0.047
Neg logit contributions
 notice-0.025
But-0.024
 suddenly-0.024
 woke-0.023
 imagined-0.022
Token agreed
Token ID4987
Feature activation-9.06e-03

Pos logit contributions
umbs+0.094
beh+0.090
igators+0.086
lette+0.085
izza+0.083
Neg logit contributions
 woke-0.027
 heard-0.026
 suddenly-0.026
 saw-0.024
But-0.024
Token and
Token ID290
Feature activation+2.15e-02

Pos logit contributions
 notice+0.009
 stage+0.008
Who+0.008
La+0.008
Stop+0.008
Neg logit contributions
umbs-0.019
lette-0.018
beh-0.018
icking-0.018
izza-0.017
Token they
Token ID484
Feature activation+3.21e-02

Pos logit contributions
umbs+0.048
lette+0.045
beh+0.045
ady+0.043
igators+0.042
Neg logit contributions
 notice-0.021
 suddenly-0.020
 woke-0.020
But-0.020
 stage-0.019
Token gave
Token ID2921
Feature activation+2.27e-02

Pos logit contributions
umbs+0.071
beh+0.067
lette+0.067
ady+0.063
illing+0.063
Neg logit contributions
 woke-0.032
 notice-0.031
 suddenly-0.030
 heard-0.029
 imagined-0.029
Token the
Token ID262
Feature activation+1.03e-02

Pos logit contributions
umbs+0.054
lette+0.051
beh+0.051
illing+0.049
izza+0.049
Neg logit contributions
 notice-0.018
 suddenly-0.016
But-0.016
 woke-0.016
 stage-0.015
Token man
Token ID582
Feature activation+1.28e-01

Pos logit contributions
umbs+0.027
beh+0.026
lette+0.025
igators+0.024
izza+0.024
Neg logit contributions
 notice-0.006
But-0.005
 stage-0.005
 woke-0.005
Who-0.005
Token some
Token ID617
Feature activation+4.38e-02

Pos logit contributions
omp+0.336
umbs+0.336
istent+0.327
ini+0.320
umb+0.320
Neg logit contributions
 again-0.067
 her-0.058
 heard-0.057
 saw-0.049
 woke-0.044
Token of
Token ID286
Feature activation+2.58e-02

Pos logit contributions
beh+0.069
umbs+0.068
lette+0.063
icking+0.059
umb+0.058
Neg logit contributions
 notice-0.068
 stage-0.064
But-0.063
 woke-0.063
 heard-0.063
Token their
Token ID511
Feature activation+3.71e-02

Pos logit contributions
umbs+0.060
beh+0.059
lette+0.058
ocado+0.055
icking+0.055
Neg logit contributions
 notice-0.022
 stage-0.021
But-0.020
 school-0.020
Who-0.018
Token breakfast
Token ID12607
Feature activation+2.36e-02

Pos logit contributions
umbs+0.063
beh+0.060
rol+0.054
lette+0.054
umb+0.054
Neg logit contributions
 notice-0.057
 stage-0.054
 saw-0.054
 heard-0.053
 school-0.053
Token.
Token ID13
Feature activation+1.14e-02

Pos logit contributions
beh+0.057
umbs+0.056
lette+0.055
igators+0.054
known+0.052
Neg logit contributions
 notice-0.015
But-0.014
 stage-0.014
 woke-0.013
 suddenly-0.012
Tokenily
Token ID813
Feature activation-1.26e-03

Pos logit contributions
 notice+0.009
 stage+0.008
But+0.008
 suddenly+0.008
 woke+0.008
Neg logit contributions
umbs-0.009
beh-0.008
lette-0.008
izza-0.008
icking-0.008
Token and
Token ID290
Feature activation+9.70e-03

Pos logit contributions
 notice+0.001
 stage+0.001
But+0.001
Who+0.001
 woke+0.001
Neg logit contributions
umbs-0.003
lette-0.003
beh-0.003
izza-0.003
icking-0.003
Token Ben
Token ID3932
Feature activation+9.94e-03

Pos logit contributions
umbs+0.022
lette+0.021
beh+0.021
izza+0.020
igators+0.020
Neg logit contributions
 notice-0.008
 stage-0.008
But-0.007
 woke-0.007
Who-0.007
Token gave
Token ID2921
Feature activation+3.40e-02

Pos logit contributions
umbs+0.021
beh+0.020
lette+0.020
igators+0.019
izza+0.019
Neg logit contributions
 notice-0.010
But-0.009
 stage-0.009
 woke-0.009
 suddenly-0.008
Token the
Token ID262
Feature activation+1.41e-02

Pos logit contributions
umbs+0.085
lette+0.082
beh+0.081
illing+0.080
izza+0.077
Neg logit contributions
 notice-0.023
 stage-0.023
 again-0.023
 dress-0.021
 school-0.021
Token man
Token ID582
Feature activation+1.27e-01

Pos logit contributions
umbs+0.033
beh+0.032
lette+0.031
igators+0.030
icking+0.029
Neg logit contributions
 notice-0.012
But-0.011
 stage-0.011
 woke-0.010
Who-0.010
Token some
Token ID617
Feature activation+3.49e-02

Pos logit contributions
umbs+0.341
rol+0.334
 supposed+0.334
ini+0.328
istent+0.327
Neg logit contributions
 again-0.065
 her-0.062
 and-0.058
,-0.057
 dress-0.057
Token bread
Token ID8509
Feature activation+6.71e-02

Pos logit contributions
beh+0.065
umbs+0.065
lette+0.061
rol+0.057
bi+0.056
Neg logit contributions
 notice-0.045
 stage-0.044
But-0.043
 dress-0.041
 school-0.041
Token.
Token ID13
Feature activation+9.74e-03

Pos logit contributions
beh+0.202
known+0.187
 highlight+0.183
 supposed+0.182
umbs+0.181
Neg logit contributions
 again-0.015
 dress-0.014
,-0.013
 and-0.009
But-0.008
Token They
Token ID1119
Feature activation+2.16e-02

Pos logit contributions
umbs+0.015
beh+0.015
lette+0.014
icking+0.013
igators+0.013
Neg logit contributions
 notice-0.015
But-0.015
 stage-0.014
 woke-0.014
Who-0.014
Token taped
Token ID36114
Feature activation+5.28e-02

Pos logit contributions
umbs+0.045
beh+0.044
lette+0.044
izza+0.041
igators+0.040
Neg logit contributions
 notice-0.024
But-0.021
 woke-0.021
 imagined-0.021
 suddenly-0.020
Token,
Token ID11
Feature activation+6.35e-03

Pos logit contributions
 notice+0.005
 stage+0.004
Who+0.004
But+0.004
La+0.004
Neg logit contributions
umbs-0.012
lette-0.012
beh-0.011
icking-0.011
izza-0.011
Token and
Token ID290
Feature activation+8.63e-03

Pos logit contributions
umbs+0.015
beh+0.014
lette+0.014
izza+0.013
igators+0.013
Neg logit contributions
 notice-0.005
But-0.005
 stage-0.005
 woke-0.004
Who-0.004
Token they
Token ID484
Feature activation+3.33e-02

Pos logit contributions
umbs+0.019
beh+0.018
lette+0.018
izza+0.017
igators+0.017
Neg logit contributions
 notice-0.008
But-0.008
 stage-0.008
 woke-0.007
 suddenly-0.007
Token gave
Token ID2921
Feature activation+2.88e-02

Pos logit contributions
umbs+0.077
beh+0.075
lette+0.073
izza+0.070
illing+0.070
Neg logit contributions
 woke-0.031
 notice-0.028
 suddenly-0.027
 heard-0.027
 imagined-0.027
Token the
Token ID262
Feature activation+1.42e-02

Pos logit contributions
umbs+0.067
beh+0.064
lette+0.064
illing+0.062
ady+0.060
Neg logit contributions
 notice-0.024
 woke-0.022
 suddenly-0.022
 again-0.021
 heard-0.021
Token man
Token ID582
Feature activation+1.25e-01

Pos logit contributions
umbs+0.030
beh+0.029
lette+0.027
igators+0.026
izza+0.026
Neg logit contributions
 notice-0.016
 stage-0.015
But-0.015
 woke-0.015
Who-0.014
Token some
Token ID617
Feature activation+4.22e-02

Pos logit contributions
umbs+0.323
omp+0.323
istent+0.315
illing+0.307
umb+0.306
Neg logit contributions
 again-0.061
 her-0.057
 heard-0.056
 dress-0.049
 woke-0.049
Token food
Token ID2057
Feature activation+4.24e-02

Pos logit contributions
beh+0.066
umbs+0.064
lette+0.060
icking+0.056
oun+0.056
Neg logit contributions
 notice-0.065
 stage-0.063
 woke-0.062
But-0.061
 heard-0.061
Token and
Token ID290
Feature activation+9.64e-03

Pos logit contributions
beh+0.104
umbs+0.101
lette+0.098
known+0.096
igators+0.094
Neg logit contributions
 notice-0.025
 woke-0.024
But-0.023
 stage-0.022
 again-0.022
Token water
Token ID1660
Feature activation+3.70e-02

Pos logit contributions
umbs+0.018
beh+0.018
lette+0.017
izza+0.016
igators+0.016
Neg logit contributions
 notice-0.012
But-0.011
 stage-0.011
 woke-0.011
Who-0.010
Token.
Token ID13
Feature activation+1.35e-02

Pos logit contributions
beh+0.088
umbs+0.087
lette+0.085
known+0.081
ocado+0.081
Neg logit contributions
 woke-0.025
 notice-0.025
 stage-0.022
 suddenly-0.022
 heard-0.021
Token!
Token ID0
Feature activation+4.70e-02

Pos logit contributions
beh+0.189
umbs+0.189
lette+0.179
wy+0.174
umb+0.173
Neg logit contributions
 again-0.033
,-0.025
 stage-0.025
 suddenly-0.023
But-0.022
Token The
Token ID383
Feature activation+4.85e-02

Pos logit contributions
beh+0.098
lette+0.093
icking+0.091
gary+0.089
wy+0.089
Neg logit contributions
But-0.052
 But-0.048
 notice-0.046
 suddenly-0.045
 stage-0.043
Token old
Token ID1468
Feature activation+2.85e-02

Pos logit contributions
umbs+0.108
beh+0.102
rol+0.093
lette+0.093
ipp+0.089
Neg logit contributions
 stage-0.059
 school-0.058
 saw-0.057
But-0.054
 heard-0.054
Token elephant
Token ID20950
Feature activation+3.94e-02

Pos logit contributions
umbs+0.051
beh+0.047
lette+0.044
izza+0.042
icking+0.041
Neg logit contributions
 stage-0.044
 woke-0.043
 notice-0.042
But-0.042
 school-0.042
Token gave
Token ID2921
Feature activation+4.87e-02

Pos logit contributions
umbs+0.094
beh+0.094
lette+0.088
izza+0.086
igators+0.084
Neg logit contributions
 woke-0.030
 suddenly-0.029
 notice-0.029
 heard-0.028
But-0.028
Token Bob
Token ID5811
Feature activation+1.25e-01

Pos logit contributions
umbs+0.116
beh+0.112
illing+0.109
izza+0.107
lette+0.103
Neg logit contributions
 suddenly-0.039
 notice-0.038
 heard-0.038
 again-0.037
But-0.037
Token a
Token ID257
Feature activation+3.78e-02

Pos logit contributions
beh+0.361
umbs+0.339
rol+0.332
oun+0.330
umm+0.330
Neg logit contributions
 and-0.054
 again-0.047
,-0.040
 heard-0.038
 saw-0.029
Token hug
Token ID16225
Feature activation+1.15e-02

Pos logit contributions
beh+0.078
umbs+0.074
lette+0.071
icking+0.068
igators+0.068
Neg logit contributions
 notice-0.045
 stage-0.044
 woke-0.043
 heard-0.040
But-0.040
Token.
Token ID13
Feature activation+3.92e-02

Pos logit contributions
umbs+0.028
beh+0.027
lette+0.026
izza+0.025
igators+0.025
Neg logit contributions
 notice-0.008
But-0.007
 stage-0.007
 woke-0.007
 suddenly-0.006
Token Bob
Token ID5811
Feature activation+4.37e-02

Pos logit contributions
beh+0.076
lette+0.072
umbs+0.071
icking+0.069
known+0.068
Neg logit contributions
But-0.048
 notice-0.044
 stage-0.043
 suddenly-0.042
Who-0.042
Token was
Token ID373
Feature activation+4.64e-02

Pos logit contributions
umbs+0.111
beh+0.111
lette+0.104
igators+0.102
izza+0.101
Neg logit contributions
 woke-0.028
But-0.025
 suddenly-0.025
 heard-0.024
 imagined-0.024
Token she
Token ID673
Feature activation+1.69e-03

Pos logit contributions
umbs+0.069
beh+0.065
lette+0.064
izza+0.062
icking+0.061
Neg logit contributions
But-0.019
 notice-0.018
 woke-0.017
 suddenly-0.017
 stage-0.016
Token loved
Token ID6151
Feature activation+9.19e-03

Pos logit contributions
umbs+0.003
beh+0.003
lette+0.003
izza+0.003
igators+0.003
Neg logit contributions
 notice-0.002
But-0.002
 stage-0.002
Who-0.002
 woke-0.002
Token to
Token ID284
Feature activation-1.66e-02

Pos logit contributions
umbs+0.019
beh+0.018
lette+0.018
izza+0.017
icking+0.017
Neg logit contributions
 notice-0.010
But-0.009
 stage-0.009
 woke-0.009
Who-0.008
Token give
Token ID1577
Feature activation+4.41e-02

Pos logit contributions
But+0.019
 notice+0.018
Stop+0.018
Who+0.018
 suddenly+0.017
Neg logit contributions
umbs-0.030
lette-0.030
beh-0.029
izza-0.028
igators-0.028
Token the
Token ID262
Feature activation+3.73e-02

Pos logit contributions
umbs+0.106
lette+0.097
umb+0.096
icking+0.096
igators+0.096
Neg logit contributions
 notice-0.034
But-0.031
 school-0.030
 dress-0.030
 stage-0.030
Token boy
Token ID2933
Feature activation+1.24e-01

Pos logit contributions
beh+0.079
umbs+0.079
lette+0.072
rol+0.069
ady+0.069
Neg logit contributions
 stage-0.042
 school-0.041
 dress-0.040
 notice-0.039
 woke-0.038
Token lots
Token ID6041
Feature activation+5.84e-02

Pos logit contributions
umbs+0.334
icking+0.315
omp+0.307
ini+0.304
rol+0.303
Neg logit contributions
 her-0.060
 school-0.054
 heard-0.053
 dress-0.052
 woke-0.047
Token of
Token ID286
Feature activation+3.89e-02

Pos logit contributions
lette+0.095
beh+0.092
icking+0.089
umbs+0.087
bi+0.086
Neg logit contributions
 again-0.083
But-0.080
 stage-0.080
 woke-0.078
 notice-0.078
Token kisses
Token ID36544
Feature activation+1.53e-02

Pos logit contributions
umbs+0.073
lette+0.069
beh+0.067
icking+0.067
ocado+0.063
Neg logit contributions
 notice-0.051
 stage-0.051
 school-0.049
 dress-0.047
But-0.046
Token.
Token ID13
Feature activation+2.18e-02

Pos logit contributions
umbs+0.036
beh+0.036
lette+0.035
igators+0.033
izza+0.033
Neg logit contributions
 notice-0.011
But-0.010
 stage-0.010
 woke-0.010
 suddenly-0.009
Token
Token ID198
Feature activation-2.29e-02

Pos logit contributions
beh+0.050
umbs+0.049
lette+0.048
icking+0.046
ocado+0.045
Neg logit contributions
 notice-0.019
But-0.019
 stage-0.017
 woke-0.017
Who-0.016
Token The
Token ID383
Feature activation+2.38e-02

Pos logit contributions
beh+0.048
umbs+0.044
lette+0.044
icking+0.042
igators+0.041
Neg logit contributions
 notice-0.033
But-0.032
 stage-0.031
Who-0.030
 suddenly-0.030
Token elderly
Token ID13830
Feature activation+1.35e-02

Pos logit contributions
beh+0.046
umbs+0.045
lette+0.042
izza+0.040
igators+0.039
Neg logit contributions
 stage-0.032
 notice-0.031
But-0.030
 suddenly-0.029
 school-0.029
Token man
Token ID582
Feature activation+1.38e-02

Pos logit contributions
umbs+0.036
beh+0.036
lette+0.035
izza+0.034
igators+0.033
Neg logit contributions
 notice-0.006
But-0.006
 stage-0.006
 woke-0.005
Who-0.005
Token was
Token ID373
Feature activation+4.01e-02

Pos logit contributions
umbs+0.031
beh+0.030
lette+0.030
izza+0.028
igators+0.028
Neg logit contributions
 notice-0.012
But-0.011
 woke-0.011
 stage-0.011
 suddenly-0.010
Token giving
Token ID3501
Feature activation+3.86e-02

Pos logit contributions
umbs+0.075
izza+0.072
beh+0.071
ishes+0.067
lette+0.066
Neg logit contributions
 notice-0.055
 suddenly-0.054
But-0.054
 stage-0.053
 heard-0.050
Token daddy
Token ID35822
Feature activation+1.22e-01

Pos logit contributions
umbs+0.082
beh+0.078
lette+0.075
izza+0.074
illing+0.072
Neg logit contributions
 notice-0.039
 suddenly-0.038
 woke-0.037
 heard-0.036
 stage-0.036
Token lots
Token ID6041
Feature activation+6.02e-02

Pos logit contributions
beh+0.302
umbs+0.300
icking+0.296
known+0.287
oun+0.285
Neg logit contributions
 heard-0.080
 woke-0.064
 again-0.062
 dress-0.061
 saw-0.058
Token of
Token ID286
Feature activation+3.95e-02

Pos logit contributions
lette+0.101
beh+0.101
icking+0.095
umbs+0.092
izza+0.092
Neg logit contributions
 stage-0.079
But-0.079
 again-0.079
 notice-0.078
 woke-0.077
Token advice
Token ID5608
Feature activation+3.38e-02

Pos logit contributions
umbs+0.072
beh+0.068
lette+0.067
icking+0.063
umb+0.062
Neg logit contributions
 stage-0.054
 notice-0.053
But-0.049
 dress-0.049
 heard-0.049
Token.
Token ID13
Feature activation+2.35e-02

Pos logit contributions
umbs+0.084
beh+0.081
lette+0.080
izza+0.077
igators+0.076
Neg logit contributions
 notice-0.021
But-0.020
 stage-0.019
 suddenly-0.017
 woke-0.017
Token
Token ID198
Feature activation+2.95e-03

Pos logit contributions
beh+0.052
umbs+0.050
lette+0.050
icking+0.047
igators+0.047
Neg logit contributions
But-0.023
 notice-0.022
Who-0.020
 stage-0.020
 suddenly-0.020
Tokeniz
Token ID528
Feature activation+2.23e-02

Pos logit contributions
umbs+0.005
beh+0.005
lette+0.005
izza+0.004
igators+0.004
Neg logit contributions
 notice-0.010
But-0.010
 stage-0.010
 woke-0.010
Who-0.009
Token thanked
Token ID26280
Feature activation-3.65e-03

Pos logit contributions
umbs+0.050
beh+0.048
lette+0.046
izza+0.045
igators+0.045
Neg logit contributions
 notice-0.019
 woke-0.019
But-0.019
 suddenly-0.018
 heard-0.017
Token him
Token ID683
Feature activation+3.16e-02

Pos logit contributions
 notice+0.005
But+0.004
 stage+0.004
 woke+0.004
 imagined+0.004
Neg logit contributions
umbs-0.006
beh-0.006
lette-0.006
izza-0.006
icking-0.006
Token,
Token ID11
Feature activation+8.56e-03

Pos logit contributions
umbs+0.086
beh+0.085
lette+0.083
igators+0.082
izza+0.080
Neg logit contributions
But-0.013
 again-0.013
 notice-0.012
 stage-0.010
 suddenly-0.009
Token gave
Token ID2921
Feature activation+1.79e-02

Pos logit contributions
umbs+0.019
beh+0.019
lette+0.019
izza+0.018
igators+0.018
Neg logit contributions
 notice-0.007
But-0.007
 stage-0.007
 woke-0.006
Who-0.006
Token John
Token ID1757
Feature activation+1.21e-01

Pos logit contributions
umbs+0.039
beh+0.037
lette+0.036
igators+0.035
ocado+0.035
Neg logit contributions
 notice-0.018
But-0.017
 stage-0.016
 suddenly-0.016
 woke-0.016
Token a
Token ID257
Feature activation+3.97e-02

Pos logit contributions
oke+0.338
ini+0.335
beh+0.334
rol+0.331
umbs+0.327
Neg logit contributions
 again-0.066
 her-0.044
 and-0.041
,-0.028
.-0.026
Token big
Token ID1263
Feature activation+2.71e-02

Pos logit contributions
beh+0.082
umbs+0.075
lette+0.075
igators+0.073
icking+0.071
Neg logit contributions
 notice-0.048
 woke-0.047
 stage-0.044
But-0.042
 heard-0.042
Token hug
Token ID16225
Feature activation+1.29e-02

Pos logit contributions
beh+0.059
umbs+0.057
lette+0.056
igators+0.053
known+0.052
Neg logit contributions
 notice-0.027
 woke-0.026
 stage-0.025
But-0.024
 heard-0.022
Token,
Token ID11
Feature activation+1.14e-02

Pos logit contributions
umbs+0.031
beh+0.030
lette+0.029
igators+0.029
izza+0.028
Neg logit contributions
 notice-0.009
But-0.008
 stage-0.008
 woke-0.008
 suddenly-0.007
Token and
Token ID290
Feature activation+4.29e-03

Pos logit contributions
umbs+0.026
beh+0.025
lette+0.025
igators+0.024
izza+0.024
Neg logit contributions
 notice-0.009
But-0.009
 stage-0.008
 woke-0.008
Who-0.008
Token moment
Token ID2589
Feature activation-4.25e-03

Pos logit contributions
umbs+0.016
beh+0.015
lette+0.015
izza+0.014
igators+0.014
Neg logit contributions
 notice-0.007
But-0.006
 stage-0.006
 woke-0.006
Who-0.006
Token and
Token ID290
Feature activation-1.41e-02

Pos logit contributions
 notice+0.004
 stage+0.003
But+0.003
Who+0.003
Stop+0.003
Neg logit contributions
umbs-0.009
lette-0.009
beh-0.009
izza-0.009
icking-0.009
Token then
Token ID788
Feature activation-1.09e-02

Pos logit contributions
 notice+0.014
 stage+0.013
La+0.012
Stop+0.011
But+0.011
Neg logit contributions
lette-0.029
umbs-0.028
beh-0.028
izza-0.027
icking-0.027
Token gave
Token ID2921
Feature activation+2.50e-02

Pos logit contributions
 notice+0.013
 stage+0.012
But+0.012
Stop+0.011
La+0.011
Neg logit contributions
beh-0.020
umbs-0.020
lette-0.020
izza-0.019
igators-0.019
Token the
Token ID262
Feature activation+2.43e-02

Pos logit contributions
umbs+0.058
beh+0.055
lette+0.054
illing+0.053
ady+0.052
Neg logit contributions
 notice-0.021
 suddenly-0.020
But-0.019
 woke-0.019
 again-0.019
Token deer
Token ID20096
Feature activation+1.21e-01

Pos logit contributions
umbs+0.051
beh+0.050
lette+0.046
izza+0.046
icking+0.045
Neg logit contributions
But-0.027
 notice-0.026
 woke-0.026
 stage-0.025
Who-0.025
Token a
Token ID257
Feature activation+2.85e-02

Pos logit contributions
oke+0.329
umbs+0.326
rol+0.322
oun+0.319
beh+0.316
Neg logit contributions
 again-0.062
 her-0.051
.-0.048
,-0.046
 something-0.045
Token big
Token ID1263
Feature activation+2.33e-02

Pos logit contributions
beh+0.052
umbs+0.049
lette+0.047
igators+0.045
icking+0.045
Neg logit contributions
 notice-0.040
 woke-0.038
 stage-0.037
But-0.037
 heard-0.036
Token,
Token ID11
Feature activation+1.37e-02

Pos logit contributions
beh+0.043
umbs+0.042
lette+0.040
igators+0.038
icking+0.037
Neg logit contributions
 notice-0.030
 woke-0.029
 stage-0.029
But-0.028
 heard-0.027
Token juicy
Token ID35075
Feature activation+5.97e-03

Pos logit contributions
umbs+0.016
beh+0.015
lette+0.015
izza+0.013
icking+0.013
Neg logit contributions
 notice-0.027
But-0.026
 stage-0.026
 woke-0.025
Who-0.025
Token apple
Token ID17180
Feature activation+5.15e-02

Pos logit contributions
umbs+0.008
beh+0.007
lette+0.007
izza+0.007
igators+0.007
Neg logit contributions
 notice-0.011
But-0.010
 stage-0.010
 woke-0.010
Who-0.010
Token agreed
Token ID4987
Feature activation+2.00e-03

Pos logit contributions
umbs+0.071
beh+0.068
lette+0.067
izza+0.063
ocado+0.062
Neg logit contributions
 notice-0.031
 woke-0.030
 heard-0.029
 suddenly-0.029
 imagined-0.029
Token,
Token ID11
Feature activation+3.73e-02

Pos logit contributions
umbs+0.004
beh+0.004
lette+0.004
izza+0.004
icking+0.004
Neg logit contributions
 notice-0.002
But-0.002
 stage-0.002
Who-0.001
 woke-0.001
Token and
Token ID290
Feature activation+2.92e-02

Pos logit contributions
umbs+0.094
beh+0.092
lette+0.089
igators+0.087
ocado+0.087
Neg logit contributions
But-0.024
 notice-0.024
 stage-0.021
 suddenly-0.020
 woke-0.020
Token gave
Token ID2921
Feature activation+4.06e-02

Pos logit contributions
umbs+0.066
beh+0.062
lette+0.062
ady+0.059
izza+0.059
Neg logit contributions
 notice-0.027
 suddenly-0.026
But-0.025
 woke-0.025
 stage-0.024
Token the
Token ID262
Feature activation+3.62e-02

Pos logit contributions
umbs+0.100
beh+0.094
lette+0.091
illing+0.090
ady+0.090
Neg logit contributions
 notice-0.028
 suddenly-0.028
 woke-0.026
But-0.026
 heard-0.026
Token farmer
Token ID18739
Feature activation+1.20e-01

Pos logit contributions
beh+0.073
umbs+0.069
rol+0.062
izza+0.060
icking+0.060
Neg logit contributions
 notice-0.046
But-0.045
 stage-0.045
 heard-0.044
 woke-0.044
Token some
Token ID617
Feature activation+5.11e-02

Pos logit contributions
umbs+0.324
beh+0.316
illing+0.301
 supposed+0.298
rol+0.296
Neg logit contributions
 again-0.058
,-0.048
 heard-0.048
 woke-0.040
 saw-0.040
Token of
Token ID286
Feature activation+5.35e-02

Pos logit contributions
beh+0.084
umbs+0.080
lette+0.074
icking+0.072
oun+0.070
Neg logit contributions
 notice-0.077
But-0.073
 stage-0.073
 woke-0.072
 heard-0.072
Token their
Token ID511
Feature activation+4.39e-02

Pos logit contributions
umbs+0.140
beh+0.138
lette+0.131
ocado+0.130
icking+0.129
Neg logit contributions
 notice-0.033
 stage-0.032
But-0.029
 school-0.029
 suddenly-0.023
Token eggs
Token ID9653
Feature activation+7.59e-02

Pos logit contributions
umbs+0.075
beh+0.073
rol+0.065
umb+0.064
lette+0.064
Neg logit contributions
 notice-0.066
 stage-0.064
 saw-0.063
 woke-0.062
 school-0.062
Token,
Token ID11
Feature activation+4.19e-02

Pos logit contributions
beh+0.219
igators+0.208
umbs+0.205
known+0.203
ocado+0.201
Neg logit contributions
 again-0.017
,-0.017
 suddenly-0.013
 dress-0.011
 notice-0.009
Token
Token ID198
Feature activation-1.34e-02

Pos logit contributions
 notice+0.001
 stage+0.001
But+0.001
 woke+0.001
Who+0.001
Neg logit contributions
umbs-0.002
beh-0.002
lette-0.002
izza-0.002
igators-0.002
Token
Token ID198
Feature activation+1.16e-02

Pos logit contributions
 notice+0.016
 stage+0.015
 imagined+0.015
Who+0.015
 woke+0.014
Neg logit contributions
umbs-0.025
beh-0.023
lette-0.023
izza-0.023
o-0.023
TokenTom
Token ID13787
Feature activation+3.20e-02

Pos logit contributions
umbs+0.028
lette+0.027
beh+0.027
icking+0.026
igators+0.025
Neg logit contributions
But-0.009
 notice-0.009
 stage-0.007
Who-0.007
 woke-0.007
Token gave
Token ID2921
Feature activation+2.39e-02

Pos logit contributions
umbs+0.081
beh+0.078
lette+0.075
known+0.071
igators+0.071
Neg logit contributions
 woke-0.022
 imagined-0.019
 heard-0.019
 notice-0.019
But-0.018
Token the
Token ID262
Feature activation+4.37e-02

Pos logit contributions
umbs+0.055
beh+0.053
lette+0.053
illing+0.050
igators+0.050
Neg logit contributions
 notice-0.020
But-0.019
 stage-0.018
 suddenly-0.018
Who-0.018
Token man
Token ID582
Feature activation+1.20e-01

Pos logit contributions
beh+0.115
lette+0.099
umbs+0.095
rol+0.092
illing+0.092
Neg logit contributions
 woke-0.043
 stage-0.042
 dress-0.040
 saw-0.039
 man-0.038
Token a
Token ID257
Feature activation+3.25e-02

Pos logit contributions
umbs+0.336
rol+0.322
beh+0.321
ini+0.320
 supposed+0.318
Neg logit contributions
 again-0.052
 heard-0.046
,-0.044
 and-0.040
 woke-0.040
Token new
Token ID649
Feature activation+1.53e-02

Pos logit contributions
beh+0.066
umbs+0.063
lette+0.061
igators+0.058
icking+0.058
Neg logit contributions
 notice-0.038
 woke-0.038
 stage-0.037
But-0.034
 dress-0.034
Token blanket
Token ID18447
Feature activation+4.50e-02

Pos logit contributions
umbs+0.032
beh+0.032
lette+0.030
izza+0.029
igators+0.029
Neg logit contributions
 notice-0.016
 stage-0.015
But-0.015
 woke-0.014
 suddenly-0.013
Token and
Token ID290
Feature activation-1.00e-02

Pos logit contributions
beh+0.120
umbs+0.114
lette+0.111
igators+0.110
izza+0.108
Neg logit contributions
 woke-0.022
 again-0.019
 stage-0.019
 suddenly-0.019
But-0.018
Token wished
Token ID16555
Feature activation-1.04e-02

Pos logit contributions
 notice+0.012
But+0.011
 stage+0.011
Stop+0.011
Who+0.010
Neg logit contributions
umbs-0.018
beh-0.018
lette-0.018
izza-0.017
igators-0.017
Token.
Token ID13
Feature activation-7.64e-03

Pos logit contributions
beh+0.097
lette+0.096
umbs+0.094
known+0.091
icking+0.090
Neg logit contributions
 notice-0.018
But-0.015
 woke-0.015
 stage-0.014
 again-0.014
Token Lily
Token ID20037
Feature activation+1.07e-02

Pos logit contributions
 notice+0.012
 stage+0.011
But+0.010
 woke+0.010
 imagined+0.010
Neg logit contributions
umbs-0.012
beh-0.011
lette-0.011
izza-0.011
igators-0.010
Token and
Token ID290
Feature activation+7.04e-03

Pos logit contributions
umbs+0.026
beh+0.025
lette+0.025
izza+0.024
icking+0.023
Neg logit contributions
 notice-0.008
But-0.007
 stage-0.007
 woke-0.006
 imagined-0.006
Token Tom
Token ID4186
Feature activation+4.63e-02

Pos logit contributions
umbs+0.014
beh+0.014
lette+0.014
izza+0.013
igators+0.013
Neg logit contributions
 notice-0.008
But-0.007
 stage-0.007
 woke-0.007
Who-0.007
Token give
Token ID1577
Feature activation+3.40e-02

Pos logit contributions
beh+0.099
umbs+0.099
lette+0.095
izza+0.092
icking+0.092
Neg logit contributions
 notice-0.055
 woke-0.046
 heard-0.044
But-0.044
 imagined-0.044
Token him
Token ID683
Feature activation+1.20e-01

Pos logit contributions
umbs+0.078
lette+0.075
beh+0.075
illing+0.074
ady+0.073
Neg logit contributions
 woke-0.028
 notice-0.028
 suddenly-0.027
But-0.025
 heard-0.025
Token some
Token ID617
Feature activation+2.50e-02

Pos logit contributions
ini+0.279
istent+0.274
icking+0.271
umbs+0.270
illing+0.270
Neg logit contributions
 dress-0.084
 woke-0.083
 heard-0.082
 shoes-0.079
 her-0.076
Token treats
Token ID18432
Feature activation+4.04e-02

Pos logit contributions
beh+0.034
umbs+0.033
lette+0.032
icking+0.030
known+0.029
Neg logit contributions
 notice-0.043
But-0.042
 stage-0.041
 woke-0.041
 heard-0.039
Token.
Token ID13
Feature activation-1.84e-03

Pos logit contributions
beh+0.106
umbs+0.103
known+0.099
lette+0.099
ady+0.097
Neg logit contributions
But-0.019
 notice-0.018
 again-0.017
 woke-0.016
 suddenly-0.015
Token They
Token ID1119
Feature activation+3.23e-02

Pos logit contributions
 notice+0.003
 stage+0.003
But+0.003
 woke+0.003
Who+0.003
Neg logit contributions
umbs-0.003
beh-0.003
lette-0.003
izza-0.002
igators-0.002
Token pet
Token ID4273
Feature activation+2.29e-02

Pos logit contributions
umbs+0.069
beh+0.069
lette+0.067
icking+0.064
izza+0.063
Neg logit contributions
 notice-0.036
 stage-0.031
 woke-0.030
But-0.030
 imagined-0.029
Token So
Token ID1406
Feature activation-1.76e-02

Pos logit contributions
 notice+0.007
 stage+0.007
But+0.007
Who+0.006
 woke+0.006
Neg logit contributions
umbs-0.009
lette-0.008
beh-0.008
izza-0.008
icking-0.008
Token,
Token ID11
Feature activation-3.14e-03

Pos logit contributions
 notice+0.018
La+0.016
 stage+0.016
Who+0.016
Stop+0.016
Neg logit contributions
umbs-0.033
lette-0.033
beh-0.033
icking-0.032
o-0.031
Token she
Token ID673
Feature activation+1.35e-03

Pos logit contributions
 notice+0.003
But+0.003
 stage+0.003
 woke+0.002
 imagined+0.002
Neg logit contributions
umbs-0.007
beh-0.007
lette-0.006
izza-0.006
igators-0.006
Token gave
Token ID2921
Feature activation+2.48e-02

Pos logit contributions
umbs+0.003
beh+0.003
lette+0.003
izza+0.002
igators+0.002
Neg logit contributions
 notice-0.002
But-0.001
 stage-0.001
Who-0.001
 woke-0.001
Token the
Token ID262
Feature activation+1.87e-02

Pos logit contributions
umbs+0.060
beh+0.058
lette+0.058
illing+0.056
ady+0.056
Neg logit contributions
 notice-0.018
 suddenly-0.017
 stage-0.017
But-0.017
 woke-0.016
Token bear
Token ID6842
Feature activation+1.19e-01

Pos logit contributions
beh+0.038
umbs+0.038
lette+0.037
izza+0.034
illing+0.034
Neg logit contributions
 stage-0.020
 notice-0.020
 woke-0.020
But-0.020
Who-0.018
Token a
Token ID257
Feature activation+2.64e-02

Pos logit contributions
beh+0.316
rol+0.302
 fastest+0.300
omp+0.300
oun+0.297
Neg logit contributions
 again-0.054
 her-0.053
 heard-0.053
 and-0.052
 woke-0.051
Token sandwich
Token ID20433
Feature activation+3.12e-02

Pos logit contributions
beh+0.050
umbs+0.048
lette+0.046
icking+0.044
igators+0.043
Neg logit contributions
 notice-0.035
 woke-0.034
 stage-0.033
But-0.032
 heard-0.031
Token.
Token ID13
Feature activation+1.51e-02

Pos logit contributions
beh+0.079
umbs+0.075
lette+0.073
igators+0.073
known+0.072
Neg logit contributions
But-0.017
 notice-0.017
 woke-0.017
 stage-0.015
 dress-0.015
Token
Token ID198
Feature activation-1.74e-02

Pos logit contributions
beh+0.028
umbs+0.028
lette+0.027
igators+0.026
icking+0.025
Neg logit contributions
 notice-0.019
But-0.019
 stage-0.017
 woke-0.017
Who-0.017
Token
Token ID198
Feature activation+7.24e-03

Pos logit contributions
 notice+0.022
 stage+0.021
 imagined+0.020
Who+0.020
 woke+0.019
Neg logit contributions
umbs-0.032
lette-0.028
o-0.028
beh-0.028
izza-0.028
Token away
Token ID1497
Feature activation+2.60e-03

Pos logit contributions
umbs+0.202
gary+0.201
beh+0.200
izza+0.194
wy+0.193
Neg logit contributions
 again-0.030
 notice-0.029
 heard-0.029
 woke-0.029
 and-0.026
Token.
Token ID13
Feature activation-2.68e-03

Pos logit contributions
umbs+0.006
beh+0.005
lette+0.005
izza+0.005
igators+0.005
Neg logit contributions
 notice-0.002
But-0.002
 stage-0.002
 woke-0.002
Who-0.002
Token He
Token ID679
Feature activation-8.85e-03

Pos logit contributions
 notice+0.004
But+0.004
 stage+0.004
 woke+0.004
 imagined+0.004
Neg logit contributions
umbs-0.004
beh-0.004
lette-0.004
izza-0.004
igators-0.004
Token gave
Token ID2921
Feature activation+2.57e-02

Pos logit contributions
 notice+0.010
But+0.010
 stage+0.010
Who+0.009
Stop+0.009
Neg logit contributions
umbs-0.017
beh-0.016
lette-0.016
igators-0.015
izza-0.015
Token the
Token ID262
Feature activation+2.88e-02

Pos logit contributions
umbs+0.062
beh+0.059
lette+0.059
illing+0.057
izza+0.056
Neg logit contributions
 notice-0.019
But-0.018
 stage-0.017
 suddenly-0.017
 woke-0.017
Token children
Token ID1751
Feature activation+1.19e-01

Pos logit contributions
beh+0.049
umbs+0.047
lette+0.043
igators+0.042
izza+0.041
Neg logit contributions
 notice-0.042
But-0.042
 woke-0.041
 stage-0.041
Who-0.040
Token some
Token ID617
Feature activation+3.44e-02

Pos logit contributions
umbs+0.328
ini+0.316
umm+0.311
lette+0.301
oun+0.301
Neg logit contributions
 her-0.062
 dress-0.055
 heard-0.046
,-0.045
 school-0.045
Token corn
Token ID11676
Feature activation+4.86e-02

Pos logit contributions
umbs+0.053
beh+0.052
lette+0.050
icking+0.046
ocado+0.045
Neg logit contributions
 notice-0.053
 stage-0.051
But-0.050
 woke-0.050
 dress-0.049
Token.
Token ID13
Feature activation-1.10e-02

Pos logit contributions
beh+0.127
umbs+0.121
known+0.117
lette+0.115
gary+0.114
Neg logit contributions
 again-0.023
 woke-0.022
 notice-0.022
 stage-0.021
But-0.021
Token He
Token ID679
Feature activation-4.58e-03

Pos logit contributions
 notice+0.018
 stage+0.017
 woke+0.017
 imagined+0.017
But+0.017
Neg logit contributions
umbs-0.016
lette-0.014
izza-0.014
beh-0.014
igators-0.013
Token told
Token ID1297
Feature activation-1.81e-03

Pos logit contributions
 notice+0.005
But+0.005
 stage+0.005
Who+0.005
Stop+0.005
Neg logit contributions
umbs-0.009
beh-0.008
lette-0.008
izza-0.008
igators-0.008
Token you
Token ID345
Feature activation+4.39e-02

Pos logit contributions
beh+0.068
umbs+0.068
lette+0.065
ady+0.063
known+0.063
Neg logit contributions
But-0.026
 notice-0.025
 stage-0.024
 woke-0.023
 suddenly-0.022
Token."
Token ID526
Feature activation+3.13e-02

Pos logit contributions
beh+0.094
umbs+0.092
known+0.086
lette+0.086
ady+0.084
Neg logit contributions
But-0.042
 woke-0.039
 notice-0.039
 stage-0.037
?"-0.036
Token He
Token ID679
Feature activation+1.84e-02

Pos logit contributions
lette+0.068
beh+0.067
umbs+0.065
igators+0.063
ocado+0.062
Neg logit contributions
But-0.030
 notice-0.028
 woke-0.026
 suddenly-0.025
-0.025
Token gave
Token ID2921
Feature activation+3.03e-02

Pos logit contributions
umbs+0.037
beh+0.036
lette+0.035
igators+0.033
izza+0.033
Neg logit contributions
 notice-0.020
But-0.020
 woke-0.019
 suddenly-0.019
 stage-0.019
Token Tim
Token ID5045
Feature activation+7.63e-02

Pos logit contributions
umbs+0.070
lette+0.066
beh+0.065
illing+0.065
ocado+0.064
Neg logit contributions
 suddenly-0.026
 notice-0.025
 again-0.025
But-0.025
 woke-0.023
Tokenmy
Token ID1820
Feature activation+1.19e-01

Pos logit contributions
beh+0.140
umbs+0.130
igators+0.126
itt+0.125
illing+0.124
Neg logit contributions
 again-0.107
 heard-0.102
 woke-0.100
 saw-0.098
But-0.088
Token a
Token ID257
Feature activation+3.57e-02

Pos logit contributions
beh+0.320
rol+0.301
oke+0.301
 supposed+0.298
behavior+0.295
Neg logit contributions
 again-0.063
 woke-0.060
 heard-0.058
 her-0.054
 and-0.053
Token sandwich
Token ID20433
Feature activation+3.57e-02

Pos logit contributions
beh+0.072
umbs+0.065
lette+0.063
igators+0.062
icking+0.062
Neg logit contributions
 notice-0.045
 woke-0.044
 stage-0.042
 heard-0.041
But-0.041
Token and
Token ID290
Feature activation+8.93e-04

Pos logit contributions
beh+0.095
umbs+0.088
igators+0.086
lette+0.086
known+0.086
Neg logit contributions
But-0.018
 woke-0.017
 notice-0.017
 stage-0.015
 dress-0.015
Token some
Token ID617
Feature activation+3.31e-02

Pos logit contributions
umbs+0.002
beh+0.002
lette+0.002
izza+0.002
igators+0.001
Neg logit contributions
 notice-0.001
But-0.001
 stage-0.001
Who-0.001
 woke-0.001
Token juice
Token ID13135
Feature activation+3.12e-02

Pos logit contributions
beh+0.047
umbs+0.045
lette+0.043
icking+0.040
known+0.039
Neg logit contributions
 notice-0.056
But-0.054
 woke-0.053
 stage-0.053
 suddenly-0.052
Token man
Token ID582
Feature activation+4.07e-02

Pos logit contributions
umbs+0.027
beh+0.026
lette+0.026
izza+0.024
igators+0.024
Neg logit contributions
 notice-0.011
But-0.011
 stage-0.010
Who-0.010
 woke-0.010
Token smiled
Token ID13541
Feature activation-3.24e-03

Pos logit contributions
umbs+0.103
beh+0.100
lette+0.095
igators+0.093
 supposed+0.093
Neg logit contributions
 suddenly-0.027
 woke-0.026
But-0.026
 heard-0.026
 notice-0.023
Token and
Token ID290
Feature activation+1.05e-02

Pos logit contributions
 notice+0.003
 stage+0.003
But+0.003
Who+0.003
Stop+0.003
Neg logit contributions
umbs-0.007
lette-0.007
beh-0.007
icking-0.006
izza-0.006
Token gave
Token ID2921
Feature activation+2.77e-02

Pos logit contributions
umbs+0.023
beh+0.022
lette+0.022
izza+0.021
igators+0.021
Neg logit contributions
 notice-0.009
But-0.009
 stage-0.009
 woke-0.008
 suddenly-0.008
Token the
Token ID262
Feature activation+2.93e-02

Pos logit contributions
umbs+0.067
beh+0.063
lette+0.062
illing+0.060
ady+0.059
Neg logit contributions
 notice-0.022
 suddenly-0.020
But-0.020
 woke-0.019
 again-0.019
Token owl
Token ID39610
Feature activation+1.19e-01

Pos logit contributions
beh+0.068
umbs+0.067
rol+0.063
lette+0.061
igators+0.061
Neg logit contributions
 notice-0.026
But-0.025
 woke-0.025
Who-0.025
 heard-0.024
Token a
Token ID257
Feature activation+3.20e-02

Pos logit contributions
umbs+0.314
beh+0.308
oun+0.306
rol+0.303
icking+0.300
Neg logit contributions
 again-0.065
,-0.051
 her-0.049
.-0.047
 heard-0.042
Token piece
Token ID3704
Feature activation+6.26e-02

Pos logit contributions
beh+0.059
umbs+0.056
lette+0.054
igators+0.053
icking+0.052
Neg logit contributions
 notice-0.044
 woke-0.041
But-0.041
 stage-0.039
 heard-0.039
Token of
Token ID286
Feature activation+2.89e-02

Pos logit contributions
beh+0.111
ishes+0.100
umbs+0.098
igators+0.096
lette+0.096
Neg logit contributions
 again-0.088
But-0.084
 woke-0.082
 suddenly-0.081
 notice-0.081
Token the
Token ID262
Feature activation+2.93e-02

Pos logit contributions
beh+0.043
umbs+0.043
lette+0.039
icking+0.038
ocado+0.037
Neg logit contributions
 notice-0.048
But-0.048
 stage-0.044
 woke-0.044
 suddenly-0.044
Token apple
Token ID17180
Feature activation+4.38e-02

Pos logit contributions
beh+0.043
umbs+0.042
igators+0.038
lette+0.038
rol+0.036
Neg logit contributions
 notice-0.049
But-0.048
 woke-0.046
 suddenly-0.045
Who-0.045
Token smiled
Token ID13541
Feature activation-1.26e-02

Pos logit contributions
 notice+0.025
 stage+0.024
Stop+0.022
But+0.022
Who+0.021
Neg logit contributions
lette-0.039
umbs-0.038
beh-0.037
izza-0.036
ocado-0.036
Token,
Token ID11
Feature activation-4.75e-03

Pos logit contributions
 notice+0.013
 stage+0.012
Stop+0.012
Who+0.012
But+0.012
Neg logit contributions
umbs-0.025
lette-0.025
beh-0.024
icking-0.024
izza-0.023
Token giving
Token ID3501
Feature activation+1.54e-02

Pos logit contributions
 notice+0.006
 stage+0.005
But+0.005
Stop+0.005
Who+0.005
Neg logit contributions
umbs-0.009
lette-0.008
beh-0.008
izza-0.008
icking-0.008
Token his
Token ID465
Feature activation+1.65e-02

Pos logit contributions
umbs+0.031
beh+0.029
lette+0.028
igators+0.027
izza+0.027
Neg logit contributions
 notice-0.018
But-0.017
 stage-0.017
 woke-0.016
 suddenly-0.016
Token grand
Token ID4490
Feature activation+3.05e-02

Pos logit contributions
umbs+0.029
beh+0.027
lette+0.026
igators+0.024
icking+0.024
Neg logit contributions
 notice-0.024
 stage-0.023
But-0.023
 woke-0.022
 school-0.022
Tokenpa
Token ID8957
Feature activation+1.19e-01

Pos logit contributions
umbs+0.057
beh+0.053
lette+0.049
igators+0.049
ocado+0.049
Neg logit contributions
 notice-0.038
 stage-0.037
 woke-0.037
But-0.036
 imagined-0.035
Token a
Token ID257
Feature activation+3.09e-02

Pos logit contributions
umbs+0.299
oun+0.282
rol+0.282
beh+0.276
umb+0.275
Neg logit contributions
 heard-0.091
 again-0.085
 woke-0.073
 saw-0.072
,-0.069
Token big
Token ID1263
Feature activation+1.34e-02

Pos logit contributions
beh+0.060
umbs+0.060
lette+0.056
igators+0.055
icking+0.054
Neg logit contributions
 notice-0.039
 stage-0.036
 woke-0.036
But-0.035
 heard-0.034
Token hug
Token ID16225
Feature activation+1.20e-02

Pos logit contributions
umbs+0.027
beh+0.027
lette+0.026
igators+0.025
izza+0.024
Neg logit contributions
 notice-0.015
 stage-0.014
But-0.013
 woke-0.013
 suddenly-0.012
Token.
Token ID13
Feature activation+3.21e-03

Pos logit contributions
umbs+0.029
beh+0.028
lette+0.027
igators+0.026
izza+0.026
Neg logit contributions
 notice-0.008
But-0.008
 stage-0.007
 woke-0.007
 suddenly-0.007
Token<|endoftext|>
Token ID50256
Feature activation-1.85e-02

Pos logit contributions
umbs+0.006
beh+0.006
lette+0.006
izza+0.005
igators+0.005
Neg logit contributions
 notice-0.004
But-0.004
 stage-0.004
 woke-0.004
Who-0.004
Token others
Token ID1854
Feature activation+3.68e-02

Pos logit contributions
umbs+0.006
beh+0.006
lette+0.006
izza+0.006
igators+0.006
Neg logit contributions
 notice-0.005
But-0.004
 stage-0.004
 woke-0.004
Who-0.004
Token.
Token ID13
Feature activation+2.75e-02

Pos logit contributions
umbs+0.097
beh+0.092
lette+0.091
izza+0.088
known+0.087
Neg logit contributions
 notice-0.019
But-0.018
 stage-0.017
 woke-0.016
 imagined-0.016
Token They
Token ID1119
Feature activation+1.87e-02

Pos logit contributions
beh+0.052
umbs+0.050
lette+0.049
icking+0.045
allows+0.044
Neg logit contributions
 notice-0.036
But-0.034
 stage-0.033
 woke-0.032
 suddenly-0.032
Token gave
Token ID2921
Feature activation+4.29e-02

Pos logit contributions
umbs+0.039
beh+0.037
lette+0.036
izza+0.035
igators+0.033
Neg logit contributions
 notice-0.021
 woke-0.019
But-0.019
 imagined-0.019
 stage-0.019
Token each
Token ID1123
Feature activation+4.54e-02

Pos logit contributions
umbs+0.103
beh+0.097
lette+0.094
illing+0.092
umb+0.092
Neg logit contributions
 notice-0.035
 heard-0.031
 suddenly-0.031
 stage-0.031
 again-0.031
Token person
Token ID1048
Feature activation+1.18e-01

Pos logit contributions
beh+0.101
icking+0.096
umbs+0.095
lette+0.091
rol+0.090
Neg logit contributions
 notice-0.047
But-0.045
 heard-0.045
 woke-0.043
 school-0.043
Token a
Token ID257
Feature activation+1.85e-02

Pos logit contributions
umbs+0.301
unch+0.274
beh+0.268
umb+0.268
omp+0.267
Neg logit contributions
 heard-0.084
 saw-0.078
 dress-0.074
 woke-0.069
,-0.068
Token medal
Token ID18279
Feature activation+3.79e-02

Pos logit contributions
umbs+0.034
beh+0.033
lette+0.032
igators+0.030
icking+0.030
Neg logit contributions
 notice-0.024
 stage-0.023
But-0.022
 woke-0.022
Who-0.021
Token to
Token ID284
Feature activation-2.21e-02

Pos logit contributions
umbs+0.091
beh+0.090
lette+0.085
allows+0.082
igators+0.082
Neg logit contributions
 stage-0.026
 notice-0.025
 woke-0.024
But-0.024
 imagined-0.023
Token show
Token ID905
Feature activation+5.80e-03

Pos logit contributions
 notice+0.028
But+0.028
 woke+0.027
Who+0.027
 stage+0.027
Neg logit contributions
umbs-0.036
lette-0.036
beh-0.036
igators-0.034
izza-0.034
Token that
Token ID326
Feature activation+2.01e-02

Pos logit contributions
umbs+0.012
beh+0.011
lette+0.011
izza+0.010
igators+0.010
Neg logit contributions
 notice-0.007
But-0.006
 stage-0.006
 woke-0.006
Who-0.006
Token
Token ID198
Feature activation+8.44e-05

Pos logit contributions
 notice+0.020
 stage+0.020
 imagined+0.019
Stop+0.018
Who+0.018
Neg logit contributions
umbs-0.029
lette-0.026
o-0.026
izza-0.026
beh-0.026
TokenMom
Token ID29252
Feature activation+1.37e-02

Pos logit contributions
umbs+0.000
lette+0.000
beh+0.000
izza+0.000
igators+0.000
Neg logit contributions
 notice-0.000
 stage-0.000
But-0.000
 woke-0.000
Who-0.000
Token nodded
Token ID14464
Feature activation-2.27e-02

Pos logit contributions
umbs+0.032
beh+0.031
lette+0.030
igators+0.029
izza+0.029
Neg logit contributions
 notice-0.011
But-0.010
 woke-0.010
 stage-0.010
 suddenly-0.009
Token and
Token ID290
Feature activation+2.43e-03

Pos logit contributions
 stage+0.025
 notice+0.025
Stop+0.024
 concert+0.022
But+0.022
Neg logit contributions
umbs-0.043
lette-0.043
icking-0.041
allows-0.040
beh-0.040
Token gave
Token ID2921
Feature activation+2.92e-02

Pos logit contributions
umbs+0.005
beh+0.005
lette+0.005
izza+0.005
igators+0.005
Neg logit contributions
 notice-0.002
But-0.002
 stage-0.002
 woke-0.002
Who-0.002
Token them
Token ID606
Feature activation+1.18e-01

Pos logit contributions
beh+0.071
umbs+0.071
lette+0.070
illing+0.067
ady+0.066
Neg logit contributions
 notice-0.020
 again-0.018
 suddenly-0.018
 stage-0.018
Stop-0.018
Token some
Token ID617
Feature activation+4.87e-02

Pos logit contributions
beh+0.338
ini+0.334
 supposed+0.327
umbs+0.324
known+0.322
Neg logit contributions
 her-0.055
 again-0.047
 dress-0.041
 it-0.036
,-0.035
Token bread
Token ID8509
Feature activation+6.50e-02

Pos logit contributions
beh+0.080
umbs+0.074
lette+0.069
known+0.067
icking+0.067
Neg logit contributions
 notice-0.072
But-0.072
 dress-0.069
 stage-0.068
 heard-0.068
Token cr
Token ID1067
Feature activation+8.94e-02

Pos logit contributions
beh+0.183
known+0.173
umbs+0.166
 highlight+0.163
igators+0.163
Neg logit contributions
 dress-0.022
,-0.021
But-0.021
 suddenly-0.018
 again-0.017
Tokenumbs
Token ID18146
Feature activation+4.55e-02

Pos logit contributions
beh+0.094
 highlight+0.090
igators+0.089
known+0.084
 strongest+0.083
Neg logit contributions
But-0.159
ro-0.155
 ti-0.154
 dress-0.152
 woke-0.152
Token from
Token ID422
Feature activation-6.19e-03

Pos logit contributions
beh+0.126
lette+0.118
known+0.118
umbs+0.117
izza+0.114
Neg logit contributions
 notice-0.018
 again-0.015
 woke-0.015
But-0.014
 suddenly-0.013
Token nodded
Token ID14464
Feature activation-1.24e-02

Pos logit contributions
umbs+0.071
beh+0.069
lette+0.066
izza+0.064
ocado+0.064
Neg logit contributions
 notice-0.024
But-0.023
 suddenly-0.022
 woke-0.022
 stage-0.021
Token eagerly
Token ID30130
Feature activation+1.30e-02

Pos logit contributions
 notice+0.012
 stage+0.012
But+0.011
Stop+0.011
Who+0.010
Neg logit contributions
umbs-0.025
lette-0.025
beh-0.024
icking-0.023
allows-0.023
Token.
Token ID13
Feature activation+9.10e-03

Pos logit contributions
umbs+0.032
beh+0.031
lette+0.030
igators+0.029
izza+0.029
Neg logit contributions
 notice-0.009
But-0.008
 stage-0.008
 woke-0.008
 suddenly-0.007
Token Mum
Token ID38367
Feature activation+5.53e-02

Pos logit contributions
umbs+0.017
beh+0.016
lette+0.016
igators+0.015
izza+0.015
Neg logit contributions
 notice-0.012
But-0.011
 stage-0.011
 woke-0.011
Who-0.011
Token gave
Token ID2921
Feature activation+4.10e-02

Pos logit contributions
beh+0.143
umbs+0.143
 ours+0.131
lette+0.130
igators+0.129
Neg logit contributions
 woke-0.036
 heard-0.034
 suddenly-0.033
 saw-0.032
 imagined-0.030
Token them
Token ID606
Feature activation+1.18e-01

Pos logit contributions
beh+0.098
umbs+0.097
illing+0.090
lette+0.089
ocado+0.088
Neg logit contributions
 notice-0.031
 heard-0.030
 suddenly-0.030
 stage-0.030
Stop-0.029
Token some
Token ID617
Feature activation+5.03e-02

Pos logit contributions
beh+0.313
umbs+0.308
oke+0.308
 supposed+0.302
ini+0.297
Neg logit contributions
 her-0.059
 dress-0.055
 again-0.054
 school-0.047
 shoes-0.046
Token food
Token ID2057
Feature activation+5.75e-02

Pos logit contributions
beh+0.088
umbs+0.081
lette+0.076
oun+0.072
icking+0.072
Neg logit contributions
 notice-0.071
But-0.069
 heard-0.069
 stage-0.068
 woke-0.068
Token and
Token ID290
Feature activation+2.46e-02

Pos logit contributions
beh+0.161
umbs+0.152
known+0.148
lette+0.145
igators+0.143
Neg logit contributions
But-0.018
 woke-0.017
 again-0.017
 stage-0.017
 notice-0.016
Token they
Token ID484
Feature activation+4.99e-02

Pos logit contributions
umbs+0.050
beh+0.048
lette+0.047
igators+0.044
oun+0.044
Neg logit contributions
 notice-0.027
 woke-0.026
 suddenly-0.025
 stage-0.025
But-0.025
Token sprinkled
Token ID49692
Feature activation+4.50e-02

Pos logit contributions
beh+0.110
umbs+0.109
lette+0.101
igators+0.097
umb+0.097
Neg logit contributions
 woke-0.052
 heard-0.049
 suddenly-0.048
 imagined-0.048
 saw-0.047
Token are
Token ID389
Feature activation+1.20e-02

Pos logit contributions
umbs+0.003
beh+0.003
lette+0.003
izza+0.003
igators+0.003
Neg logit contributions
 notice-0.003
But-0.003
 stage-0.003
 woke-0.002
Who-0.002
Token going
Token ID1016
Feature activation-2.81e-02

Pos logit contributions
umbs+0.020
beh+0.019
lette+0.019
izza+0.018
igators+0.018
Neg logit contributions
 notice-0.017
 stage-0.016
But-0.016
 suddenly-0.015
 woke-0.015
Token to
Token ID284
Feature activation-4.32e-02

Pos logit contributions
 notice+0.036
Who+0.036
Stop+0.035
 woke+0.035
But+0.035
Neg logit contributions
umbs-0.046
lette-0.045
o-0.044
beh-0.044
igators-0.042
Token a
Token ID257
Feature activation-4.12e-02

Pos logit contributions
But+0.053
Stop+0.052
Exc+0.051
 woke+0.051
icol+0.050
Neg logit contributions
umbs-0.066
 equal-0.065
igators-0.064
lette-0.064
beh-0.062
Token far
Token ID1290
Feature activation-6.30e-02

Pos logit contributions
 notice+0.059
Stop+0.058
 imagined+0.057
But+0.057
 woke+0.056
Neg logit contributions
 dollar-0.053
izza-0.052
lette-0.052
ocado-0.052
igators-0.051
Tokenaway
Token ID8272
Feature activation-1.19e-01

Pos logit contributions
Who+0.077
La+0.077
icol+0.074
Raw+0.073
Stop+0.073
Neg logit contributions
igator-0.102
igators-0.100
ole-0.099
 dollar-0.096
melon-0.091
Token place
Token ID1295
Feature activation-3.33e-02

Pos logit contributions
 Ach+0.200
Stop+0.188
Are+0.186
Ben+0.181
Exc+0.177
Neg logit contributions
igators-0.143
 kinds-0.120
igator-0.119
 ours-0.118
 strongest-0.112
Token!"
Token ID2474
Feature activation-3.32e-02

Pos logit contributions
Who+0.037
 notice+0.036
Stop+0.035
La+0.034
 woke+0.034
Neg logit contributions
umbs-0.063
lette-0.061
igators-0.061
beh-0.058
izza-0.057
Token Tom
Token ID4186
Feature activation+8.04e-03

Pos logit contributions
 notice+0.049
 seek+0.049
 woke+0.048
Raw+0.047
 ti+0.047
Neg logit contributions
izza-0.042
umbs-0.042
lette-0.040
o-0.038
 equal-0.037
Token said
Token ID531
Feature activation+6.89e-04

Pos logit contributions
umbs+0.018
beh+0.017
lette+0.017
izza+0.016
igators+0.016
Neg logit contributions
 notice-0.007
But-0.007
 stage-0.006
 woke-0.006
 suddenly-0.006
Token,
Token ID11
Feature activation+4.04e-03

Pos logit contributions
umbs+0.002
beh+0.002
lette+0.002
izza+0.002
icking+0.002
Neg logit contributions
 notice-0.000
But-0.000
 stage-0.000
Who-0.000
 woke-0.000
Token She
Token ID1375
Feature activation-3.49e-02

Pos logit contributions
 notice+0.006
But+0.005
 stage+0.005
 imagined+0.005
Who+0.005
Neg logit contributions
umbs-0.007
lette-0.006
beh-0.006
izza-0.006
igators-0.006
Token had
Token ID550
Feature activation-6.26e-03

Pos logit contributions
 notice+0.048
 stage+0.044
 avail+0.044
 recess+0.044
 concert+0.042
Neg logit contributions
beh-0.056
o-0.056
lette-0.056
umbs-0.053
ocado-0.052
Token never
Token ID1239
Feature activation-1.71e-02

Pos logit contributions
 notice+0.009
 stage+0.009
But+0.009
Who+0.009
Stop+0.009
Neg logit contributions
umbs-0.009
lette-0.009
beh-0.009
izza-0.008
igators-0.008
Token seen
Token ID1775
Feature activation-1.17e-02

Pos logit contributions
 stage+0.029
But+0.029
 notice+0.028
Stop+0.027
La+0.027
Neg logit contributions
beh-0.022
umbs-0.022
lette-0.022
 supposed-0.020
known-0.019
Token a
Token ID257
Feature activation-5.80e-02

Pos logit contributions
 notice+0.016
But+0.015
 stage+0.014
 woke+0.014
Who+0.014
Neg logit contributions
umbs-0.021
lette-0.020
beh-0.019
igators-0.018
izza-0.018
Token math
Token ID10688
Feature activation-1.13e-01

Pos logit contributions
 notice+0.089
Stop+0.086
 imagined+0.085
La+0.084
Exc+0.084
Neg logit contributions
 dollar-0.075
 supposed-0.071
ocado-0.070
 equal-0.068
 tast-0.067
Token book
Token ID1492
Feature activation-4.65e-02

Pos logit contributions
icol+0.157
AAAA+0.156
La+0.154
Stop+0.152
Sh+0.152
Neg logit contributions
 dollar-0.154
 strongest-0.145
 dollars-0.143
 supposed-0.141
 tast-0.139
Token before
Token ID878
Feature activation-3.91e-02

Pos logit contributions
Stop+0.061
Who+0.060
 notice+0.060
AAAA+0.059
La+0.058
Neg logit contributions
umbs-0.078
lette-0.072
icking-0.070
 supposed-0.068
asting-0.067
Token,
Token ID11
Feature activation-1.93e-02

Pos logit contributions
 notice+0.048
Stop+0.045
Who+0.045
La+0.044
But+0.043
Neg logit contributions
umbs-0.072
lette-0.065
icking-0.065
allows-0.064
 supposed-0.064
Token so
Token ID523
Feature activation-2.78e-02

Pos logit contributions
 notice+0.023
 stage+0.021
La+0.021
But+0.021
 woke+0.020
Neg logit contributions
umbs-0.035
lette-0.034
beh-0.033
icking-0.031
allows-0.031
Token she
Token ID673
Feature activation-2.78e-02

Pos logit contributions
 notice+0.030
La+0.029
 concert+0.028
Stop+0.028
chool+0.027
Neg logit contributions
umbs-0.051
lette-0.049
allows-0.049
beh-0.048
icking-0.047
Token red
Token ID2266
Feature activation-3.63e-02

Pos logit contributions
 notice+0.008
But+0.008
 stage+0.008
 woke+0.008
Who+0.007
Neg logit contributions
umbs-0.008
beh-0.008
lette-0.008
igators-0.007
izza-0.007
Token.
Token ID13
Feature activation-7.30e-03

Pos logit contributions
 notice+0.055
La+0.051
But+0.051
 seek+0.049
 stage+0.048
Neg logit contributions
umbs-0.059
lette-0.058
icking-0.057
melon-0.055
beh-0.053
Token It
Token ID632
Feature activation-3.29e-02

Pos logit contributions
 notice+0.011
But+0.011
 stage+0.010
 imagined+0.010
 woke+0.010
Neg logit contributions
umbs-0.012
lette-0.011
beh-0.011
izza-0.010
allows-0.010
Token was
Token ID373
Feature activation+7.33e-03

Pos logit contributions
 notice+0.044
 concert+0.040
 recess+0.039
 stage+0.038
But+0.037
Neg logit contributions
umbs-0.053
o-0.053
igators-0.050
lette-0.050
ocado-0.048
Token a
Token ID257
Feature activation-4.40e-02

Pos logit contributions
umbs+0.013
beh+0.013
lette+0.012
izza+0.012
igators+0.012
Neg logit contributions
 notice-0.010
But-0.009
 stage-0.009
 woke-0.009
Who-0.009
Token yoga
Token ID20351
Feature activation-1.12e-01

Pos logit contributions
 notice+0.077
 imagined+0.070
But+0.070
Stop+0.069
Who+0.067
Neg logit contributions
 dollar-0.053
lette-0.051
umbs-0.051
ocado-0.050
izza-0.048
Token mat
Token ID2603
Feature activation-1.84e-02

Pos logit contributions
 notice+0.150
Exc+0.139
 inspection+0.134
Who+0.132
Stop+0.127
Neg logit contributions
 dollar-0.163
lette-0.154
ery-0.152
 best-0.152
ocado-0.151
Token!
Token ID0
Feature activation+9.93e-04

Pos logit contributions
 notice+0.020
But+0.019
Who+0.018
La+0.018
Stop+0.018
Neg logit contributions
umbs-0.036
lette-0.035
beh-0.034
icking-0.033
igators-0.033
Token Ell
Token ID7122
Feature activation+2.24e-02

Pos logit contributions
umbs+0.002
beh+0.002
lette+0.002
izza+0.002
igators+0.002
Neg logit contributions
 notice-0.001
But-0.001
 stage-0.001
 woke-0.001
Who-0.001
Tokena
Token ID64
Feature activation-1.94e-02

Pos logit contributions
umbs+0.029
beh+0.027
igators+0.026
lette+0.026
izza+0.025
Neg logit contributions
 notice-0.040
 woke-0.040
But-0.039
 imagined-0.039
 heard-0.039
Token was
Token ID373
Feature activation+1.30e-02

Pos logit contributions
 notice+0.022
 stage+0.019
But+0.019
 avail+0.019
Stop+0.018
Neg logit contributions
umbs-0.038
lette-0.036
beh-0.035
izza-0.034
o-0.034
Token,
Token ID11
Feature activation-2.04e-03

Pos logit contributions
 notice+0.000
But+0.000
 stage+0.000
Who+0.000
 woke+0.000
Neg logit contributions
umbs-0.000
beh-0.000
lette-0.000
izza-0.000
igators-0.000
Token going
Token ID1016
Feature activation-1.45e-02

Pos logit contributions
 notice+0.002
But+0.002
 stage+0.002
 woke+0.002
Who+0.002
Neg logit contributions
umbs-0.004
lette-0.004
beh-0.004
izza-0.004
igators-0.004
Token to
Token ID284
Feature activation-3.83e-02

Pos logit contributions
 notice+0.020
 woke+0.019
Who+0.019
But+0.019
Stop+0.018
Neg logit contributions
umbs-0.023
beh-0.022
lette-0.022
igators-0.022
icking-0.021
Token a
Token ID257
Feature activation-3.08e-02

Pos logit contributions
 woke+0.052
icol+0.051
Stop+0.051
But+0.051
 imagined+0.050
Neg logit contributions
umbs-0.053
igators-0.052
 equal-0.052
beh-0.050
lette-0.049
Token far
Token ID1290
Feature activation-4.85e-02

Pos logit contributions
 imagined+0.044
 notice+0.044
Stop+0.044
 woke+0.043
But+0.043
Neg logit contributions
 dollar-0.039
lette-0.039
umbs-0.039
ocado-0.038
izza-0.038
Tokenaway
Token ID8272
Feature activation-1.12e-01

Pos logit contributions
La+0.055
Who+0.053
Stop+0.051
 ti+0.051
icol+0.051
Neg logit contributions
ole-0.080
 dollar-0.080
igators-0.080
igator-0.078
melon-0.076
Token place
Token ID1295
Feature activation-2.81e-02

Pos logit contributions
 Ach+0.188
 blush+0.183
Stop+0.182
Are+0.172
Ben+0.170
Neg logit contributions
igators-0.126
 kinds-0.109
 tank-0.104
 strongest-0.100
igator-0.098
Token.
Token ID13
Feature activation-5.82e-03

Pos logit contributions
Who+0.032
Stop+0.032
 notice+0.032
Raw+0.031
La+0.031
Neg logit contributions
umbs-0.049
lette-0.049
beh-0.048
igators-0.047
icking-0.047
Token
Token ID198
Feature activation-8.23e-03

Pos logit contributions
 notice+0.009
 stage+0.009
But+0.008
 woke+0.008
 imagined+0.008
Neg logit contributions
umbs-0.009
lette-0.008
beh-0.008
izza-0.008
ocado-0.008
Token
Token ID198
Feature activation-3.95e-03

Pos logit contributions
 notice+0.010
 stage+0.009
But+0.009
Who+0.009
Stop+0.009
Neg logit contributions
umbs-0.015
lette-0.014
beh-0.014
izza-0.014
igators-0.013
Token"
Token ID1
Feature activation-1.83e-02

Pos logit contributions
 notice+0.005
 stage+0.005
 woke+0.005
 imagined+0.005
But+0.005
Neg logit contributions
umbs-0.007
beh-0.006
lette-0.006
izza-0.006
ocado-0.006
Token.
Token ID13
Feature activation-1.02e-02

Pos logit contributions
 notice+0.036
Stop+0.034
Raw+0.032
 woke+0.032
La+0.032
Neg logit contributions
beh-0.050
umbs-0.048
illing-0.048
lette-0.048
known-0.046
Token She
Token ID1375
Feature activation-5.42e-02

Pos logit contributions
 notice+0.014
But+0.013
 stage+0.013
 imagined+0.013
La+0.013
Neg logit contributions
umbs-0.018
lette-0.016
beh-0.016
izza-0.015
igators-0.015
Token had
Token ID550
Feature activation-2.25e-02

Pos logit contributions
 avail+0.086
 conclusions+0.073
icol+0.072
 notice+0.069
 stage+0.067
Neg logit contributions
o-0.087
 supposed-0.073
ocado-0.070
known-0.070
illing-0.069
Token a
Token ID257
Feature activation-4.05e-02

Pos logit contributions
 notice+0.037
+0.037
 stage+0.037
 avail+0.036
chool+0.036
Neg logit contributions
lette-0.027
 supposed-0.026
beh-0.026
umbs-0.026
o-0.023
Token big
Token ID1263
Feature activation-4.11e-02

Pos logit contributions
 notice+0.072
But+0.070
 imagined+0.070
Who+0.069
Stop+0.068
Neg logit contributions
 dollar-0.044
 best-0.040
ocado-0.039
lette-0.039
 supposed-0.037
Token math
Token ID10688
Feature activation-1.10e-01

Pos logit contributions
Who+0.060
But+0.060
 notice+0.056
 suddenly+0.055
Stop+0.054
Neg logit contributions
 dollar-0.063
ish-0.057
lette-0.056
melon-0.056
 brothers-0.056
Token book
Token ID1492
Feature activation-3.38e-02

Pos logit contributions
icol+0.172
 notice+0.170
La+0.168
Stop+0.167
Exc+0.165
Neg logit contributions
 dollar-0.144
 best-0.129
 dollars-0.120
 supposed-0.114
 strongest-0.113
Token in
Token ID287
Feature activation-2.75e-02

Pos logit contributions
 notice+0.053
Who+0.052
Stop+0.052
 avail+0.050
Raw+0.049
Neg logit contributions
umbs-0.050
lette-0.046
 dollar-0.045
o-0.045
beh-0.044
Token her
Token ID607
Feature activation-2.00e-02

Pos logit contributions
icol+0.031
 notice+0.029
 concert+0.029
+0.029
 woke+0.029
Neg logit contributions
umbs-0.047
lette-0.046
ady-0.046
beh-0.045
 equal-0.044
Token bag
Token ID6131
Feature activation-1.03e-02

Pos logit contributions
 notice+0.025
Stop+0.023
 suddenly+0.022
Who+0.022
But+0.022
Neg logit contributions
lette-0.034
umbs-0.034
ady-0.034
beh-0.033
ocado-0.033
Token.
Token ID13
Feature activation-1.21e-02

Pos logit contributions
 notice+0.010
Who+0.009
 stage+0.009
Stop+0.009
But+0.009
Neg logit contributions
umbs-0.022
lette-0.021
beh-0.021
icking-0.020
izza-0.019
Token take
Token ID1011
Feature activation-5.91e-03

Pos logit contributions
 notice+0.017
 stage+0.017
But+0.016
Who+0.016
Stop+0.016
Neg logit contributions
umbs-0.027
lette-0.026
beh-0.025
igators-0.025
izza-0.024
Token us
Token ID514
Feature activation+1.48e-02

Pos logit contributions
 notice+0.007
But+0.007
 stage+0.007
Who+0.007
 woke+0.006
Neg logit contributions
umbs-0.011
lette-0.010
beh-0.010
izza-0.010
igators-0.010
Token to
Token ID284
Feature activation-3.99e-02

Pos logit contributions
umbs+0.031
beh+0.030
lette+0.029
izza+0.028
igators+0.027
Neg logit contributions
 notice-0.016
But-0.014
 stage-0.014
 woke-0.014
 imagined-0.013
Token a
Token ID257
Feature activation-1.55e-02

Pos logit contributions
Stop+0.056
But+0.056
Who+0.055
icol+0.055
 woke+0.055
Neg logit contributions
umbs-0.056
lette-0.054
 equal-0.053
izza-0.053
igators-0.053
Token far
Token ID1290
Feature activation-5.06e-02

Pos logit contributions
 notice+0.022
But+0.021
Stop+0.020
 imagined+0.020
 woke+0.020
Neg logit contributions
umbs-0.024
lette-0.024
beh-0.023
izza-0.022
igators-0.022
Tokenaway
Token ID8272
Feature activation-1.10e-01

Pos logit contributions
La+0.061
Who+0.060
icol+0.057
Stop+0.056
Raw+0.056
Neg logit contributions
igators-0.083
ole-0.083
igator-0.083
 dollar-0.081
melon-0.079
Token place
Token ID1295
Feature activation-2.04e-02

Pos logit contributions
 Ach+0.181
Are+0.178
Stop+0.178
Exc+0.169
 blush+0.166
Neg logit contributions
igators-0.134
 ours-0.114
 kinds-0.111
igator-0.109
 equal-0.106
Token."
Token ID526
Feature activation-1.19e-02

Pos logit contributions
 notice+0.023
Who+0.022
Stop+0.021
But+0.021
La+0.021
Neg logit contributions
umbs-0.039
lette-0.037
igators-0.036
beh-0.036
izza-0.035
Token
Token ID198
Feature activation-1.46e-02

Pos logit contributions
 notice+0.015
 stage+0.014
 imagined+0.013
 woke+0.013
Stop+0.013
Neg logit contributions
umbs-0.022
lette-0.020
izza-0.020
beh-0.020
o-0.020
Token
Token ID198
Feature activation+1.40e-03

Pos logit contributions
 notice+0.018
 stage+0.017
 imagined+0.016
Who+0.016
 woke+0.016
Neg logit contributions
umbs-0.027
lette-0.024
izza-0.024
beh-0.024
o-0.024
TokenL
Token ID43
Feature activation-1.22e-02

Pos logit contributions
umbs+0.003
beh+0.003
lette+0.003
izza+0.003
igators+0.002
Neg logit contributions
 notice-0.002
 stage-0.001
But-0.001
 woke-0.001
Who-0.001
Token can
Token ID460
Feature activation-2.98e-02

Pos logit contributions
 stage+0.010
 notice+0.010
But+0.010
La+0.009
Who+0.009
Neg logit contributions
umbs-0.014
lette-0.013
beh-0.013
izza-0.012
ocado-0.012
Token find
Token ID1064
Feature activation-2.84e-02

Pos logit contributions
icol+0.036
 stage+0.036
Stop+0.036
 woke+0.036
AAAA+0.035
Neg logit contributions
lette-0.049
umbs-0.048
 equal-0.046
beh-0.044
izza-0.044
Token animals
Token ID4695
Feature activation-2.44e-02

Pos logit contributions
 notice+0.040
But+0.038
 woke+0.037
 stage+0.036
Who+0.036
Neg logit contributions
umbs-0.045
lette-0.044
izza-0.042
beh-0.041
igators-0.041
Token in
Token ID287
Feature activation-7.41e-03

Pos logit contributions
 notice+0.027
icol+0.025
 stage+0.025
Who+0.024
Stop+0.024
Neg logit contributions
umbs-0.046
lette-0.045
igators-0.044
beh-0.043
ocado-0.042
Token the
Token ID262
Feature activation-1.81e-02

Pos logit contributions
 notice+0.009
But+0.009
 woke+0.009
 stage+0.008
Who+0.008
Neg logit contributions
umbs-0.013
lette-0.013
beh-0.013
igators-0.012
izza-0.012
Token math
Token ID10688
Feature activation-1.09e-01

Pos logit contributions
 notice+0.023
 suddenly+0.021
But+0.020
 imagined+0.020
 woke+0.020
Neg logit contributions
lette-0.031
umbs-0.029
 dollar-0.029
ocado-0.029
izza-0.028
Token book
Token ID1492
Feature activation-6.31e-02

Pos logit contributions
icol+0.169
 inspection+0.145
 notice+0.143
AAAA+0.142
Stop+0.140
Neg logit contributions
 dollar-0.158
 dollars-0.151
igators-0.150
ocado-0.143
 equal-0.137
Token.
Token ID13
Feature activation-1.57e-02

Pos logit contributions
icol+0.085
 notice+0.080
Who+0.080
Stop+0.079
 avail+0.075
Neg logit contributions
umbs-0.110
igators-0.101
lette-0.098
 dollar-0.097
izza-0.096
Token Maybe
Token ID6674
Feature activation-5.57e-02

Pos logit contributions
 notice+0.019
 stage+0.018
 woke+0.017
lf+0.016
 imagined+0.016
Neg logit contributions
umbs-0.030
lette-0.029
beh-0.028
izza-0.028
ocado-0.027
Token I
Token ID314
Feature activation-5.90e-03

Pos logit contributions
icol+0.086
 notice+0.085
Stop+0.082
Raw+0.080
 woke+0.080
Neg logit contributions
lette-0.072
 dollar-0.071
izza-0.069
 ours-0.069
igators-0.068
Token can
Token ID460
Feature activation-2.63e-02

Pos logit contributions
 stage+0.008
 notice+0.008
But+0.007
 woke+0.007
Who+0.007
Neg logit contributions
umbs-0.010
lette-0.010
beh-0.010
izza-0.009
ocado-0.009
Token take
Token ID1011
Feature activation-4.40e-05

Pos logit contributions
 notice+0.017
 stage+0.016
But+0.016
Who+0.016
 woke+0.016
Neg logit contributions
umbs-0.025
lette-0.024
beh-0.024
igators-0.022
izza-0.022
Token you
Token ID345
Feature activation+1.32e-02

Pos logit contributions
 notice+0.000
But+0.000
 stage+0.000
Who+0.000
 woke+0.000
Neg logit contributions
umbs-0.000
beh-0.000
lette-0.000
izza-0.000
igators-0.000
Token to
Token ID284
Feature activation-3.36e-02

Pos logit contributions
umbs+0.027
beh+0.026
lette+0.025
izza+0.024
igators+0.024
Neg logit contributions
 notice-0.015
But-0.014
 stage-0.013
 woke-0.013
 imagined-0.012
Token a
Token ID257
Feature activation-1.64e-02

Pos logit contributions
Stop+0.044
Who+0.044
But+0.044
 woke+0.044
icol+0.043
Neg logit contributions
umbs-0.050
lette-0.050
 equal-0.048
beh-0.048
izza-0.047
Token far
Token ID1290
Feature activation-5.08e-02

Pos logit contributions
 notice+0.022
But+0.021
Stop+0.021
 imagined+0.020
 woke+0.020
Neg logit contributions
umbs-0.026
lette-0.026
beh-0.025
izza-0.025
igators-0.024
Tokenaway
Token ID8272
Feature activation-1.08e-01

Pos logit contributions
La+0.062
Who+0.061
Stop+0.057
Raw+0.055
Are+0.055
Neg logit contributions
ole-0.084
igators-0.082
 dollar-0.082
igator-0.081
umbs-0.078
Token place
Token ID1295
Feature activation-2.07e-02

Pos logit contributions
 Ach+0.173
Stop+0.168
Are+0.167
Ben+0.157
Exc+0.155
Neg logit contributions
igators-0.142
 equal-0.119
 ours-0.117
ishes-0.115
igator-0.115
Token.
Token ID13
Feature activation+7.28e-03

Pos logit contributions
 notice+0.023
Who+0.022
Stop+0.021
But+0.021
 woke+0.021
Neg logit contributions
umbs-0.040
lette-0.038
beh-0.038
igators-0.037
izza-0.036
Token You
Token ID921
Feature activation+2.45e-02

Pos logit contributions
umbs+0.012
beh+0.012
lette+0.012
izza+0.011
igators+0.011
Neg logit contributions
 notice-0.010
But-0.010
 stage-0.010
 woke-0.009
Who-0.009
Token have
Token ID423
Feature activation+3.92e-03

Pos logit contributions
umbs+0.048
beh+0.047
lette+0.046
izza+0.043
icking+0.043
Neg logit contributions
 notice-0.031
But-0.028
 woke-0.028
 stage-0.027
 imagined-0.027
Token to
Token ID284
Feature activation-1.46e-02

Pos logit contributions
umbs+0.008
beh+0.008
lette+0.007
izza+0.007
igators+0.007
Neg logit contributions
 notice-0.004
But-0.004
 stage-0.004
 woke-0.004
Who-0.004
Token a
Token ID257
Feature activation+9.66e-03

Pos logit contributions
umbs+0.071
beh+0.067
lette+0.064
ady+0.064
igators+0.064
Neg logit contributions
 notice-0.025
 woke-0.025
 stage-0.024
But-0.023
 suddenly-0.022
Token story
Token ID1621
Feature activation-1.77e-02

Pos logit contributions
umbs+0.017
beh+0.016
lette+0.016
igators+0.015
icking+0.015
Neg logit contributions
 notice-0.014
But-0.013
 stage-0.013
 woke-0.013
Who-0.012
Token about
Token ID546
Feature activation+1.81e-02

Pos logit contributions
 notice+0.031
But+0.030
Who+0.029
Stop+0.029
 stage+0.029
Neg logit contributions
umbs-0.024
lette-0.022
beh-0.021
izza-0.021
icking-0.020
Token a
Token ID257
Feature activation-2.93e-02

Pos logit contributions
umbs+0.036
beh+0.034
lette+0.033
izza+0.032
icking+0.031
Neg logit contributions
 notice-0.021
 stage-0.020
But-0.020
Who-0.019
 woke-0.019
Token far
Token ID1290
Feature activation-3.02e-02

Pos logit contributions
 notice+0.046
Stop+0.045
But+0.044
 imagined+0.043
La+0.042
Neg logit contributions
ocado-0.037
lette-0.037
 dollar-0.036
umbs-0.036
izza-0.035
Tokenaway
Token ID8272
Feature activation-1.08e-01

Pos logit contributions
La+0.036
 imagined+0.034
But+0.033
Who+0.033
 notice+0.033
Neg logit contributions
 dollar-0.050
beh-0.050
melon-0.050
igators-0.049
umbs-0.048
Token land
Token ID1956
Feature activation-1.68e-02

Pos logit contributions
Stop+0.175
 Ach+0.169
La+0.164
Ben+0.157
J+0.155
Neg logit contributions
ishes-0.116
 chicks-0.116
bi-0.113
igators-0.109
 dollars-0.104
Token that
Token ID326
Feature activation+6.27e-03

Pos logit contributions
 notice+0.020
Who+0.019
But+0.018
 stage+0.018
Stop+0.018
Neg logit contributions
umbs-0.031
lette-0.030
beh-0.030
icking-0.029
igators-0.029
Token was
Token ID373
Feature activation+1.11e-02

Pos logit contributions
umbs+0.013
beh+0.012
lette+0.012
izza+0.012
igators+0.012
Neg logit contributions
 notice-0.007
But-0.006
 stage-0.006
 woke-0.006
Who-0.006
Token very
Token ID845
Feature activation+1.02e-02

Pos logit contributions
umbs+0.019
beh+0.018
lette+0.018
izza+0.017
igators+0.016
Neg logit contributions
 notice-0.016
But-0.015
 stage-0.015
 woke-0.015
 suddenly-0.014
Token distant
Token ID12899
Feature activation-1.51e-02

Pos logit contributions
umbs+0.015
beh+0.013
lette+0.013
izza+0.012
igators+0.012
Neg logit contributions
 notice-0.018
But-0.017
 stage-0.017
 woke-0.016
Who-0.016
Token away
Token ID1497
Feature activation+5.25e-03

Pos logit contributions
 notice+0.003
But+0.003
 stage+0.003
 woke+0.003
Who+0.003
Neg logit contributions
umbs-0.003
beh-0.003
lette-0.003
izza-0.003
igators-0.003
Token.
Token ID13
Feature activation-7.50e-03

Pos logit contributions
umbs+0.010
beh+0.010
lette+0.010
izza+0.009
igators+0.009
Neg logit contributions
 notice-0.006
But-0.006
 stage-0.006
 woke-0.005
Who-0.005
Token She
Token ID1375
Feature activation-1.29e-03

Pos logit contributions
 notice+0.012
 stage+0.011
But+0.011
 imagined+0.011
 woke+0.011
Neg logit contributions
umbs-0.012
lette-0.011
beh-0.011
izza-0.010
igators-0.010
Token has
Token ID468
Feature activation-8.43e-03

Pos logit contributions
 notice+0.002
But+0.002
 stage+0.002
Who+0.002
 woke+0.002
Neg logit contributions
umbs-0.002
beh-0.002
lette-0.002
izza-0.002
igators-0.002
Token a
Token ID257
Feature activation-2.67e-02

Pos logit contributions
 notice+0.013
But+0.013
 stage+0.013
 woke+0.012
Who+0.012
Neg logit contributions
umbs-0.012
lette-0.012
beh-0.011
igators-0.011
izza-0.011
Token math
Token ID10688
Feature activation-1.08e-01

Pos logit contributions
 notice+0.044
 imagined+0.042
 woke+0.042
But+0.042
Stop+0.042
Neg logit contributions
 dollar-0.034
umbs-0.032
lette-0.031
izza-0.031
ocado-0.031
Token book
Token ID1492
Feature activation-3.28e-02

Pos logit contributions
icol+0.166
 woke+0.165
AAAA+0.164
Stop+0.161
Hi+0.160
Neg logit contributions
 dollar-0.143
 best-0.134
 dollars-0.127
 tast-0.121
 equal-0.120
Token with
Token ID351
Feature activation+4.12e-04

Pos logit contributions
 notice+0.045
Who+0.045
Stop+0.044
 woke+0.042
Raw+0.042
Neg logit contributions
umbs-0.054
lette-0.050
beh-0.048
 dollar-0.048
igators-0.047
Token many
Token ID867
Feature activation-1.62e-02

Pos logit contributions
umbs+0.001
beh+0.001
lette+0.001
izza+0.001
igators+0.001
Neg logit contributions
 notice-0.001
But-0.001
 stage-0.001
 woke-0.001
Who-0.000
Token pictures
Token ID5986
Feature activation-2.50e-03

Pos logit contributions
 notice+0.025
 woke+0.023
But+0.023
 stage+0.023
 suddenly+0.023
Neg logit contributions
umbs-0.024
beh-0.023
lette-0.023
igators-0.022
 equal-0.022
Token and
Token ID290
Feature activation-3.16e-02

Pos logit contributions
 notice+0.003
But+0.003
 stage+0.003
 woke+0.003
Who+0.002
Neg logit contributions
umbs-0.005
lette-0.005
beh-0.005
izza-0.004
igators-0.004
Token,
Token ID11
Feature activation-3.90e-02

Pos logit contributions
 notice+0.028
Who+0.027
 stage+0.026
 imagined+0.026
But+0.025
Neg logit contributions
umbs-0.048
o-0.046
lette-0.045
izza-0.045
itt-0.045
Token she
Token ID673
Feature activation-3.60e-02

Pos logit contributions
La+0.053
 notice+0.052
icol+0.052
 imagined+0.051
 concert+0.051
Neg logit contributions
melon-0.051
o-0.048
allows-0.048
oun-0.048
umbs-0.048
Token went
Token ID1816
Feature activation-9.56e-03

Pos logit contributions
 stage+0.046
 avail+0.044
chool+0.040
 notice+0.039
 speaker+0.039
Neg logit contributions
o-0.062
lette-0.058
ocado-0.058
beh-0.058
icking-0.056
Token to
Token ID284
Feature activation-3.30e-02

Pos logit contributions
 notice+0.012
But+0.011
Who+0.011
 stage+0.011
Stop+0.011
Neg logit contributions
umbs-0.017
beh-0.017
lette-0.016
icking-0.015
igators-0.015
Token a
Token ID257
Feature activation-1.92e-02

Pos logit contributions
But+0.050
 ti+0.047
icol+0.046
Stop+0.046
lf+0.046
Neg logit contributions
beh-0.039
igators-0.039
umbs-0.039
o-0.038
 supposed-0.037
Token yoga
Token ID20351
Feature activation-1.07e-01

Pos logit contributions
 notice+0.027
But+0.026
 imagined+0.025
Stop+0.025
Who+0.024
Neg logit contributions
umbs-0.028
lette-0.028
beh-0.027
 dollar-0.027
izza-0.027
Token class
Token ID1398
Feature activation-2.55e-02

Pos logit contributions
 notice+0.150
 inspection+0.148
 avail+0.144
Who+0.143
 widened+0.140
Neg logit contributions
 dollar-0.154
melon-0.142
itt-0.132
ocado-0.131
 tank-0.130
Token with
Token ID351
Feature activation-6.28e-03

Pos logit contributions
 notice+0.031
Stop+0.029
Who+0.029
La+0.028
Raw+0.027
Neg logit contributions
beh-0.045
umbs-0.045
lette-0.044
illing-0.043
icking-0.042
Token her
Token ID607
Feature activation-6.24e-03

Pos logit contributions
 notice+0.006
But+0.006
 stage+0.005
Who+0.005
 woke+0.005
Neg logit contributions
umbs-0.013
beh-0.013
lette-0.013
igators-0.012
izza-0.012
Token mom
Token ID1995
Feature activation+4.13e-02

Pos logit contributions
 notice+0.009
But+0.008
Who+0.008
 woke+0.008
Stop+0.008
Neg logit contributions
umbs-0.010
lette-0.010
beh-0.010
izza-0.009
icking-0.009
Token.
Token ID13
Feature activation-1.53e-02

Pos logit contributions
beh+0.081
umbs+0.079
igators+0.075
izza+0.073
known+0.071
Neg logit contributions
But-0.050
 notice-0.050
 heard-0.050
 woke-0.049
 saw-0.048
Token.
Token ID13
Feature activation+1.66e-02

Pos logit contributions
umbs+0.024
beh+0.024
lette+0.023
izza+0.023
igators+0.022
Neg logit contributions
 notice-0.008
But-0.007
 stage-0.007
 woke-0.007
 suddenly-0.006
Token Lily
Token ID20037
Feature activation-5.07e-03

Pos logit contributions
umbs+0.030
beh+0.030
lette+0.030
icking+0.028
izza+0.027
Neg logit contributions
 notice-0.021
But-0.020
 stage-0.019
Who-0.019
 woke-0.019
Token has
Token ID468
Feature activation-1.42e-02

Pos logit contributions
 notice+0.006
But+0.006
 stage+0.006
Who+0.006
Stop+0.006
Neg logit contributions
umbs-0.009
beh-0.009
lette-0.009
igators-0.008
izza-0.008
Token a
Token ID257
Feature activation-4.29e-02

Pos logit contributions
 notice+0.021
But+0.020
 stage+0.020
Who+0.020
 woke+0.019
Neg logit contributions
umbs-0.022
lette-0.021
beh-0.021
igators-0.020
ady-0.019
Token pink
Token ID11398
Feature activation-8.35e-02

Pos logit contributions
Who+0.075
 notice+0.075
Stop+0.074
 woke+0.072
Exc+0.072
Neg logit contributions
 dollar-0.046
ocado-0.043
 best-0.043
umbs-0.042
lette-0.042
Token yoga
Token ID20351
Feature activation-1.07e-01

Pos logit contributions
 notice+0.128
Stop+0.126
Who+0.125
Exc+0.123
 inspection+0.117
Neg logit contributions
ish-0.111
 brothers-0.109
une-0.107
 best-0.107
 dollar-0.106
Token mat
Token ID2603
Feature activation-2.05e-02

Pos logit contributions
 notice+0.139
Who+0.136
Stop+0.130
 avail+0.129
Exc+0.129
Neg logit contributions
 dollar-0.167
 best-0.161
ery-0.153
ocado-0.151
melon-0.151
Token.
Token ID13
Feature activation-1.36e-03

Pos logit contributions
 notice+0.024
Who+0.022
But+0.021
Stop+0.021
 stage+0.021
Neg logit contributions
umbs-0.040
lette-0.038
beh-0.037
igators-0.036
icking-0.036
Token She
Token ID1375
Feature activation-1.82e-02

Pos logit contributions
 notice+0.002
But+0.002
 stage+0.002
 woke+0.001
 imagined+0.001
Neg logit contributions
umbs-0.003
beh-0.002
lette-0.002
izza-0.002
igators-0.002
Token puts
Token ID7584
Feature activation-1.87e-02

Pos logit contributions
 notice+0.026
But+0.024
 stage+0.024
Who+0.024
Stop+0.023
Neg logit contributions
umbs-0.029
beh-0.027
lette-0.027
igators-0.027
izza-0.026
Token it
Token ID340
Feature activation-4.60e-03

Pos logit contributions
 notice+0.024
But+0.021
 woke+0.021
Who+0.021
 stage+0.020
Neg logit contributions
umbs-0.033
lette-0.031
beh-0.031
allows-0.030
igators-0.030
Token was
Token ID373
Feature activation-7.27e-03

Pos logit contributions
 notice+0.023
But+0.022
 stage+0.022
Who+0.020
La+0.020
Neg logit contributions
beh-0.041
umbs-0.040
igators-0.039
lette-0.038
o-0.038
Token a
Token ID257
Feature activation-3.99e-02

Pos logit contributions
 notice+0.011
But+0.010
Who+0.010
 stage+0.010
Stop+0.010
Neg logit contributions
umbs-0.011
beh-0.011
lette-0.011
igators-0.010
icking-0.010
Token nightmare
Token ID17123
Feature activation-6.29e-02

Pos logit contributions
Stop+0.068
 notice+0.068
But+0.066
 imagined+0.065
Who+0.065
Neg logit contributions
 dollar-0.043
ocado-0.041
lette-0.040
 tast-0.040
melon-0.039
Token about
Token ID546
Feature activation-4.10e-02

Pos logit contributions
Who+0.090
Stop+0.090
But+0.089
La+0.088
 notice+0.088
Neg logit contributions
umbs-0.095
icking-0.087
lette-0.086
o-0.085
 supposed-0.084
Token a
Token ID257
Feature activation-8.32e-02

Pos logit contributions
 notice+0.067
But+0.066
+0.064
Exc+0.061
Stop+0.060
Neg logit contributions
beh-0.052
 supposed-0.051
igators-0.050
umbs-0.049
allows-0.049
Token scary
Token ID14343
Feature activation-1.07e-01

Pos logit contributions
Stop+0.130
zes+0.129
chool+0.127
ulk+0.126
Fine+0.125
Neg logit contributions
 valuable-0.095
 best-0.090
 strongest-0.089
 lucky-0.089
 important-0.088
Token monster
Token ID9234
Feature activation-6.03e-02

Pos logit contributions
Sorry+0.187
Exc+0.182
But+0.180
Stop+0.179
Who+0.178
Neg logit contributions
 important-0.096
 best-0.094
 adorable-0.092
 cub-0.091
ish-0.090
Token that
Token ID326
Feature activation-4.78e-02

Pos logit contributions
 avail+0.082
 notice+0.081
But+0.079
Stop+0.079
Who+0.077
Neg logit contributions
o-0.093
umbs-0.090
beh-0.088
igators-0.086
lette-0.086
Token wouldn
Token ID3636
Feature activation-1.86e-02

Pos logit contributions
 notice+0.061
But+0.060
 avail+0.058
chool+0.057
 stage+0.055
Neg logit contributions
beh-0.078
igators-0.078
igator-0.073
o-0.072
umbs-0.071
Token't
Token ID470
Feature activation-3.68e-02

Pos logit contributions
 notice+0.028
 stage+0.027
 imagined+0.027
But+0.027
Who+0.026
Neg logit contributions
umbs-0.029
beh-0.027
lette-0.025
icking-0.024
oun-0.024
Token go
Token ID467
Feature activation-3.08e-02

Pos logit contributions
icol+0.051
chool+0.050
 stage+0.049
But+0.048
Really+0.048
Neg logit contributions
umbs-0.056
beh-0.055
igators-0.053
 supposed-0.052
asting-0.050
Token a
Token ID257
Feature activation-4.43e-02

Pos logit contributions
umbs+0.051
beh+0.049
lette+0.047
izza+0.046
oun+0.044
Neg logit contributions
 notice-0.029
 stage-0.029
 suddenly-0.027
But-0.027
 school-0.027
Token map
Token ID3975
Feature activation+3.00e-02

Pos logit contributions
 notice+0.066
Stop+0.064
But+0.062
Exc+0.061
 imagined+0.061
Neg logit contributions
 dollar-0.058
ocado-0.057
lette-0.056
melon-0.055
igators-0.054
Token of
Token ID286
Feature activation+2.36e-04

Pos logit contributions
beh+0.074
umbs+0.073
lette+0.071
allows+0.068
izza+0.068
Neg logit contributions
 stage-0.020
 notice-0.019
But-0.017
 woke-0.017
 suddenly-0.016
Token a
Token ID257
Feature activation-4.13e-02

Pos logit contributions
umbs+0.000
beh+0.000
lette+0.000
izza+0.000
igators+0.000
Neg logit contributions
 notice-0.000
But-0.000
 stage-0.000
 woke-0.000
Who-0.000
Token far
Token ID1290
Feature activation-2.72e-02

Pos logit contributions
 notice+0.059
Stop+0.056
But+0.056
 woke+0.053
Who+0.053
Neg logit contributions
 dollar-0.059
umbs-0.059
lette-0.058
ocado-0.057
igators-0.057
Tokenaway
Token ID8272
Feature activation-1.07e-01

Pos logit contributions
But+0.035
La+0.034
 notice+0.034
Who+0.033
 woke+0.032
Neg logit contributions
igators-0.043
umbs-0.042
beh-0.042
 dollar-0.041
melon-0.040
Token land
Token ID1956
Feature activation-1.54e-03

Pos logit contributions
Exc+0.162
 Ach+0.157
Stop+0.157
Sorry+0.153
Ba+0.153
Neg logit contributions
igators-0.130
 kinds-0.119
 sizes-0.110
 tank-0.109
ishes-0.108
Token.
Token ID13
Feature activation+1.35e-02

Pos logit contributions
 notice+0.001
But+0.001
 stage+0.001
Who+0.001
 woke+0.001
Neg logit contributions
umbs-0.003
lette-0.003
beh-0.003
igators-0.003
izza-0.003
Token
Token ID198
Feature activation-1.68e-02

Pos logit contributions
beh+0.027
umbs+0.027
lette+0.027
icking+0.025
izza+0.025
Neg logit contributions
 notice-0.015
But-0.014
 stage-0.013
Who-0.013
 woke-0.013
Token
Token ID198
Feature activation+1.45e-02

Pos logit contributions
 notice+0.020
 stage+0.020
 imagined+0.019
Who+0.019
 woke+0.019
Neg logit contributions
umbs-0.031
lette-0.028
beh-0.028
o-0.027
izza-0.027
Token"
Token ID1
Feature activation-9.79e-03

Pos logit contributions
umbs+0.031
beh+0.031
lette+0.030
icking+0.029
izza+0.029
Neg logit contributions
But-0.014
 notice-0.014
 stage-0.012
 suddenly-0.012
Who-0.012
Token it
Token ID340
Feature activation+1.98e-02

Pos logit contributions
umbs+0.018
beh+0.017
lette+0.017
izza+0.016
igators+0.016
Neg logit contributions
 notice-0.006
But-0.006
 stage-0.006
 woke-0.005
Who-0.005
Token was
Token ID373
Feature activation+1.71e-02

Pos logit contributions
umbs+0.051
beh+0.049
lette+0.048
izza+0.047
igators+0.046
Neg logit contributions
 notice-0.011
 woke-0.011
But-0.010
 suddenly-0.009
 stage-0.009
Token from
Token ID422
Feature activation-9.63e-03

Pos logit contributions
umbs+0.033
beh+0.031
lette+0.030
izza+0.029
igators+0.028
Neg logit contributions
 notice-0.021
But-0.020
 stage-0.020
 woke-0.019
 suddenly-0.019
Token a
Token ID257
Feature activation-1.53e-02

Pos logit contributions
 notice+0.012
But+0.012
 woke+0.011
 stage+0.011
Who+0.011
Neg logit contributions
umbs-0.017
lette-0.017
beh-0.017
igators-0.016
izza-0.016
Token far
Token ID1290
Feature activation-3.93e-02

Pos logit contributions
 notice+0.021
But+0.021
 stage+0.019
Who+0.019
 imagined+0.019
Neg logit contributions
umbs-0.025
lette-0.024
beh-0.023
izza-0.022
 dollar-0.022
Tokenaway
Token ID8272
Feature activation-1.07e-01

Pos logit contributions
La+0.058
Who+0.056
 imagined+0.054
lf+0.053
 stage+0.053
Neg logit contributions
igator-0.055
 dollar-0.054
melon-0.054
igators-0.054
umbs-0.053
Token place
Token ID1295
Feature activation-1.97e-02

Pos logit contributions
 Ach+0.158
Ben+0.153
Ba+0.150
La+0.150
Stop+0.149
Neg logit contributions
igators-0.139
bi-0.124
 chicks-0.121
 kinds-0.118
igator-0.117
Token.
Token ID13
Feature activation-2.07e-02

Pos logit contributions
 notice+0.021
Who+0.019
But+0.019
 stage+0.019
Stop+0.019
Neg logit contributions
umbs-0.039
lette-0.038
beh-0.037
icking-0.036
igators-0.036
Token She
Token ID1375
Feature activation-2.17e-02

Pos logit contributions
 notice+0.029
 imagined+0.028
 seek+0.028
 stage+0.028
lf+0.028
Neg logit contributions
umbs-0.035
izza-0.030
lette-0.030
ocado-0.028
oops-0.028
Token said
Token ID531
Feature activation+5.59e-03

Pos logit contributions
 notice+0.028
 stage+0.026
But+0.026
Stop+0.025
 avail+0.024
Neg logit contributions
umbs-0.037
lette-0.036
beh-0.036
izza-0.034
igators-0.034
Token it
Token ID340
Feature activation+1.82e-02

Pos logit contributions
umbs+0.012
beh+0.012
lette+0.012
izza+0.011
igators+0.011
Neg logit contributions
 notice-0.005
But-0.005
 stage-0.004
 woke-0.004
Who-0.004
Token to
Token ID284
Feature activation+2.49e-02

Pos logit contributions
La+0.047
 notice+0.046
 stage+0.044
Stop+0.043
Ba+0.042
Neg logit contributions
beh-0.039
lette-0.038
 supposed-0.036
o-0.036
izza-0.036
Token travel
Token ID3067
Feature activation-4.08e-02

Pos logit contributions
umbs+0.052
beh+0.050
lette+0.049
ocado+0.047
izza+0.047
Neg logit contributions
 notice-0.028
 stage-0.025
But-0.024
 woke-0.024
 dress-0.024
Token to
Token ID284
Feature activation-1.94e-02

Pos logit contributions
AAAA+0.069
icol+0.069
Who+0.069
Raw+0.068
 stage+0.068
Neg logit contributions
o-0.047
beh-0.047
asting-0.043
cream-0.042
igators-0.041
Token a
Token ID257
Feature activation-2.57e-02

Pos logit contributions
But+0.026
 notice+0.025
 woke+0.025
Stop+0.025
 imagined+0.024
Neg logit contributions
beh-0.028
umbs-0.028
lette-0.027
igators-0.027
izza-0.026
Token far
Token ID1290
Feature activation-3.73e-02

Pos logit contributions
 notice+0.039
 imagined+0.037
But+0.037
Stop+0.037
 woke+0.036
Neg logit contributions
umbs-0.033
lette-0.033
 dollar-0.032
beh-0.032
ocado-0.032
Tokenaway
Token ID8272
Feature activation-1.06e-01

Pos logit contributions
La+0.036
Who+0.036
Stop+0.035
 woke+0.034
 notice+0.034
Neg logit contributions
igators-0.068
 dollar-0.067
beh-0.065
umbs-0.065
ole-0.065
Token place
Token ID1295
Feature activation-3.56e-03

Pos logit contributions
Stop+0.178
 Ach+0.176
 blush+0.173
ged+0.169
Are+0.168
Neg logit contributions
igators-0.110
 tank-0.095
 ours-0.095
 dollar-0.093
 strongest-0.092
Token and
Token ID290
Feature activation-3.75e-02

Pos logit contributions
 notice+0.003
But+0.003
 stage+0.003
Who+0.003
 woke+0.003
Neg logit contributions
umbs-0.008
beh-0.007
lette-0.007
igators-0.007
izza-0.007
Token brought
Token ID3181
Feature activation+2.88e-02

Pos logit contributions
La+0.050
Stop+0.050
But+0.047
ara+0.047
 stage+0.047
Neg logit contributions
beh-0.058
igators-0.054
lette-0.053
 tast-0.052
known-0.051
Token Spot
Token ID15899
Feature activation+7.11e-02

Pos logit contributions
umbs+0.061
lette+0.059
beh+0.057
icking+0.056
illing+0.056
Neg logit contributions
 notice-0.029
 woke-0.027
But-0.026
 stage-0.026
 heard-0.025
Token along
Token ID1863
Feature activation-1.47e-02

Pos logit contributions
umbs+0.177
itt+0.167
lette+0.167
omp+0.166
elling+0.164
Neg logit contributions
 again-0.047
 dress-0.043
 heard-0.041
 woke-0.041
 school-0.040
Token.
Token ID13
Feature activation+1.37e-02

Pos logit contributions
umbs+0.019
beh+0.019
lette+0.018
izza+0.018
icking+0.017
Neg logit contributions
 notice-0.008
But-0.008
 stage-0.007
 woke-0.007
Who-0.007
Token Can
Token ID1680
Feature activation-1.23e-02

Pos logit contributions
umbs+0.023
beh+0.023
lette+0.022
icking+0.021
igators+0.021
Neg logit contributions
 notice-0.019
But-0.018
 stage-0.017
Who-0.017
 woke-0.017
Token we
Token ID356
Feature activation-5.14e-03

Pos logit contributions
 notice+0.017
 stage+0.016
 imagined+0.016
But+0.015
 woke+0.015
Neg logit contributions
umbs-0.020
beh-0.020
lette-0.019
izza-0.019
igators-0.018
Token play
Token ID711
Feature activation-6.39e-02

Pos logit contributions
 notice+0.006
 stage+0.006
But+0.006
Who+0.006
 woke+0.006
Neg logit contributions
umbs-0.009
lette-0.009
beh-0.009
izza-0.009
icking-0.009
Token together
Token ID1978
Feature activation-7.83e-02

Pos logit contributions
icol+0.110
 notice+0.106
 inspection+0.103
yt+0.103
 avail+0.101
Neg logit contributions
 equally-0.077
 dollar-0.075
lette-0.072
 equal-0.072
o-0.071
Token again
Token ID757
Feature activation-1.04e-01

Pos logit contributions
icol+0.112
Raw+0.096
Who+0.096
AAAA+0.094
Stop+0.093
Neg logit contributions
o-0.134
icking-0.129
 dollar-0.123
ipp-0.122
ac-0.121
Token?"
Token ID1701
Feature activation-3.64e-02

Pos logit contributions
icol+0.164
Stop+0.159
 robe+0.158
atively+0.157
 speaker+0.151
Neg logit contributions
o-0.136
 brothers-0.122
 dollar-0.119
 best-0.118
icking-0.117
Token
Token ID198
Feature activation-2.51e-02

Pos logit contributions
lf+0.062
 imagined+0.062
 stage+0.060
 woke+0.059
 concert+0.058
Neg logit contributions
umbs-0.048
o-0.041
itt-0.040
izza-0.040
imes-0.038
Token
Token ID198
Feature activation-1.21e-02

Pos logit contributions
 notice+0.036
 imagined+0.036
 stage+0.035
chool+0.033
 seek+0.033
Neg logit contributions
umbs-0.040
o-0.036
izza-0.036
itt-0.034
lette-0.034
TokenBen
Token ID11696
Feature activation-1.34e-02

Pos logit contributions
 stage+0.008
 notice+0.008
 woke+0.007
 imagined+0.007
chool+0.006
Neg logit contributions
umbs-0.028
lette-0.027
ocado-0.027
beh-0.027
izza-0.026
Token smiles
Token ID21845
Feature activation-2.61e-02

Pos logit contributions
 notice+0.018
But+0.016
 stage+0.016
Who+0.015
 imagined+0.015
Neg logit contributions
umbs-0.023
lette-0.022
beh-0.022
izza-0.021
igators-0.021
Token.
Token ID13
Feature activation+1.33e-02

Pos logit contributions
 notice+0.004
But+0.004
 stage+0.004
Who+0.004
 woke+0.004
Neg logit contributions
umbs-0.009
beh-0.009
lette-0.009
izza-0.008
igators-0.008
Token Can
Token ID1680
Feature activation-4.05e-03

Pos logit contributions
umbs+0.024
beh+0.023
lette+0.023
icking+0.022
izza+0.022
Neg logit contributions
 notice-0.017
But-0.016
 stage-0.016
Who-0.015
 woke-0.015
Token we
Token ID356
Feature activation-1.33e-02

Pos logit contributions
 notice+0.005
 stage+0.005
But+0.004
 woke+0.004
 imagined+0.004
Neg logit contributions
umbs-0.007
beh-0.007
lette-0.007
izza-0.007
igators-0.007
Token play
Token ID711
Feature activation-7.53e-02

Pos logit contributions
 notice+0.015
 stage+0.014
Who+0.014
But+0.014
 woke+0.014
Neg logit contributions
umbs-0.025
lette-0.024
beh-0.024
izza-0.023
icking-0.023
Token soccer
Token ID11783
Feature activation-8.72e-02

Pos logit contributions
 inspection+0.140
icol+0.139
yt+0.134
 notice+0.133
 woke+0.131
Neg logit contributions
 equally-0.079
 equal-0.076
 best-0.073
 brothers-0.073
 dollar-0.071
Token again
Token ID757
Feature activation-1.04e-01

Pos logit contributions
icol+0.138
Who+0.136
 avail+0.133
 inspection+0.129
 notice+0.129
Neg logit contributions
o-0.124
igators-0.117
 dollar-0.115
 equally-0.111
melon-0.110
Token?
Token ID30
Feature activation+1.08e-02

Pos logit contributions
icol+0.158
 inspection+0.155
atively+0.153
 robe+0.152
 bumped+0.150
Neg logit contributions
o-0.140
 brothers-0.126
ac-0.124
icking-0.123
aking-0.123
Token I
Token ID314
Feature activation-1.47e-02

Pos logit contributions
umbs+0.020
beh+0.019
lette+0.019
igators+0.018
icking+0.018
Neg logit contributions
 notice-0.013
But-0.013
 stage-0.013
Who-0.012
 woke-0.012
Token will
Token ID481
Feature activation-1.88e-02

Pos logit contributions
 stage+0.023
lf+0.022
But+0.022
Who+0.022
 notice+0.022
Neg logit contributions
umbs-0.021
beh-0.021
lette-0.020
ocado-0.019
izza-0.019
Token work
Token ID670
Feature activation-3.80e-02

Pos logit contributions
Who+0.023
 woke+0.023
 notice+0.023
Stop+0.022
 stage+0.022
Neg logit contributions
lette-0.031
umbs-0.030
beh-0.030
 equal-0.028
icking-0.028
Token hard
Token ID1327
Feature activation-5.91e-02

Pos logit contributions
Who+0.073
 woke+0.071
 notice+0.070
Raw+0.068
 robe+0.067
Neg logit contributions
umbs-0.038
 equal-0.038
 dollar-0.037
lette-0.037
beh-0.037
Token.
Token ID13
Feature activation+1.32e-02

Pos logit contributions
umbs+0.006
beh+0.006
lette+0.006
izza+0.005
igators+0.005
Neg logit contributions
 notice-0.003
But-0.002
 stage-0.002
 woke-0.002
Who-0.002
Token Can
Token ID1680
Feature activation-3.21e-03

Pos logit contributions
umbs+0.023
beh+0.022
lette+0.022
icking+0.021
izza+0.020
Neg logit contributions
 notice-0.018
But-0.017
 stage-0.017
Who-0.016
 woke-0.016
Token we
Token ID356
Feature activation-2.94e-02

Pos logit contributions
 notice+0.004
 stage+0.004
But+0.004
 woke+0.003
 imagined+0.003
Neg logit contributions
umbs-0.006
beh-0.006
lette-0.006
izza-0.006
igators-0.005
Token play
Token ID711
Feature activation-7.76e-02

Pos logit contributions
 stage+0.035
Who+0.035
 notice+0.035
Stop+0.034
La+0.033
Neg logit contributions
lette-0.048
umbs-0.048
beh-0.047
icking-0.045
melon-0.045
Token together
Token ID1978
Feature activation-8.08e-02

Pos logit contributions
icol+0.138
 inspection+0.133
 notice+0.131
 avail+0.130
yt+0.128
Neg logit contributions
 equally-0.092
o-0.085
 equal-0.080
umbs-0.077
ery-0.076
Token again
Token ID757
Feature activation-1.04e-01

Pos logit contributions
icol+0.120
Who+0.101
AAAA+0.100
Raw+0.097
Stop+0.094
Neg logit contributions
o-0.136
icking-0.124
umbs-0.124
 dollar-0.123
lette-0.123
Token?"
Token ID1701
Feature activation-2.64e-02

Pos logit contributions
icol+0.173
Stop+0.163
 robe+0.162
atively+0.156
 bumped+0.151
Neg logit contributions
o-0.138
 brothers-0.114
 best-0.110
allows-0.109
 dollar-0.108
Token he
Token ID339
Feature activation-3.71e-02

Pos logit contributions
 notice+0.035
 stage+0.035
 imagined+0.035
 woke+0.034
 concert+0.033
Neg logit contributions
umbs-0.041
o-0.039
beh-0.039
lette-0.038
izza-0.038
Token said
Token ID531
Feature activation+6.05e-04

Pos logit contributions
 notice+0.054
 recess+0.053
 stage+0.051
icol+0.051
 seek+0.050
Neg logit contributions
o-0.050
igators-0.049
beh-0.049
umbs-0.048
 dollar-0.047
Token.
Token ID13
Feature activation-3.06e-02

Pos logit contributions
umbs+0.001
beh+0.001
lette+0.001
izza+0.001
icking+0.001
Neg logit contributions
 notice-0.001
But-0.000
 stage-0.000
Who-0.000
 woke-0.000
Token
Token ID198
Feature activation-1.89e-02

Pos logit contributions
 notice+0.046
 stage+0.044
 imagined+0.043
 concert+0.042
 avail+0.041
Neg logit contributions
umbs-0.049
o-0.044
izza-0.044
oun-0.042
lette-0.042
Token.
Token ID13
Feature activation-1.44e-03

Pos logit contributions
 notice+0.012
 stage+0.011
But+0.010
Who+0.010
 woke+0.010
Neg logit contributions
umbs-0.019
lette-0.018
beh-0.018
igators-0.017
izza-0.016
Token Can
Token ID1680
Feature activation-9.85e-03

Pos logit contributions
 notice+0.002
 stage+0.002
But+0.002
 woke+0.002
Who+0.002
Neg logit contributions
umbs-0.003
beh-0.003
lette-0.003
izza-0.002
igators-0.002
Token we
Token ID356
Feature activation-1.81e-02

Pos logit contributions
 notice+0.013
 stage+0.012
 imagined+0.012
But+0.012
 woke+0.011
Neg logit contributions
umbs-0.017
beh-0.017
lette-0.016
izza-0.016
igators-0.015
Token play
Token ID711
Feature activation-6.99e-02

Pos logit contributions
 notice+0.022
 stage+0.021
Who+0.021
Stop+0.020
La+0.020
Neg logit contributions
umbs-0.031
lette-0.030
beh-0.029
icking-0.029
izza-0.028
Token together
Token ID1978
Feature activation-8.14e-02

Pos logit contributions
icol+0.134
 inspection+0.132
 notice+0.129
yt+0.128
 avail+0.125
Neg logit contributions
 equally-0.071
 equal-0.066
ery-0.065
o-0.064
umbs-0.063
Token again
Token ID757
Feature activation-1.04e-01

Pos logit contributions
icol+0.115
Who+0.096
 inspection+0.096
Raw+0.094
Stop+0.091
Neg logit contributions
o-0.138
icking-0.135
ac-0.128
 dollar-0.128
umbs-0.126
Token?"
Token ID1701
Feature activation-3.21e-02

Pos logit contributions
icol+0.156
 robe+0.155
atively+0.150
Stop+0.146
 inspection+0.144
Neg logit contributions
o-0.139
 brothers-0.128
 dollar-0.125
icking-0.123
ac-0.123
Token
Token ID198
Feature activation-2.78e-02

Pos logit contributions
 stage+0.052
 concert+0.049
 imagined+0.049
 notice+0.048
lf+0.048
Neg logit contributions
umbs-0.045
o-0.044
izza-0.039
oops-0.038
beh-0.037
Token
Token ID198
Feature activation-1.21e-02

Pos logit contributions
 stage+0.041
 notice+0.041
 imagined+0.039
 concert+0.038
chool+0.038
Neg logit contributions
umbs-0.043
o-0.042
izza-0.039
beh-0.039
lette-0.037
TokenL
Token ID43
Feature activation-1.39e-02

Pos logit contributions
 stage+0.010
 notice+0.010
 woke+0.008
 imagined+0.008
 concert+0.008
Neg logit contributions
umbs-0.027
beh-0.027
lette-0.026
o-0.025
izza-0.025
Tokenila
Token ID10102
Feature activation-1.93e-02

Pos logit contributions
 notice+0.019
 stage+0.019
 suddenly+0.018
But+0.018
 woke+0.017
Neg logit contributions
umbs-0.025
beh-0.023
lette-0.022
icking-0.021
ady-0.021
Token their
Token ID511
Feature activation+1.48e-02

Pos logit contributions
 notice+0.025
 stage+0.023
Stop+0.023
Who+0.023
But+0.023
Neg logit contributions
lette-0.033
beh-0.032
umbs-0.032
icking-0.031
izza-0.031
Token favorite
Token ID4004
Feature activation+5.79e-03

Pos logit contributions
umbs+0.027
beh+0.026
lette+0.025
igators+0.024
izza+0.023
Neg logit contributions
 notice-0.020
 stage-0.019
But-0.018
 woke-0.018
 suddenly-0.017
Token animals
Token ID4695
Feature activation+2.27e-02

Pos logit contributions
umbs+0.011
beh+0.011
lette+0.011
izza+0.010
igators+0.010
Neg logit contributions
 notice-0.007
But-0.006
 stage-0.006
 woke-0.006
Who-0.006
Token were
Token ID547
Feature activation+5.93e-02

Pos logit contributions
umbs+0.045
beh+0.044
lette+0.043
izza+0.041
ocado+0.039
Neg logit contributions
 notice-0.026
 woke-0.024
But-0.024
 stage-0.024
 suddenly-0.023
Token the
Token ID262
Feature activation+4.31e-02

Pos logit contributions
umbs+0.126
izza+0.121
beh+0.118
lette+0.117
umb+0.110
Neg logit contributions
 heard-0.066
 stage-0.066
 notice-0.065
 suddenly-0.064
 saw-0.063
Token sheep
Token ID15900
Feature activation+7.14e-02

Pos logit contributions
umbs+0.071
beh+0.069
izza+0.066
lette+0.062
known+0.060
Neg logit contributions
 notice-0.066
 stage-0.065
But-0.062
 suddenly-0.060
 school-0.060
Token.
Token ID13
Feature activation+2.82e-02

Pos logit contributions
beh+0.188
umbs+0.187
izza+0.181
lette+0.177
oops+0.174
Neg logit contributions
 stage-0.034
 suddenly-0.034
 woke-0.032
 heard-0.029
 notice-0.029
Token The
Token ID383
Feature activation+9.01e-03

Pos logit contributions
beh+0.059
lette+0.057
umbs+0.055
icking+0.053
known+0.052
Neg logit contributions
 notice-0.031
But-0.029
 stage-0.027
Who-0.027
 suddenly-0.027
Token sheep
Token ID15900
Feature activation+2.02e-02

Pos logit contributions
umbs+0.016
beh+0.016
lette+0.016
izza+0.015
icking+0.014
Neg logit contributions
 notice-0.012
But-0.011
 stage-0.011
 woke-0.011
Who-0.011
Token were
Token ID547
Feature activation+6.32e-02

Pos logit contributions
umbs+0.045
beh+0.044
lette+0.043
izza+0.042
icking+0.040
Neg logit contributions
 notice-0.018
But-0.016
 woke-0.016
 stage-0.016
 suddenly-0.016
Token soft
Token ID2705
Feature activation+1.15e-03

Pos logit contributions
izza+0.132
umbs+0.129
lette+0.124
beh+0.124
icking+0.121
Neg logit contributions
 notice-0.070
 heard-0.069
 stage-0.068
 saw-0.066
 suddenly-0.064
Token and
Token ID290
Feature activation-2.76e-02

Pos logit contributions
 notice+0.020
Who+0.020
Stop+0.019
But+0.019
La+0.017
Neg logit contributions
umbs-0.047
lette-0.043
icking-0.042
beh-0.041
izza-0.041
Token hugged
Token ID41130
Feature activation-2.16e-02

Pos logit contributions
But+0.035
 notice+0.034
Stop+0.033
Who+0.032
 stage+0.030
Neg logit contributions
umbs-0.046
beh-0.046
o-0.044
lette-0.044
igators-0.043
Token the
Token ID262
Feature activation+1.73e-03

Pos logit contributions
 notice+0.037
 stage+0.036
 seek+0.036
Who+0.035
 imagined+0.034
Neg logit contributions
umbs-0.026
lette-0.025
allows-0.024
beh-0.023
izza-0.023
Token t
Token ID256
Feature activation+2.95e-02

Pos logit contributions
umbs+0.004
lette+0.003
beh+0.003
izza+0.003
igators+0.003
Neg logit contributions
 notice-0.002
But-0.002
 stage-0.002
 woke-0.001
Who-0.001
Tokeneddy
Token ID21874
Feature activation+1.19e-02

Pos logit contributions
umbs+0.036
beh+0.036
lette+0.035
igators+0.035
izza+0.032
Neg logit contributions
But-0.050
 woke-0.050
 notice-0.049
 stage-0.048
Who-0.048
Token bear
Token ID6842
Feature activation+8.51e-02

Pos logit contributions
umbs+0.022
beh+0.022
lette+0.021
izza+0.020
igators+0.020
Neg logit contributions
 notice-0.014
But-0.014
 stage-0.014
 woke-0.014
Who-0.013
Token.
Token ID13
Feature activation-9.13e-03

Pos logit contributions
beh+0.254
umbs+0.239
known+0.236
ction+0.235
gary+0.234
Neg logit contributions
 again-0.027
,-0.014
.-0.011
 dress-0.010
 quietly-0.010
Token They
Token ID1119
Feature activation-8.60e-03

Pos logit contributions
 notice+0.018
 stage+0.016
But+0.016
 imagined+0.016
 woke+0.016
Neg logit contributions
umbs-0.011
izza-0.010
lette-0.009
beh-0.009
allows-0.009
Token thanked
Token ID26280
Feature activation-6.33e-03

Pos logit contributions
 notice+0.011
 stage+0.011
But+0.011
Who+0.010
Stop+0.010
Neg logit contributions
umbs-0.015
lette-0.014
beh-0.014
izza-0.013
igators-0.013
Token their
Token ID511
Feature activation+7.40e-03

Pos logit contributions
 notice+0.009
But+0.008
 stage+0.008
 imagined+0.008
 woke+0.008
Neg logit contributions
umbs-0.011
lette-0.010
beh-0.010
izza-0.010
igators-0.009
Token mom
Token ID1995
Feature activation+4.18e-02

Pos logit contributions
umbs+0.015
beh+0.015
lette+0.015
izza+0.014
igators+0.014
Neg logit contributions
 notice-0.008
 stage-0.007
But-0.007
 woke-0.007
Who-0.007
Token
Token ID198
Feature activation+1.48e-02

Pos logit contributions
umbs+0.019
beh+0.019
lette+0.018
igators+0.017
icking+0.017
Neg logit contributions
 notice-0.008
But-0.008
 stage-0.007
 woke-0.007
Who-0.007
TokenJ
Token ID41
Feature activation+3.11e-02

Pos logit contributions
umbs+0.034
beh+0.033
lette+0.033
icking+0.031
ishes+0.031
Neg logit contributions
But-0.013
 notice-0.013
Who-0.011
 stage-0.011
 suddenly-0.011
Tokenill
Token ID359
Feature activation+2.44e-02

Pos logit contributions
beh+0.052
umbs+0.050
lette+0.048
igators+0.046
known+0.046
Neg logit contributions
 notice-0.042
 woke-0.040
But-0.040
 stage-0.039
 heard-0.039
Token and
Token ID290
Feature activation-4.95e-03

Pos logit contributions
umbs+0.059
beh+0.058
lette+0.056
izza+0.054
igators+0.054
Neg logit contributions
 notice-0.016
But-0.016
 woke-0.016
 imagined-0.015
 suddenly-0.015
Token Jack
Token ID3619
Feature activation+5.08e-02

Pos logit contributions
 notice+0.008
But+0.007
 stage+0.007
 woke+0.007
Who+0.007
Neg logit contributions
umbs-0.007
lette-0.007
beh-0.007
izza-0.007
icking-0.007
Token got
Token ID1392
Feature activation+7.54e-02

Pos logit contributions
beh+0.116
umbs+0.116
igators+0.105
lette+0.105
oun+0.102
Neg logit contributions
 imagined-0.052
 heard-0.049
 woke-0.048
 saw-0.048
 suddenly-0.047
Token very
Token ID845
Feature activation+4.97e-02

Pos logit contributions
umbs+0.206
wy+0.192
umb+0.189
oun+0.187
ocado+0.187
Neg logit contributions
 it-0.055
 suddenly-0.051
 notice-0.050
 saw-0.049
 again-0.047
Token excited
Token ID6568
Feature activation+1.68e-04

Pos logit contributions
umbs+0.102
lette+0.092
beh+0.088
herer+0.085
umb+0.085
Neg logit contributions
 notice-0.071
 suddenly-0.063
But-0.061
 imagined-0.060
 saw-0.059
Token and
Token ID290
Feature activation+2.89e-02

Pos logit contributions
umbs+0.000
lette+0.000
beh+0.000
izza+0.000
icking+0.000
Neg logit contributions
 notice-0.000
But-0.000
 stage-0.000
Who-0.000
 woke-0.000
Token started
Token ID2067
Feature activation+5.43e-02

Pos logit contributions
umbs+0.071
beh+0.066
lette+0.066
igators+0.062
umb+0.062
Neg logit contributions
 notice-0.023
 suddenly-0.022
 imagined-0.021
 woke-0.020
 stage-0.020
Token playing
Token ID2712
Feature activation-2.63e-02

Pos logit contributions
umbs+0.140
ocado+0.127
wy+0.126
beh+0.125
igators+0.124
Neg logit contributions
 notice-0.045
 stage-0.043
 suddenly-0.042
 heard-0.040
 dress-0.038
Token,
Token ID11
Feature activation+1.76e-02

Pos logit contributions
umbs+0.029
beh+0.028
lette+0.028
izza+0.026
igators+0.026
Neg logit contributions
 notice-0.022
But-0.021
 stage-0.020
 woke-0.020
 suddenly-0.020
Token so
Token ID523
Feature activation-1.82e-03

Pos logit contributions
umbs+0.039
beh+0.038
lette+0.038
izza+0.036
igators+0.036
Neg logit contributions
 notice-0.016
But-0.016
 stage-0.014
 woke-0.014
 suddenly-0.014
Token he
Token ID339
Feature activation-3.63e-03

Pos logit contributions
 notice+0.002
 stage+0.002
But+0.002
 woke+0.002
Stop+0.002
Neg logit contributions
umbs-0.004
beh-0.004
lette-0.003
icking-0.003
izza-0.003
Token gave
Token ID2921
Feature activation+2.10e-03

Pos logit contributions
 notice+0.004
 stage+0.004
But+0.004
Who+0.004
 woke+0.004
Neg logit contributions
umbs-0.007
beh-0.007
lette-0.007
izza-0.006
igators-0.006
Token Sus
Token ID8932
Feature activation+4.09e-02

Pos logit contributions
umbs+0.004
beh+0.004
lette+0.004
izza+0.004
igators+0.004
Neg logit contributions
 notice-0.002
But-0.002
 stage-0.002
 woke-0.002
Who-0.002
Tokenie
Token ID494
Feature activation+9.54e-02

Pos logit contributions
umbs+0.093
izza+0.088
lette+0.086
beh+0.085
ages+0.080
Neg logit contributions
 woke-0.033
 again-0.033
 notice-0.031
 heard-0.031
But-0.030
Token the
Token ID262
Feature activation+2.36e-02

Pos logit contributions
umbs+0.255
beh+0.240
lette+0.237
 supposed+0.236
rol+0.236
Neg logit contributions
 dress-0.053
 again-0.052
 heard-0.048
 her-0.047
 school-0.046
Token paint
Token ID7521
Feature activation+2.57e-02

Pos logit contributions
umbs+0.046
beh+0.046
lette+0.043
izza+0.040
igators+0.040
Neg logit contributions
 notice-0.030
 stage-0.029
But-0.027
 dress-0.027
 school-0.026
Token and
Token ID290
Feature activation-4.40e-03

Pos logit contributions
beh+0.057
umbs+0.056
lette+0.054
igators+0.052
izza+0.052
Neg logit contributions
 notice-0.023
 stage-0.022
But-0.021
 dress-0.020
 woke-0.020
Token they
Token ID484
Feature activation+2.11e-02

Pos logit contributions
 notice+0.005
But+0.005
 stage+0.005
Who+0.004
 woke+0.004
Neg logit contributions
umbs-0.008
beh-0.008
lette-0.008
izza-0.008
igators-0.008
Token both
Token ID1111
Feature activation+2.74e-02

Pos logit contributions
umbs+0.041
beh+0.038
lette+0.038
izza+0.036
igators+0.035
Neg logit contributions
 notice-0.026
 imagined-0.025
 woke-0.024
 suddenly-0.024
But-0.024
Token thanked
Token ID26280
Feature activation-1.57e-02

Pos logit contributions
 notice+0.001
But+0.001
 stage+0.001
Who+0.001
 woke+0.001
Neg logit contributions
umbs-0.002
beh-0.002
lette-0.001
izza-0.001
igators-0.001
Token her
Token ID607
Feature activation-6.48e-03

Pos logit contributions
 notice+0.020
But+0.019
 imagined+0.018
 woke+0.018
 stage+0.018
Neg logit contributions
umbs-0.027
beh-0.027
lette-0.026
igators-0.025
izza-0.025
Token mom
Token ID1995
Feature activation+2.97e-02

Pos logit contributions
 notice+0.008
But+0.008
Stop+0.008
 stage+0.008
 woke+0.007
Neg logit contributions
umbs-0.011
beh-0.011
lette-0.011
izza-0.011
icking-0.011
Token for
Token ID329
Feature activation+9.92e-03

Pos logit contributions
umbs+0.064
beh+0.063
lette+0.060
igators+0.059
izza+0.058
Neg logit contributions
 notice-0.028
But-0.027
 again-0.027
 woke-0.026
 stage-0.025
Token being
Token ID852
Feature activation+3.69e-02

Pos logit contributions
umbs+0.020
beh+0.019
lette+0.018
izza+0.018
igators+0.017
Neg logit contributions
 notice-0.011
But-0.011
 stage-0.011
 woke-0.010
Who-0.010
Token so
Token ID523
Feature activation+7.39e-02

Pos logit contributions
umbs+0.074
izza+0.071
oun+0.067
icking+0.066
beh+0.065
Neg logit contributions
But-0.047
 notice-0.044
 suddenly-0.043
 heard-0.042
Who-0.039
Token helpful
Token ID7613
Feature activation-5.83e-03

Pos logit contributions
umbs+0.180
umb+0.146
izza+0.141
unes+0.141
ogram+0.138
Neg logit contributions
But-0.079
 she-0.076
Who-0.072
 seek-0.072
 quietly-0.070
Token.
Token ID13
Feature activation+1.66e-02

Pos logit contributions
 notice+0.005
 stage+0.005
But+0.005
Who+0.005
La+0.004
Neg logit contributions
umbs-0.013
lette-0.012
beh-0.012
icking-0.012
igators-0.011
Token That
Token ID1320
Feature activation+3.05e-02

Pos logit contributions
umbs+0.032
beh+0.032
lette+0.031
icking+0.029
igators+0.029
Neg logit contributions
 notice-0.019
But-0.018
 stage-0.017
 woke-0.017
Who-0.017
Token night
Token ID1755
Feature activation-1.77e-02

Pos logit contributions
umbs+0.068
beh+0.066
lette+0.061
izza+0.059
allows+0.058
Neg logit contributions
 woke-0.032
 stage-0.031
 notice-0.030
 suddenly-0.030
But-0.029
Token,
Token ID11
Feature activation+1.14e-02

Pos logit contributions
 notice+0.016
Who+0.014
But+0.014
 stage+0.014
Stop+0.013
Neg logit contributions
umbs-0.038
lette-0.038
beh-0.037
icking-0.036
izza-0.036
Token do
Token ID466
Feature activation-1.26e-02

Pos logit contributions
 notice+0.008
But+0.007
 stage+0.007
Who+0.007
 woke+0.007
Neg logit contributions
umbs-0.011
lette-0.011
beh-0.011
icking-0.010
igators-0.010
Token it
Token ID340
Feature activation-3.01e-02

Pos logit contributions
 notice+0.012
But+0.011
Who+0.011
Stop+0.010
 woke+0.010
Neg logit contributions
umbs-0.026
lette-0.026
beh-0.025
izza-0.025
allows-0.024
Token again
Token ID757
Feature activation-4.60e-02

Pos logit contributions
 notice+0.031
Who+0.031
Stop+0.030
 stage+0.030
But+0.029
Neg logit contributions
umbs-0.060
lette-0.056
beh-0.056
icking-0.055
allows-0.055
Token soon
Token ID2582
Feature activation-2.78e-02

Pos logit contributions
Stop+0.056
 stage+0.055
 robe+0.053
Who+0.053
 notice+0.051
Neg logit contributions
lette-0.081
umbs-0.076
 equal-0.075
o-0.075
allows-0.074
Token."
Token ID526
Feature activation+1.45e-02

Pos logit contributions
Who+0.032
 notice+0.032
But+0.032
 stage+0.031
Stop+0.030
Neg logit contributions
umbs-0.052
lette-0.050
beh-0.048
icking-0.046
izza-0.046
Token
Token ID220
Feature activation+3.72e-02

Pos logit contributions
umbs+0.029
beh+0.029
lette+0.028
igators+0.026
icking+0.026
Neg logit contributions
 notice-0.016
But-0.015
 stage-0.015
 woke-0.015
Who-0.014
Token
Token ID198
Feature activation+9.53e-03

Pos logit contributions
beh+0.093
lette+0.090
umbs+0.089
igators+0.087
known+0.084
Neg logit contributions
But-0.023
 notice-0.022
 stage-0.018
-0.018
 suddenly-0.017
Token
Token ID198
Feature activation+2.30e-02

Pos logit contributions
umbs+0.021
beh+0.020
lette+0.020
igators+0.019
icking+0.019
Neg logit contributions
 notice-0.009
But-0.009
 stage-0.008
 woke-0.008
Who-0.007
TokenThe
Token ID464
Feature activation+1.90e-02

Pos logit contributions
umbs+0.047
beh+0.046
lette+0.044
icking+0.043
ishes+0.043
Neg logit contributions
But-0.029
 notice-0.025
Who-0.024
 suddenly-0.024
 stage-0.023
Token fat
Token ID3735
Feature activation+7.91e-03

Pos logit contributions
umbs+0.037
beh+0.036
lette+0.034
igators+0.033
izza+0.032
Neg logit contributions
 stage-0.023
But-0.023
 notice-0.023
 woke-0.021
 suddenly-0.021
Token cat
Token ID3797
Feature activation-8.25e-03

Pos logit contributions
umbs+0.015
beh+0.015
lette+0.015
izza+0.014
igators+0.014
Neg logit contributions
 notice-0.009
But-0.009
 stage-0.009
 woke-0.008
Who-0.008
Token looked
Token ID3114
Feature activation+1.47e-02

Pos logit contributions
 notice+0.016
 stage+0.016
But+0.015
La+0.014
Who+0.014
Neg logit contributions
umbs-0.028
lette-0.028
beh-0.028
izza-0.026
igators-0.026
Token around
Token ID1088
Feature activation-3.04e-02

Pos logit contributions
umbs+0.027
beh+0.026
lette+0.026
izza+0.024
igators+0.023
Neg logit contributions
 notice-0.020
But-0.018
 stage-0.018
 woke-0.018
 suddenly-0.017
Token for
Token ID329
Feature activation+5.34e-03

Pos logit contributions
La+0.037
Who+0.035
 notice+0.035
Stop+0.034
 woke+0.034
Neg logit contributions
umbs-0.049
lette-0.048
 equal-0.048
icking-0.047
melon-0.046
Token the
Token ID262
Feature activation-1.56e-02

Pos logit contributions
umbs+0.010
beh+0.010
lette+0.010
izza+0.009
igators+0.009
Neg logit contributions
 notice-0.006
But-0.006
 stage-0.006
 woke-0.006
Who-0.006
Token other
Token ID584
Feature activation+1.54e-04

Pos logit contributions
 notice+0.021
La+0.019
But+0.019
 stage+0.019
 imagined+0.019
Neg logit contributions
lette-0.026
umbs-0.025
beh-0.024
izza-0.023
 dollar-0.023
Token animals
Token ID4695
Feature activation+3.98e-02

Pos logit contributions
umbs+0.000
lette+0.000
beh+0.000
izza+0.000
igators+0.000
Neg logit contributions
 notice-0.000
But-0.000
 stage-0.000
 woke-0.000
Who-0.000
Token he
Token ID339
Feature activation-1.69e-02

Pos logit contributions
umbs+0.099
beh+0.096
lette+0.092
izza+0.091
oun+0.089
Neg logit contributions
 notice-0.029
But-0.025
 stage-0.024
 heard-0.024
 woke-0.024
Token had
Token ID550
Feature activation-1.22e-02

Pos logit contributions
 stage+0.018
 notice+0.018
But+0.017
La+0.016
Stop+0.016
Neg logit contributions
umbs-0.032
lette-0.032
beh-0.032
ocado-0.030
o-0.030
Token shared
Token ID4888
Feature activation+2.81e-03

Pos logit contributions
 stage+0.018
 notice+0.018
But+0.017
Who+0.017
La+0.016
Neg logit contributions
umbs-0.019
lette-0.019
beh-0.019
icking-0.017
izza-0.017
Token his
Token ID465
Feature activation+1.29e-02

Pos logit contributions
umbs+0.006
beh+0.006
lette+0.006
izza+0.006
igators+0.006
Neg logit contributions
 notice-0.002
But-0.002
 stage-0.002
 woke-0.002
Who-0.002
Token cheese
Token ID9891
Feature activation+4.42e-02

Pos logit contributions
umbs+0.019
beh+0.018
lette+0.017
igators+0.016
icking+0.016
Neg logit contributions
 notice-0.021
But-0.021
 stage-0.020
 woke-0.020
Who-0.020
Token it
Token ID340
Feature activation+9.33e-04

Pos logit contributions
 notice+0.005
But+0.004
 stage+0.004
 woke+0.004
Who+0.004
Neg logit contributions
umbs-0.010
lette-0.009
beh-0.009
izza-0.009
icking-0.009
Token was
Token ID373
Feature activation+1.75e-02

Pos logit contributions
umbs+0.002
beh+0.002
lette+0.002
izza+0.002
igators+0.002
Neg logit contributions
 notice-0.001
But-0.001
 stage-0.001
Who-0.001
 woke-0.001
Token just
Token ID655
Feature activation+1.90e-02

Pos logit contributions
umbs+0.034
beh+0.034
lette+0.033
izza+0.032
igators+0.031
Neg logit contributions
 notice-0.021
But-0.020
 suddenly-0.019
 stage-0.019
 woke-0.018
Token the
Token ID262
Feature activation+3.85e-02

Pos logit contributions
umbs+0.038
beh+0.036
lette+0.035
izza+0.034
igators+0.033
Neg logit contributions
 notice-0.023
 stage-0.021
But-0.021
 suddenly-0.021
 woke-0.020
Token right
Token ID826
Feature activation-1.42e-02

Pos logit contributions
umbs+0.071
beh+0.068
lette+0.059
izza+0.058
itt+0.058
Neg logit contributions
 notice-0.057
 stage-0.056
But-0.054
 suddenly-0.053
 school-0.051
Token size
Token ID2546
Feature activation+2.19e-02

Pos logit contributions
 notice+0.020
 stage+0.019
But+0.019
La+0.019
Stop+0.019
Neg logit contributions
lette-0.023
umbs-0.023
beh-0.022
icking-0.021
bi-0.021
Token to
Token ID284
Feature activation-1.20e-02

Pos logit contributions
umbs+0.047
beh+0.046
lette+0.045
izza+0.043
igators+0.042
Neg logit contributions
 notice-0.021
 stage-0.019
But-0.019
 woke-0.018
 suddenly-0.018
Token sail
Token ID14936
Feature activation-1.51e-02

Pos logit contributions
 notice+0.015
But+0.014
 stage+0.014
Who+0.014
Stop+0.014
Neg logit contributions
umbs-0.022
lette-0.021
beh-0.020
icking-0.019
igators-0.019
Token back
Token ID736
Feature activation-1.03e-02

Pos logit contributions
 notice+0.017
Who+0.016
But+0.016
 stage+0.016
La+0.015
Neg logit contributions
umbs-0.029
lette-0.028
beh-0.027
igators-0.026
icking-0.026
Token home
Token ID1363
Feature activation+3.21e-03

Pos logit contributions
 notice+0.014
But+0.013
Who+0.013
 stage+0.013
Stop+0.013
Neg logit contributions
umbs-0.018
beh-0.017
lette-0.017
icking-0.016
igators-0.016
Token.
Token ID13
Feature activation+2.83e-02

Pos logit contributions
umbs+0.007
beh+0.007
lette+0.007
izza+0.007
igators+0.007
Neg logit contributions
 notice-0.003
But-0.002
 stage-0.002
 woke-0.002
Who-0.002
TokenOne
Token ID3198
Feature activation+3.00e-02

Pos logit contributions
umbs+0.019
beh+0.019
lette+0.018
izza+0.017
icking+0.017
Neg logit contributions
 notice-0.009
But-0.009
 stage-0.008
 woke-0.008
Who-0.008
Token day
Token ID1110
Feature activation-8.09e-03

Pos logit contributions
umbs+0.064
beh+0.063
lette+0.060
ishes+0.059
rol+0.058
Neg logit contributions
But-0.033
 notice-0.032
 woke-0.032
 school-0.032
 heard-0.029
Token,
Token ID11
Feature activation-7.49e-04

Pos logit contributions
 notice+0.006
 stage+0.006
But+0.006
Who+0.006
La+0.006
Neg logit contributions
umbs-0.018
lette-0.017
beh-0.017
izza-0.017
icking-0.017
Token he
Token ID339
Feature activation-3.49e-02

Pos logit contributions
 notice+0.001
 stage+0.001
But+0.001
 woke+0.001
Who+0.001
Neg logit contributions
umbs-0.001
beh-0.001
lette-0.001
izza-0.001
igators-0.001
Token decided
Token ID3066
Feature activation-3.35e-02

Pos logit contributions
 stage+0.045
 avail+0.044
 notice+0.039
 ti+0.039
 mat+0.039
Neg logit contributions
o-0.060
beh-0.056
ocado-0.056
ady-0.055
lette-0.055
Token to
Token ID284
Feature activation+1.58e-02

Pos logit contributions
La+0.056
 notice+0.053
chool+0.052
 stage+0.052
Stop+0.052
Neg logit contributions
beh-0.040
lette-0.039
 supposed-0.037
o-0.036
izza-0.036
Token visit
Token ID3187
Feature activation-1.74e-02

Pos logit contributions
umbs+0.033
beh+0.031
lette+0.031
izza+0.030
ocado+0.029
Neg logit contributions
 notice-0.018
 stage-0.016
But-0.016
 woke-0.016
 suddenly-0.015
Token the
Token ID262
Feature activation-2.06e-03

Pos logit contributions
 notice+0.028
Stop+0.027
But+0.026
La+0.026
Who+0.025
Neg logit contributions
umbs-0.023
beh-0.023
lette-0.022
izza-0.022
igators-0.022
Token living
Token ID2877
Feature activation-3.83e-02

Pos logit contributions
 notice+0.002
But+0.002
 stage+0.002
 woke+0.002
 imagined+0.002
Neg logit contributions
umbs-0.004
lette-0.004
beh-0.004
izza-0.004
igators-0.004
Token room
Token ID2119
Feature activation-9.67e-03

Pos logit contributions
 notice+0.058
La+0.053
icol+0.052
Stop+0.052
Exc+0.052
Neg logit contributions
lette-0.053
ishes-0.052
beh-0.050
o-0.049
asp-0.049
Token.
Token ID13
Feature activation+2.23e-02

Pos logit contributions
 notice+0.010
But+0.010
 stage+0.010
Stop+0.009
Who+0.009
Neg logit contributions
umbs-0.019
lette-0.018
beh-0.018
icking-0.018
izza-0.017
Token to
Token ID284
Feature activation-1.46e-02

Pos logit contributions
umbs+0.036
beh+0.035
lette+0.034
icking+0.033
izza+0.033
Neg logit contributions
 notice-0.014
 stage-0.013
But-0.013
 woke-0.013
Who-0.012
Token go
Token ID467
Feature activation-3.90e-03

Pos logit contributions
But+0.017
 notice+0.016
Stop+0.016
 stage+0.015
 suddenly+0.015
Neg logit contributions
umbs-0.027
lette-0.026
beh-0.025
igators-0.025
izza-0.025
Token to
Token ID284
Feature activation-2.23e-02

Pos logit contributions
 notice+0.005
But+0.004
 stage+0.004
Who+0.004
Stop+0.004
Neg logit contributions
umbs-0.007
beh-0.007
lette-0.007
igators-0.007
icking-0.006
Token church
Token ID4928
Feature activation-2.16e-02

Pos logit contributions
But+0.031
 notice+0.030
Stop+0.029
 suddenly+0.029
 woke+0.029
Neg logit contributions
umbs-0.033
beh-0.032
igators-0.031
lette-0.030
allows-0.030
Token with
Token ID351
Feature activation-3.60e-03

Pos logit contributions
Stop+0.026
 notice+0.026
Who+0.025
 woke+0.024
Raw+0.024
Neg logit contributions
lette-0.038
umbs-0.038
beh-0.037
allows-0.036
icking-0.035
Token her
Token ID607
Feature activation+1.32e-02

Pos logit contributions
 notice+0.003
But+0.003
 stage+0.003
Who+0.003
 woke+0.003
Neg logit contributions
umbs-0.008
lette-0.008
beh-0.008
izza-0.007
igators-0.007
Token family
Token ID1641
Feature activation+3.73e-02

Pos logit contributions
umbs+0.028
beh+0.027
lette+0.027
izza+0.025
igators+0.025
Neg logit contributions
 notice-0.013
 stage-0.013
But-0.013
 woke-0.012
 school-0.012
Token every
Token ID790
Feature activation-6.72e-04

Pos logit contributions
umbs+0.110
beh+0.107
izza+0.101
lette+0.101
known+0.101
Neg logit contributions
But-0.012
 notice-0.008
 stage-0.008
 school-0.008
?"-0.007
Token Sunday
Token ID3502
Feature activation-4.70e-03

Pos logit contributions
 notice+0.001
But+0.001
 stage+0.001
Who+0.001
 woke+0.001
Neg logit contributions
umbs-0.001
lette-0.001
beh-0.001
igators-0.001
izza-0.001
Token.
Token ID13
Feature activation-3.19e-03

Pos logit contributions
 notice+0.004
But+0.004
 stage+0.004
Who+0.004
Stop+0.004
Neg logit contributions
umbs-0.010
lette-0.010
beh-0.010
izza-0.009
icking-0.009
Token One
Token ID1881
Feature activation+2.25e-02

Pos logit contributions
 notice+0.005
But+0.004
 stage+0.004
 imagined+0.004
 woke+0.004
Neg logit contributions
umbs-0.005
lette-0.005
beh-0.005
izza-0.005
igators-0.005
Token falls
Token ID8953
Feature activation-2.20e-02

Pos logit contributions
umbs+0.024
beh+0.023
lette+0.023
izza+0.021
igators+0.021
Neg logit contributions
 notice-0.010
But-0.009
 stage-0.009
 woke-0.009
 suddenly-0.009
Token out
Token ID503
Feature activation-1.50e-02

Pos logit contributions
 notice+0.028
But+0.028
Who+0.027
 stage+0.027
La+0.025
Neg logit contributions
umbs-0.039
beh-0.036
lette-0.036
igators-0.035
icking-0.035
Token and
Token ID290
Feature activation-7.23e-03

Pos logit contributions
 notice+0.024
But+0.023
 stage+0.023
Who+0.023
La+0.022
Neg logit contributions
umbs-0.022
beh-0.020
lette-0.020
icking-0.019
igators-0.019
Token they
Token ID484
Feature activation+2.14e-02

Pos logit contributions
 notice+0.009
But+0.009
 stage+0.008
Who+0.008
La+0.008
Neg logit contributions
umbs-0.013
beh-0.013
lette-0.012
igators-0.012
izza-0.012
Token become
Token ID1716
Feature activation+4.64e-02

Pos logit contributions
umbs+0.044
lette+0.042
beh+0.042
izza+0.040
ady+0.039
Neg logit contributions
 notice-0.024
 woke-0.022
But-0.021
 stage-0.021
 suddenly-0.021
Token bald
Token ID28273
Feature activation+1.97e-03

Pos logit contributions
umbs+0.088
izza+0.081
oun+0.077
lette+0.075
igators+0.074
Neg logit contributions
 notice-0.068
 suddenly-0.064
 heard-0.060
But-0.060
 imagined-0.059
Token.
Token ID13
Feature activation+2.04e-02

Pos logit contributions
umbs+0.004
beh+0.004
lette+0.004
izza+0.004
igators+0.004
Neg logit contributions
 notice-0.002
But-0.002
 stage-0.001
 woke-0.001
Who-0.001
Token Tim
Token ID5045
Feature activation+1.29e-02

Pos logit contributions
umbs+0.039
beh+0.038
lette+0.038
ocado+0.035
ady+0.034
Neg logit contributions
But-0.025
 notice-0.024
 stage-0.023
 woke-0.023
 suddenly-0.023
Tokenmy
Token ID1820
Feature activation-1.62e-02

Pos logit contributions
umbs+0.014
beh+0.013
lette+0.013
izza+0.012
igators+0.012
Neg logit contributions
 notice-0.026
But-0.025
 woke-0.025
 stage-0.024
 imagined-0.024
Token didn
Token ID1422
Feature activation+3.00e-02

Pos logit contributions
 notice+0.019
 stage+0.018
Who+0.017
But+0.017
Stop+0.017
Neg logit contributions
lette-0.029
umbs-0.029
beh-0.029
izza-0.027
icking-0.027
Token't
Token ID470
Feature activation-8.56e-03

Pos logit contributions
lette+0.064
beh+0.063
umbs+0.063
ishes+0.062
ocado+0.062
Neg logit contributions
 notice-0.030
But-0.029
 stage-0.027
 suddenly-0.026
 woke-0.024
Token One
Token ID1881
Feature activation+2.50e-02

Pos logit contributions
 notice+0.022
 stage+0.021
 imagined+0.021
But+0.021
Who+0.020
Neg logit contributions
umbs-0.019
lette-0.018
izza-0.017
oun-0.016
beh-0.016
Token day
Token ID1110
Feature activation-3.52e-02

Pos logit contributions
umbs+0.051
beh+0.050
lette+0.048
izza+0.047
ishes+0.046
Neg logit contributions
But-0.029
 woke-0.029
 notice-0.029
 school-0.027
 stage-0.026
Token,
Token ID11
Feature activation-4.46e-02

Pos logit contributions
 notice+0.042
Who+0.042
 stage+0.041
 imagined+0.040
La+0.040
Neg logit contributions
umbs-0.058
o-0.055
 dollar-0.054
itt-0.054
izza-0.054
Token she
Token ID673
Feature activation-4.86e-02

Pos logit contributions
icol+0.063
Raw+0.062
La+0.062
 imagined+0.061
 portal+0.060
Neg logit contributions
melon-0.055
umbs-0.053
allows-0.049
o-0.049
 highlight-0.049
Token saw
Token ID2497
Feature activation-7.17e-03

Pos logit contributions
 avail+0.062
 stage+0.061
Really+0.054
But+0.054
chool+0.053
Neg logit contributions
o-0.084
ocado-0.077
lette-0.075
icking-0.075
ady-0.075
Token a
Token ID257
Feature activation-2.88e-02

Pos logit contributions
 notice+0.011
But+0.010
 stage+0.010
 imagined+0.009
 woke+0.009
Neg logit contributions
umbs-0.011
lette-0.011
beh-0.011
igators-0.010
izza-0.010
Token patient
Token ID5827
Feature activation-1.39e-02

Pos logit contributions
 notice+0.047
 imagined+0.044
But+0.044
Stop+0.042
La+0.041
Neg logit contributions
umbs-0.038
lette-0.037
 dollar-0.036
ocado-0.036
izza-0.035
Token dog
Token ID3290
Feature activation+6.08e-03

Pos logit contributions
 notice+0.016
But+0.015
Who+0.014
Stop+0.014
La+0.014
Neg logit contributions
umbs-0.026
lette-0.026
beh-0.025
igators-0.024
icking-0.023
Token sitting
Token ID5586
Feature activation-1.44e-02

Pos logit contributions
umbs+0.012
beh+0.011
lette+0.011
izza+0.010
igators+0.010
Neg logit contributions
 notice-0.007
But-0.007
 stage-0.007
 woke-0.007
Who-0.007
Token in
Token ID287
Feature activation+2.72e-03

Pos logit contributions
 notice+0.021
 stage+0.020
 woke+0.019
Who+0.019
La+0.018
Neg logit contributions
beh-0.023
umbs-0.023
lette-0.022
icking-0.022
o-0.021
Token the
Token ID262
Feature activation+1.95e-04

Pos logit contributions
umbs+0.005
beh+0.005
lette+0.005
izza+0.005
igators+0.005
Neg logit contributions
 notice-0.003
But-0.003
 stage-0.003
 woke-0.003
Who-0.003
Token
Token ID251
Feature activation-9.37e-03

Pos logit contributions
umbs+0.006
beh+0.005
lette+0.005
izza+0.005
igators+0.005
Neg logit contributions
 notice-0.005
But-0.005
 stage-0.005
 woke-0.005
Who-0.005
Tokenit
Token ID270
Feature activation-2.16e-02

Pos logit contributions
 notice+0.011
 stage+0.010
Who+0.009
Stop+0.009
But+0.009
Neg logit contributions
umbs-0.018
lette-0.018
beh-0.017
izza-0.017
allows-0.016
Token was
Token ID373
Feature activation+2.45e-02

Pos logit contributions
 notice+0.022
 stage+0.021
Stop+0.019
But+0.018
 recess+0.018
Neg logit contributions
lette-0.044
umbs-0.043
beh-0.043
o-0.041
ocado-0.040
Token special
Token ID2041
Feature activation-9.16e-03

Pos logit contributions
umbs+0.044
beh+0.041
lette+0.040
izza+0.040
ocado+0.037
Neg logit contributions
 notice-0.034
But-0.033
 suddenly-0.033
 stage-0.032
 woke-0.031
Token.
Token ID13
Feature activation+5.49e-03

Pos logit contributions
 notice+0.010
Stop+0.009
But+0.009
Who+0.009
La+0.009
Neg logit contributions
umbs-0.018
lette-0.018
beh-0.018
icking-0.017
igators-0.017
Token Every
Token ID3887
Feature activation-6.76e-03

Pos logit contributions
umbs+0.010
beh+0.009
lette+0.009
izza+0.008
igators+0.008
Neg logit contributions
 notice-0.008
But-0.007
 stage-0.007
 woke-0.007
Who-0.007
Token day
Token ID1110
Feature activation-9.37e-03

Pos logit contributions
 notice+0.011
But+0.010
 stage+0.010
Who+0.010
Stop+0.010
Neg logit contributions
umbs-0.010
lette-0.009
beh-0.009
igators-0.008
 dollar-0.008
Token it
Token ID340
Feature activation-7.37e-03

Pos logit contributions
 notice+0.008
But+0.008
 stage+0.007
Who+0.007
 imagined+0.007
Neg logit contributions
umbs-0.020
lette-0.019
beh-0.019
izza-0.019
icking-0.018
Token sat
Token ID3332
Feature activation-3.91e-02

Pos logit contributions
 notice+0.008
 stage+0.008
But+0.008
Who+0.007
Stop+0.007
Neg logit contributions
umbs-0.014
beh-0.014
lette-0.014
igators-0.013
izza-0.013
Token on
Token ID319
Feature activation-1.21e-02

Pos logit contributions
 notice+0.069
La+0.064
Who+0.064
 stage+0.063
 speaker+0.063
Neg logit contributions
icking-0.051
ishes-0.049
o-0.048
aking-0.047
ish-0.047
Token the
Token ID262
Feature activation-6.83e-04

Pos logit contributions
 notice+0.016
But+0.015
Who+0.015
 imagined+0.014
 woke+0.014
Neg logit contributions
umbs-0.021
lette-0.020
beh-0.019
izza-0.019
igators-0.018
TokenOnce
Token ID7454
Feature activation+1.78e-02

Pos logit contributions
 notice+0.020
 stage+0.019
 imagined+0.018
But+0.018
Who+0.018
Neg logit contributions
umbs-0.031
lette-0.028
beh-0.027
o-0.027
izza-0.027
Token upon
Token ID2402
Feature activation-1.26e-02

Pos logit contributions
umbs+0.035
beh+0.035
lette+0.034
izza+0.032
igators+0.031
Neg logit contributions
 notice-0.020
But-0.019
 stage-0.019
 woke-0.018
 suddenly-0.018
Token a
Token ID257
Feature activation-1.30e-03

Pos logit contributions
 notice+0.019
 stage+0.019
But+0.018
Stop+0.017
 woke+0.017
Neg logit contributions
umbs-0.020
lette-0.018
beh-0.018
allows-0.016
izza-0.016
Token time
Token ID640
Feature activation+1.37e-03

Pos logit contributions
 notice+0.001
 stage+0.001
But+0.001
Who+0.001
 woke+0.001
Neg logit contributions
umbs-0.003
lette-0.002
beh-0.002
izza-0.002
igators-0.002
Token,
Token ID11
Feature activation+2.56e-03

Pos logit contributions
umbs+0.003
beh+0.003
lette+0.003
izza+0.003
igators+0.003
Neg logit contributions
 notice-0.001
But-0.001
 stage-0.001
Who-0.001
 woke-0.001
Token there
Token ID612
Feature activation-1.37e-02

Pos logit contributions
umbs+0.005
beh+0.005
lette+0.005
izza+0.005
igators+0.004
Neg logit contributions
 notice-0.003
But-0.003
 stage-0.003
 woke-0.003
Who-0.003
Token was
Token ID373
Feature activation+3.66e-02

Pos logit contributions
 notice+0.013
 stage+0.012
But+0.012
La+0.011
Who+0.011
Neg logit contributions
umbs-0.029
lette-0.027
beh-0.026
icking-0.026
o-0.026
Token a
Token ID257
Feature activation-8.38e-04

Pos logit contributions
beh+0.086
umbs+0.081
icking+0.080
lette+0.080
ishes+0.078
Neg logit contributions
But-0.033
 notice-0.031
 suddenly-0.029
 school-0.029
 stage-0.029
Token little
Token ID1310
Feature activation+2.54e-02

Pos logit contributions
 notice+0.001
But+0.001
 stage+0.001
 woke+0.001
Who+0.001
Neg logit contributions
umbs-0.001
lette-0.001
beh-0.001
izza-0.001
igators-0.001
Token girl
Token ID2576
Feature activation-9.26e-03

Pos logit contributions
umbs+0.060
beh+0.059
lette+0.057
izza+0.055
icking+0.054
Neg logit contributions
 stage-0.020
 notice-0.020
But-0.019
 woke-0.018
 school-0.017
Token called
Token ID1444
Feature activation-6.85e-03

Pos logit contributions
 notice+0.014
 stage+0.013
Stop+0.012
But+0.012
Who+0.012
Neg logit contributions
lette-0.015
umbs-0.014
beh-0.014
o-0.013
ocado-0.013
Token living
Token ID2877
Feature activation-2.49e-02

Pos logit contributions
 notice+0.039
 imagined+0.037
But+0.036
La+0.035
 avail+0.034
Neg logit contributions
umbs-0.043
lette-0.042
melon-0.041
izza-0.040
igators-0.040
Token room
Token ID2119
Feature activation-1.38e-05

Pos logit contributions
 notice+0.034
But+0.031
Who+0.030
Stop+0.030
La+0.029
Neg logit contributions
umbs-0.041
lette-0.040
ishes-0.039
asp-0.038
allows-0.038
Token.
Token ID13
Feature activation-8.07e-03

Pos logit contributions
 notice+0.000
But+0.000
 stage+0.000
Who+0.000
 woke+0.000
Neg logit contributions
umbs-0.000
lette-0.000
beh-0.000
izza-0.000
igators-0.000
Token They
Token ID1119
Feature activation+2.61e-02

Pos logit contributions
 notice+0.015
 stage+0.014
 woke+0.014
But+0.014
 imagined+0.014
Neg logit contributions
umbs-0.010
lette-0.009
izza-0.009
beh-0.008
igators-0.008
Token sit
Token ID1650
Feature activation-1.43e-02

Pos logit contributions
umbs+0.055
beh+0.054
lette+0.053
izza+0.049
icking+0.049
Neg logit contributions
 notice-0.030
 stage-0.027
 woke-0.025
But-0.025
 dress-0.024
Token on
Token ID319
Feature activation-1.11e-02

Pos logit contributions
 notice+0.020
 stage+0.018
Stop+0.018
 woke+0.018
Who+0.018
Neg logit contributions
umbs-0.024
lette-0.023
icking-0.023
allows-0.023
beh-0.022
Token the
Token ID262
Feature activation-1.14e-02

Pos logit contributions
 notice+0.015
But+0.014
 woke+0.013
 imagined+0.013
Who+0.013
Neg logit contributions
umbs-0.019
lette-0.018
izza-0.017
beh-0.017
allows-0.017
Token floor
Token ID4314
Feature activation-1.15e-02

Pos logit contributions
 notice+0.014
But+0.012
 imagined+0.012
Who+0.012
Stop+0.012
Neg logit contributions
umbs-0.022
lette-0.021
izza-0.020
igators-0.020
allows-0.020
Token.
Token ID13
Feature activation-5.88e-03

Pos logit contributions
 notice+0.010
Who+0.010
Stop+0.010
But+0.009
 stage+0.009
Neg logit contributions
umbs-0.025
lette-0.024
beh-0.023
icking-0.022
izza-0.022
Token They
Token ID1119
Feature activation+1.39e-02

Pos logit contributions
 notice+0.011
 stage+0.010
But+0.010
 woke+0.010
 imagined+0.010
Neg logit contributions
umbs-0.008
lette-0.007
izza-0.007
beh-0.007
igators-0.006
Token pretend
Token ID16614
Feature activation-1.01e-03

Pos logit contributions
umbs+0.028
beh+0.027
lette+0.026
izza+0.025
icking+0.025
Neg logit contributions
 notice-0.017
 stage-0.015
But-0.015
 woke-0.015
 imagined-0.014
Token mess
Token ID2085
Feature activation-3.04e-02

Pos logit contributions
But+0.047
 notice+0.046
La+0.044
Who+0.043
 imagined+0.042
Neg logit contributions
umbs-0.054
lette-0.052
igators-0.051
melon-0.050
 dollar-0.048
Token.
Token ID13
Feature activation-5.53e-03

Pos logit contributions
 notice+0.035
But+0.033
La+0.032
Who+0.032
 stage+0.031
Neg logit contributions
umbs-0.059
lette-0.055
beh-0.055
igators-0.053
icking-0.053
Token As
Token ID1081
Feature activation-2.09e-02

Pos logit contributions
 notice+0.008
 stage+0.007
But+0.007
 woke+0.007
 imagined+0.007
Neg logit contributions
umbs-0.009
beh-0.009
lette-0.009
izza-0.008
igators-0.008
Token she
Token ID673
Feature activation-3.77e-02

Pos logit contributions
 notice+0.023
But+0.022
Who+0.021
Stop+0.021
 stage+0.021
Neg logit contributions
umbs-0.037
beh-0.036
lette-0.036
igators-0.036
izza-0.035
Token was
Token ID373
Feature activation-1.59e-02

Pos logit contributions
 avail+0.049
 notice+0.048
But+0.046
 stage+0.044
 speaker+0.042
Neg logit contributions
o-0.061
beh-0.059
umbs-0.059
igators-0.058
lette-0.058
Token playing
Token ID2712
Feature activation-5.81e-02

Pos logit contributions
Who+0.025
But+0.025
 notice+0.024
Stop+0.024
+0.024
Neg logit contributions
umbs-0.022
lette-0.022
beh-0.021
icking-0.021
ady-0.020
Token,
Token ID11
Feature activation-2.98e-02

Pos logit contributions
La+0.086
icol+0.084
 imagined+0.080
Ba+0.076
Who+0.075
Neg logit contributions
 best-0.079
melon-0.078
 equal-0.077
 dollar-0.077
igator-0.076
Token she
Token ID673
Feature activation-4.55e-02

Pos logit contributions
 notice+0.029
But+0.028
 woke+0.027
 imagined+0.026
Stop+0.026
Neg logit contributions
umbs-0.056
beh-0.055
lette-0.054
allows-0.051
icking-0.051
Token saw
Token ID2497
Feature activation-2.37e-02

Pos logit contributions
 avail+0.045
 stage+0.044
 notice+0.042
Stop+0.041
icol+0.040
Neg logit contributions
lette-0.087
o-0.087
ocado-0.083
icking-0.082
umbs-0.081
Token a
Token ID257
Feature activation-2.47e-02

Pos logit contributions
 notice+0.036
But+0.033
 stage+0.032
+0.032
 seek+0.032
Neg logit contributions
lette-0.034
umbs-0.034
beh-0.033
igators-0.031
icking-0.031
Token little
Token ID1310
Feature activation-6.59e-03

Pos logit contributions
 notice+0.038
 imagined+0.037
But+0.037
La+0.035
Stop+0.035
Neg logit contributions
umbs-0.034
lette-0.033
 dollar-0.031
izza-0.031
ocado-0.031
Token he
Token ID339
Feature activation-1.10e-02

Pos logit contributions
 notice+0.008
 stage+0.007
But+0.007
Who+0.007
 woke+0.007
Neg logit contributions
umbs-0.009
beh-0.009
lette-0.009
izza-0.008
igators-0.008
Token could
Token ID714
Feature activation-2.75e-02

Pos logit contributions
 notice+0.013
 stage+0.013
But+0.012
Who+0.011
Stop+0.011
Neg logit contributions
beh-0.020
umbs-0.020
lette-0.020
igators-0.019
o-0.019
Token see
Token ID766
Feature activation-2.01e-02

Pos logit contributions
Stop+0.035
Who+0.034
La+0.033
 notice+0.032
But+0.032
Neg logit contributions
umbs-0.043
lette-0.043
beh-0.042
icking-0.041
igators-0.041
Token in
Token ID287
Feature activation-1.63e-02

Pos logit contributions
 notice+0.026
But+0.023
La+0.023
 stage+0.023
 seek+0.023
Neg logit contributions
umbs-0.034
beh-0.033
lette-0.033
igators-0.032
icking-0.032
Token the
Token ID262
Feature activation-2.87e-02

Pos logit contributions
 notice+0.021
But+0.020
La+0.020
 woke+0.020
Who+0.019
Neg logit contributions
umbs-0.026
beh-0.025
lette-0.025
 equal-0.024
igators-0.024
Token dark
Token ID3223
Feature activation-6.32e-02

Pos logit contributions
 notice+0.040
lf+0.037
 imagined+0.037
zes+0.037
 stage+0.036
Neg logit contributions
lette-0.041
umbs-0.040
igators-0.040
ady-0.039
 highlight-0.039
Token.
Token ID13
Feature activation-1.37e-03

Pos logit contributions
 notice+0.087
 concert+0.085
La+0.085
Who+0.084
zes+0.084
Neg logit contributions
umbs-0.089
ady-0.086
umb-0.085
igators-0.085
o-0.084
Token
Token ID220
Feature activation+1.24e-02

Pos logit contributions
 notice+0.002
 stage+0.002
But+0.002
 woke+0.002
Who+0.001
Neg logit contributions
umbs-0.003
lette-0.002
beh-0.002
izza-0.002
igators-0.002
Token
Token ID198
Feature activation-1.86e-02

Pos logit contributions
umbs+0.027
beh+0.027
lette+0.026
igators+0.025
izza+0.025
Neg logit contributions
 notice-0.011
But-0.011
 stage-0.010
 woke-0.009
Who-0.009
Token
Token ID198
Feature activation+4.40e-03

Pos logit contributions
 notice+0.023
 stage+0.022
 woke+0.021
 imagined+0.021
Who+0.021
Neg logit contributions
umbs-0.034
lette-0.031
beh-0.031
o-0.030
izza-0.030
TokenJohn
Token ID7554
Feature activation+1.81e-02

Pos logit contributions
umbs+0.008
beh+0.007
lette+0.007
izza+0.007
igators+0.007
Neg logit contributions
 notice-0.006
But-0.006
 stage-0.006
 woke-0.005
Who-0.005
Token k
Token ID479
Feature activation+4.93e-03

Pos logit contributions
umbs+0.028
beh+0.027
lette+0.027
izza+0.025
icking+0.025
Neg logit contributions
 notice-0.014
 stage-0.013
But-0.013
 woke-0.012
Who-0.012
Tokenitty
Token ID9760
Feature activation-1.98e-02

Pos logit contributions
umbs+0.004
beh+0.004
lette+0.004
izza+0.003
igators+0.003
Neg logit contributions
 notice-0.011
But-0.011
 stage-0.010
 woke-0.010
Who-0.010
Token.
Token ID13
Feature activation+7.39e-03

Pos logit contributions
 notice+0.021
 stage+0.018
Who+0.018
But+0.017
 imagined+0.017
Neg logit contributions
umbs-0.040
lette-0.039
beh-0.039
o-0.038
igators-0.037
Token Are
Token ID4231
Feature activation+1.26e-02

Pos logit contributions
umbs+0.014
beh+0.013
lette+0.013
izza+0.012
icking+0.012
Neg logit contributions
 notice-0.009
But-0.009
 stage-0.009
 woke-0.008
Who-0.008
Token you
Token ID345
Feature activation+3.03e-02

Pos logit contributions
umbs+0.026
lette+0.024
beh+0.024
izza+0.023
icking+0.023
Neg logit contributions
 notice-0.014
But-0.013
 stage-0.013
 woke-0.012
Who-0.012
Token lost
Token ID2626
Feature activation-6.89e-02

Pos logit contributions
umbs+0.048
beh+0.045
izza+0.044
lette+0.043
oun+0.042
Neg logit contributions
 notice-0.049
 suddenly-0.046
 stage-0.045
But-0.044
 heard-0.044
Token?
Token ID30
Feature activation+8.22e-03

Pos logit contributions
 notice+0.097
icol+0.093
Exc+0.089
Who+0.089
 inspection+0.088
Neg logit contributions
 ours-0.100
o-0.099
lette-0.095
igator-0.092
igators-0.092
Token You
Token ID921
Feature activation+1.90e-02

Pos logit contributions
umbs+0.016
beh+0.015
lette+0.015
izza+0.014
icking+0.014
Neg logit contributions
 notice-0.010
But-0.009
 stage-0.009
 woke-0.009
Who-0.009
Token look
Token ID804
Feature activation+2.02e-03

Pos logit contributions
umbs+0.034
lette+0.033
beh+0.032
izza+0.031
illing+0.030
Neg logit contributions
 notice-0.026
 woke-0.025
But-0.024
 heard-0.024
 stage-0.023
Token sad
Token ID6507
Feature activation-6.50e-02

Pos logit contributions
umbs+0.003
beh+0.003
lette+0.003
izza+0.003
igators+0.003
Neg logit contributions
 notice-0.003
But-0.003
 stage-0.003
 woke-0.003
Who-0.003
Token and
Token ID290
Feature activation+2.58e-03

Pos logit contributions
lf+0.101
Who+0.099
La+0.098
 avail+0.098
 robe+0.096
Neg logit contributions
icking-0.084
lette-0.082
 brothers-0.082
igators-0.082
o-0.081
Token friends
Token ID2460
Feature activation-4.81e-03

Pos logit contributions
umbs+0.019
beh+0.018
lette+0.018
izza+0.017
igators+0.017
Neg logit contributions
 notice-0.019
 stage-0.017
But-0.017
 woke-0.017
 imagined-0.017
Token.
Token ID13
Feature activation-8.32e-03

Pos logit contributions
 notice+0.005
But+0.004
 stage+0.004
Who+0.004
Stop+0.004
Neg logit contributions
umbs-0.010
lette-0.010
beh-0.010
igators-0.009
icking-0.009
Token
Token ID198
Feature activation-3.19e-02

Pos logit contributions
 notice+0.010
 stage+0.009
But+0.009
 imagined+0.009
Who+0.009
Neg logit contributions
umbs-0.016
lette-0.015
beh-0.015
izza-0.014
igators-0.014
Token
Token ID198
Feature activation-1.20e-02

Pos logit contributions
 notice+0.043
 stage+0.042
 imagined+0.040
Who+0.040
Stop+0.040
Neg logit contributions
umbs-0.055
o-0.049
lette-0.048
izza-0.047
beh-0.047
TokenAt
Token ID2953
Feature activation-3.85e-02

Pos logit contributions
 notice+0.016
 stage+0.015
 woke+0.014
 ti+0.014
icol+0.014
Neg logit contributions
umbs-0.021
lette-0.020
beh-0.019
igators-0.019
ocado-0.018
Token school
Token ID1524
Feature activation-5.77e-02

Pos logit contributions
 woke+0.051
Who+0.050
Raw+0.050
Exc+0.050
But+0.049
Neg logit contributions
melon-0.056
icking-0.055
o-0.053
igators-0.053
umbs-0.053
Token,
Token ID11
Feature activation-2.30e-02

Pos logit contributions
Raw+0.071
icol+0.070
Who+0.070
 avail+0.069
AAAA+0.065
Neg logit contributions
icking-0.087
 dollar-0.086
known-0.084
lette-0.083
 equal-0.083
Token Amy
Token ID14235
Feature activation-3.36e-02

Pos logit contributions
 notice+0.032
icol+0.030
 stage+0.029
La+0.029
 woke+0.029
Neg logit contributions
umbs-0.036
igators-0.035
lette-0.035
beh-0.034
allows-0.033
Token saw
Token ID2497
Feature activation-4.66e-03

Pos logit contributions
 stage+0.039
 avail+0.039
 notice+0.039
Who+0.039
But+0.039
Neg logit contributions
umbs-0.056
lette-0.056
o-0.054
beh-0.054
ocado-0.053
Token a
Token ID257
Feature activation-3.07e-02

Pos logit contributions
 notice+0.007
But+0.006
 stage+0.006
 woke+0.006
Who+0.006
Neg logit contributions
umbs-0.008
lette-0.007
beh-0.007
igators-0.007
izza-0.007
Token boy
Token ID2933
Feature activation+1.36e-02

Pos logit contributions
 notice+0.047
But+0.046
 imagined+0.046
Stop+0.044
 woke+0.043
Neg logit contributions
umbs-0.040
lette-0.040
 dollar-0.039
ocado-0.039
izza-0.038
Token -
Token ID532
Feature activation+2.15e-04

Pos logit contributions
 notice+0.043
Who+0.041
 woke+0.041
But+0.041
Stop+0.039
Neg logit contributions
umbs-0.072
lette-0.068
beh-0.067
igators-0.065
icking-0.065
Token even
Token ID772
Feature activation-2.14e-02

Pos logit contributions
umbs+0.000
beh+0.000
lette+0.000
izza+0.000
igators+0.000
Neg logit contributions
 notice-0.000
But-0.000
 stage-0.000
 woke-0.000
Who-0.000
Token when
Token ID618
Feature activation-3.18e-02

Pos logit contributions
 stage+0.028
 notice+0.027
 woke+0.027
But+0.027
+0.026
Neg logit contributions
beh-0.035
umbs-0.034
lette-0.033
igators-0.032
known-0.031
Token she
Token ID673
Feature activation-4.50e-02

Pos logit contributions
 woke+0.032
 notice+0.031
icol+0.029
But+0.029
Exc+0.029
Neg logit contributions
beh-0.058
lette-0.057
umbs-0.056
 supposed-0.055
 equal-0.055
Token was
Token ID373
Feature activation-7.88e-03

Pos logit contributions
 stage+0.053
But+0.051
 notice+0.050
 speaker+0.049
 avail+0.049
Neg logit contributions
umbs-0.077
o-0.075
beh-0.075
lette-0.075
known-0.071
Token asleep
Token ID16039
Feature activation-8.94e-02

Pos logit contributions
 notice+0.012
But+0.012
 stage+0.011
Who+0.011
 woke+0.011
Neg logit contributions
umbs-0.012
lette-0.011
beh-0.011
icking-0.010
igators-0.010
Token.
Token ID13
Feature activation-1.01e-02

Pos logit contributions
Raw+0.123
AAAA+0.120
Stop+0.116
La+0.114
Who+0.111
Neg logit contributions
icking-0.123
lette-0.122
 best-0.117
 equal-0.115
known-0.115
Token<|endoftext|>
Token ID50256
Feature activation-1.81e-02

Pos logit contributions
 notice+0.013
 stage+0.013
But+0.012
 woke+0.011
Stop+0.011
Neg logit contributions
umbs-0.018
lette-0.017
beh-0.017
izza-0.016
ocado-0.016
TokenM
Token ID44
Feature activation+2.11e-02

Pos logit contributions
 notice+0.022
 stage+0.019
But+0.019
 imagined+0.019
Who+0.018
Neg logit contributions
umbs-0.036
lette-0.032
beh-0.031
o-0.031
izza-0.031
Tokenummy
Token ID13513
Feature activation-1.12e-02

Pos logit contributions
umbs+0.030
beh+0.030
lette+0.028
igators+0.027
ocado+0.026
Neg logit contributions
 notice-0.035
 stage-0.034
 woke-0.034
But-0.034
 school-0.034
Token was
Token ID373
Feature activation+1.96e-02

Pos logit contributions
 notice+0.010
But+0.009
 stage+0.009
Who+0.008
La+0.008
Neg logit contributions
umbs-0.024
lette-0.024
beh-0.023
izza-0.023
igators-0.023