TOP ACTIVATIONS
MAX = 0.107

Why do you want a bisc uit ?" The
" Where are we going mom my ?" asked Lucy .
They write " Thank you " and " We miss you
" What are you doing sir ?" The man
, " I demand a bisc uit !" The
" Maybe it is a birthday cake !" They
you want to go this morning John ?" his mom asked
do I have to do mom my ?". Her
crying and say " sorry " to each other . They
okay to say " sorry " if you make a mistake
, hey ! What is going on here ?" Mom my
my friend send me a birthday card now ?" Her mom
The flame said " bye " too . Amy went home
me to finish my bath " said her mom .
. He didn 't want Farmer Dan to be mad .
will be this glad come winter season ?" asked Mama .
my saw her and said " G abby , what 's
we have a perfectly good tea set ?" he asked .
" Please let me go " said Jim .
, " What is it mom ?!" Her mom replied ,

MIN ACTIVATIONS
MIN = -0.116

Tim enjoyed hitting the ball very hard . One
a delicate bird named Fl appy . She liked to fly
pillar loved to move around very quickly . One day ,
had a toy he loved very much . One day ,
and could kick the ball very far . She wanted to
to go around the track very fast . One day ,
liked playing in the sun very much . One day ,
shiny shield that he loved very much . He would always
had a toy he loved very much . The toy was
loved to ride his bike very fast . One day ,
toy bunny that she loved very much . The bunny belonged
toy fork that he loved very much . Tim 's fork
She ran up the stairs very quickly , but her mom
toy car that he loved very much . It was his
a small bird called Fl appy . Fl appy was always
L ily learned to sail very well . The boat was
, how do you zip so fast ?" B
blue sweater that he loved very much . Tim wore his
toy car that he loved very much . It was red
toy shield that he loved very much . He would play

INTERVAL 0.051 - 0.107
CONTAINS 0.555%

and saw an even bigger surprise . There was a beautiful
Emma and Jack were so excited to see the black board
" Hey , what 's going on here ?" a voice
apologised to Tom and said " sorry , little brother ,
L ily 's friend , Tim my , said "

INTERVAL -0.005 - 0.051
CONTAINS 58.469%

onions in the food . Tim my didn 't like onions
more excited . Finally , they found the ball
away from the scary man . Finally , Lucy
Mom hugged Sam . She was not angry . She was
OK ?" Sam said . He ran to Tom . He

INTERVAL -0.060 - -0.005
CONTAINS 40.693%

toys and they should give some to other children who had
She was walking in the park when she saw a j
Candy she was so excited that she ran right into the
focus on the road . He did not like the noise
It is a blanket ." He gave her the

INTERVAL -0.116 - -0.060
CONTAINS 0.283%

string . He tied it really tight so it wouldn 't
He was very gr umpy because he couldn 't help anyone
her eyes . She balanced very carefully and took a deep
girl . She was three years old and very curious .
two kids played happily in their new sandbox for the rest
TokenWhy
Token ID5195
Feature activation+4.32e-03

Pos logit contributions
 comm+0.073
 aback+0.069
 happened+0.066
 Birthday+0.064
 begs+0.064
Neg logit contributions
 often-0.082
His-0.074
ot-0.069
ind-0.068
nam-0.068
Token do
Token ID466
Feature activation+1.59e-03

Pos logit contributions
 written+0.006
 aback+0.005
Gu+0.005
 comm+0.005
 source+0.005
Neg logit contributions
igs-0.007
 often-0.007
ot-0.006
 solar-0.006
 lived-0.006
Token you
Token ID345
Feature activation+1.17e-02

Pos logit contributions
 written+0.002
 aback+0.002
Gu+0.002
 comm+0.002
 source+0.002
Neg logit contributions
igs-0.003
 often-0.003
ot-0.002
 solar-0.002
 lived-0.002
Token want
Token ID765
Feature activation+1.13e-02

Pos logit contributions
 written+0.015
Gu+0.014
 aback+0.014
 comm+0.014
 source+0.013
Neg logit contributions
 often-0.019
igs-0.019
ot-0.017
 lived-0.017
 solar-0.017
Token a
Token ID257
Feature activation+6.09e-02

Pos logit contributions
 written+0.014
 aback+0.014
 comm+0.013
Gu+0.013
hered+0.013
Neg logit contributions
igs-0.019
 often-0.019
ot-0.017
 solar-0.016
 lived-0.016
Token bisc
Token ID36228
Feature activation+1.07e-01

Pos logit contributions
 aback+0.090
 comm+0.088
 written+0.084
ctions+0.083
umbs+0.083
Neg logit contributions
ot-0.090
igs-0.079
 tight-0.079
 often-0.078
oppy-0.075
Tokenuit
Token ID5013
Feature activation+2.99e-02

Pos logit contributions
 comm+0.191
 aback+0.167
Gu+0.164
 payoff+0.156
 source+0.155
Neg logit contributions
ot-0.139
belt-0.124
igs-0.121
ittens-0.120
ig-0.119
Token?"
Token ID1701
Feature activation+9.78e-03

Pos logit contributions
 comm+0.049
Gu+0.048
 aback+0.047
 written+0.047
 source+0.044
Neg logit contributions
 often-0.039
igs-0.037
ot-0.035
 lived-0.032
belt-0.030
Token
Token ID198
Feature activation+2.44e-04

Pos logit contributions
 written+0.014
 comm+0.013
 aback+0.013
Gu+0.013
 source+0.012
Neg logit contributions
igs-0.015
 often-0.015
ot-0.013
 solar-0.013
 lived-0.013
Token
Token ID198
Feature activation-1.50e-02

Pos logit contributions
 written+0.000
 aback+0.000
Gu+0.000
 comm+0.000
 source+0.000
Neg logit contributions
igs-0.000
 often-0.000
ot-0.000
 solar-0.000
 lived-0.000
TokenThe
Token ID464
Feature activation+9.25e-03

Pos logit contributions
igs+0.022
 often+0.019
 solar+0.018
ot+0.017
 lived+0.017
Neg logit contributions
Gu-0.024
 aback-0.024
 written-0.024
 comm-0.022
Sur-0.022
Token"
Token ID1
Feature activation+3.97e-02

Pos logit contributions
igs+0.035
 often+0.031
 solar+0.030
 gut+0.029
 lived+0.028
Neg logit contributions
Gu-0.029
 written-0.029
 aback-0.028
 comm-0.027
Sur-0.027
TokenWhere
Token ID8496
Feature activation+1.97e-02

Pos logit contributions
 aback+0.054
 comm+0.051
 written+0.050
 source+0.050
 begs+0.049
Neg logit contributions
 often-0.060
ot-0.055
 lived-0.053
apy-0.051
igs-0.051
Token are
Token ID389
Feature activation+5.01e-03

Pos logit contributions
 aback+0.028
Gu+0.028
 written+0.028
 comm+0.027
hered+0.026
Neg logit contributions
 often-0.029
igs-0.029
 lived-0.027
ot-0.026
apy-0.025
Token we
Token ID356
Feature activation+2.47e-02

Pos logit contributions
 written+0.007
 aback+0.007
Gu+0.006
 comm+0.006
 source+0.006
Neg logit contributions
igs-0.008
 often-0.008
ot-0.007
 solar-0.007
 lived-0.007
Token going
Token ID1016
Feature activation+4.10e-02

Pos logit contributions
 aback+0.032
Gu+0.031
 comm+0.031
 written+0.030
 bisc+0.029
Neg logit contributions
 often-0.042
igs-0.039
 lived-0.037
ot-0.036
 solar-0.036
Token mom
Token ID1995
Feature activation+1.04e-01

Pos logit contributions
 comm+0.073
Gu+0.071
 written+0.070
 aback+0.069
 bisc+0.066
Neg logit contributions
 often-0.053
igs-0.048
ot-0.042
ug-0.040
 solar-0.040
Tokenmy
Token ID1820
Feature activation+6.89e-02

Pos logit contributions
 comm+0.233
 aback+0.226
 bisc+0.219
Gu+0.215
Uh+0.212
Neg logit contributions
 often-0.105
 own-0.068
 every-0.067
ot-0.057
ind-0.055
Token?"
Token ID1701
Feature activation+2.10e-03

Pos logit contributions
 comm+0.143
Gu+0.140
 aback+0.138
 bisc+0.136
 written+0.132
Neg logit contributions
 often-0.070
igs-0.054
ot-0.049
 lived-0.047
ind-0.042
Token asked
Token ID1965
Feature activation+6.47e-04

Pos logit contributions
 written+0.003
 aback+0.003
Gu+0.003
 comm+0.003
 source+0.003
Neg logit contributions
igs-0.003
 often-0.003
ot-0.003
 solar-0.003
 lived-0.003
Token Lucy
Token ID22162
Feature activation-4.69e-04

Pos logit contributions
 written+0.001
 aback+0.001
 comm+0.001
Gu+0.001
 source+0.001
Neg logit contributions
igs-0.001
 often-0.001
ot-0.001
 solar-0.001
 lived-0.001
Token.
Token ID13
Feature activation+3.40e-03

Pos logit contributions
igs+0.001
 often+0.001
ot+0.000
 solar+0.000
 lived+0.000
Neg logit contributions
 written-0.001
 aback-0.001
 comm-0.001
Gu-0.001
 source-0.001
Token They
Token ID1119
Feature activation-1.15e-02

Pos logit contributions
 written+0.019
 aback+0.019
 comm+0.019
Gu+0.018
 source+0.018
Neg logit contributions
 often-0.011
igs-0.011
ot-0.009
 solar-0.009
 lived-0.009
Token write
Token ID3551
Feature activation+1.64e-02

Pos logit contributions
igs+0.022
 often+0.020
ot+0.019
ug+0.019
 gut+0.019
Neg logit contributions
 written-0.013
 aback-0.011
 comm-0.011
Gu-0.011
 source-0.011
Token "
Token ID366
Feature activation+3.27e-02

Pos logit contributions
Gu+0.024
 aback+0.024
 comm+0.024
 written+0.023
 source+0.022
Neg logit contributions
 often-0.023
igs-0.023
ot-0.021
 lived-0.020
 solar-0.020
TokenThank
Token ID10449
Feature activation+9.05e-03

Pos logit contributions
 comm+0.036
 aback+0.035
 source+0.032
 written+0.032
 calling+0.031
Neg logit contributions
 often-0.058
ot-0.057
igs-0.057
 lived-0.053
 solar-0.052
Token you
Token ID345
Feature activation-3.37e-02

Pos logit contributions
 written+0.012
 aback+0.012
 comm+0.011
Gu+0.011
 source+0.011
Neg logit contributions
igs-0.014
 often-0.014
ot-0.013
 solar-0.013
 lived-0.012
Token"
Token ID1
Feature activation+1.04e-01

Pos logit contributions
igs+0.056
apy+0.047
 often+0.047
 solar+0.045
 lived+0.044
Neg logit contributions
 written-0.047
 Birthday-0.042
 aback-0.042
 calling-0.041
 come-0.040
Token and
Token ID290
Feature activation-1.63e-02

Pos logit contributions
 comm+0.187
 aback+0.176
ction+0.176
ctions+0.174
ograph+0.173
Neg logit contributions
 often-0.119
ot-0.113
 every-0.098
 nicely-0.093
 tight-0.088
Token "
Token ID366
Feature activation+3.33e-02

Pos logit contributions
igs+0.028
 often+0.025
ug+0.024
ind+0.023
ot+0.023
Neg logit contributions
 written-0.023
Gu-0.020
 comm-0.020
 aback-0.020
 saying-0.020
TokenWe
Token ID1135
Feature activation+5.06e-03

Pos logit contributions
 aback+0.035
 comm+0.035
 source+0.031
 written+0.030
 calling+0.030
Neg logit contributions
 often-0.062
ot-0.060
igs-0.059
 lived-0.056
 solar-0.055
Token miss
Token ID2051
Feature activation-1.98e-02

Pos logit contributions
 written+0.007
 aback+0.007
Gu+0.007
 comm+0.006
 source+0.006
Neg logit contributions
igs-0.008
 often-0.008
ot-0.007
 solar-0.007
 lived-0.007
Token you
Token ID345
Feature activation-3.96e-02

Pos logit contributions
igs+0.030
 often+0.027
 lived+0.024
 solar+0.024
 gut+0.024
Neg logit contributions
 written-0.030
 aback-0.026
 comm-0.026
Gu-0.026
 calling-0.026
Token "
Token ID366
Feature activation+7.15e-02

Pos logit contributions
 written+0.022
 aback+0.021
 comm+0.021
Gu+0.021
hered+0.020
Neg logit contributions
 often-0.015
igs-0.015
ot-0.013
 solar-0.012
 lived-0.012
TokenWhat
Token ID2061
Feature activation+3.27e-02

Pos logit contributions
 comm+0.105
 begs+0.101
 aback+0.098
 happened+0.097
 Birthday+0.096
Neg logit contributions
 often-0.088
His-0.088
ind-0.076
 solar-0.073
 nicely-0.071
Token are
Token ID389
Feature activation-7.32e-03

Pos logit contributions
 aback+0.049
 comm+0.048
Gu+0.047
hered+0.047
 source+0.046
Neg logit contributions
 often-0.047
 lived-0.045
igs-0.042
 goals-0.040
 solar-0.040
Token you
Token ID345
Feature activation+4.52e-02

Pos logit contributions
igs+0.013
 often+0.013
ot+0.012
 gut+0.012
 solar+0.012
Neg logit contributions
 written-0.008
Gu-0.007
 comm-0.007
 aback-0.007
 source-0.007
Token doing
Token ID1804
Feature activation+3.77e-02

Pos logit contributions
 aback+0.063
 bisc+0.061
Gu+0.059
umbs+0.059
ives+0.057
Neg logit contributions
 often-0.071
igs-0.065
 lived-0.064
ot-0.059
 solar-0.059
Token sir
Token ID15967
Feature activation+1.03e-01

Pos logit contributions
Gu+0.068
 aback+0.067
 comm+0.067
 bisc+0.067
 written+0.066
Neg logit contributions
 often-0.045
igs-0.041
 lived-0.034
ot-0.034
 gut-0.032
Token?"
Token ID1701
Feature activation+2.04e-02

Pos logit contributions
 aback+0.208
Gu+0.207
 comm+0.207
Uh+0.205
Sur+0.204
Neg logit contributions
 often-0.096
ot-0.065
 lived-0.062
 nicely-0.057
 every-0.056
Token
Token ID198
Feature activation-9.00e-04

Pos logit contributions
 written+0.030
 aback+0.029
Gu+0.029
 comm+0.029
 source+0.028
Neg logit contributions
 often-0.029
igs-0.027
ot-0.026
 lived-0.025
 solar-0.024
Token
Token ID198
Feature activation-2.97e-03

Pos logit contributions
igs+0.002
 often+0.001
ot+0.001
 solar+0.001
 lived+0.001
Neg logit contributions
 written-0.001
 aback-0.001
Gu-0.001
 comm-0.001
 source-0.001
TokenThe
Token ID464
Feature activation-5.78e-03

Pos logit contributions
igs+0.004
 often+0.004
 solar+0.003
ot+0.003
 lived+0.003
Neg logit contributions
 written-0.005
Gu-0.005
 aback-0.005
 comm-0.005
hered-0.004
Token man
Token ID582
Feature activation+2.18e-02

Pos logit contributions
igs+0.009
 often+0.009
ot+0.008
 solar+0.008
 lived+0.008
Neg logit contributions
 written-0.008
 comm-0.008
 aback-0.007
Gu-0.007
 source-0.007
Token,
Token ID11
Feature activation-5.19e-03

Pos logit contributions
igs+0.045
 often+0.038
ind+0.038
 solar+0.037
ot+0.035
Neg logit contributions
 written-0.045
 aback-0.042
Gu-0.039
 comm-0.038
 come-0.038
Token "
Token ID366
Feature activation+4.35e-02

Pos logit contributions
igs+0.007
 often+0.007
ot+0.006
 solar+0.006
 gut+0.006
Neg logit contributions
 written-0.008
 aback-0.008
Gu-0.008
 comm-0.008
 source-0.007
TokenI
Token ID40
Feature activation+3.07e-02

Pos logit contributions
 comm+0.060
 aback+0.056
 calling+0.054
 source+0.054
 begs+0.054
Neg logit contributions
 often-0.064
ot-0.060
His-0.057
 lived-0.056
 solar-0.055
Token demand
Token ID3512
Feature activation-2.57e-03

Pos logit contributions
 comm+0.044
 written+0.042
Gu+0.042
 aback+0.041
umbs+0.041
Neg logit contributions
 often-0.047
 lived-0.042
ot-0.040
igs-0.040
 solar-0.038
Token a
Token ID257
Feature activation+2.83e-02

Pos logit contributions
igs+0.004
 often+0.004
ot+0.004
 solar+0.004
 lived+0.004
Neg logit contributions
 written-0.003
 aback-0.003
 comm-0.003
Gu-0.003
 source-0.003
Token bisc
Token ID36228
Feature activation+9.90e-02

Pos logit contributions
 aback+0.042
 written+0.042
 comm+0.041
Gu+0.041
hered+0.040
Neg logit contributions
 often-0.038
ot-0.036
igs-0.035
 solar-0.034
 tight-0.031
Tokenuit
Token ID5013
Feature activation+1.09e-02

Pos logit contributions
 comm+0.174
 aback+0.153
Gu+0.152
 source+0.144
 written+0.141
Neg logit contributions
ot-0.131
belt-0.111
ig-0.109
kin-0.109
igs-0.109
Token!"
Token ID2474
Feature activation+2.14e-02

Pos logit contributions
 written+0.015
 aback+0.015
 comm+0.015
Gu+0.015
 source+0.014
Neg logit contributions
igs-0.016
 often-0.016
ot-0.015
 solar-0.014
 lived-0.014
Token
Token ID198
Feature activation+1.16e-02

Pos logit contributions
 written+0.040
Gu+0.039
 comm+0.038
 aback+0.037
 bisc+0.037
Neg logit contributions
 often-0.021
igs-0.020
ot-0.019
 lived-0.017
 solar-0.017
Token
Token ID198
Feature activation-1.02e-02

Pos logit contributions
 written+0.015
 aback+0.015
 comm+0.015
Gu+0.014
 bisc+0.014
Neg logit contributions
igs-0.018
 often-0.018
ot-0.017
 solar-0.016
 lived-0.016
TokenThe
Token ID464
Feature activation+1.28e-02

Pos logit contributions
igs+0.014
 often+0.012
 solar+0.011
 lived+0.011
ot+0.011
Neg logit contributions
 written-0.017
Gu-0.017
 aback-0.017
 comm-0.016
 source-0.016
Token "
Token ID366
Feature activation+2.88e-02

Pos logit contributions
igs+0.008
 often+0.008
 solar+0.007
ot+0.007
ind+0.007
Neg logit contributions
 written-0.008
Gu-0.008
 aback-0.008
 comm-0.007
 source-0.007
TokenMaybe
Token ID13300
Feature activation+2.10e-02

Pos logit contributions
 comm+0.037
 aback+0.036
 written+0.036
 calling+0.034
 source+0.033
Neg logit contributions
 often-0.047
igs-0.046
ot-0.044
apy-0.041
 solar-0.041
Token it
Token ID340
Feature activation+4.71e-02

Pos logit contributions
 aback+0.036
 comm+0.035
 written+0.035
hered+0.034
Gu+0.033
Neg logit contributions
 often-0.027
igs-0.027
ot-0.023
 lived-0.022
apy-0.021
Token is
Token ID318
Feature activation+3.50e-02

Pos logit contributions
 written+0.079
 aback+0.078
 comm+0.078
hered+0.077
oph+0.076
Neg logit contributions
 often-0.060
igs-0.055
 lived-0.053
 tight-0.045
 goals-0.044
Token a
Token ID257
Feature activation+4.07e-02

Pos logit contributions
 aback+0.059
hered+0.057
Gu+0.055
 comm+0.054
ives+0.053
Neg logit contributions
 often-0.049
igs-0.047
 lived-0.040
ot-0.040
 tight-0.038
Token birthday
Token ID10955
Feature activation+9.60e-02

Pos logit contributions
 aback+0.063
 written+0.057
hered+0.056
Gu+0.055
 comm+0.054
Neg logit contributions
 often-0.062
igs-0.055
 solar-0.054
ot-0.054
 tight-0.050
Token cake
Token ID12187
Feature activation+7.64e-02

Pos logit contributions
 aback+0.187
 comm+0.170
arre+0.165
 source+0.163
hered+0.161
Neg logit contributions
 often-0.121
ot-0.102
 lived-0.093
 nicely-0.092
 throne-0.092
Token!"
Token ID2474
Feature activation+9.67e-03

Pos logit contributions
 aback+0.164
 comm+0.149
Gu+0.145
hered+0.140
viously+0.139
Neg logit contributions
 often-0.090
ot-0.067
igs-0.066
 lived-0.064
 every-0.063
Token
Token ID198
Feature activation-1.59e-02

Pos logit contributions
 written+0.013
 aback+0.012
Gu+0.012
 comm+0.012
hered+0.012
Neg logit contributions
igs-0.015
 often-0.015
ot-0.014
 lived-0.013
 solar-0.013
Token
Token ID198
Feature activation-1.65e-02

Pos logit contributions
igs+0.032
 often+0.031
 solar+0.029
 gut+0.028
belt+0.028
Neg logit contributions
 written-0.014
 aback-0.013
 comm-0.013
Gu-0.013
 source-0.012
TokenThey
Token ID2990
Feature activation-2.03e-02

Pos logit contributions
igs+0.032
 often+0.031
 solar+0.030
 gut+0.029
 lived+0.028
Neg logit contributions
Gu-0.016
Sur-0.015
 written-0.014
 comm-0.014
 aback-0.014
Token you
Token ID345
Feature activation+1.43e-02

Pos logit contributions
igs+0.007
 often+0.006
ot+0.006
 solar+0.006
 gut+0.006
Neg logit contributions
 written-0.005
 aback-0.005
Gu-0.005
 comm-0.004
 source-0.004
Token want
Token ID765
Feature activation+2.09e-02

Pos logit contributions
 written+0.018
Gu+0.018
 aback+0.018
 comm+0.018
 bisc+0.017
Neg logit contributions
 often-0.023
igs-0.023
ot-0.020
 lived-0.020
 solar-0.020
Token to
Token ID284
Feature activation+4.59e-03

Pos logit contributions
 comm+0.027
 written+0.027
 aback+0.027
Gu+0.027
 bisc+0.025
Neg logit contributions
 often-0.033
igs-0.033
ot-0.029
 lived-0.028
 solar-0.028
Token go
Token ID467
Feature activation+3.09e-02

Pos logit contributions
 written+0.006
 aback+0.006
Gu+0.006
 comm+0.006
 source+0.005
Neg logit contributions
igs-0.008
 often-0.007
ot-0.007
 solar-0.007
 lived-0.006
Token this
Token ID428
Feature activation+1.99e-02

Pos logit contributions
 written+0.051
 comm+0.051
Gu+0.051
 aback+0.049
 bisc+0.047
Neg logit contributions
 often-0.040
igs-0.038
ot-0.034
 lived-0.031
ug-0.030
Token morning
Token ID3329
Feature activation+9.55e-02

Pos logit contributions
 written+0.029
 aback+0.029
Gu+0.028
 comm+0.028
hered+0.027
Neg logit contributions
 often-0.029
igs-0.028
ot-0.025
 solar-0.025
 lived-0.024
Token John
Token ID1757
Feature activation+5.09e-02

Pos logit contributions
Gu+0.217
 comm+0.209
 aback+0.207
ives+0.204
eared+0.202
Neg logit contributions
 often-0.071
ot-0.047
 lived-0.046
igs-0.041
 every-0.039
Token?"
Token ID1701
Feature activation+1.24e-02

Pos logit contributions
 comm+0.093
 aback+0.090
Gu+0.090
 written+0.088
 bisc+0.086
Neg logit contributions
 often-0.060
igs-0.049
 lived-0.049
ot-0.048
ind-0.041
Token his
Token ID465
Feature activation-1.03e-02

Pos logit contributions
 written+0.018
 aback+0.017
Gu+0.017
 comm+0.017
 bisc+0.016
Neg logit contributions
 often-0.018
igs-0.018
ot-0.016
 lived-0.015
 solar-0.015
Token mom
Token ID1995
Feature activation+2.31e-02

Pos logit contributions
igs+0.013
 often+0.012
 solar+0.010
ug+0.010
ot+0.010
Neg logit contributions
 written-0.018
 comm-0.017
 aback-0.017
 source-0.017
Gu-0.016
Token asked
Token ID1965
Feature activation+9.21e-03

Pos logit contributions
 aback+0.032
Gu+0.032
 written+0.032
 comm+0.031
 bisc+0.031
Neg logit contributions
 often-0.035
igs-0.032
ot-0.030
 lived-0.028
ug-0.027
Token do
Token ID466
Feature activation+1.20e-02

Pos logit contributions
 aback+0.022
 written+0.022
 comm+0.022
Gu+0.021
hered+0.021
Neg logit contributions
 often-0.027
igs-0.026
 lived-0.024
ot-0.024
 solar-0.023
Token I
Token ID314
Feature activation+4.05e-02

Pos logit contributions
 written+0.017
 aback+0.017
 comm+0.016
Gu+0.016
 source+0.015
Neg logit contributions
igs-0.019
 often-0.018
ot-0.016
 lived-0.016
 solar-0.015
Token have
Token ID423
Feature activation+3.24e-02

Pos logit contributions
oph+0.071
Gu+0.070
 aback+0.069
umbs+0.068
hered+0.068
Neg logit contributions
 often-0.049
ot-0.044
igs-0.044
 lived-0.043
 exc-0.039
Token to
Token ID284
Feature activation-1.83e-02

Pos logit contributions
 aback+0.049
 comm+0.049
Gu+0.049
 bisc+0.047
umbs+0.046
Neg logit contributions
 often-0.047
igs-0.045
 lived-0.039
ot-0.039
 solar-0.037
Token do
Token ID466
Feature activation+1.70e-02

Pos logit contributions
igs+0.030
 often+0.028
 solar+0.028
ind+0.026
ug+0.025
Neg logit contributions
 written-0.023
 come-0.021
 comm-0.021
 aback-0.021
 source-0.021
Token mom
Token ID1995
Feature activation+9.50e-02

Pos logit contributions
 written+0.028
 comm+0.027
 aback+0.027
Gu+0.027
 bisc+0.026
Neg logit contributions
 often-0.022
igs-0.022
ot-0.019
 lived-0.018
 solar-0.017
Tokenmy
Token ID1820
Feature activation+6.00e-02

Pos logit contributions
 comm+0.178
 aback+0.177
 bisc+0.167
oph+0.167
Sur+0.166
Neg logit contributions
 often-0.117
ot-0.094
 own-0.089
 nicely-0.082
 alone-0.082
Token?".
Token ID43634
Feature activation+6.22e-03

Pos logit contributions
 comm+0.103
 aback+0.102
Gu+0.102
 written+0.097
 bisc+0.097
Neg logit contributions
 often-0.076
ot-0.066
igs-0.065
 lived-0.056
 nicely-0.054
Token
Token ID198
Feature activation+1.61e-02

Pos logit contributions
 written+0.011
 aback+0.011
Gu+0.011
 comm+0.011
 source+0.010
Neg logit contributions
igs-0.007
 often-0.007
ot-0.006
 solar-0.006
 lived-0.006
Token
Token ID198
Feature activation-9.50e-03

Pos logit contributions
 written+0.023
 aback+0.022
 comm+0.022
Gu+0.021
 bisc+0.020
Neg logit contributions
 often-0.024
igs-0.024
ot-0.022
 lived-0.021
 solar-0.021
TokenHer
Token ID9360
Feature activation+2.60e-03

Pos logit contributions
igs+0.013
 often+0.011
 solar+0.011
ug+0.010
ot+0.010
Neg logit contributions
 written-0.016
Gu-0.015
 aback-0.015
 comm-0.015
 Birthday-0.014
Token crying
Token ID13774
Feature activation-1.25e-02

Pos logit contributions
igs+0.000
 often+0.000
ot+0.000
 solar+0.000
 lived+0.000
Neg logit contributions
 written-0.000
 aback-0.000
 comm-0.000
Gu-0.000
 source-0.000
Token and
Token ID290
Feature activation+7.78e-03

Pos logit contributions
igs+0.021
 often+0.019
 solar+0.018
ot+0.017
 lived+0.017
Neg logit contributions
 written-0.017
 comm-0.016
 aback-0.016
 source-0.015
hered-0.015
Token say
Token ID910
Feature activation-2.60e-02

Pos logit contributions
 written+0.010
 aback+0.010
Gu+0.010
 comm+0.009
 source+0.009
Neg logit contributions
igs-0.013
 often-0.013
ot-0.011
 solar-0.011
 lived-0.011
Token "
Token ID366
Feature activation+3.65e-02

Pos logit contributions
igs+0.045
ind+0.040
 often+0.039
 solar+0.038
 ig+0.037
Neg logit contributions
 written-0.035
 come-0.031
 aback-0.030
hered-0.030
Gu-0.029
Tokensorry
Token ID41599
Feature activation+2.55e-02

Pos logit contributions
 written+0.059
 aback+0.057
 calling+0.056
 source+0.055
 comm+0.054
Neg logit contributions
 often-0.047
ot-0.044
igs-0.040
apy-0.039
 lived-0.037
Token"
Token ID1
Feature activation+9.38e-02

Pos logit contributions
 written+0.053
 comm+0.051
 aback+0.050
Gu+0.050
 source+0.049
Neg logit contributions
igs-0.021
 often-0.021
ot-0.020
ug-0.017
 lived-0.016
Token to
Token ID284
Feature activation-1.69e-02

Pos logit contributions
 comm+0.169
ctions+0.162
 aback+0.161
Gu+0.161
ction+0.159
Neg logit contributions
 often-0.107
ot-0.101
 tight-0.084
 every-0.083
 nicely-0.082
Token each
Token ID1123
Feature activation+1.51e-02

Pos logit contributions
igs+0.028
 often+0.027
 solar+0.026
 lived+0.025
belt+0.024
Neg logit contributions
 comm-0.020
 written-0.019
 calling-0.019
 aback-0.018
 begs-0.018
Token other
Token ID584
Feature activation-6.85e-03

Pos logit contributions
 written+0.032
 aback+0.032
Gu+0.031
 comm+0.031
 source+0.029
Neg logit contributions
 often-0.012
igs-0.012
ot-0.010
 lived-0.009
 solar-0.008
Token.
Token ID13
Feature activation-2.54e-02

Pos logit contributions
igs+0.009
 often+0.008
 solar+0.007
ot+0.007
 lived+0.007
Neg logit contributions
 written-0.012
 aback-0.011
 comm-0.011
Gu-0.011
hered-0.011
Token They
Token ID1119
Feature activation-1.50e-02

Pos logit contributions
igs+0.045
 often+0.038
ind+0.037
ug+0.035
 gut+0.035
Neg logit contributions
Gu-0.032
 aback-0.032
 written-0.031
 How-0.031
 comm-0.030
Token okay
Token ID8788
Feature activation+7.78e-03

Pos logit contributions
igs+0.007
 often+0.006
ot+0.006
 solar+0.006
 gut+0.005
Neg logit contributions
 written-0.006
 aback-0.006
 comm-0.006
Gu-0.006
 source-0.005
Token to
Token ID284
Feature activation-6.61e-03

Pos logit contributions
 written+0.010
 aback+0.010
Gu+0.010
 comm+0.010
 source+0.009
Neg logit contributions
igs-0.012
 often-0.012
ot-0.011
 solar-0.011
 lived-0.011
Token say
Token ID910
Feature activation-7.58e-03

Pos logit contributions
igs+0.011
 often+0.010
 solar+0.010
ot+0.010
apy+0.009
Neg logit contributions
 written-0.008
 aback-0.008
 comm-0.008
Gu-0.008
 source-0.008
Token "
Token ID366
Feature activation+6.98e-02

Pos logit contributions
igs+0.011
 often+0.010
ot+0.009
 solar+0.009
 lived+0.009
Neg logit contributions
 written-0.011
 aback-0.011
 comm-0.011
Gu-0.010
hered-0.010
Tokensorry
Token ID41599
Feature activation+3.29e-02

Pos logit contributions
 comm+0.088
 bisc+0.083
 written+0.082
 aback+0.081
 begs+0.080
Neg logit contributions
 often-0.111
ot-0.106
igs-0.097
ug-0.097
ind-0.093
Token"
Token ID1
Feature activation+9.28e-02

Pos logit contributions
 written+0.054
 comm+0.052
Gu+0.051
 aback+0.051
 bisc+0.049
Neg logit contributions
 often-0.041
igs-0.040
ot-0.039
 lived-0.035
ug-0.035
Token if
Token ID611
Feature activation+3.57e-03

Pos logit contributions
 comm+0.147
ctions+0.144
ction+0.143
Gu+0.143
 bisc+0.142
Neg logit contributions
 often-0.116
ot-0.112
 tight-0.097
belt-0.096
 every-0.095
Token you
Token ID345
Feature activation+3.49e-03

Pos logit contributions
 written+0.006
 aback+0.005
Gu+0.005
 comm+0.005
 source+0.005
Neg logit contributions
igs-0.005
 often-0.005
ot-0.004
 solar-0.004
 lived-0.004
Token make
Token ID787
Feature activation+3.68e-02

Pos logit contributions
 written+0.004
 aback+0.004
Gu+0.004
 comm+0.004
 source+0.004
Neg logit contributions
igs-0.006
 often-0.006
ot-0.005
 solar-0.005
 lived-0.005
Token a
Token ID257
Feature activation-8.23e-04

Pos logit contributions
 written+0.057
Gu+0.056
 aback+0.056
cake+0.055
 bisc+0.055
Neg logit contributions
igs-0.050
 often-0.048
ot-0.043
 goals-0.042
 tight-0.040
Token mistake
Token ID7457
Feature activation+6.28e-03

Pos logit contributions
igs+0.001
 often+0.001
ot+0.001
 solar+0.001
 lived+0.001
Neg logit contributions
 written-0.001
 aback-0.001
 comm-0.001
Gu-0.001
 source-0.001
Token,
Token ID11
Feature activation+3.61e-02

Pos logit contributions
 written+0.053
 aback+0.053
Gu+0.052
 comm+0.050
 source+0.050
Neg logit contributions
igs-0.026
 often-0.024
ot-0.021
 lived-0.020
 gut-0.018
Token hey
Token ID17207
Feature activation+3.23e-02

Pos logit contributions
 written+0.060
 aback+0.055
Gu+0.054
 comm+0.052
cake+0.052
Neg logit contributions
igs-0.047
 often-0.046
ot-0.044
 lived-0.042
 gut-0.042
Token!
Token ID0
Feature activation+2.75e-02

Pos logit contributions
 aback+0.061
 written+0.061
Gu+0.060
 source+0.058
 comm+0.058
Neg logit contributions
igs-0.035
 often-0.032
ot-0.030
 lived-0.028
 gut-0.025
Token What
Token ID1867
Feature activation+4.39e-02

Pos logit contributions
 written+0.036
 aback+0.032
 comm+0.031
 source+0.031
 bisc+0.029
Neg logit contributions
igs-0.045
ot-0.043
 often-0.043
 lived-0.042
apy-0.041
Token is
Token ID318
Feature activation+1.25e-03

Pos logit contributions
 aback+0.078
 comm+0.074
 source+0.074
Gu+0.071
 written+0.071
Neg logit contributions
 lived-0.053
 often-0.050
igs-0.047
ot-0.046
 solar-0.044
Token going
Token ID1016
Feature activation+9.23e-02

Pos logit contributions
 written+0.002
 aback+0.002
Gu+0.002
 comm+0.002
 source+0.002
Neg logit contributions
igs-0.002
 often-0.002
ot-0.002
 solar-0.002
 lived-0.002
Token on
Token ID319
Feature activation+3.61e-02

Pos logit contributions
oph+0.177
umbs+0.177
 aback+0.167
andra+0.166
Gu+0.166
Neg logit contributions
 often-0.083
 high-0.080
ot-0.078
 alone-0.076
igs-0.072
Token here
Token ID994
Feature activation+4.98e-02

Pos logit contributions
 written+0.047
 aback+0.046
Gu+0.046
 comm+0.044
 source+0.042
Neg logit contributions
igs-0.061
 often-0.057
ot-0.053
ind-0.051
 lived-0.050
Token?"
Token ID1701
Feature activation-3.43e-02

Pos logit contributions
Gu+0.102
 aback+0.099
 comm+0.098
 written+0.096
hered+0.092
Neg logit contributions
 often-0.046
igs-0.043
 lived-0.041
ot-0.038
 nicely-0.032
Token Mom
Token ID11254
Feature activation+4.38e-02

Pos logit contributions
igs+0.060
 often+0.055
belt+0.054
ug+0.051
 solar+0.050
Neg logit contributions
Gu-0.041
 comm-0.040
 written-0.039
 begs-0.037
 Sandra-0.037
Tokenmy
Token ID1820
Feature activation+3.82e-02

Pos logit contributions
 aback+0.048
 written+0.047
 bisc+0.046
oph+0.046
hered+0.045
Neg logit contributions
 often-0.079
igs-0.075
ot-0.069
ind-0.068
ug-0.064
Token my
Token ID616
Feature activation+3.33e-02

Pos logit contributions
igs+0.033
 often+0.031
 solar+0.030
ot+0.029
ug+0.029
Neg logit contributions
 written-0.022
 comm-0.020
Gu-0.019
 aback-0.019
 come-0.018
Token friend
Token ID1545
Feature activation+6.75e-02

Pos logit contributions
 aback+0.059
Gu+0.056
hered+0.054
 written+0.053
 comm+0.053
Neg logit contributions
 often-0.042
 solar-0.038
igs-0.037
ot-0.036
 lived-0.034
Token send
Token ID3758
Feature activation+2.98e-02

Pos logit contributions
 aback+0.113
 comm+0.110
Gu+0.110
 bisc+0.103
 written+0.101
Neg logit contributions
 often-0.095
ot-0.081
 lived-0.079
 nicely-0.070
 own-0.069
Token me
Token ID502
Feature activation+1.60e-02

Pos logit contributions
Gu+0.049
 comm+0.048
 aback+0.047
 written+0.046
hered+0.045
Neg logit contributions
 often-0.038
igs-0.036
ot-0.035
 lived-0.031
 solar-0.030
Token a
Token ID257
Feature activation+4.94e-02

Pos logit contributions
 written+0.022
 comm+0.022
Gu+0.022
 aback+0.022
hered+0.020
Neg logit contributions
igs-0.024
 often-0.024
ot-0.022
 solar-0.021
 lived-0.021
Token birthday
Token ID10955
Feature activation+9.23e-02

Pos logit contributions
 aback+0.070
Gu+0.068
 comm+0.063
umbs+0.063
hered+0.063
Neg logit contributions
 often-0.071
ot-0.069
 solar-0.068
 tight-0.066
igs-0.065
Token card
Token ID2657
Feature activation+6.52e-02

Pos logit contributions
 aback+0.158
 comm+0.147
Gu+0.146
viously+0.137
hered+0.135
Neg logit contributions
 often-0.127
ot-0.116
 collection-0.103
 nicely-0.098
 every-0.097
Token now
Token ID783
Feature activation+3.51e-02

Pos logit contributions
 aback+0.120
 comm+0.120
Gu+0.119
 source+0.108
Uh+0.107
Neg logit contributions
 often-0.078
ot-0.064
 lived-0.062
igs-0.062
 nicely-0.056
Token?"
Token ID1701
Feature activation+9.38e-03

Pos logit contributions
 aback+0.067
 comm+0.066
Gu+0.066
 written+0.064
 bisc+0.063
Neg logit contributions
 often-0.037
igs-0.033
 lived-0.031
ot-0.029
 solar-0.025
Token Her
Token ID2332
Feature activation+4.07e-03

Pos logit contributions
 written+0.014
 aback+0.014
 comm+0.013
Gu+0.013
 source+0.013
Neg logit contributions
 often-0.013
igs-0.013
ot-0.012
 lived-0.011
 solar-0.011
Token mom
Token ID1995
Feature activation+3.02e-02

Pos logit contributions
 written+0.008
 aback+0.008
Gu+0.008
 comm+0.008
 source+0.008
Neg logit contributions
igs-0.004
 often-0.004
ot-0.003
 solar-0.003
 lived-0.003
Token The
Token ID383
Feature activation-1.17e-02

Pos logit contributions
 written+0.028
 aback+0.027
 comm+0.026
Gu+0.026
 source+0.026
Neg logit contributions
 often-0.026
igs-0.024
ot-0.023
 lived-0.023
 solar-0.022
Token flame
Token ID16506
Feature activation+2.30e-02

Pos logit contributions
igs+0.017
 often+0.015
ot+0.015
ug+0.014
apy+0.013
Neg logit contributions
 written-0.018
 comm-0.018
 source-0.018
Gu-0.017
 aback-0.017
Token said
Token ID531
Feature activation+8.46e-04

Pos logit contributions
 written+0.033
Gu+0.032
 aback+0.032
 comm+0.031
 source+0.029
Neg logit contributions
 often-0.035
igs-0.034
 lived-0.031
ot-0.030
 solar-0.029
Token "
Token ID366
Feature activation+6.78e-02

Pos logit contributions
 written+0.001
 aback+0.001
Gu+0.001
 comm+0.001
 source+0.001
Neg logit contributions
igs-0.001
 often-0.001
ot-0.001
 solar-0.001
 lived-0.001
Tokenbye
Token ID16390
Feature activation+3.52e-02

Pos logit contributions
 comm+0.092
 aback+0.087
 source+0.084
 begs+0.084
 calling+0.083
Neg logit contributions
 often-0.108
ot-0.101
 lived-0.092
ug-0.085
igs-0.083
Token"
Token ID1
Feature activation+9.18e-02

Pos logit contributions
 comm+0.061
 written+0.060
 aback+0.060
Gu+0.057
 bisc+0.056
Neg logit contributions
 often-0.043
igs-0.040
ot-0.039
 lived-0.035
 solar-0.034
Token too
Token ID1165
Feature activation-3.97e-02

Pos logit contributions
 comm+0.156
Gu+0.154
ction+0.151
ctions+0.151
 bisc+0.150
Neg logit contributions
 often-0.115
ot-0.103
 every-0.090
 tight-0.087
 lived-0.082
Token.
Token ID13
Feature activation+1.25e-02

Pos logit contributions
igs+0.069
 solar+0.059
apy+0.056
 lived+0.056
belt+0.056
Neg logit contributions
 written-0.052
 come-0.048
 calling-0.047
 alright-0.046
 saying-0.046
Token Amy
Token ID14235
Feature activation+4.16e-03

Pos logit contributions
 written+0.016
 aback+0.015
Gu+0.015
 comm+0.015
 source+0.014
Neg logit contributions
 often-0.021
igs-0.020
ot-0.019
 lived-0.018
 solar-0.018
Token went
Token ID1816
Feature activation+9.70e-03

Pos logit contributions
 written+0.005
 aback+0.005
Gu+0.005
 comm+0.005
 source+0.005
Neg logit contributions
igs-0.007
 often-0.007
ot-0.006
 solar-0.006
 lived-0.006
Token home
Token ID1363
Feature activation-1.27e-02

Pos logit contributions
 written+0.013
 aback+0.012
 comm+0.012
Gu+0.012
 source+0.011
Neg logit contributions
igs-0.016
 often-0.015
ot-0.014
 solar-0.013
 gut-0.013
Token me
Token ID502
Feature activation+1.34e-02

Pos logit contributions
igs+0.039
 often+0.038
 solar+0.037
 lived+0.036
ug+0.035
Neg logit contributions
 written-0.029
 aback-0.028
 comm-0.028
 calling-0.028
Gu-0.027
Token to
Token ID284
Feature activation-2.53e-03

Pos logit contributions
 written+0.018
Gu+0.018
 comm+0.017
 aback+0.017
hered+0.016
Neg logit contributions
igs-0.021
 often-0.021
ot-0.019
 solar-0.018
 lived-0.018
Token finish
Token ID5461
Feature activation-2.33e-02

Pos logit contributions
igs+0.004
 often+0.004
ot+0.004
 solar+0.004
 lived+0.004
Neg logit contributions
 written-0.003
 aback-0.003
 comm-0.003
Gu-0.003
 source-0.003
Token my
Token ID616
Feature activation-1.15e-02

Pos logit contributions
igs+0.032
 often+0.031
 solar+0.030
 lived+0.030
ot+0.030
Neg logit contributions
 written-0.034
 aback-0.032
 comm-0.031
 calling-0.031
 source-0.031
Token bath
Token ID7837
Feature activation+9.92e-03

Pos logit contributions
igs+0.018
 often+0.018
ot+0.016
 solar+0.016
 lived+0.016
Neg logit contributions
 comm-0.015
 written-0.014
 source-0.014
 aback-0.014
Gu-0.014
Token"
Token ID1
Feature activation+9.18e-02

Pos logit contributions
 written+0.017
Gu+0.016
 aback+0.016
 comm+0.016
 source+0.015
Neg logit contributions
igs-0.012
 often-0.012
ot-0.011
 solar-0.010
 lived-0.010
Token said
Token ID531
Feature activation+8.46e-04

Pos logit contributions
Gu+0.146
ctions+0.140
 comm+0.139
ction+0.137
hered+0.136
Neg logit contributions
 often-0.122
ot-0.109
 tight-0.101
 every-0.099
belt-0.095
Token her
Token ID607
Feature activation-1.61e-02

Pos logit contributions
 written+0.001
 aback+0.001
 comm+0.001
Gu+0.001
 source+0.001
Neg logit contributions
igs-0.001
 often-0.001
ot-0.001
 solar-0.001
 lived-0.001
Token mom
Token ID1995
Feature activation+1.55e-02

Pos logit contributions
igs+0.021
 often+0.019
 solar+0.017
ind+0.016
ug+0.016
Neg logit contributions
 written-0.027
 comm-0.026
 aback-0.025
 source-0.025
 bisc-0.025
Token.
Token ID13
Feature activation+1.84e-02

Pos logit contributions
Gu+0.017
 aback+0.017
 written+0.017
 comm+0.017
 source+0.015
Neg logit contributions
 often-0.028
igs-0.027
ot-0.025
 lived-0.024
 solar-0.024
Token
Token ID198
Feature activation-5.25e-03

Pos logit contributions
 written+0.030
 aback+0.029
Gu+0.029
 comm+0.028
 bisc+0.027
Neg logit contributions
 often-0.024
igs-0.022
ot-0.021
 lived-0.020
 solar-0.019
Token.
Token ID13
Feature activation+2.05e-02

Pos logit contributions
 written+0.023
Gu+0.023
 aback+0.022
 comm+0.022
 bisc+0.021
Neg logit contributions
 often-0.016
igs-0.015
ot-0.014
 gut-0.013
 solar-0.012
Token He
Token ID679
Feature activation+3.05e-03

Pos logit contributions
 written+0.040
 comm+0.038
hered+0.037
Gu+0.037
 aback+0.037
Neg logit contributions
 often-0.020
igs-0.019
ot-0.016
 lived-0.015
 gut-0.015
Token didn
Token ID1422
Feature activation+1.13e-02

Pos logit contributions
 written+0.004
 aback+0.004
Gu+0.004
 comm+0.004
 source+0.004
Neg logit contributions
igs-0.005
 often-0.004
ot-0.004
 solar-0.004
 lived-0.004
Token't
Token ID470
Feature activation+3.42e-02

Pos logit contributions
 written+0.012
 aback+0.011
Gu+0.011
 comm+0.011
 source+0.010
Neg logit contributions
igs-0.021
 often-0.021
ot-0.019
 solar-0.018
 gut-0.018
Token want
Token ID765
Feature activation+2.61e-02

Pos logit contributions
Gu+0.052
oph+0.052
 written+0.051
 comm+0.050
 Birthday+0.050
Neg logit contributions
 often-0.048
ot-0.042
igs-0.039
 exc-0.039
 lived-0.036
Token Farmer
Token ID37755
Feature activation+9.15e-02

Pos logit contributions
 comm+0.037
 written+0.037
Gu+0.037
hered+0.036
arre+0.035
Neg logit contributions
 often-0.039
igs-0.037
ot-0.034
 gut-0.031
 solar-0.031
Token Dan
Token ID6035
Feature activation+5.27e-04

Pos logit contributions
Uh+0.157
Gu+0.153
Ta+0.142
umbs+0.142
arre+0.142
Neg logit contributions
 often-0.113
ot-0.088
 lived-0.086
 gut-0.084
 toes-0.084
Token to
Token ID284
Feature activation+1.45e-02

Pos logit contributions
 written+0.001
 aback+0.001
Gu+0.001
 comm+0.001
 source+0.001
Neg logit contributions
igs-0.001
 often-0.001
ot-0.001
 solar-0.001
 lived-0.001
Token be
Token ID307
Feature activation+2.73e-02

Pos logit contributions
 written+0.020
Gu+0.020
 aback+0.019
 comm+0.019
 bisc+0.018
Neg logit contributions
 often-0.023
igs-0.022
ot-0.020
 gut-0.019
 lived-0.019
Token mad
Token ID8805
Feature activation+2.18e-02

Pos logit contributions
Gu+0.041
 written+0.041
oph+0.041
cake+0.041
 bisc+0.039
Neg logit contributions
 often-0.038
igs-0.037
ot-0.033
 gut-0.030
 tight-0.030
Token.
Token ID13
Feature activation+2.18e-02

Pos logit contributions
Gu+0.038
 written+0.038
 aback+0.037
 comm+0.037
 bisc+0.036
Neg logit contributions
 often-0.026
igs-0.023
ot-0.021
 gut-0.020
 goals-0.019
Token will
Token ID481
Feature activation+1.35e-02

Pos logit contributions
igs+0.013
 often+0.012
ot+0.011
 solar+0.011
 gut+0.011
Neg logit contributions
 written-0.010
 aback-0.010
Gu-0.009
 comm-0.009
 source-0.009
Token be
Token ID307
Feature activation+3.40e-02

Pos logit contributions
 written+0.017
 aback+0.017
Gu+0.017
 comm+0.016
hered+0.015
Neg logit contributions
 often-0.022
igs-0.022
ot-0.020
 solar-0.019
 lived-0.019
Token this
Token ID428
Feature activation+2.87e-02

Pos logit contributions
hered+0.048
Gu+0.048
 written+0.048
cake+0.047
oph+0.047
Neg logit contributions
 often-0.052
igs-0.051
ot-0.042
 tight-0.042
 gut-0.042
Token glad
Token ID9675
Feature activation+4.45e-02

Pos logit contributions
 written+0.045
 aback+0.043
Gu+0.043
hered+0.042
 comm+0.042
Neg logit contributions
 often-0.041
igs-0.038
ot-0.034
 lived-0.032
 solar-0.032
Token come
Token ID1282
Feature activation+4.30e-02

Pos logit contributions
Gu+0.075
 aback+0.074
 written+0.074
hered+0.074
 bisc+0.072
Neg logit contributions
 often-0.060
igs-0.050
ot-0.047
 lived-0.043
 gut-0.041
Token winter
Token ID7374
Feature activation+9.15e-02

Pos logit contributions
 comm+0.068
 written+0.065
Gu+0.064
 aback+0.063
oph+0.062
Neg logit contributions
 often-0.063
ot-0.057
igs-0.056
ind-0.051
 lived-0.050
Token season
Token ID1622
Feature activation+5.24e-02

Pos logit contributions
Gu+0.187
 aback+0.184
 written+0.180
Sur+0.176
eared+0.175
Neg logit contributions
 often-0.101
igs-0.075
ot-0.064
 every-0.062
 lived-0.059
Token?"
Token ID1701
Feature activation+9.10e-03

Pos logit contributions
Gu+0.102
 written+0.100
 aback+0.097
 comm+0.097
eared+0.094
Neg logit contributions
 often-0.059
igs-0.052
ot-0.046
 lived-0.041
ug-0.040
Token asked
Token ID1965
Feature activation+3.94e-03

Pos logit contributions
 written+0.012
 aback+0.012
Gu+0.012
 comm+0.012
 source+0.011
Neg logit contributions
igs-0.014
 often-0.014
ot-0.013
 solar-0.012
 lived-0.012
Token Mama
Token ID40084
Feature activation+8.78e-03

Pos logit contributions
 written+0.006
 aback+0.006
Gu+0.006
 comm+0.006
 source+0.006
Neg logit contributions
igs-0.005
 often-0.005
ot-0.004
 solar-0.004
 lived-0.004
Token.
Token ID13
Feature activation+2.31e-02

Pos logit contributions
 written+0.016
 aback+0.016
Gu+0.016
 comm+0.016
 source+0.015
Neg logit contributions
igs-0.009
 often-0.009
ot-0.008
 solar-0.007
 lived-0.007
Tokenmy
Token ID1820
Feature activation+2.78e-02

Pos logit contributions
 comm+0.044
Gu+0.042
 aback+0.041
 written+0.041
 bisc+0.041
Neg logit contributions
 often-0.069
 lived-0.059
ot-0.059
igs-0.059
 tight-0.055
Token saw
Token ID2497
Feature activation+4.17e-02

Pos logit contributions
Gu+0.034
 comm+0.034
 written+0.034
 aback+0.034
 source+0.032
Neg logit contributions
 often-0.047
igs-0.043
 lived-0.042
ot-0.040
 tight-0.038
Token her
Token ID607
Feature activation+3.65e-02

Pos logit contributions
 comm+0.084
Gu+0.082
 bisc+0.079
hered+0.078
 source+0.078
Neg logit contributions
 often-0.036
igs-0.033
 lived-0.033
ot-0.029
 every-0.026
Token and
Token ID290
Feature activation-2.14e-03

Pos logit contributions
Gu+0.067
 aback+0.060
 written+0.058
umbs+0.057
eared+0.057
Neg logit contributions
 often-0.045
igs-0.040
 tight-0.039
ot-0.039
 lived-0.038
Token said
Token ID531
Feature activation-1.40e-02

Pos logit contributions
igs+0.004
 often+0.004
 solar+0.004
ot+0.004
 gut+0.004
Neg logit contributions
 written-0.002
 aback-0.002
 comm-0.002
Gu-0.002
 source-0.002
Token "
Token ID366
Feature activation+9.14e-02

Pos logit contributions
igs+0.019
 often+0.017
 solar+0.016
ind+0.016
ot+0.015
Neg logit contributions
 written-0.023
 aback-0.021
Gu-0.021
 comm-0.021
hered-0.020
TokenG
Token ID38
Feature activation-2.10e-02

Pos logit contributions
 begs+0.140
ives+0.133
 comm+0.131
 Birthday+0.130
 bisc+0.126
Neg logit contributions
 often-0.114
ind-0.106
ot-0.102
 every-0.088
 rainy-0.087
Tokenabby
Token ID42457
Feature activation-3.36e-02

Pos logit contributions
 often+0.035
igs+0.034
 solar+0.034
apy+0.033
 lived+0.031
Neg logit contributions
 written-0.025
 aback-0.024
Gu-0.023
oph-0.023
cake-0.022
Token,
Token ID11
Feature activation+2.45e-02

Pos logit contributions
igs+0.048
 often+0.045
 solar+0.042
 gut+0.040
apy+0.040
Neg logit contributions
 written-0.050
 aback-0.047
 Birthday-0.046
 come-0.045
 appeared-0.044
Token what
Token ID644
Feature activation+3.53e-02

Pos logit contributions
 written+0.040
 aback+0.040
 comm+0.039
Gu+0.039
hered+0.039
Neg logit contributions
 often-0.032
igs-0.031
ot-0.028
 lived-0.027
 gut-0.026
Token's
Token ID338
Feature activation+1.60e-02

Pos logit contributions
 aback+0.052
 comm+0.051
hered+0.048
Gu+0.048
 source+0.047
Neg logit contributions
 often-0.052
 lived-0.051
igs-0.047
 solar-0.045
ot-0.045
Token we
Token ID356
Feature activation-1.88e-02

Pos logit contributions
igs+0.046
 often+0.043
ot+0.041
 solar+0.041
 gut+0.040
Neg logit contributions
 written-0.029
Gu-0.027
 source-0.026
 come-0.026
 comm-0.026
Token have
Token ID423
Feature activation+1.72e-02

Pos logit contributions
igs+0.032
 often+0.030
 gut+0.028
apy+0.027
ot+0.027
Neg logit contributions
 written-0.024
 source-0.021
 come-0.021
 aback-0.021
 comm-0.021
Token a
Token ID257
Feature activation+3.28e-02

Pos logit contributions
 written+0.022
 aback+0.022
Gu+0.021
 comm+0.021
hered+0.021
Neg logit contributions
igs-0.028
 often-0.027
ot-0.024
 solar-0.024
 lived-0.023
Token perfectly
Token ID7138
Feature activation+1.43e-02

Pos logit contributions
 written+0.049
 aback+0.048
Gu+0.046
eared+0.044
 comm+0.044
Neg logit contributions
 often-0.049
igs-0.047
 solar-0.043
ot-0.042
 tight-0.040
Token good
Token ID922
Feature activation+3.30e-02

Pos logit contributions
 written+0.023
 aback+0.023
Gu+0.022
 comm+0.022
 source+0.021
Neg logit contributions
igs-0.019
 often-0.018
ot-0.016
 solar-0.015
 lived-0.015
Token tea
Token ID8887
Feature activation+9.12e-02

Pos logit contributions
 written+0.051
Gu+0.051
 aback+0.048
 comm+0.046
cake+0.046
Neg logit contributions
igs-0.048
 often-0.047
ot-0.045
 solar-0.041
 lived-0.039
Token set
Token ID900
Feature activation+3.20e-02

Pos logit contributions
Gu+0.162
 aback+0.155
 comm+0.137
Uh+0.135
viously+0.134
Neg logit contributions
 often-0.120
igs-0.107
ot-0.101
 collection-0.095
 lived-0.095
Token?"
Token ID1701
Feature activation+6.40e-03

Pos logit contributions
Gu+0.057
 comm+0.056
 aback+0.054
 written+0.053
hered+0.051
Neg logit contributions
 often-0.039
igs-0.038
ot-0.034
ug-0.031
 lived-0.030
Token he
Token ID339
Feature activation+5.93e-03

Pos logit contributions
 written+0.010
 aback+0.009
Gu+0.009
 comm+0.009
 source+0.009
Neg logit contributions
igs-0.009
 often-0.009
ot-0.008
 solar-0.008
 lived-0.007
Token asked
Token ID1965
Feature activation-1.75e-02

Pos logit contributions
 written+0.007
 aback+0.007
Gu+0.007
 comm+0.007
 source+0.006
Neg logit contributions
igs-0.010
 often-0.010
ot-0.009
 solar-0.009
 lived-0.009
Token.
Token ID13
Feature activation+6.86e-03

Pos logit contributions
igs+0.027
 solar+0.024
 often+0.023
ug+0.021
ind+0.021
Neg logit contributions
 aback-0.026
 written-0.026
 comm-0.025
Gu-0.024
hered-0.024
Token"
Token ID1
Feature activation+4.04e-02

Pos logit contributions
 written+0.010
 aback+0.009
Gu+0.009
 comm+0.009
 source+0.009
Neg logit contributions
igs-0.006
 often-0.006
ot-0.005
 solar-0.005
 lived-0.005
TokenPlease
Token ID5492
Feature activation-1.15e-02

Pos logit contributions
 Birthday+0.053
 written+0.050
 aback+0.048
 comm+0.046
 begs+0.046
Neg logit contributions
 often-0.061
His-0.058
 lived-0.057
apy-0.055
ot-0.055
Token let
Token ID1309
Feature activation+4.53e-02

Pos logit contributions
igs+0.017
 often+0.016
 solar+0.014
ot+0.014
 goals+0.014
Neg logit contributions
 aback-0.017
 written-0.017
 comm-0.016
 source-0.016
hered-0.016
Token me
Token ID502
Feature activation+2.70e-02

Pos logit contributions
 written+0.088
bows+0.083
Gu+0.081
elt+0.081
 comm+0.080
Neg logit contributions
 often-0.046
igs-0.044
 lived-0.042
ot-0.039
 tight-0.037
Token go
Token ID467
Feature activation+2.46e-02

Pos logit contributions
 written+0.040
Gu+0.040
 comm+0.039
 aback+0.037
oph+0.037
Neg logit contributions
igs-0.039
 often-0.039
ot-0.035
ug-0.033
 lived-0.033
Token"
Token ID1
Feature activation+9.12e-02

Pos logit contributions
 written+0.040
 comm+0.038
Gu+0.038
oph+0.036
 aback+0.036
Neg logit contributions
 often-0.034
igs-0.033
ot-0.029
ug-0.027
 lived-0.027
Token said
Token ID531
Feature activation+4.75e-03

Pos logit contributions
Gu+0.147
ograph+0.145
ctions+0.145
ction+0.144
 Birthday+0.143
Neg logit contributions
 often-0.108
 his-0.090
 every-0.088
 tight-0.087
 high-0.086
Token Jim
Token ID5395
Feature activation-1.51e-02

Pos logit contributions
 written+0.008
 aback+0.007
Gu+0.007
 comm+0.007
 source+0.007
Neg logit contributions
igs-0.006
 often-0.006
ot-0.006
 solar-0.005
 lived-0.005
Token.
Token ID13
Feature activation+2.05e-02

Pos logit contributions
igs+0.020
 often+0.018
 solar+0.017
ot+0.016
apy+0.015
Neg logit contributions
 written-0.025
 aback-0.025
 comm-0.024
Gu-0.023
 source-0.023
Token
Token ID198
Feature activation+1.44e-02

Pos logit contributions
 written+0.032
 comm+0.031
 aback+0.030
Gu+0.030
 bisc+0.030
Neg logit contributions
 often-0.028
igs-0.026
ot-0.024
 lived-0.023
 solar-0.023
Token
Token ID198
Feature activation+1.39e-02

Pos logit contributions
 written+0.020
 aback+0.019
Gu+0.019
 comm+0.019
hered+0.018
Neg logit contributions
 often-0.022
igs-0.022
ot-0.020
 lived-0.019
 solar-0.019
Token,
Token ID11
Feature activation+9.55e-03

Pos logit contributions
 comm+0.031
Gu+0.031
 aback+0.030
 written+0.030
hered+0.029
Neg logit contributions
 often-0.038
igs-0.035
ot-0.032
 lived-0.031
 solar-0.030
Token "
Token ID366
Feature activation+6.69e-02

Pos logit contributions
 written+0.016
 aback+0.016
Gu+0.016
 comm+0.016
 source+0.015
Neg logit contributions
 often-0.011
igs-0.011
ot-0.010
 lived-0.009
 solar-0.009
TokenWhat
Token ID2061
Feature activation+3.01e-02

Pos logit contributions
 comm+0.128
 aback+0.119
 begs+0.117
 happened+0.111
 forgiven+0.108
Neg logit contributions
 often-0.070
ind-0.061
His-0.058
ot-0.057
 solar-0.053
Token is
Token ID318
Feature activation-2.61e-03

Pos logit contributions
 aback+0.046
 comm+0.046
hered+0.045
Gu+0.043
 source+0.043
Neg logit contributions
 often-0.044
 lived-0.041
igs-0.039
 solar-0.037
ot-0.037
Token it
Token ID340
Feature activation+3.25e-02

Pos logit contributions
igs+0.004
 often+0.004
ot+0.003
 solar+0.003
 gut+0.003
Neg logit contributions
 written-0.004
 aback-0.004
Gu-0.004
 comm-0.004
 source-0.004
Token mom
Token ID1995
Feature activation+9.09e-02

Pos logit contributions
Gu+0.061
 aback+0.060
 comm+0.060
 written+0.059
hered+0.058
Neg logit contributions
 often-0.035
igs-0.033
 lived-0.029
ot-0.029
 gut-0.025
Token?!"
Token ID30823
Feature activation+2.81e-02

Pos logit contributions
 comm+0.146
 aback+0.142
Gu+0.140
Sur+0.139
 bisc+0.139
Neg logit contributions
 often-0.127
 lived-0.103
ot-0.097
igs-0.093
 tight-0.093
Token Her
Token ID2332
Feature activation+3.50e-03

Pos logit contributions
Gu+0.052
 comm+0.051
 aback+0.051
 written+0.051
 bisc+0.049
Neg logit contributions
 often-0.030
ot-0.026
 lived-0.024
igs-0.024
apy-0.022
Token mom
Token ID1995
Feature activation+2.40e-02

Pos logit contributions
 written+0.007
 aback+0.007
Gu+0.007
 comm+0.007
 source+0.006
Neg logit contributions
igs-0.003
 often-0.003
ot-0.003
 solar-0.003
 lived-0.003
Token replied
Token ID8712
Feature activation-3.61e-03

Pos logit contributions
Gu+0.030
 aback+0.030
 written+0.029
 comm+0.028
 source+0.028
Neg logit contributions
 often-0.040
igs-0.037
ot-0.036
 lived-0.035
 tight-0.032
Token,
Token ID11
Feature activation+9.94e-03

Pos logit contributions
igs+0.004
 often+0.004
 solar+0.004
ot+0.003
 lived+0.003
Neg logit contributions
 written-0.006
 aback-0.006
 comm-0.006
Gu-0.006
hered-0.006
Token Tim
Token ID5045
Feature activation-2.80e-02

Pos logit contributions
igs+0.005
 often+0.005
ot+0.004
 gut+0.004
 solar+0.004
Neg logit contributions
 written-0.007
Gu-0.006
 aback-0.006
 comm-0.006
 source-0.006
Token enjoyed
Token ID8359
Feature activation-1.02e-02

Pos logit contributions
igs+0.048
 gut+0.044
apy+0.042
 solar+0.042
ot+0.041
Neg logit contributions
 written-0.035
Gu-0.031
 aback-0.030
 comm-0.030
hered-0.029
Token hitting
Token ID9008
Feature activation-4.45e-03

Pos logit contributions
igs+0.016
 often+0.016
ot+0.015
 solar+0.014
 gut+0.014
Neg logit contributions
 written-0.013
Gu-0.013
 comm-0.012
 aback-0.012
hered-0.012
Token the
Token ID262
Feature activation+1.93e-02

Pos logit contributions
igs+0.005
 often+0.005
ot+0.004
 solar+0.004
 lived+0.004
Neg logit contributions
 written-0.008
 aback-0.008
 comm-0.007
Gu-0.007
 source-0.007
Token ball
Token ID2613
Feature activation-4.86e-02

Pos logit contributions
 written+0.033
 aback+0.033
Gu+0.032
 comm+0.031
 calling+0.029
Neg logit contributions
igs-0.024
 often-0.023
ot-0.021
 solar-0.020
 goals-0.020
Token very
Token ID845
Feature activation-1.16e-01

Pos logit contributions
igs+0.089
 gut+0.088
ind+0.087
 solar+0.084
oppy+0.080
Neg logit contributions
 written-0.051
 right-0.047
 source-0.046
 aback-0.046
 come-0.045
Token hard
Token ID1327
Feature activation-3.68e-02

Pos logit contributions
 solar+0.206
ocol+0.193
 Acc+0.187
 lived+0.187
 wardrobe+0.183
Neg logit contributions
 sure-0.089
 right-0.087
 so-0.086
 too-0.085
t-0.080
Token.
Token ID13
Feature activation+4.42e-03

Pos logit contributions
igs+0.060
 gut+0.057
ot+0.054
 solar+0.054
apy+0.054
Neg logit contributions
 written-0.049
 aback-0.042
 comm-0.041
 source-0.041
 saying-0.039
Token
Token ID198
Feature activation+1.72e-02

Pos logit contributions
 written+0.006
 aback+0.006
Gu+0.006
 comm+0.006
 source+0.005
Neg logit contributions
igs-0.007
 often-0.007
ot-0.006
 solar-0.006
 lived-0.006
Token
Token ID198
Feature activation-1.61e-02

Pos logit contributions
 written+0.023
 aback+0.023
 comm+0.022
Gu+0.022
hered+0.021
Neg logit contributions
 often-0.027
igs-0.027
ot-0.024
 lived-0.023
 solar-0.023
TokenOne
Token ID3198
Feature activation-2.60e-02

Pos logit contributions
igs+0.023
 gut+0.021
 solar+0.020
 often+0.019
ot+0.019
Neg logit contributions
Gu-0.025
 written-0.024
 source-0.022
 bisc-0.022
 comm-0.022
Token a
Token ID257
Feature activation-3.63e-02

Pos logit contributions
 aback+0.053
 comm+0.050
 written+0.050
 begs+0.050
Gu+0.048
Neg logit contributions
 often-0.047
igs-0.044
 lived-0.042
ot-0.041
 solar-0.039
Token delicate
Token ID19217
Feature activation-3.47e-02

Pos logit contributions
igs+0.066
 often+0.059
ug+0.056
lo+0.055
aro+0.055
Neg logit contributions
Gu-0.041
 comm-0.040
 written-0.039
 bisc-0.039
 source-0.037
Token bird
Token ID6512
Feature activation-4.12e-02

Pos logit contributions
igs+0.062
 often+0.057
apy+0.053
 solar+0.052
ug+0.052
Neg logit contributions
 written-0.040
 comm-0.038
 source-0.037
 bisc-0.037
Gu-0.035
Token named
Token ID3706
Feature activation+6.23e-03

Pos logit contributions
igs+0.060
 gut+0.054
 goals+0.052
apy+0.051
belt+0.050
Neg logit contributions
 written-0.059
 calling-0.057
 source-0.056
 appeared-0.054
 saying-0.054
Token Fl
Token ID1610
Feature activation-2.55e-02

Pos logit contributions
 written+0.010
 aback+0.009
 comm+0.009
Gu+0.009
 source+0.009
Neg logit contributions
igs-0.009
 often-0.008
ot-0.008
 solar-0.007
 lived-0.007
Tokenappy
Token ID7774
Feature activation-1.14e-01

Pos logit contributions
igs+0.036
 often+0.035
 solar+0.031
 gut+0.031
nam+0.030
Neg logit contributions
hered-0.036
 written-0.035
 source-0.033
 Birthday-0.033
 bisc-0.033
Token.
Token ID13
Feature activation-5.29e-02

Pos logit contributions
igs+0.276
apy+0.260
belt+0.258
 gut+0.255
agons+0.239
Neg logit contributions
's-0.049
 appeared-0.045
 once-0.044
m-0.039
 written-0.035
Token She
Token ID1375
Feature activation-4.25e-02

Pos logit contributions
igs+0.098
 gut+0.091
belt+0.087
ind+0.086
ot+0.085
Neg logit contributions
 How-0.057
Gu-0.057
 aback-0.053
 bisc-0.053
 Would-0.051
Token liked
Token ID8288
Feature activation+1.17e-02

Pos logit contributions
igs+0.054
 gut+0.051
 goals+0.045
belt+0.045
 rocky+0.042
Neg logit contributions
 written-0.074
 happened-0.070
 appeared-0.070
 baked-0.066
Gu-0.065
Token to
Token ID284
Feature activation-2.72e-02

Pos logit contributions
 written+0.014
 aback+0.014
 comm+0.013
Gu+0.013
hered+0.012
Neg logit contributions
igs-0.020
 often-0.020
ot-0.018
 solar-0.018
 lived-0.018
Token fly
Token ID6129
Feature activation-1.51e-02

Pos logit contributions
igs+0.049
 often+0.046
belt+0.045
bers+0.045
ind+0.045
Neg logit contributions
 come-0.027
 written-0.027
 source-0.026
 comm-0.025
Gu-0.025
Tokenpillar
Token ID41643
Feature activation-6.08e-02

Pos logit contributions
igs+0.029
 often+0.028
ot+0.027
 gut+0.026
 solar+0.026
Neg logit contributions
 written-0.011
 aback-0.011
 comm-0.010
Gu-0.010
hered-0.009
Token loved
Token ID6151
Feature activation-1.84e-02

Pos logit contributions
igs+0.125
ug+0.115
belt+0.115
apy+0.114
 gut+0.114
Neg logit contributions
 written-0.056
 happened-0.048
 appeared-0.048
 meant-0.046
 revealed-0.043
Token to
Token ID284
Feature activation-4.13e-02

Pos logit contributions
igs+0.040
belt+0.037
 often+0.036
ot+0.036
ug+0.036
Neg logit contributions
 written-0.014
Gu-0.012
 comm-0.012
 aback-0.012
 source-0.011
Token move
Token ID1445
Feature activation-4.82e-02

Pos logit contributions
igs+0.076
ind+0.073
ug+0.073
belt+0.073
bers+0.072
Neg logit contributions
 come-0.048
 bisc-0.041
 smell-0.040
 source-0.039
 taste-0.039
Token around
Token ID1088
Feature activation-4.33e-02

Pos logit contributions
igs+0.088
belt+0.081
oppy+0.079
 gut+0.078
apy+0.078
Neg logit contributions
 aback-0.055
 written-0.053
 such-0.047
 come-0.043
 comm-0.043
Token very
Token ID845
Feature activation-1.13e-01

Pos logit contributions
belt+0.091
igs+0.089
apy+0.085
ot+0.084
oppy+0.081
Neg logit contributions
 written-0.034
 coming-0.033
 such-0.032
 come-0.032
 calling-0.032
Token quickly
Token ID2952
Feature activation-5.76e-02

Pos logit contributions
belt+0.209
 lived+0.206
agons+0.202
 Acc+0.199
isher+0.196
Neg logit contributions
 out-0.072
 so-0.069
 sure-0.064
 right-0.063
 to-0.059
Token.
Token ID13
Feature activation-2.03e-02

Pos logit contributions
igs+0.099
apy+0.089
oppy+0.089
ind+0.087
ot+0.087
Neg logit contributions
 written-0.070
 calling-0.064
 saying-0.063
 come-0.061
 source-0.061
Token One
Token ID1881
Feature activation-2.51e-02

Pos logit contributions
igs+0.034
 gut+0.029
ot+0.028
ind+0.028
 often+0.028
Neg logit contributions
Gu-0.028
 written-0.027
 aback-0.026
 How-0.025
 source-0.025
Token day
Token ID1110
Feature activation-2.64e-02

Pos logit contributions
igs+0.044
 often+0.036
 gut+0.035
belt+0.035
 solar+0.035
Neg logit contributions
 written-0.031
 source-0.030
Gu-0.030
 bisc-0.029
 comm-0.029
Token,
Token ID11
Feature activation-1.62e-02

Pos logit contributions
igs+0.044
ind+0.042
 often+0.042
 solar+0.040
belt+0.039
Neg logit contributions
 written-0.034
 happened-0.033
hered-0.032
 appeared-0.032
 aback-0.032
Token had
Token ID550
Feature activation+2.07e-02

Pos logit contributions
igs+0.061
 gut+0.053
apy+0.051
aro+0.051
ug+0.050
Neg logit contributions
 written-0.049
 happened-0.042
Gu-0.042
 baked-0.042
 meant-0.041
Token a
Token ID257
Feature activation-2.00e-02

Pos logit contributions
 aback+0.031
 written+0.028
 comm+0.028
Gu+0.028
hered+0.027
Neg logit contributions
 often-0.031
igs-0.030
 lived-0.028
ot-0.027
 solar-0.027
Token toy
Token ID13373
Feature activation-7.07e-02

Pos logit contributions
igs+0.029
 often+0.026
ug+0.025
ot+0.024
aro+0.024
Neg logit contributions
 written-0.030
 comm-0.029
 bisc-0.029
Gu-0.029
 source-0.028
Token he
Token ID339
Feature activation-4.47e-02

Pos logit contributions
igs+0.131
 often+0.109
lo+0.108
ts+0.107
ot+0.107
Neg logit contributions
 written-0.081
 source-0.073
 bisc-0.073
 saying-0.073
 appeared-0.070
Token loved
Token ID6151
Feature activation-3.83e-02

Pos logit contributions
igs+0.067
lights+0.056
apy+0.055
 gut+0.055
aro+0.054
Neg logit contributions
 written-0.077
Gu-0.067
 baked-0.065
 meant-0.063
 come-0.063
Token very
Token ID845
Feature activation-1.12e-01

Pos logit contributions
igs+0.093
apy+0.085
oppy+0.084
 gut+0.083
ug+0.082
Neg logit contributions
 written-0.026
 saying-0.020
 coming-0.019
 calling-0.018
 telling-0.018
Token much
Token ID881
Feature activation-4.29e-02

Pos logit contributions
stack+0.230
 prope+0.222
 solar+0.218
 mast+0.213
 gut+0.212
Neg logit contributions
 news-0.063
 written-0.048
t-0.048
 supposed-0.042
 taste-0.041
Token.
Token ID13
Feature activation-1.57e-02

Pos logit contributions
igs+0.078
apy+0.072
ot+0.068
 solar+0.065
 lived+0.065
Neg logit contributions
 written-0.054
 aback-0.048
 comm-0.047
 saying-0.045
 such-0.045
Token One
Token ID1881
Feature activation-1.76e-02

Pos logit contributions
igs+0.022
 often+0.020
ind+0.018
 gut+0.018
ot+0.018
Neg logit contributions
 written-0.025
Gu-0.025
 aback-0.025
 bisc-0.023
 source-0.023
Token day
Token ID1110
Feature activation-2.80e-02

Pos logit contributions
igs+0.029
 often+0.024
 solar+0.023
 gut+0.023
ug+0.022
Neg logit contributions
 written-0.025
Gu-0.024
 source-0.023
 bisc-0.023
 comm-0.023
Token,
Token ID11
Feature activation-1.59e-02

Pos logit contributions
igs+0.043
 often+0.042
 solar+0.041
ind+0.040
 gut+0.039
Neg logit contributions
 written-0.037
hered-0.036
 happened-0.036
 aback-0.035
 appeared-0.035
Token and
Token ID290
Feature activation-4.32e-02

Pos logit contributions
igs+0.014
 often+0.013
ot+0.012
 solar+0.012
apy+0.011
Neg logit contributions
 written-0.016
 aback-0.016
 comm-0.015
Gu-0.015
hered-0.015
Token could
Token ID714
Feature activation-1.05e-02

Pos logit contributions
igs+0.066
ind+0.054
 ig+0.053
 gut+0.053
 solar+0.052
Neg logit contributions
 written-0.071
Gu-0.061
 aback-0.060
 come-0.059
 funn-0.058
Token kick
Token ID4829
Feature activation-4.74e-03

Pos logit contributions
igs+0.017
 often+0.015
 solar+0.015
ot+0.015
ug+0.014
Neg logit contributions
 written-0.014
 comm-0.013
Gu-0.013
 aback-0.013
 source-0.013
Token the
Token ID262
Feature activation+2.25e-03

Pos logit contributions
igs+0.007
 often+0.006
ot+0.006
 solar+0.006
 gut+0.006
Neg logit contributions
 written-0.007
 aback-0.007
 comm-0.007
Gu-0.007
hered-0.007
Token ball
Token ID2613
Feature activation-4.05e-02

Pos logit contributions
 written+0.004
 aback+0.003
Gu+0.003
 comm+0.003
 source+0.003
Neg logit contributions
igs-0.003
 often-0.003
ot-0.003
 solar-0.003
 lived-0.002
Token very
Token ID845
Feature activation-1.12e-01

Pos logit contributions
igs+0.069
 gut+0.064
 solar+0.062
ind+0.060
apy+0.060
Neg logit contributions
 written-0.050
 aback-0.047
hered-0.046
 come-0.045
 source-0.045
Token far
Token ID1290
Feature activation-2.39e-02

Pos logit contributions
 solar+0.194
 lived+0.192
 ig+0.191
 chim+0.187
 Acc+0.183
Neg logit contributions
 right-0.103
 so-0.090
 written-0.089
 too-0.087
 sure-0.087
Token.
Token ID13
Feature activation-1.50e-02

Pos logit contributions
igs+0.041
apy+0.036
 often+0.034
 solar+0.034
 lived+0.033
Neg logit contributions
 aback-0.032
 written-0.032
 source-0.029
 comm-0.029
hered-0.028
Token She
Token ID1375
Feature activation-3.27e-02

Pos logit contributions
igs+0.020
 often+0.017
 gut+0.017
ot+0.016
ug+0.016
Neg logit contributions
 written-0.024
Gu-0.024
 aback-0.023
 source-0.022
 How-0.022
Token wanted
Token ID2227
Feature activation+1.51e-02

Pos logit contributions
igs+0.057
 gut+0.049
ind+0.048
apy+0.047
ot+0.047
Neg logit contributions
 written-0.042
Gu-0.036
 announced-0.036
 aback-0.036
 comm-0.035
Token to
Token ID284
Feature activation+8.41e-03

Pos logit contributions
 written+0.018
 aback+0.018
 comm+0.018
Gu+0.018
 bisc+0.017
Neg logit contributions
 often-0.025
igs-0.025
ot-0.022
 solar-0.022
 gut-0.021
Token to
Token ID284
Feature activation-6.26e-02

Pos logit contributions
igs+0.050
 gut+0.046
oppy+0.046
apy+0.045
 often+0.044
Neg logit contributions
 written-0.015
Gu-0.013
 come-0.012
 calling-0.012
 saying-0.011
Token go
Token ID467
Feature activation-2.95e-02

Pos logit contributions
igs+0.123
 gut+0.120
oppy+0.119
bers+0.119
 goals+0.117
Neg logit contributions
 come-0.074
 say-0.054
 smell-0.051
 surprise-0.051
 calling-0.050
Token around
Token ID1088
Feature activation-4.63e-02

Pos logit contributions
igs+0.056
 gut+0.055
apy+0.054
oppy+0.053
 goals+0.053
Neg logit contributions
 come-0.026
 calling-0.025
 source-0.025
 written-0.025
 coming-0.022
Token the
Token ID262
Feature activation-1.86e-02

Pos logit contributions
igs+0.087
oppy+0.085
apy+0.084
ot+0.083
belt+0.081
Neg logit contributions
 calling-0.046
 coming-0.045
 written-0.045
 source-0.045
 come-0.042
Token track
Token ID2610
Feature activation-4.42e-02

Pos logit contributions
igs+0.030
 often+0.027
apy+0.027
 gut+0.025
ot+0.025
Neg logit contributions
Gu-0.025
 written-0.025
 source-0.024
 comm-0.024
 calling-0.023
Token very
Token ID845
Feature activation-1.11e-01

Pos logit contributions
igs+0.076
 gut+0.066
apy+0.065
oppy+0.064
ind+0.063
Neg logit contributions
 written-0.061
 source-0.053
 come-0.052
 calling-0.051
 Birthday-0.049
Token fast
Token ID3049
Feature activation-5.31e-02

Pos logit contributions
ocol+0.233
 microsc+0.194
 solar+0.193
 Acc+0.193
charged+0.188
Neg logit contributions
 so-0.078
 sure-0.065
 once-0.061
 out-0.061
 just-0.059
Token.
Token ID13
Feature activation-1.08e-02

Pos logit contributions
igs+0.103
apy+0.094
oppy+0.090
 gut+0.086
aro+0.086
Neg logit contributions
 written-0.061
 aback-0.054
 comm-0.051
 source-0.051
 such-0.050
Token One
Token ID1881
Feature activation-3.41e-02

Pos logit contributions
igs+0.016
 often+0.014
ot+0.013
 gut+0.013
ind+0.013
Neg logit contributions
 written-0.017
Gu-0.017
 aback-0.016
 source-0.015
 comm-0.015
Token day
Token ID1110
Feature activation-3.56e-02

Pos logit contributions
igs+0.065
 gut+0.052
aro+0.051
 often+0.051
 solar+0.051
Neg logit contributions
 written-0.038
 source-0.037
 come-0.037
Gu-0.036
 bisc-0.035
Token,
Token ID11
Feature activation-2.39e-02

Pos logit contributions
igs+0.062
ind+0.060
 often+0.059
 solar+0.057
 gut+0.056
Neg logit contributions
 happened-0.042
 written-0.042
hered-0.040
 appeared-0.040
 come-0.038
Token liked
Token ID8288
Feature activation+1.01e-02

Pos logit contributions
igs+0.052
 gut+0.044
ug+0.043
 goals+0.043
aro+0.041
Neg logit contributions
 written-0.044
 happened-0.040
Gu-0.040
 appeared-0.038
 come-0.038
Token playing
Token ID2712
Feature activation+1.17e-02

Pos logit contributions
 written+0.012
 aback+0.011
 comm+0.011
Gu+0.011
hered+0.010
Neg logit contributions
igs-0.018
 often-0.018
ot-0.016
 solar-0.016
 lived-0.015
Token in
Token ID287
Feature activation-1.16e-02

Pos logit contributions
 written+0.016
 aback+0.016
Gu+0.015
 comm+0.015
 source+0.015
Neg logit contributions
igs-0.019
 often-0.018
ot-0.016
 lived-0.016
 solar-0.016
Token the
Token ID262
Feature activation-4.34e-03

Pos logit contributions
igs+0.016
 often+0.016
 gut+0.014
ot+0.014
ug+0.014
Neg logit contributions
 written-0.017
Gu-0.017
 aback-0.016
 comm-0.016
 source-0.016
Token sun
Token ID4252
Feature activation+2.58e-02

Pos logit contributions
igs+0.007
 often+0.007
ot+0.006
 gut+0.006
 solar+0.006
Neg logit contributions
 written-0.006
Gu-0.006
 comm-0.005
 aback-0.005
 source-0.005
Token very
Token ID845
Feature activation-1.11e-01

Pos logit contributions
 aback+0.052
Gu+0.051
 comm+0.050
 written+0.049
hered+0.046
Neg logit contributions
 often-0.027
igs-0.025
ot-0.022
 lived-0.020
 solar-0.020
Token much
Token ID881
Feature activation-4.91e-02

Pos logit contributions
charged+0.189
rum+0.182
hetic+0.182
 solar+0.178
 gut+0.178
Neg logit contributions
 out-0.096
 so-0.095
cake-0.094
 up-0.084
 right-0.084
Token.
Token ID13
Feature activation-2.13e-02

Pos logit contributions
igs+0.093
apy+0.085
aro+0.079
ot+0.077
 gut+0.077
Neg logit contributions
 written-0.059
 aback-0.050
 source-0.047
 such-0.046
 calling-0.046
Token One
Token ID1881
Feature activation-1.65e-02

Pos logit contributions
igs+0.033
 often+0.030
ot+0.029
 gut+0.027
ts+0.027
Neg logit contributions
 written-0.030
 aback-0.029
Gu-0.028
 source-0.028
 comm-0.025
Token day
Token ID1110
Feature activation-2.50e-02

Pos logit contributions
igs+0.026
 often+0.022
 solar+0.021
 gut+0.021
ug+0.020
Neg logit contributions
 written-0.024
 source-0.023
Gu-0.023
 bisc-0.022
 comm-0.022
Token,
Token ID11
Feature activation-1.90e-02

Pos logit contributions
igs+0.036
 often+0.035
 solar+0.033
ot+0.031
 goals+0.031
Neg logit contributions
 written-0.037
 aback-0.036
hered-0.035
 appeared-0.033
 source-0.033
Token shiny
Token ID22441
Feature activation-3.49e-02

Pos logit contributions
igs+0.020
 often+0.018
ot+0.017
ug+0.017
apy+0.016
Neg logit contributions
 written-0.025
 comm-0.024
Gu-0.024
 source-0.024
 bisc-0.023
Token shield
Token ID7614
Feature activation-1.35e-02

Pos logit contributions
igs+0.038
 often+0.037
apy+0.036
aro+0.035
lo+0.034
Neg logit contributions
 written-0.060
 source-0.057
 bisc-0.056
 aback-0.054
 comm-0.054
Token that
Token ID326
Feature activation-3.88e-02

Pos logit contributions
igs+0.021
 often+0.019
ot+0.018
apy+0.017
 solar+0.017
Neg logit contributions
 written-0.020
 aback-0.018
 comm-0.018
 source-0.018
Gu-0.017
Token he
Token ID339
Feature activation-4.88e-02

Pos logit contributions
igs+0.060
ind+0.053
ug+0.050
 gut+0.049
 goals+0.048
Neg logit contributions
 written-0.059
 come-0.052
 happened-0.052
 appeared-0.051
Gu-0.051
Token loved
Token ID6151
Feature activation-3.39e-02

Pos logit contributions
igs+0.072
apy+0.068
 gut+0.067
lights+0.065
 goals+0.065
Neg logit contributions
 written-0.071
 baked-0.063
 appeared-0.060
 prevented-0.060
Gu-0.060
Token very
Token ID845
Feature activation-1.10e-01

Pos logit contributions
igs+0.080
apy+0.076
ot+0.073
oppy+0.073
 gut+0.073
Neg logit contributions
 written-0.023
 come-0.016
 coming-0.015
 saying-0.015
 calling-0.014
Token much
Token ID881
Feature activation-4.25e-02

Pos logit contributions
stack+0.206
belt+0.203
 solar+0.203
 gut+0.203
 mast+0.201
Neg logit contributions
 sure-0.058
 written-0.051
 news-0.050
umbs-0.044
ata-0.043
Token.
Token ID13
Feature activation-2.00e-02

Pos logit contributions
igs+0.076
apy+0.070
ot+0.069
 solar+0.063
oppy+0.063
Neg logit contributions
 written-0.055
 aback-0.048
 such-0.046
 comm-0.045
 saying-0.044
Token He
Token ID679
Feature activation-5.73e-02

Pos logit contributions
igs+0.028
 often+0.024
ot+0.023
ind+0.023
apy+0.023
Neg logit contributions
Gu-0.032
 written-0.031
 aback-0.031
 source-0.029
 How-0.029
Token would
Token ID561
Feature activation-3.72e-02

Pos logit contributions
igs+0.102
 gut+0.098
apy+0.098
lo+0.093
lights+0.093
Neg logit contributions
 written-0.069
 baked-0.061
 appeared-0.060
 come-0.056
 seen-0.056
Token always
Token ID1464
Feature activation-4.56e-03

Pos logit contributions
igs+0.065
apy+0.060
lo+0.059
ug+0.058
 gut+0.057
Neg logit contributions
 come-0.044
 written-0.044
 source-0.038
 calling-0.038
 comm-0.036
Token had
Token ID550
Feature activation+2.67e-02

Pos logit contributions
igs+0.050
 gut+0.042
aro+0.041
apy+0.041
ug+0.040
Neg logit contributions
 written-0.045
Gu-0.040
 comm-0.040
 happened-0.039
 aback-0.039
Token a
Token ID257
Feature activation-1.73e-02

Pos logit contributions
 aback+0.042
 comm+0.038
oph+0.038
 written+0.038
Gu+0.038
Neg logit contributions
 often-0.038
igs-0.035
 lived-0.034
ot-0.033
 solar-0.032
Token toy
Token ID13373
Feature activation-7.17e-02

Pos logit contributions
igs+0.025
 often+0.022
ug+0.021
ot+0.020
aro+0.020
Neg logit contributions
 written-0.026
 comm-0.026
 bisc-0.025
Gu-0.025
 source-0.025
Token he
Token ID339
Feature activation-4.19e-02

Pos logit contributions
igs+0.133
ot+0.110
lo+0.110
 often+0.110
ts+0.109
Neg logit contributions
 written-0.080
 source-0.074
 saying-0.073
 bisc-0.072
 appeared-0.070
Token loved
Token ID6151
Feature activation-3.66e-02

Pos logit contributions
igs+0.061
apy+0.051
lights+0.050
 gut+0.050
ug+0.049
Neg logit contributions
 written-0.073
Gu-0.064
 baked-0.062
 come-0.060
 meant-0.059
Token very
Token ID845
Feature activation-1.09e-01

Pos logit contributions
igs+0.090
apy+0.082
oppy+0.081
ot+0.080
 gut+0.079
Neg logit contributions
 written-0.026
 saying-0.019
 coming-0.018
 calling-0.017
 telling-0.017
Token much
Token ID881
Feature activation-3.50e-02

Pos logit contributions
stack+0.218
 gut+0.211
 solar+0.211
 prope+0.210
 mast+0.206
Neg logit contributions
 news-0.063
 written-0.055
cake-0.049
 supposed-0.048
t-0.047
Token.
Token ID13
Feature activation-1.62e-02

Pos logit contributions
igs+0.060
apy+0.054
ot+0.052
 solar+0.050
ug+0.049
Neg logit contributions
 written-0.049
 aback-0.044
 comm-0.042
 source-0.040
 saying-0.040
Token The
Token ID383
Feature activation-5.85e-03

Pos logit contributions
igs+0.023
 often+0.020
ind+0.019
 gut+0.019
ot+0.019
Neg logit contributions
 written-0.025
Gu-0.025
 aback-0.025
 bisc-0.023
 source-0.023
Token toy
Token ID13373
Feature activation-4.28e-02

Pos logit contributions
igs+0.008
 often+0.008
ot+0.007
ug+0.007
 gut+0.007
Neg logit contributions
 written-0.009
Gu-0.009
 comm-0.009
 aback-0.009
 source-0.009
Token was
Token ID373
Feature activation-2.34e-02

Pos logit contributions
igs+0.091
ot+0.077
 gut+0.077
 often+0.076
ind+0.075
Neg logit contributions
 written-0.043
 appeared-0.038
 happened-0.035
 meant-0.033
 comm-0.033
Token loved
Token ID6151
Feature activation-1.61e-03

Pos logit contributions
igs+0.054
 gut+0.045
aro+0.044
ug+0.044
 goals+0.043
Neg logit contributions
 written-0.048
Gu-0.045
 happened-0.045
 comm-0.044
 baked-0.043
Token to
Token ID284
Feature activation-3.01e-02

Pos logit contributions
igs+0.003
 often+0.003
ot+0.003
 solar+0.003
 gut+0.003
Neg logit contributions
 written-0.002
Gu-0.002
 aback-0.002
 comm-0.002
 source-0.001
Token ride
Token ID6594
Feature activation-1.89e-02

Pos logit contributions
igs+0.052
 often+0.049
ind+0.049
 gut+0.048
ug+0.048
Neg logit contributions
 come-0.036
 bisc-0.034
 written-0.033
Gu-0.032
 Birthday-0.031
Token his
Token ID465
Feature activation-1.46e-02

Pos logit contributions
igs+0.022
 often+0.020
 gut+0.019
ot+0.019
 solar+0.019
Neg logit contributions
 written-0.034
 aback-0.031
 comm-0.030
 source-0.029
 bisc-0.029
Token bike
Token ID7161
Feature activation-3.43e-02

Pos logit contributions
igs+0.018
 often+0.017
ot+0.016
ug+0.015
 solar+0.015
Neg logit contributions
 written-0.024
 comm-0.023
 source-0.023
 bisc-0.022
Gu-0.022
Token very
Token ID845
Feature activation-1.09e-01

Pos logit contributions
igs+0.062
apy+0.059
 gut+0.057
ot+0.055
 solar+0.054
Neg logit contributions
 written-0.041
 source-0.037
 aback-0.034
 come-0.033
hered-0.032
Token fast
Token ID3049
Feature activation-4.62e-02

Pos logit contributions
ocol+0.211
 lived+0.196
 downs+0.194
 solar+0.193
 Acc+0.191
Neg logit contributions
 so-0.075
 right-0.071
t-0.068
 sure-0.065
 out-0.064
Token.
Token ID13
Feature activation-1.51e-02

Pos logit contributions
igs+0.085
apy+0.078
oppy+0.073
ot+0.072
aro+0.072
Neg logit contributions
 written-0.058
 aback-0.053
 source-0.049
hered-0.048
 comm-0.048
Token One
Token ID1881
Feature activation-2.29e-02

Pos logit contributions
igs+0.019
 often+0.016
 gut+0.015
ot+0.015
 solar+0.014
Neg logit contributions
Gu-0.026
 written-0.026
 aback-0.025
 source-0.025
 bisc-0.024
Token day
Token ID1110
Feature activation-1.87e-02

Pos logit contributions
igs+0.039
 often+0.032
 solar+0.032
 gut+0.031
aro+0.030
Neg logit contributions
 written-0.031
Gu-0.029
 source-0.028
 bisc-0.028
 comm-0.027
Token,
Token ID11
Feature activation-9.67e-03

Pos logit contributions
igs+0.027
 often+0.026
 solar+0.024
ind+0.023
 gut+0.023
Neg logit contributions
 written-0.028
hered-0.027
 aback-0.027
Gu-0.025
 appeared-0.025
Token toy
Token ID13373
Feature activation-5.61e-02

Pos logit contributions
igs+0.004
 often+0.004
ot+0.004
ug+0.003
 solar+0.003
Neg logit contributions
 written-0.005
 comm-0.005
Gu-0.005
 aback-0.005
 source-0.005
Token bunny
Token ID44915
Feature activation-4.86e-02

Pos logit contributions
igs+0.099
 often+0.086
 gut+0.082
ot+0.082
 goals+0.081
Neg logit contributions
 written-0.067
 bisc-0.061
 saying-0.060
 appeared-0.060
 source-0.059
Token that
Token ID326
Feature activation-4.46e-02

Pos logit contributions
igs+0.078
lo+0.073
 gut+0.069
 goals+0.068
lights+0.067
Neg logit contributions
 written-0.065
 bisc-0.060
 appeared-0.059
hered-0.055
 saying-0.055
Token she
Token ID673
Feature activation-4.55e-02

Pos logit contributions
igs+0.086
ug+0.080
 goals+0.076
ind+0.076
 gut+0.074
Neg logit contributions
 written-0.047
Gu-0.046
 appeared-0.045
 happened-0.045
 come-0.044
Token loved
Token ID6151
Feature activation-3.26e-02

Pos logit contributions
igs+0.067
lights+0.059
ug+0.057
 gut+0.057
 rocky+0.055
Neg logit contributions
 written-0.069
Gu-0.066
 baked-0.065
 appeared-0.063
 come-0.062
Token very
Token ID845
Feature activation-1.09e-01

Pos logit contributions
igs+0.077
ug+0.071
apy+0.070
 gut+0.070
belt+0.070
Neg logit contributions
 written-0.022
 saying-0.017
 calling-0.016
 comm-0.016
 such-0.015
Token much
Token ID881
Feature activation-3.36e-02

Pos logit contributions
 gut+0.224
 mast+0.223
belt+0.220
stack+0.215
 prope+0.214
Neg logit contributions
umbs-0.058
 news-0.044
 written-0.043
cake-0.043
 sure-0.043
Token.
Token ID13
Feature activation-1.99e-02

Pos logit contributions
igs+0.058
apy+0.052
ot+0.051
ug+0.048
belt+0.048
Neg logit contributions
 written-0.047
 aback-0.041
 comm-0.040
 calling-0.038
 saying-0.038
Token The
Token ID383
Feature activation-2.98e-03

Pos logit contributions
igs+0.029
ug+0.025
 often+0.024
 gut+0.024
ind+0.024
Neg logit contributions
Gu-0.031
 written-0.029
 aback-0.029
 How-0.028
 bisc-0.028
Token bunny
Token ID44915
Feature activation-4.32e-02

Pos logit contributions
igs+0.004
 often+0.004
ot+0.004
 solar+0.004
ug+0.004
Neg logit contributions
 written-0.004
 aback-0.004
Gu-0.004
 comm-0.004
 source-0.004
Token belonged
Token ID19611
Feature activation-9.29e-03

Pos logit contributions
igs+0.089
ug+0.081
 gut+0.081
lo+0.080
 solar+0.077
Neg logit contributions
 appeared-0.038
 written-0.037
 revealed-0.033
 come-0.032
Gu-0.032
Token toy
Token ID13373
Feature activation-7.26e-02

Pos logit contributions
igs+0.036
 often+0.032
ug+0.032
aro+0.030
lo+0.030
Neg logit contributions
 written-0.034
 comm-0.034
 bisc-0.034
 source-0.033
Gu-0.033
Token fork
Token ID15563
Feature activation-3.95e-02

Pos logit contributions
igs+0.133
 often+0.111
ot+0.110
lo+0.109
ts+0.109
Neg logit contributions
 written-0.081
 source-0.074
 saying-0.073
 bisc-0.072
 appeared-0.072
Token that
Token ID326
Feature activation-4.13e-02

Pos logit contributions
igs+0.073
 gut+0.062
ot+0.061
 solar+0.060
lo+0.060
Neg logit contributions
 written-0.052
 appeared-0.042
 source-0.041
 come-0.040
 saying-0.040
Token he
Token ID339
Feature activation-5.46e-02

Pos logit contributions
igs+0.068
ug+0.060
ind+0.059
 gut+0.055
 goals+0.055
Neg logit contributions
 written-0.062
 happened-0.055
 come-0.054
Gu-0.053
 baked-0.053
Token loved
Token ID6151
Feature activation-3.75e-02

Pos logit contributions
igs+0.083
lights+0.074
 gut+0.073
ug+0.072
aro+0.071
Neg logit contributions
 written-0.085
 baked-0.078
Gu-0.073
 happened-0.071
 appeared-0.070
Token very
Token ID845
Feature activation-1.09e-01

Pos logit contributions
igs+0.087
apy+0.080
ug+0.079
 gut+0.079
ot+0.078
Neg logit contributions
 written-0.029
 saying-0.021
 coming-0.020
 such-0.020
 come-0.020
Token much
Token ID881
Feature activation-3.22e-02

Pos logit contributions
stack+0.235
 prope+0.215
 gut+0.210
 mast+0.210
ifully+0.209
Neg logit contributions
 news-0.047
 sure-0.045
 written-0.042
 supposed-0.032
t-0.032
Token.
Token ID13
Feature activation-2.34e-02

Pos logit contributions
igs+0.054
apy+0.048
ot+0.047
 solar+0.044
ug+0.043
Neg logit contributions
 written-0.046
 aback-0.042
 comm-0.040
 source-0.038
 such-0.038
Token Tim
Token ID5045
Feature activation-3.69e-02

Pos logit contributions
igs+0.036
ind+0.031
 often+0.031
 gut+0.030
ot+0.030
Neg logit contributions
Gu-0.034
 aback-0.033
 written-0.033
 How-0.032
 bisc-0.031
Token's
Token ID338
Feature activation-1.07e-02

Pos logit contributions
igs+0.066
apy+0.061
aro+0.060
 gut+0.059
ug+0.055
Neg logit contributions
 written-0.047
 baked-0.042
my-0.041
Gu-0.041
 meant-0.041
Token fork
Token ID15563
Feature activation-4.65e-02

Pos logit contributions
igs+0.014
 often+0.013
ug+0.012
ot+0.012
 solar+0.011
Neg logit contributions
 written-0.018
 source-0.017
 comm-0.017
 aback-0.017
Gu-0.016
Token She
Token ID1375
Feature activation-3.55e-02

Pos logit contributions
igs+0.014
 often+0.012
ug+0.011
apy+0.011
 solar+0.011
Neg logit contributions
 written-0.011
 aback-0.011
Gu-0.010
 source-0.010
 comm-0.010
Token ran
Token ID4966
Feature activation+3.45e-03

Pos logit contributions
igs+0.067
 gut+0.063
apy+0.061
 solar+0.060
 goals+0.059
Neg logit contributions
 written-0.036
Gu-0.032
 comm-0.031
 come-0.030
 appeared-0.030
Token up
Token ID510
Feature activation+4.18e-03

Pos logit contributions
 written+0.004
 aback+0.004
Gu+0.004
 comm+0.004
 source+0.004
Neg logit contributions
igs-0.006
 often-0.006
ot-0.005
 solar-0.005
 lived-0.005
Token the
Token ID262
Feature activation-1.84e-02

Pos logit contributions
 written+0.005
 aback+0.005
Gu+0.005
 comm+0.005
 source+0.005
Neg logit contributions
igs-0.007
 often-0.007
ot-0.006
 solar-0.006
 lived-0.006
Token stairs
Token ID16046
Feature activation-2.41e-02

Pos logit contributions
igs+0.021
 often+0.019
ot+0.018
apy+0.018
 lived+0.017
Neg logit contributions
 source-0.033
 written-0.032
 comm-0.031
Gu-0.030
 aback-0.030
Token very
Token ID845
Feature activation-1.08e-01

Pos logit contributions
igs+0.048
 solar+0.042
 lived+0.042
ot+0.041
 gut+0.041
Neg logit contributions
 written-0.024
 aback-0.024
 source-0.022
hered-0.020
 calling-0.020
Token quickly
Token ID2952
Feature activation-3.31e-02

Pos logit contributions
ocol+0.218
 lived+0.201
 solar+0.197
rum+0.188
 microsc+0.187
Neg logit contributions
 right-0.084
 up-0.081
 source-0.081
 so-0.074
 out-0.071
Token,
Token ID11
Feature activation+1.57e-02

Pos logit contributions
igs+0.062
 often+0.056
apy+0.056
ot+0.055
 solar+0.055
Neg logit contributions
 written-0.036
 aback-0.032
 source-0.032
 calling-0.032
 comm-0.031
Token but
Token ID475
Feature activation-2.69e-02

Pos logit contributions
 written+0.018
Gu+0.017
 aback+0.017
 comm+0.016
 bisc+0.015
Neg logit contributions
 often-0.029
igs-0.028
ot-0.025
 solar-0.025
 lived-0.025
Token her
Token ID607
Feature activation-2.91e-02

Pos logit contributions
igs+0.049
 solar+0.044
ot+0.042
ind+0.041
ug+0.041
Neg logit contributions
 written-0.033
 aback-0.029
 source-0.028
 calling-0.028
 appeared-0.027
Token mom
Token ID1995
Feature activation+3.35e-02

Pos logit contributions
igs+0.041
 often+0.037
ot+0.037
ug+0.036
aro+0.035
Neg logit contributions
 source-0.042
 comm-0.042
 written-0.041
 Birthday-0.039
 surprise-0.038
Token toy
Token ID13373
Feature activation-7.31e-02

Pos logit contributions
igs+0.032
 often+0.029
ug+0.028
aro+0.027
ot+0.027
Neg logit contributions
 written-0.032
 comm-0.031
 bisc-0.031
 source-0.031
Gu-0.030
Token car
Token ID1097
Feature activation-4.08e-02

Pos logit contributions
igs+0.134
 often+0.113
lo+0.112
ot+0.112
ts+0.110
Neg logit contributions
 written-0.081
 source-0.075
 saying-0.075
 bisc-0.074
 appeared-0.073
Token that
Token ID326
Feature activation-5.06e-02

Pos logit contributions
igs+0.082
 gut+0.071
apy+0.069
 goals+0.066
oppy+0.066
Neg logit contributions
 written-0.050
 come-0.040
 source-0.039
 calling-0.039
 saying-0.038
Token he
Token ID339
Feature activation-5.51e-02

Pos logit contributions
igs+0.090
ind+0.080
 goals+0.075
ug+0.075
 gut+0.074
Neg logit contributions
 written-0.067
 happened-0.063
 come-0.062
 meant-0.059
Gu-0.059
Token loved
Token ID6151
Feature activation-3.66e-02

Pos logit contributions
igs+0.086
 gut+0.073
apy+0.070
 rocky+0.070
lights+0.070
Neg logit contributions
 written-0.082
 baked-0.075
Gu-0.074
 come-0.070
 been-0.070
Token very
Token ID845
Feature activation-1.08e-01

Pos logit contributions
igs+0.091
oppy+0.086
apy+0.083
 gut+0.082
ot+0.081
Neg logit contributions
 written-0.025
 saying-0.018
 coming-0.018
 come-0.016
 calling-0.016
Token much
Token ID881
Feature activation-3.80e-02

Pos logit contributions
stack+0.219
 gut+0.215
 mast+0.204
 solar+0.202
ifully+0.200
Neg logit contributions
 written-0.052
 news-0.050
 sure-0.045
 supposed-0.044
umbs-0.040
Token.
Token ID13
Feature activation-2.68e-02

Pos logit contributions
igs+0.068
apy+0.062
ot+0.060
oppy+0.056
 solar+0.056
Neg logit contributions
 written-0.050
 aback-0.043
 comm-0.042
 saying-0.040
 calling-0.040
Token It
Token ID632
Feature activation-3.10e-02

Pos logit contributions
igs+0.042
 often+0.037
ind+0.037
ot+0.036
 gut+0.036
Neg logit contributions
Gu-0.038
 aback-0.036
 written-0.036
 How-0.035
 bisc-0.034
Token was
Token ID373
Feature activation-1.41e-02

Pos logit contributions
igs+0.073
 gut+0.064
 goals+0.061
ug+0.061
ind+0.060
Neg logit contributions
 written-0.024
Gu-0.024
 come-0.022
 meant-0.021
 appeared-0.020
Token his
Token ID465
Feature activation-2.01e-02

Pos logit contributions
igs+0.023
 often+0.021
 gut+0.020
ot+0.020
 solar+0.020
Neg logit contributions
 written-0.020
 source-0.017
Gu-0.017
 aback-0.017
 comm-0.017
Token a
Token ID257
Feature activation-4.13e-02

Pos logit contributions
 aback+0.052
 comm+0.050
 written+0.049
 begs+0.049
Gu+0.048
Neg logit contributions
 often-0.046
igs-0.043
 lived-0.041
ot-0.040
 solar-0.038
Token small
Token ID1402
Feature activation-3.60e-02

Pos logit contributions
igs+0.076
 often+0.067
ug+0.065
belt+0.064
aro+0.064
Neg logit contributions
Gu-0.046
 comm-0.044
 written-0.043
 bisc-0.043
 source-0.041
Token bird
Token ID6512
Feature activation-4.40e-02

Pos logit contributions
igs+0.073
 often+0.067
apy+0.063
 gut+0.063
belt+0.062
Neg logit contributions
 written-0.031
 comm-0.031
 bisc-0.029
 source-0.029
Gu-0.028
Token called
Token ID1444
Feature activation+1.01e-03

Pos logit contributions
igs+0.064
 gut+0.058
 goals+0.056
belt+0.054
apy+0.053
Neg logit contributions
 written-0.063
 calling-0.061
 source-0.060
 appeared-0.059
 come-0.058
Token Fl
Token ID1610
Feature activation-2.47e-02

Pos logit contributions
 written+0.001
 aback+0.001
Gu+0.001
 comm+0.001
 source+0.001
Neg logit contributions
igs-0.002
 often-0.002
ot-0.001
 solar-0.001
 lived-0.001
Tokenappy
Token ID7774
Feature activation-1.08e-01

Pos logit contributions
 often+0.035
igs+0.034
 solar+0.030
nam+0.029
 gut+0.029
Neg logit contributions
hered-0.035
 written-0.034
 source-0.033
Gu-0.033
 bisc-0.032
Token.
Token ID13
Feature activation-5.31e-02

Pos logit contributions
igs+0.257
belt+0.238
apy+0.236
 gut+0.232
agons+0.226
Neg logit contributions
 once-0.051
 appeared-0.050
's-0.048
 written-0.044
 come-0.042
Token Fl
Token ID1610
Feature activation-2.52e-02

Pos logit contributions
igs+0.095
 gut+0.087
ind+0.085
ot+0.084
belt+0.082
Neg logit contributions
 How-0.061
Gu-0.061
 aback-0.056
 Would-0.055
 bisc-0.055
Tokenappy
Token ID7774
Feature activation-8.01e-02

Pos logit contributions
 often+0.028
igs+0.025
 gut+0.022
belt+0.021
 solar+0.021
Neg logit contributions
 written-0.043
hered-0.043
 aback-0.042
Gu-0.042
elt-0.041
Token was
Token ID373
Feature activation+3.20e-04

Pos logit contributions
igs+0.141
 gut+0.134
belt+0.125
nam+0.124
apy+0.123
Neg logit contributions
 written-0.099
 happened-0.096
 appeared-0.095
's-0.090
 meant-0.090
Token always
Token ID1464
Feature activation+1.21e-02

Pos logit contributions
 written+0.001
 aback+0.001
Gu+0.001
 comm+0.001
 source+0.000
Neg logit contributions
igs-0.000
 often-0.000
ot-0.000
 solar-0.000
 gut-0.000
TokenL
Token ID43
Feature activation-4.16e-04

Pos logit contributions
igs+0.050
 often+0.044
 solar+0.043
 gut+0.042
 toes+0.040
Neg logit contributions
Gu-0.040
 written-0.038
Sur-0.037
 comm-0.035
 aback-0.035
Tokenily
Token ID813
Feature activation+1.11e-03

Pos logit contributions
igs+0.001
 often+0.001
 solar+0.001
ot+0.001
 gut+0.001
Neg logit contributions
 written-0.001
 aback-0.001
Gu-0.001
 comm-0.001
 source-0.001
Token learned
Token ID4499
Feature activation-1.42e-02

Pos logit contributions
 written+0.001
 aback+0.001
 comm+0.001
Gu+0.001
 source+0.001
Neg logit contributions
igs-0.002
 often-0.002
ot-0.002
 solar-0.002
 gut-0.002
Token to
Token ID284
Feature activation-2.61e-02

Pos logit contributions
igs+0.024
 often+0.023
 solar+0.022
ot+0.021
ug+0.021
Neg logit contributions
 written-0.019
 comm-0.017
 aback-0.017
 source-0.016
 calling-0.016
Token sail
Token ID14936
Feature activation-1.17e-02

Pos logit contributions
igs+0.047
 solar+0.044
apy+0.042
ug+0.041
 often+0.041
Neg logit contributions
 written-0.028
 come-0.027
 comm-0.027
 aback-0.027
 calling-0.026
Token very
Token ID845
Feature activation-1.08e-01

Pos logit contributions
igs+0.019
 often+0.017
 solar+0.016
ot+0.016
 gut+0.015
Neg logit contributions
 written-0.016
 aback-0.016
 comm-0.014
 source-0.014
Gu-0.014
Token well
Token ID880
Feature activation-5.02e-02

Pos logit contributions
 ig+0.176
 chim+0.168
 solar+0.164
 throne+0.159
 ships+0.152
Neg logit contributions
 written-0.109
 right-0.101
 sure-0.100
 just-0.098
 once-0.097
Token.
Token ID13
Feature activation-2.34e-02

Pos logit contributions
igs+0.088
 solar+0.078
apy+0.077
 gut+0.075
ot+0.074
Neg logit contributions
 written-0.061
 aback-0.056
 comm-0.050
 source-0.050
hered-0.050
Token The
Token ID383
Feature activation-2.42e-02

Pos logit contributions
igs+0.043
 often+0.038
ind+0.036
aro+0.036
ts+0.035
Neg logit contributions
 written-0.028
 aback-0.028
Gu-0.027
 How-0.025
 comm-0.024
Token boat
Token ID8848
Feature activation-7.80e-03

Pos logit contributions
igs+0.036
 often+0.034
ot+0.031
apy+0.030
ug+0.029
Neg logit contributions
 written-0.036
 source-0.035
 comm-0.035
Gu-0.032
 bisc-0.032
Token was
Token ID373
Feature activation-5.25e-03

Pos logit contributions
igs+0.013
 often+0.012
ot+0.011
 solar+0.011
 gut+0.011
Neg logit contributions
 written-0.010
 aback-0.010
 comm-0.009
Gu-0.009
 source-0.009
Token,
Token ID11
Feature activation+7.94e-03

Pos logit contributions
igs+0.048
 often+0.043
 solar+0.042
apy+0.042
nam+0.042
Neg logit contributions
 written-0.034
 aback-0.032
hered-0.032
 comm-0.032
 appeared-0.031
Token how
Token ID703
Feature activation-4.12e-02

Pos logit contributions
 written+0.011
 aback+0.010
Gu+0.010
 comm+0.010
 source+0.010
Neg logit contributions
igs-0.012
 often-0.012
ot-0.011
 solar-0.011
 lived-0.011
Token do
Token ID466
Feature activation-2.29e-02

Pos logit contributions
igs+0.072
ind+0.064
 solar+0.063
 often+0.062
aro+0.061
Neg logit contributions
 come-0.048
 written-0.048
 happened-0.043
 news-0.042
 aback-0.042
Token you
Token ID345
Feature activation-1.14e-02

Pos logit contributions
igs+0.043
 often+0.041
 solar+0.040
ot+0.040
 gut+0.038
Neg logit contributions
Gu-0.022
 written-0.022
 comm-0.021
 come-0.021
 aback-0.019
Token zip
Token ID19974
Feature activation-4.33e-02

Pos logit contributions
igs+0.019
 often+0.018
 solar+0.017
ot+0.017
apy+0.017
Neg logit contributions
 written-0.014
 aback-0.014
 comm-0.013
 source-0.013
Gu-0.013
Token so
Token ID523
Feature activation-1.08e-01

Pos logit contributions
igs+0.080
 often+0.077
 solar+0.073
apy+0.073
ot+0.073
Neg logit contributions
 aback-0.045
 written-0.042
 source-0.039
 come-0.037
 such-0.036
Token fast
Token ID3049
Feature activation-1.54e-02

Pos logit contributions
ocol+0.199
 solar+0.199
alks+0.189
alk+0.188
mo+0.185
Neg logit contributions
 come-0.089
 right-0.087
It-0.080
 won-0.077
 sure-0.077
Token?"
Token ID1701
Feature activation-1.89e-02

Pos logit contributions
igs+0.025
 often+0.024
ot+0.022
 solar+0.022
 gut+0.021
Neg logit contributions
 written-0.020
 aback-0.019
 comm-0.019
Gu-0.018
 source-0.018
Token
Token ID198
Feature activation-1.14e-02

Pos logit contributions
igs+0.033
 often+0.030
 solar+0.029
ind+0.029
ug+0.028
Neg logit contributions
 written-0.023
Gu-0.023
 comm-0.023
 aback-0.023
 begs-0.020
Token
Token ID198
Feature activation-4.40e-02

Pos logit contributions
igs+0.022
 often+0.021
 solar+0.020
 gut+0.020
ot+0.020
Neg logit contributions
 written-0.011
Gu-0.011
 aback-0.011
 comm-0.010
 source-0.009
TokenB
Token ID33
Feature activation-2.13e-02

Pos logit contributions
igs+0.087
 goals+0.080
 solar+0.079
 ig+0.078
ug+0.078
Neg logit contributions
Gu-0.048
Sur-0.044
hered-0.037
 comm-0.036
Jennifer-0.035
Token blue
Token ID4171
Feature activation-1.91e-02

Pos logit contributions
igs+0.030
 often+0.027
ug+0.026
aro+0.025
ot+0.025
Neg logit contributions
 written-0.031
 comm-0.030
 bisc-0.030
 source-0.029
Gu-0.029
Token sweater
Token ID35685
Feature activation-3.80e-02

Pos logit contributions
igs+0.031
 often+0.029
ot+0.026
ug+0.026
 gut+0.026
Neg logit contributions
 written-0.025
 bisc-0.024
 source-0.024
Gu-0.023
 comm-0.023
Token that
Token ID326
Feature activation-4.72e-02

Pos logit contributions
igs+0.074
 gut+0.061
ot+0.060
 goals+0.060
 solar+0.059
Neg logit contributions
 written-0.049
 source-0.041
cake-0.041
 Birthday-0.038
 come-0.038
Token he
Token ID339
Feature activation-5.90e-02

Pos logit contributions
igs+0.077
ind+0.065
ug+0.065
 goals+0.065
 gut+0.064
Neg logit contributions
 written-0.068
 come-0.062
 happened-0.061
 bisc-0.060
 meant-0.059
Token loved
Token ID6151
Feature activation-3.89e-02

Pos logit contributions
igs+0.089
 gut+0.078
ug+0.078
 goals+0.076
lights+0.075
Neg logit contributions
 written-0.087
 baked-0.081
Gu-0.078
 come-0.076
 appeared-0.075
Token very
Token ID845
Feature activation-1.08e-01

Pos logit contributions
igs+0.093
 gut+0.085
oppy+0.085
apy+0.083
ubby+0.082
Neg logit contributions
 written-0.027
 saying-0.022
 come-0.021
 coming-0.020
 telling-0.019
Token much
Token ID881
Feature activation-4.11e-02

Pos logit contributions
stack+0.220
 gut+0.216
ifully+0.206
 mast+0.205
 prope+0.205
Neg logit contributions
 news-0.049
 sure-0.046
 written-0.045
 supposed-0.040
umbs-0.038
Token.
Token ID13
Feature activation-2.01e-02

Pos logit contributions
igs+0.075
apy+0.068
ot+0.066
 solar+0.062
oppy+0.062
Neg logit contributions
 written-0.054
 aback-0.045
 such-0.044
 comm-0.044
 saying-0.043
Token Tim
Token ID5045
Feature activation-3.50e-02

Pos logit contributions
igs+0.030
 often+0.027
ind+0.026
 gut+0.026
ot+0.026
Neg logit contributions
Gu-0.030
 written-0.028
 aback-0.028
 bisc-0.027
 How-0.027
Token wore
Token ID12408
Feature activation-1.47e-02

Pos logit contributions
igs+0.058
apy+0.053
 gut+0.052
aro+0.051
 solar+0.049
Neg logit contributions
 written-0.046
 baked-0.042
Gu-0.042
my-0.042
 seen-0.041
Token his
Token ID465
Feature activation-2.06e-02

Pos logit contributions
igs+0.024
 often+0.023
ot+0.021
 solar+0.021
ug+0.021
Neg logit contributions
 written-0.020
 aback-0.018
Gu-0.018
 source-0.017
 comm-0.017
Token toy
Token ID13373
Feature activation-6.94e-02

Pos logit contributions
igs+0.018
 often+0.017
ug+0.015
ot+0.015
 gut+0.014
Neg logit contributions
 written-0.021
 comm-0.020
Gu-0.020
 bisc-0.019
 source-0.019
Token car
Token ID1097
Feature activation-3.86e-02

Pos logit contributions
igs+0.126
 often+0.104
lo+0.104
 gut+0.103
ot+0.103
Neg logit contributions
 written-0.080
 source-0.074
 bisc-0.073
 saying-0.073
 appeared-0.071
Token that
Token ID326
Feature activation-4.43e-02

Pos logit contributions
igs+0.074
 gut+0.065
apy+0.061
 solar+0.060
 goals+0.059
Neg logit contributions
 written-0.050
 come-0.040
 source-0.040
 calling-0.039
 saying-0.039
Token he
Token ID339
Feature activation-6.11e-02

Pos logit contributions
igs+0.076
ind+0.067
 gut+0.064
ug+0.063
charged+0.060
Neg logit contributions
 written-0.064
 happened-0.058
 come-0.057
 meant-0.056
Gu-0.055
Token loved
Token ID6151
Feature activation-4.40e-02

Pos logit contributions
igs+0.098
 gut+0.087
lights+0.083
 rocky+0.082
apy+0.082
Neg logit contributions
 written-0.087
 baked-0.081
 prevented-0.076
Gu-0.076
 meant-0.075
Token very
Token ID845
Feature activation-1.07e-01

Pos logit contributions
igs+0.113
oppy+0.105
apy+0.102
 gut+0.100
ubby+0.100
Neg logit contributions
 written-0.029
 coming-0.021
 saying-0.020
 come-0.020
 telling-0.018
Token much
Token ID881
Feature activation-3.86e-02

Pos logit contributions
stack+0.218
 gut+0.209
 solar+0.204
 mast+0.199
 prope+0.199
Neg logit contributions
 news-0.060
 written-0.055
cake-0.049
 sure-0.049
umbs-0.047
Token.
Token ID13
Feature activation-1.53e-02

Pos logit contributions
igs+0.068
apy+0.060
ot+0.058
 solar+0.056
oppy+0.054
Neg logit contributions
 written-0.053
 aback-0.045
 comm-0.044
 such-0.044
cake-0.042
Token It
Token ID632
Feature activation-3.59e-02

Pos logit contributions
igs+0.021
 often+0.019
ind+0.018
ot+0.018
 gut+0.018
Neg logit contributions
Gu-0.024
 written-0.024
 aback-0.024
 source-0.022
 bisc-0.022
Token was
Token ID373
Feature activation-1.37e-02

Pos logit contributions
igs+0.087
 gut+0.078
 goals+0.074
ind+0.073
ug+0.072
Neg logit contributions
 written-0.026
Gu-0.024
 meant-0.024
 come-0.024
 appeared-0.022
Token red
Token ID2266
Feature activation-1.29e-02

Pos logit contributions
igs+0.023
 often+0.021
 gut+0.020
 solar+0.020
ot+0.020
Neg logit contributions
 written-0.018
 source-0.016
Gu-0.016
 aback-0.015
 comm-0.015
Token toy
Token ID13373
Feature activation-6.95e-02

Pos logit contributions
igs+0.027
 often+0.024
ug+0.023
ot+0.022
aro+0.022
Neg logit contributions
 written-0.028
 comm-0.027
 bisc-0.027
Gu-0.026
 source-0.026
Token shield
Token ID7614
Feature activation-1.70e-02

Pos logit contributions
igs+0.124
 often+0.106
lo+0.103
ot+0.103
 gut+0.102
Neg logit contributions
 written-0.079
 bisc-0.073
 saying-0.072
 source-0.072
 appeared-0.071
Token that
Token ID326
Feature activation-4.49e-02

Pos logit contributions
igs+0.027
 often+0.024
ot+0.023
apy+0.022
 gut+0.022
Neg logit contributions
 written-0.025
 source-0.022
 aback-0.022
 comm-0.022
hered-0.021
Token he
Token ID339
Feature activation-5.81e-02

Pos logit contributions
igs+0.071
ind+0.065
 goals+0.061
ug+0.061
 gut+0.060
Neg logit contributions
 written-0.063
 come-0.059
 happened-0.058
 appeared-0.057
 meant-0.056
Token loved
Token ID6151
Feature activation-3.40e-02

Pos logit contributions
igs+0.085
 goals+0.080
 gut+0.079
lights+0.078
apy+0.077
Neg logit contributions
 written-0.082
 baked-0.080
 prevented-0.073
 seen-0.073
 appeared-0.073
Token very
Token ID845
Feature activation-1.07e-01

Pos logit contributions
igs+0.083
apy+0.078
oppy+0.078
ot+0.078
 gut+0.077
Neg logit contributions
 written-0.020
 coming-0.013
 saying-0.013
 come-0.012
 such-0.012
Token much
Token ID881
Feature activation-3.29e-02

Pos logit contributions
stack+0.220
 gut+0.213
 solar+0.206
 prope+0.204
 wrists+0.203
Neg logit contributions
 sure-0.042
 news-0.040
 written-0.039
 supposed-0.039
t-0.036
Token.
Token ID13
Feature activation-2.59e-02

Pos logit contributions
igs+0.056
apy+0.051
ot+0.051
 solar+0.047
oppy+0.046
Neg logit contributions
 written-0.046
 aback-0.041
 comm-0.039
 source-0.037
 calling-0.037
Token He
Token ID679
Feature activation-5.90e-02

Pos logit contributions
igs+0.040
 often+0.035
ind+0.035
ot+0.034
 gut+0.034
Neg logit contributions
Gu-0.037
 aback-0.035
 written-0.035
 How-0.035
 bisc-0.034
Token would
Token ID561
Feature activation-3.92e-02

Pos logit contributions
igs+0.107
 gut+0.102
apy+0.101
lo+0.100
aro+0.098
Neg logit contributions
 written-0.065
 baked-0.065
 brought-0.060
 appeared-0.059
 been-0.059
Token play
Token ID711
Feature activation+1.69e-02

Pos logit contributions
igs+0.071
lo+0.065
ug+0.063
apy+0.062
 gut+0.062
Neg logit contributions
 come-0.045
 written-0.044
 source-0.038
 calling-0.037
 smell-0.036
Token and
Token ID290
Feature activation+2.67e-02

Pos logit contributions
Gu+0.041
 comm+0.041
 written+0.038
 aback+0.038
 funn+0.036
Neg logit contributions
 often-0.036
igs-0.036
ot-0.032
 tight-0.030
 solar-0.030
Token saw
Token ID2497
Feature activation+7.83e-03

Pos logit contributions
Gu+0.035
 comm+0.035
 aback+0.034
umbs+0.034
 bisc+0.034
Neg logit contributions
 often-0.041
igs-0.039
 lived-0.038
ot-0.037
 solar-0.035
Token an
Token ID281
Feature activation-3.01e-02

Pos logit contributions
 written+0.012
 aback+0.012
Gu+0.012
 comm+0.012
hered+0.011
Neg logit contributions
igs-0.011
 often-0.010
ot-0.009
 solar-0.009
 lived-0.009
Token even
Token ID772
Feature activation+2.67e-02

Pos logit contributions
igs+0.042
 often+0.037
apy+0.036
ot+0.035
 solar+0.033
Neg logit contributions
 written-0.051
 comm-0.048
 aback-0.046
 source-0.046
Gu-0.044
Token bigger
Token ID5749
Feature activation-1.40e-03

Pos logit contributions
hered+0.039
 aback+0.039
Gu+0.039
 written+0.038
oph+0.038
Neg logit contributions
 often-0.037
igs-0.036
ot-0.031
 dusty-0.031
 solar-0.031
Token surprise
Token ID5975
Feature activation+5.35e-02

Pos logit contributions
igs+0.002
 often+0.002
ot+0.002
 solar+0.002
 lived+0.002
Neg logit contributions
 written-0.002
 aback-0.002
 comm-0.002
Gu-0.002
 source-0.002
Token.
Token ID13
Feature activation+2.27e-02

Pos logit contributions
 aback+0.111
Gu+0.110
arre+0.107
 comm+0.103
umbs+0.100
Neg logit contributions
 often-0.050
 lived-0.041
ot-0.040
igs-0.040
 solar-0.032
Token There
Token ID1318
Feature activation+2.41e-02

Pos logit contributions
 comm+0.032
 aback+0.032
 written+0.031
hered+0.031
 bisc+0.031
Neg logit contributions
 often-0.032
igs-0.030
ot-0.029
 solar-0.029
 lived-0.028
Token was
Token ID373
Feature activation+2.63e-02

Pos logit contributions
Gu+0.039
 comm+0.038
 aback+0.038
 written+0.037
 bisc+0.037
Neg logit contributions
 often-0.032
igs-0.030
 lived-0.029
ot-0.028
apy-0.026
Token a
Token ID257
Feature activation+1.34e-02

Pos logit contributions
Gu+0.035
 aback+0.034
hered+0.033
 comm+0.033
 written+0.032
Neg logit contributions
 often-0.041
igs-0.040
 lived-0.036
ot-0.036
 solar-0.036
Token beautiful
Token ID4950
Feature activation+8.92e-03

Pos logit contributions
 written+0.022
 aback+0.022
Gu+0.022
 comm+0.021
hered+0.021
Neg logit contributions
igs-0.016
 often-0.016
ot-0.014
 solar-0.014
 lived-0.013
Token Emma
Token ID18966
Feature activation-3.29e-03

Pos logit contributions
 written+0.003
 aback+0.002
Gu+0.002
 comm+0.002
 source+0.002
Neg logit contributions
igs-0.002
 often-0.002
ot-0.002
 solar-0.002
 lived-0.002
Token and
Token ID290
Feature activation-3.37e-03

Pos logit contributions
igs+0.006
 often+0.005
ot+0.005
 solar+0.005
 gut+0.005
Neg logit contributions
 written-0.004
 aback-0.004
 comm-0.004
Gu-0.004
 source-0.004
Token Jack
Token ID3619
Feature activation-1.78e-02

Pos logit contributions
igs+0.006
 often+0.006
ot+0.005
 solar+0.005
 gut+0.005
Neg logit contributions
 written-0.004
 aback-0.004
 comm-0.004
Gu-0.004
 source-0.004
Token were
Token ID547
Feature activation+1.72e-02

Pos logit contributions
igs+0.033
 often+0.028
 gut+0.028
ot+0.027
 solar+0.027
Neg logit contributions
 written-0.023
 aback-0.021
 comm-0.021
 announced-0.020
Gu-0.019
Token so
Token ID523
Feature activation-1.26e-02

Pos logit contributions
 aback+0.030
Gu+0.029
 written+0.029
 bisc+0.028
hered+0.028
Neg logit contributions
 often-0.022
igs-0.020
 lived-0.017
ot-0.017
 solar-0.017
Token excited
Token ID6568
Feature activation+5.65e-02

Pos logit contributions
igs+0.022
 often+0.019
 solar+0.018
 gut+0.018
ug+0.017
Neg logit contributions
 written-0.016
 comm-0.016
Gu-0.015
 aback-0.015
 calling-0.014
Token to
Token ID284
Feature activation-5.72e-03

Pos logit contributions
Gu+0.136
Uh+0.131
 aback+0.128
Ta+0.128
 bisc+0.126
Neg logit contributions
 often-0.046
 every-0.029
ot-0.023
igs-0.022
.-0.021
Token see
Token ID766
Feature activation+2.37e-02

Pos logit contributions
igs+0.010
 often+0.010
ot+0.009
 solar+0.009
 gut+0.009
Neg logit contributions
 written-0.007
 aback-0.006
Gu-0.006
 comm-0.006
 source-0.006
Token the
Token ID262
Feature activation+2.98e-02

Pos logit contributions
Gu+0.040
 aback+0.038
 written+0.038
 comm+0.038
hered+0.038
Neg logit contributions
 often-0.029
igs-0.028
 lived-0.025
ot-0.024
 solar-0.024
Token black
Token ID2042
Feature activation+9.79e-03

Pos logit contributions
 aback+0.055
hered+0.053
Gu+0.052
 written+0.050
umbs+0.049
Neg logit contributions
 often-0.031
igs-0.030
 solar-0.029
ot-0.028
 dusty-0.027
Tokenboard
Token ID3526
Feature activation+1.28e-02

Pos logit contributions
 written+0.013
 aback+0.013
Gu+0.012
 comm+0.012
 source+0.011
Neg logit contributions
igs-0.016
 often-0.015
ot-0.014
 solar-0.014
 lived-0.014
Token"
Token ID1
Feature activation+4.84e-02

Pos logit contributions
igs+0.041
 often+0.038
 solar+0.037
 gut+0.035
 collection+0.034
Neg logit contributions
Gu-0.031
 written-0.031
Sur-0.031
 aback-0.028
Time-0.028
TokenHey
Token ID10814
Feature activation+4.10e-02

Pos logit contributions
 aback+0.071
 written+0.066
 Birthday+0.064
 calling+0.063
 source+0.063
Neg logit contributions
apy-0.065
 often-0.064
ot-0.064
igs-0.063
 tops-0.059
Token,
Token ID11
Feature activation+3.49e-02

Pos logit contributions
 written+0.096
 aback+0.093
 comm+0.092
Gu+0.092
 source+0.090
Neg logit contributions
igs-0.024
 often-0.021
ot-0.019
 lived-0.016
 gut-0.014
Token what
Token ID644
Feature activation+4.25e-02

Pos logit contributions
 written+0.059
 aback+0.056
Gu+0.055
andra+0.052
 comm+0.052
Neg logit contributions
igs-0.043
 often-0.043
ot-0.041
 lived-0.038
 gut-0.038
Token's
Token ID338
Feature activation+2.50e-02

Pos logit contributions
 aback+0.077
 source+0.071
 comm+0.071
Gu+0.069
 bisc+0.068
Neg logit contributions
 lived-0.052
 often-0.049
igs-0.047
ot-0.045
 goals-0.043
Token going
Token ID1016
Feature activation+8.40e-02

Pos logit contributions
 aback+0.045
 written+0.044
Gu+0.044
umbs+0.043
 comm+0.043
Neg logit contributions
igs-0.030
 often-0.028
 lived-0.024
ot-0.024
 solar-0.023
Token on
Token ID319
Feature activation+3.45e-02

Pos logit contributions
oph+0.162
 comm+0.157
umbs+0.155
andra+0.153
Gu+0.150
Neg logit contributions
 often-0.081
 lived-0.073
ot-0.072
 high-0.071
 alone-0.069
Token here
Token ID994
Feature activation+6.30e-02

Pos logit contributions
 written+0.054
Gu+0.052
 comm+0.052
 aback+0.051
 bisc+0.048
Neg logit contributions
igs-0.050
 often-0.048
ot-0.042
ug-0.041
 lived-0.041
Token?"
Token ID1701
Feature activation+1.88e-04

Pos logit contributions
Gu+0.144
 comm+0.140
 written+0.137
 aback+0.136
umbs+0.134
Neg logit contributions
 often-0.047
igs-0.038
 lived-0.038
ot-0.033
 nicely-0.032
Token a
Token ID257
Feature activation-2.03e-02

Pos logit contributions
 written+0.000
 aback+0.000
Gu+0.000
 comm+0.000
 source+0.000
Neg logit contributions
igs-0.000
 often-0.000
ot-0.000
 solar-0.000
 gut-0.000
Token voice
Token ID3809
Feature activation+2.91e-02

Pos logit contributions
igs+0.039
 often+0.038
 solar+0.035
 lived+0.034
ot+0.033
Neg logit contributions
 comm-0.021
Gu-0.020
 written-0.019
 aback-0.019
 source-0.019
Token apologised
Token ID46397
Feature activation-5.05e-03

Pos logit contributions
Gu+0.026
 written+0.026
 aback+0.025
 comm+0.024
hered+0.024
Neg logit contributions
 often-0.037
igs-0.034
ot-0.032
 lived-0.031
 gut-0.030
Token to
Token ID284
Feature activation+4.77e-04

Pos logit contributions
igs+0.009
 often+0.009
 solar+0.008
ot+0.008
 lived+0.008
Neg logit contributions
 written-0.006
 aback-0.006
 comm-0.005
Gu-0.005
 source-0.005
Token Tom
Token ID4186
Feature activation+6.00e-03

Pos logit contributions
 written+0.001
 aback+0.001
 comm+0.001
Gu+0.001
 source+0.001
Neg logit contributions
igs-0.001
 often-0.001
ot-0.001
 solar-0.001
 lived-0.001
Token and
Token ID290
Feature activation+1.95e-03

Pos logit contributions
 written+0.010
 aback+0.010
Gu+0.010
 comm+0.010
 source+0.009
Neg logit contributions
igs-0.007
 often-0.007
ot-0.006
 solar-0.006
 lived-0.006
Token said
Token ID531
Feature activation-3.99e-02

Pos logit contributions
 written+0.003
 aback+0.003
 comm+0.003
Gu+0.003
 source+0.003
Neg logit contributions
igs-0.003
 often-0.003
ot-0.003
 solar-0.002
 lived-0.002
Token "
Token ID366
Feature activation+5.33e-02

Pos logit contributions
igs+0.069
 solar+0.064
ind+0.062
 rocky+0.056
 often+0.056
Neg logit contributions
 written-0.053
 aback-0.049
 alright-0.048
 saying-0.046
 come-0.046
Tokensorry
Token ID41599
Feature activation+3.39e-02

Pos logit contributions
 Birthday+0.071
 comm+0.070
 written+0.069
 aback+0.069
 calling+0.067
Neg logit contributions
 often-0.080
ot-0.073
igs-0.065
His-0.065
ind-0.064
Token,
Token ID11
Feature activation+6.48e-04

Pos logit contributions
 written+0.063
 comm+0.062
Gu+0.061
 aback+0.060
 source+0.059
Neg logit contributions
 often-0.034
igs-0.033
ot-0.033
 lived-0.028
ug-0.028
Token little
Token ID1310
Feature activation+3.43e-03

Pos logit contributions
 written+0.001
 aback+0.001
Gu+0.001
 comm+0.001
 source+0.001
Neg logit contributions
igs-0.001
 often-0.001
ot-0.001
 solar-0.001
 lived-0.001
Token brother
Token ID3956
Feature activation-1.18e-02

Pos logit contributions
 written+0.005
 aback+0.004
Gu+0.004
 comm+0.004
 source+0.004
Neg logit contributions
igs-0.006
 often-0.005
ot-0.005
 solar-0.005
 lived-0.005
Token,
Token ID11
Feature activation+8.49e-03

Pos logit contributions
igs+0.019
 often+0.018
 solar+0.016
ot+0.016
 gut+0.016
Neg logit contributions
 written-0.016
 aback-0.016
Gu-0.015
 comm-0.015
hered-0.014
Token
Token ID198
Feature activation+2.03e-03

Pos logit contributions
 written+0.046
 comm+0.044
 aback+0.044
Gu+0.043
hered+0.042
Neg logit contributions
 often-0.042
igs-0.040
ot-0.038
 lived-0.037
 solar-0.034
TokenL
Token ID43
Feature activation+3.31e-02

Pos logit contributions
 written+0.003
 aback+0.003
Gu+0.003
 comm+0.003
 source+0.003
Neg logit contributions
igs-0.003
 often-0.003
ot-0.002
 solar-0.002
 gut-0.002
Tokenily
Token ID813
Feature activation-1.58e-02

Pos logit contributions
 written+0.063
 aback+0.063
 comm+0.062
 source+0.062
 bisc+0.061
Neg logit contributions
igs-0.035
 often-0.033
ot-0.031
 lived-0.028
ug-0.025
Token's
Token ID338
Feature activation+4.54e-03

Pos logit contributions
igs+0.032
 often+0.029
 solar+0.029
apy+0.029
ot+0.028
Neg logit contributions
 written-0.015
 comm-0.014
Gu-0.013
 aback-0.013
 source-0.012
Token friend
Token ID1545
Feature activation+2.23e-02

Pos logit contributions
 written+0.008
 aback+0.008
Gu+0.007
 comm+0.007
 source+0.007
Neg logit contributions
igs-0.006
 often-0.005
ot-0.005
 solar-0.005
 lived-0.004
Token,
Token ID11
Feature activation+5.15e-02

Pos logit contributions
 written+0.043
 aback+0.042
Gu+0.042
 comm+0.042
 source+0.041
Neg logit contributions
 often-0.022
igs-0.020
ot-0.020
 lived-0.019
ug-0.016
Token Tim
Token ID5045
Feature activation+1.61e-02

Pos logit contributions
umbs+0.096
ives+0.091
arre+0.090
ctions+0.089
 Birthday+0.089
Neg logit contributions
 lived-0.054
ot-0.049
 often-0.048
 his-0.041
 Sp-0.041
Tokenmy
Token ID1820
Feature activation+4.91e-03

Pos logit contributions
 written+0.015
 aback+0.014
 comm+0.014
Gu+0.014
 source+0.013
Neg logit contributions
 often-0.033
igs-0.032
ot-0.030
 lived-0.029
 solar-0.028
Token,
Token ID11
Feature activation+5.11e-02

Pos logit contributions
 written+0.009
 aback+0.009
Gu+0.009
 comm+0.009
 source+0.008
Neg logit contributions
igs-0.005
 often-0.005
ot-0.004
 solar-0.004
 lived-0.004
Token said
Token ID531
Feature activation-2.07e-02

Pos logit contributions
umbs+0.082
ctions+0.078
 aback+0.076
cake+0.075
ives+0.075
Neg logit contributions
 lived-0.073
 often-0.065
 loved-0.058
ot-0.057
 named-0.053
Token "
Token ID366
Feature activation+3.92e-02

Pos logit contributions
igs+0.031
 solar+0.028
ind+0.028
 often+0.028
ot+0.025
Neg logit contributions
 written-0.030
 aback-0.029
Gu-0.028
hered-0.028
 come-0.028
Token onions
Token ID24681
Feature activation+1.04e-02

Pos logit contributions
Gu+0.040
 written+0.039
 comm+0.039
 Birthday+0.038
 aback+0.038
Neg logit contributions
igs-0.028
 often-0.027
ot-0.024
 lived-0.023
apy-0.022
Token in
Token ID287
Feature activation-3.21e-03

Pos logit contributions
 written+0.014
Gu+0.014
 aback+0.014
 comm+0.014
 source+0.013
Neg logit contributions
igs-0.016
 often-0.016
ot-0.014
 solar-0.014
 lived-0.013
Token the
Token ID262
Feature activation-4.78e-03

Pos logit contributions
igs+0.004
 often+0.004
ot+0.003
 solar+0.003
 lived+0.003
Neg logit contributions
 written-0.005
 aback-0.005
Gu-0.005
 comm-0.005
 source-0.005
Token food
Token ID2057
Feature activation-1.56e-02

Pos logit contributions
igs+0.008
 often+0.008
ot+0.007
ug+0.007
 solar+0.007
Neg logit contributions
 written-0.006
 aback-0.006
Gu-0.006
 comm-0.006
 source-0.005
Token.
Token ID13
Feature activation-1.69e-02

Pos logit contributions
igs+0.021
 often+0.019
ot+0.017
 solar+0.017
 gut+0.017
Neg logit contributions
 written-0.026
 aback-0.024
 comm-0.024
 source-0.023
 bisc-0.023
Token Tim
Token ID5045
Feature activation+1.07e-02

Pos logit contributions
igs+0.029
 often+0.026
 solar+0.025
ot+0.025
 gut+0.024
Neg logit contributions
 written-0.021
Gu-0.021
 aback-0.021
 comm-0.021
cake-0.019
Tokenmy
Token ID1820
Feature activation-1.11e-02

Pos logit contributions
 written+0.008
 aback+0.007
Gu+0.007
 comm+0.007
hered+0.006
Neg logit contributions
 often-0.023
igs-0.023
ot-0.022
 lived-0.021
 solar-0.021
Token didn
Token ID1422
Feature activation-2.60e-03

Pos logit contributions
igs+0.020
 often+0.019
ot+0.018
 solar+0.018
ug+0.018
Neg logit contributions
 written-0.013
 comm-0.012
Gu-0.012
 aback-0.012
 source-0.011
Token't
Token ID470
Feature activation+1.06e-02

Pos logit contributions
igs+0.005
 often+0.005
ot+0.005
 solar+0.005
 lived+0.005
Neg logit contributions
 written-0.002
 aback-0.002
Gu-0.002
 comm-0.002
 source-0.002
Token like
Token ID588
Feature activation-4.05e-03

Pos logit contributions
 written+0.013
 aback+0.013
 comm+0.012
Gu+0.012
 source+0.011
Neg logit contributions
 often-0.018
igs-0.018
ot-0.016
 lived-0.015
 solar-0.015
Token onions
Token ID24681
Feature activation+8.07e-03

Pos logit contributions
igs+0.006
 often+0.006
ot+0.005
 solar+0.005
 lived+0.005
Neg logit contributions
 written-0.006
 aback-0.006
 comm-0.006
Gu-0.006
 source-0.005
Token more
Token ID517
Feature activation+6.50e-03

Pos logit contributions
 written+0.016
Gu+0.016
 aback+0.015
 comm+0.015
hered+0.015
Neg logit contributions
igs-0.018
 often-0.018
ot-0.016
 solar-0.015
 gut-0.015
Token excited
Token ID6568
Feature activation+4.07e-02

Pos logit contributions
 written+0.009
 aback+0.009
Gu+0.009
 comm+0.009
 source+0.008
Neg logit contributions
igs-0.010
 often-0.010
ot-0.009
 solar-0.008
 lived-0.008
Token.
Token ID13
Feature activation-1.99e-03

Pos logit contributions
Gu+0.081
 aback+0.074
 written+0.071
 bisc+0.071
Uh+0.070
Neg logit contributions
 often-0.049
igs-0.041
ot-0.038
 every-0.034
 toes-0.034
Token
Token ID198
Feature activation-1.17e-02

Pos logit contributions
igs+0.003
 often+0.003
ot+0.003
 solar+0.002
 gut+0.002
Neg logit contributions
 written-0.003
 aback-0.003
Gu-0.003
 comm-0.003
 source-0.003
Token
Token ID198
Feature activation-1.71e-02

Pos logit contributions
igs+0.021
 often+0.020
 solar+0.019
 gut+0.019
ot+0.019
Neg logit contributions
 written-0.013
Gu-0.012
 aback-0.012
 comm-0.012
 source-0.011
TokenFinally
Token ID11158
Feature activation-4.16e-03

Pos logit contributions
igs+0.027
 often+0.025
 solar+0.023
 gut+0.023
ot+0.022
Neg logit contributions
 written-0.023
Gu-0.023
 aback-0.022
 comm-0.022
Sur-0.021
Token,
Token ID11
Feature activation+3.27e-02

Pos logit contributions
igs+0.005
 often+0.005
 solar+0.004
ot+0.004
 gut+0.004
Neg logit contributions
 written-0.007
 aback-0.007
 comm-0.007
Gu-0.007
hered-0.007
Token they
Token ID484
Feature activation+9.31e-03

Pos logit contributions
 aback+0.063
Gu+0.063
 written+0.062
 comm+0.061
ives+0.058
Neg logit contributions
 often-0.033
igs-0.030
ot-0.028
apy-0.026
 lived-0.025
Token found
Token ID1043
Feature activation+1.50e-02

Pos logit contributions
 written+0.012
 aback+0.012
Gu+0.012
 comm+0.012
 source+0.011
Neg logit contributions
 often-0.015
igs-0.015
ot-0.014
 lived-0.013
 solar-0.013
Token the
Token ID262
Feature activation+2.99e-02

Pos logit contributions
Gu+0.021
 written+0.021
 comm+0.021
 aback+0.020
hered+0.019
Neg logit contributions
igs-0.022
 often-0.022
ot-0.020
 lived-0.019
 solar-0.019
Token ball
Token ID2613
Feature activation+4.95e-03

Pos logit contributions
Gu+0.051
 aback+0.050
 written+0.050
 comm+0.046
 begs+0.045
Neg logit contributions
 often-0.036
igs-0.036
ot-0.031
 solar-0.031
 goals-0.030
Token away
Token ID1497
Feature activation-1.44e-02

Pos logit contributions
igs+0.026
 often+0.024
 solar+0.023
ot+0.023
 lived+0.023
Neg logit contributions
 aback-0.016
 written-0.015
 comm-0.014
 source-0.014
hered-0.014
Token from
Token ID422
Feature activation-1.99e-02

Pos logit contributions
igs+0.023
 often+0.020
 solar+0.020
ot+0.019
 lived+0.019
Neg logit contributions
 written-0.020
 aback-0.020
 comm-0.020
 source-0.019
 calling-0.018
Token the
Token ID262
Feature activation-3.93e-03

Pos logit contributions
igs+0.031
 often+0.027
ot+0.027
 solar+0.026
apy+0.026
Neg logit contributions
 written-0.028
 source-0.027
 aback-0.027
 comm-0.027
 calling-0.027
Token scary
Token ID14343
Feature activation+7.06e-03

Pos logit contributions
igs+0.006
 often+0.005
ot+0.005
 lived+0.005
 solar+0.005
Neg logit contributions
 written-0.006
 comm-0.006
 aback-0.006
Gu-0.006
 source-0.006
Token man
Token ID582
Feature activation+6.10e-03

Pos logit contributions
 written+0.010
 aback+0.010
Gu+0.010
 comm+0.010
 source+0.009
Neg logit contributions
igs-0.011
 often-0.010
ot-0.009
 solar-0.009
 lived-0.009
Token.
Token ID13
Feature activation+9.20e-03

Pos logit contributions
 written+0.011
 aback+0.010
Gu+0.010
 comm+0.010
 source+0.010
Neg logit contributions
igs-0.007
 often-0.007
ot-0.006
 solar-0.006
 lived-0.006
Token
Token ID198
Feature activation-1.71e-04

Pos logit contributions
 written+0.013
 aback+0.012
 comm+0.012
Gu+0.012
 source+0.012
Neg logit contributions
igs-0.014
 often-0.014
ot-0.012
 solar-0.012
 lived-0.012
Token
Token ID198
Feature activation-4.38e-03

Pos logit contributions
igs+0.000
 often+0.000
ot+0.000
 solar+0.000
 gut+0.000
Neg logit contributions
 written-0.000
 aback-0.000
Gu-0.000
 comm-0.000
 source-0.000
TokenFinally
Token ID11158
Feature activation+3.62e-03

Pos logit contributions
igs+0.006
 often+0.006
 solar+0.005
ot+0.005
 lived+0.005
Neg logit contributions
 written-0.007
 aback-0.007
Gu-0.007
 comm-0.007
hered-0.006
Token,
Token ID11
Feature activation+3.03e-02

Pos logit contributions
 written+0.007
 aback+0.007
Gu+0.006
 comm+0.006
 source+0.006
Neg logit contributions
igs-0.004
 often-0.004
ot-0.003
 solar-0.003
 lived-0.003
Token Lucy
Token ID22162
Feature activation+2.89e-02

Pos logit contributions
 written+0.055
Gu+0.054
 comm+0.053
 aback+0.053
 bisc+0.052
Neg logit contributions
 often-0.036
igs-0.030
 lived-0.028
apy-0.028
ot-0.027
TokenMom
Token ID29252
Feature activation+3.35e-02

Pos logit contributions
igs+0.061
 often+0.055
 solar+0.055
 goals+0.052
 toes+0.052
Neg logit contributions
Gu-0.038
Sur-0.037
 aback-0.035
 bisc-0.033
Meg-0.032
Token hugged
Token ID41130
Feature activation+1.75e-02

Pos logit contributions
Gu+0.048
 source+0.047
 aback+0.047
 written+0.047
 comm+0.046
Neg logit contributions
 often-0.047
ot-0.044
igs-0.043
 lived-0.042
 tight-0.039
Token Sam
Token ID3409
Feature activation-6.54e-03

Pos logit contributions
 written+0.025
 comm+0.024
Gu+0.024
 aback+0.024
 source+0.023
Neg logit contributions
igs-0.026
 often-0.026
ot-0.024
 solar-0.022
ug-0.022
Token.
Token ID13
Feature activation-9.89e-03

Pos logit contributions
igs+0.012
 often+0.011
 solar+0.011
ot+0.011
apy+0.011
Neg logit contributions
 written-0.007
 aback-0.007
 comm-0.007
Gu-0.006
 source-0.006
Token She
Token ID1375
Feature activation+1.59e-02

Pos logit contributions
igs+0.016
 often+0.015
ind+0.014
 solar+0.014
ot+0.013
Neg logit contributions
 written-0.014
 aback-0.013
Gu-0.013
 comm-0.012
 bisc-0.012
Token was
Token ID373
Feature activation+9.65e-03

Pos logit contributions
 aback+0.017
 written+0.017
Gu+0.016
 source+0.016
 comm+0.016
Neg logit contributions
 often-0.029
igs-0.029
 lived-0.027
ot-0.026
 solar-0.025
Token not
Token ID407
Feature activation+3.22e-02

Pos logit contributions
 written+0.017
 aback+0.016
Gu+0.016
 comm+0.016
hered+0.016
Neg logit contributions
igs-0.011
 often-0.011
ot-0.010
 lived-0.009
 solar-0.009
Token angry
Token ID7954
Feature activation+3.30e-02

Pos logit contributions
Gu+0.061
oph+0.060
hered+0.059
 bisc+0.059
cake+0.059
Neg logit contributions
 often-0.032
igs-0.030
ot-0.028
 exc-0.025
 tight-0.025
Token.
Token ID13
Feature activation-1.51e-02

Pos logit contributions
 bisc+0.064
Gu+0.063
 written+0.063
andra+0.062
 aback+0.062
Neg logit contributions
 often-0.034
igs-0.026
 goals-0.025
ot-0.024
 nicely-0.024
Token She
Token ID1375
Feature activation+1.68e-02

Pos logit contributions
igs+0.027
 often+0.025
ind+0.023
 solar+0.022
ug+0.022
Neg logit contributions
 written-0.019
 aback-0.019
Gu-0.018
 comm-0.017
 How-0.016
Token was
Token ID373
Feature activation+2.51e-02

Pos logit contributions
 aback+0.018
 written+0.018
Gu+0.017
 source+0.017
hered+0.017
Neg logit contributions
 often-0.031
igs-0.029
 lived-0.029
ot-0.028
 solar-0.026
Token OK
Token ID7477
Feature activation+4.17e-02

Pos logit contributions
 aback+0.028
 written+0.028
Gu+0.026
oph+0.025
 bisc+0.025
Neg logit contributions
igs-0.045
 often-0.044
ot-0.040
 lived-0.039
ug-0.039
Token?"
Token ID1701
Feature activation+5.82e-04

Pos logit contributions
 written+0.089
 aback+0.089
ives+0.087
Gu+0.086
 comm+0.085
Neg logit contributions
 often-0.034
ot-0.033
igs-0.030
 lived-0.025
ug-0.025
Token Sam
Token ID3409
Feature activation+8.77e-03

Pos logit contributions
 written+0.001
 aback+0.001
Gu+0.001
 comm+0.001
 source+0.001
Neg logit contributions
igs-0.001
 often-0.001
ot-0.001
 solar-0.001
 lived-0.001
Token said
Token ID531
Feature activation-2.70e-03

Pos logit contributions
 written+0.011
 aback+0.010
Gu+0.010
 comm+0.010
 source+0.010
Neg logit contributions
igs-0.015
 often-0.015
ot-0.013
 lived-0.013
 solar-0.013
Token.
Token ID13
Feature activation+6.29e-03

Pos logit contributions
igs+0.003
 often+0.003
 solar+0.003
ot+0.003
 lived+0.002
Neg logit contributions
 written-0.005
 aback-0.005
 comm-0.005
Gu-0.005
 source-0.005
Token He
Token ID679
Feature activation+2.48e-03

Pos logit contributions
 written+0.012
 aback+0.011
Gu+0.011
 comm+0.011
 source+0.011
Neg logit contributions
igs-0.007
 often-0.007
ot-0.006
 solar-0.005
 lived-0.005
Token ran
Token ID4966
Feature activation+1.84e-02

Pos logit contributions
 written+0.003
 aback+0.003
Gu+0.003
 comm+0.003
 source+0.003
Neg logit contributions
igs-0.004
 often-0.004
ot-0.004
 solar-0.004
 lived-0.003
Token to
Token ID284
Feature activation+1.11e-02

Pos logit contributions
 written+0.024
Gu+0.023
 comm+0.023
oph+0.022
 Birthday+0.022
Neg logit contributions
 often-0.030
igs-0.028
ot-0.027
 tight-0.025
 gut-0.025
Token Tom
Token ID4186
Feature activation+1.24e-02

Pos logit contributions
 written+0.020
 aback+0.019
Gu+0.019
 comm+0.019
 Birthday+0.018
Neg logit contributions
igs-0.012
 often-0.012
ot-0.011
 solar-0.010
 gut-0.010
Token.
Token ID13
Feature activation+1.37e-03

Pos logit contributions
 written+0.023
Gu+0.022
 aback+0.022
 comm+0.022
 source+0.021
Neg logit contributions
 often-0.013
igs-0.013
ot-0.011
 solar-0.010
 lived-0.010
Token He
Token ID679
Feature activation+4.28e-03

Pos logit contributions
 written+0.002
 aback+0.002
Gu+0.002
 comm+0.002
 source+0.002
Neg logit contributions
igs-0.002
 often-0.002
ot-0.002
 solar-0.002
 lived-0.002
Token toys
Token ID14958
Feature activation+4.70e-03

Pos logit contributions
igs+0.052
ot+0.049
 often+0.048
oppy+0.047
 gut+0.047
Neg logit contributions
 written-0.043
 comm-0.039
 aback-0.038
 ingredients-0.038
 calling-0.037
Token and
Token ID290
Feature activation-1.93e-02

Pos logit contributions
 written+0.006
 aback+0.006
Gu+0.006
 comm+0.006
 source+0.006
Neg logit contributions
igs-0.007
 often-0.007
ot-0.007
 solar-0.006
 lived-0.006
Token they
Token ID484
Feature activation-3.24e-02

Pos logit contributions
igs+0.030
 often+0.028
 solar+0.026
ot+0.025
ug+0.025
Neg logit contributions
 written-0.027
 comm-0.025
 aback-0.024
Gu-0.024
 come-0.023
Token should
Token ID815
Feature activation-4.45e-02

Pos logit contributions
igs+0.056
 gut+0.048
ind+0.046
 solar+0.046
ot+0.045
Neg logit contributions
 written-0.046
 aback-0.038
 come-0.038
 announced-0.038
 comm-0.038
Token give
Token ID1577
Feature activation+1.19e-02

Pos logit contributions
igs+0.084
 solar+0.076
ug+0.075
apy+0.074
bers+0.074
Neg logit contributions
 come-0.049
 written-0.044
 calling-0.040
 source-0.039
 alright-0.039
Token some
Token ID617
Feature activation-6.42e-03

Pos logit contributions
 written+0.018
Gu+0.017
 aback+0.017
 comm+0.017
 source+0.016
Neg logit contributions
igs-0.017
 often-0.017
ot-0.015
 solar-0.015
 lived-0.015
Token to
Token ID284
Feature activation-2.35e-02

Pos logit contributions
igs+0.010
 often+0.010
ot+0.009
 solar+0.009
 gut+0.009
Neg logit contributions
 written-0.009
 aback-0.008
 comm-0.008
Gu-0.008
 source-0.008
Token other
Token ID584
Feature activation+1.24e-02

Pos logit contributions
igs+0.034
 often+0.028
 solar+0.027
 gut+0.026
ug+0.026
Neg logit contributions
 written-0.036
 calling-0.034
 come-0.034
 comm-0.034
 source-0.033
Token children
Token ID1751
Feature activation-2.42e-02

Pos logit contributions
 written+0.021
 aback+0.020
Gu+0.020
 comm+0.019
hered+0.019
Neg logit contributions
igs-0.016
 often-0.016
ot-0.014
 solar-0.013
 lived-0.013
Token who
Token ID508
Feature activation-2.50e-02

Pos logit contributions
igs+0.038
 often+0.034
 solar+0.033
 gut+0.032
ug+0.031
Neg logit contributions
 written-0.034
 aback-0.032
 comm-0.029
 calling-0.029
Gu-0.029
Token had
Token ID550
Feature activation-1.09e-03

Pos logit contributions
igs+0.042
 often+0.037
 gut+0.036
ot+0.036
 solar+0.035
Neg logit contributions
 written-0.034
 aback-0.030
Gu-0.029
 comm-0.028
 come-0.028
Token She
Token ID1375
Feature activation-3.35e-02

Pos logit contributions
igs+0.038
 gut+0.034
aro+0.033
ind+0.033
ug+0.033
Neg logit contributions
 written-0.024
Gu-0.023
 aback-0.022
cake-0.021
 How-0.021
Token was
Token ID373
Feature activation+1.90e-02

Pos logit contributions
igs+0.055
 gut+0.045
ug+0.044
 goals+0.044
aro+0.043
Neg logit contributions
 written-0.052
 happened-0.049
Gu-0.047
 comm-0.047
 baked-0.047
Token walking
Token ID6155
Feature activation-1.77e-03

Pos logit contributions
 aback+0.033
 written+0.033
 comm+0.032
Gu+0.032
 bisc+0.032
Neg logit contributions
 often-0.023
igs-0.021
ot-0.019
 lived-0.018
 solar-0.018
Token in
Token ID287
Feature activation+6.18e-03

Pos logit contributions
igs+0.003
 often+0.003
ot+0.003
 solar+0.003
 gut+0.003
Neg logit contributions
 written-0.002
 aback-0.002
Gu-0.002
 comm-0.002
 source-0.002
Token the
Token ID262
Feature activation-9.78e-03

Pos logit contributions
 written+0.011
 aback+0.011
Gu+0.011
 comm+0.011
 source+0.010
Neg logit contributions
igs-0.007
 often-0.007
ot-0.006
 solar-0.006
 lived-0.006
Token park
Token ID3952
Feature activation-1.06e-02

Pos logit contributions
igs+0.018
 often+0.017
ot+0.016
ug+0.015
apy+0.015
Neg logit contributions
Gu-0.011
 written-0.011
 comm-0.011
 source-0.010
 aback-0.010
Token when
Token ID618
Feature activation-4.21e-02

Pos logit contributions
igs+0.020
 often+0.018
 solar+0.017
ot+0.017
 gut+0.017
Neg logit contributions
 written-0.012
Gu-0.011
 aback-0.011
 source-0.011
 comm-0.011
Token she
Token ID673
Feature activation+7.58e-03

Pos logit contributions
igs+0.084
ug+0.076
 gut+0.075
ind+0.074
 solar+0.074
Neg logit contributions
 written-0.040
 happened-0.038
 source-0.036
 appeared-0.036
Gu-0.035
Token saw
Token ID2497
Feature activation+2.58e-03

Pos logit contributions
 written+0.009
 aback+0.009
 comm+0.008
Gu+0.008
 source+0.008
Neg logit contributions
igs-0.013
 often-0.013
ot-0.012
 lived-0.012
 solar-0.012
Token a
Token ID257
Feature activation+4.32e-03

Pos logit contributions
 written+0.003
 aback+0.003
Gu+0.003
 comm+0.003
 source+0.003
Neg logit contributions
igs-0.005
 often-0.005
ot-0.004
 solar-0.004
 lived-0.004
Token j
Token ID474
Feature activation-5.07e-02

Pos logit contributions
 written+0.007
 aback+0.007
Gu+0.007
 comm+0.007
 source+0.006
Neg logit contributions
igs-0.006
 often-0.006
ot-0.005
 solar-0.005
 lived-0.005
Token Candy
Token ID24680
Feature activation+1.07e-02

Pos logit contributions
Gu+0.047
 written+0.047
 bisc+0.045
 comm+0.044
 aback+0.044
Neg logit contributions
 often-0.033
igs-0.031
 lived-0.028
ot-0.027
 solar-0.026
Token she
Token ID673
Feature activation-2.57e-03

Pos logit contributions
 written+0.020
 aback+0.020
Gu+0.019
 comm+0.019
 source+0.018
Neg logit contributions
igs-0.011
 often-0.011
ot-0.009
 lived-0.009
 solar-0.009
Token was
Token ID373
Feature activation+3.34e-02

Pos logit contributions
igs+0.004
 often+0.004
ot+0.004
 solar+0.004
 gut+0.004
Neg logit contributions
 written-0.003
 aback-0.003
 comm-0.003
Gu-0.003
 source-0.003
Token so
Token ID523
Feature activation-7.75e-03

Pos logit contributions
 bisc+0.055
Gu+0.055
 aback+0.055
 written+0.053
 comm+0.052
Neg logit contributions
 often-0.046
igs-0.037
ot-0.036
 lived-0.035
 tight-0.033
Token excited
Token ID6568
Feature activation+5.50e-02

Pos logit contributions
igs+0.012
 often+0.011
 solar+0.010
ot+0.010
 gut+0.010
Neg logit contributions
 written-0.011
 aback-0.011
 comm-0.010
Gu-0.010
 source-0.010
Token that
Token ID326
Feature activation-1.19e-02

Pos logit contributions
Gu+0.117
Uh+0.113
 aback+0.108
 bisc+0.108
hered+0.107
Neg logit contributions
 often-0.059
 every-0.041
 lived-0.041
igs-0.040
ot-0.037
Token she
Token ID673
Feature activation-1.13e-03

Pos logit contributions
igs+0.021
 often+0.018
ug+0.017
 solar+0.017
ot+0.017
Neg logit contributions
 written-0.016
 aback-0.015
Gu-0.014
 comm-0.014
 calling-0.014
Token ran
Token ID4966
Feature activation+2.81e-02

Pos logit contributions
igs+0.002
 often+0.002
ot+0.002
 solar+0.001
 gut+0.001
Neg logit contributions
 written-0.002
 aback-0.002
 comm-0.002
Gu-0.002
 source-0.001
Token right
Token ID826
Feature activation+6.14e-03

Pos logit contributions
Gu+0.037
 written+0.037
 comm+0.036
 bisc+0.035
 Birthday+0.034
Neg logit contributions
 often-0.045
igs-0.040
ot-0.039
 tight-0.039
 solar-0.036
Token into
Token ID656
Feature activation+8.15e-03

Pos logit contributions
 written+0.007
 aback+0.007
Gu+0.007
 comm+0.007
 source+0.006
Neg logit contributions
igs-0.011
 often-0.011
ot-0.010
 solar-0.009
 lived-0.009
Token the
Token ID262
Feature activation-1.20e-02

Pos logit contributions
 written+0.015
 aback+0.014
Gu+0.014
 comm+0.014
 source+0.013
Neg logit contributions
igs-0.009
 often-0.009
ot-0.008
 solar-0.007
 lived-0.007
Token focus
Token ID2962
Feature activation-3.53e-02

Pos logit contributions
igs+0.039
 often+0.035
 solar+0.034
ot+0.033
apy+0.033
Neg logit contributions
 written-0.023
 comm-0.021
 source-0.021
 come-0.021
 aback-0.021
Token on
Token ID319
Feature activation-3.23e-02

Pos logit contributions
igs+0.072
 solar+0.066
 often+0.064
oppy+0.064
ot+0.063
Neg logit contributions
 written-0.038
 aback-0.030
 saying-0.029
 alright-0.027
 calling-0.027
Token the
Token ID262
Feature activation-2.84e-03

Pos logit contributions
igs+0.048
 often+0.042
ot+0.042
 solar+0.041
 lived+0.040
Neg logit contributions
 written-0.049
 calling-0.045
 comm-0.044
 saying-0.044
 aback-0.044
Token road
Token ID2975
Feature activation-3.37e-02

Pos logit contributions
igs+0.004
 often+0.004
ot+0.003
 solar+0.003
 lived+0.003
Neg logit contributions
 written-0.005
 comm-0.004
 aback-0.004
Gu-0.004
 source-0.004
Token.
Token ID13
Feature activation-9.64e-03

Pos logit contributions
igs+0.051
 solar+0.043
 often+0.043
apy+0.042
ot+0.041
Neg logit contributions
 written-0.052
 aback-0.048
 source-0.046
 calling-0.045
 comm-0.045
Token He
Token ID679
Feature activation-2.22e-02

Pos logit contributions
igs+0.014
 often+0.013
 solar+0.012
ot+0.011
ind+0.011
Neg logit contributions
 written-0.015
 aback-0.015
Gu-0.015
 comm-0.014
 source-0.013
Token did
Token ID750
Feature activation-2.75e-02

Pos logit contributions
igs+0.042
 often+0.038
 solar+0.036
ot+0.036
 gut+0.036
Neg logit contributions
 written-0.026
 comm-0.023
 aback-0.022
Gu-0.022
 announced-0.022
Token not
Token ID407
Feature activation+8.69e-03

Pos logit contributions
igs+0.052
 solar+0.051
ind+0.049
ot+0.049
ug+0.048
Neg logit contributions
 written-0.029
 come-0.025
 source-0.025
 aback-0.025
 calling-0.024
Token like
Token ID588
Feature activation-2.47e-02

Pos logit contributions
 written+0.011
 aback+0.010
Gu+0.010
 comm+0.010
 source+0.010
Neg logit contributions
igs-0.015
 often-0.015
ot-0.013
 solar-0.013
 lived-0.013
Token the
Token ID262
Feature activation-3.01e-03

Pos logit contributions
igs+0.039
 often+0.035
ot+0.034
 solar+0.033
 lived+0.033
Neg logit contributions
 written-0.034
 calling-0.031
 comm-0.031
 aback-0.030
 source-0.030
Token noise
Token ID7838
Feature activation-9.48e-03

Pos logit contributions
igs+0.004
 often+0.004
ot+0.003
 solar+0.003
 lived+0.003
Neg logit contributions
 written-0.005
 comm-0.005
 aback-0.005
Gu-0.005
 source-0.005
Token It
Token ID632
Feature activation+7.33e-03

Pos logit contributions
 written+0.009
 aback+0.008
 comm+0.008
Gu+0.008
 source+0.008
Neg logit contributions
igs-0.011
 often-0.010
ot-0.009
 solar-0.009
 lived-0.009
Token is
Token ID318
Feature activation+5.37e-03

Pos logit contributions
 written+0.011
 aback+0.010
Gu+0.010
 comm+0.010
 source+0.010
Neg logit contributions
igs-0.011
 often-0.011
ot-0.009
 solar-0.009
 lived-0.009
Token a
Token ID257
Feature activation+1.89e-02

Pos logit contributions
 written+0.008
 aback+0.007
Gu+0.007
 comm+0.007
 source+0.007
Neg logit contributions
igs-0.008
 often-0.008
ot-0.007
 solar-0.007
 lived-0.007
Token blanket
Token ID18447
Feature activation-2.58e-03

Pos logit contributions
 aback+0.029
 written+0.028
Gu+0.027
 comm+0.027
hered+0.026
Neg logit contributions
 often-0.026
igs-0.026
 solar-0.024
ot-0.024
 lived-0.023
Token."
Token ID526
Feature activation-1.31e-04

Pos logit contributions
igs+0.003
 often+0.003
ot+0.003
 solar+0.003
 gut+0.003
Neg logit contributions
 written-0.004
 aback-0.004
Gu-0.004
 comm-0.004
 source-0.004
Token
Token ID198
Feature activation-2.65e-02

Pos logit contributions
igs+0.000
 often+0.000
ot+0.000
 solar+0.000
 lived+0.000
Neg logit contributions
 written-0.000
 aback-0.000
Gu-0.000
 comm-0.000
 source-0.000
Token
Token ID198
Feature activation-3.57e-02

Pos logit contributions
igs+0.060
 often+0.057
belt+0.055
 solar+0.055
 gut+0.054
Neg logit contributions
Gu-0.017
 aback-0.016
 written-0.016
 comm-0.016
 source-0.013
TokenHe
Token ID1544
Feature activation+2.55e-02

Pos logit contributions
igs+0.073
 solar+0.068
 chim+0.066
 often+0.066
ug+0.066
Neg logit contributions
Gu-0.035
Sur-0.034
 written-0.029
Are-0.029
Time-0.028
Token gave
Token ID2921
Feature activation+9.48e-04

Pos logit contributions
 aback+0.035
 written+0.034
Gu+0.033
 source+0.033
 comm+0.032
Neg logit contributions
 often-0.039
igs-0.039
 lived-0.036
ot-0.035
 solar-0.033
Token her
Token ID607
Feature activation-1.12e-02

Pos logit contributions
 written+0.001
 aback+0.001
Gu+0.001
 comm+0.001
 source+0.001
Neg logit contributions
igs-0.001
 often-0.001
ot-0.001
 solar-0.001
 lived-0.001
Token the
Token ID262
Feature activation+2.15e-02

Pos logit contributions
igs+0.020
 often+0.020
 solar+0.019
ug+0.018
ot+0.018
Neg logit contributions
 written-0.012
 aback-0.012
 source-0.011
Gu-0.011
 bisc-0.011
Token string
Token ID4731
Feature activation-1.73e-02

Pos logit contributions
 written+0.009
 aback+0.009
Gu+0.009
 comm+0.008
hered+0.008
Neg logit contributions
igs-0.012
 often-0.012
ot-0.011
 solar-0.011
 lived-0.011
Token.
Token ID13
Feature activation+1.01e-02

Pos logit contributions
igs+0.024
 often+0.023
 solar+0.021
ot+0.021
 gut+0.020
Neg logit contributions
 written-0.028
 aback-0.025
 source-0.024
hered-0.024
 comm-0.024
Token He
Token ID679
Feature activation-1.07e-02

Pos logit contributions
 written+0.016
 comm+0.015
 aback+0.015
Gu+0.015
 source+0.015
Neg logit contributions
igs-0.014
 often-0.013
ot-0.012
 lived-0.011
 solar-0.011
Token tied
Token ID8165
Feature activation+3.21e-03

Pos logit contributions
igs+0.019
 often+0.018
ot+0.017
 solar+0.017
 gut+0.017
Neg logit contributions
 written-0.013
 comm-0.011
 aback-0.011
Gu-0.011
 announced-0.011
Token it
Token ID340
Feature activation-2.19e-02

Pos logit contributions
 written+0.005
 aback+0.005
Gu+0.005
 comm+0.005
 source+0.005
Neg logit contributions
igs-0.004
 often-0.004
ot-0.004
 solar-0.004
 lived-0.004
Token really
Token ID1107
Feature activation-7.40e-02

Pos logit contributions
igs+0.041
 often+0.037
 solar+0.037
ot+0.036
 gut+0.036
Neg logit contributions
 written-0.024
 aback-0.024
 source-0.021
hered-0.020
 comm-0.020
Token tight
Token ID5381
Feature activation-3.08e-02

Pos logit contributions
 ig+0.110
ind+0.109
 solar+0.102
 chim+0.100
charged+0.100
Neg logit contributions
 written-0.099
 aback-0.098
 right-0.094
 sure-0.089
 come-0.087
Token so
Token ID523
Feature activation-8.37e-02

Pos logit contributions
igs+0.058
 often+0.052
 solar+0.051
ot+0.051
apy+0.050
Neg logit contributions
 written-0.037
 aback-0.035
 source-0.030
hered-0.030
 comm-0.029
Token it
Token ID340
Feature activation-3.08e-02

Pos logit contributions
ind+0.154
 solar+0.154
agons+0.149
igs+0.147
 apartment+0.145
Neg logit contributions
 written-0.080
 sure-0.072
 once-0.066
 come-0.066
 happened-0.065
Token wouldn
Token ID3636
Feature activation-1.96e-02

Pos logit contributions
igs+0.047
 often+0.043
ind+0.042
 solar+0.041
 gut+0.040
Neg logit contributions
 written-0.044
 aback-0.042
 come-0.038
 appeared-0.038
 comm-0.038
Token't
Token ID470
Feature activation-1.01e-02

Pos logit contributions
igs+0.044
 often+0.042
 solar+0.041
ot+0.039
 gut+0.039
Neg logit contributions
 written-0.014
 aback-0.013
hered-0.012
 comm-0.011
 source-0.011
Token He
Token ID679
Feature activation-4.46e-02

Pos logit contributions
igs+0.040
 often+0.036
 gut+0.032
ind+0.032
ot+0.031
Neg logit contributions
 written-0.034
Gu-0.034
 aback-0.032
 bisc-0.030
 How-0.030
Token was
Token ID373
Feature activation+2.10e-02

Pos logit contributions
igs+0.087
 gut+0.076
oppy+0.071
 goals+0.070
lights+0.070
Neg logit contributions
 written-0.052
 happened-0.049
Gu-0.049
 appeared-0.049
 baked-0.048
Token very
Token ID845
Feature activation-4.60e-02

Pos logit contributions
 aback+0.040
 written+0.039
oph+0.039
Gu+0.039
 comm+0.038
Neg logit contributions
 often-0.022
igs-0.020
ot-0.018
 lived-0.018
 solar-0.017
Token gr
Token ID1036
Feature activation-2.63e-02

Pos logit contributions
igs+0.082
 goals+0.071
 solar+0.070
 gut+0.070
 often+0.069
Neg logit contributions
 written-0.048
Gu-0.047
 come-0.046
 baked-0.044
 comm-0.044
Tokenumpy
Token ID32152
Feature activation-2.84e-02

Pos logit contributions
igs+0.044
 often+0.040
 gut+0.037
 solar+0.037
apy+0.036
Neg logit contributions
 written-0.036
 aback-0.033
eared-0.033
umbs-0.032
 comm-0.032
Token because
Token ID780
Feature activation-6.13e-02

Pos logit contributions
igs+0.048
ot+0.041
apy+0.041
 often+0.040
 solar+0.040
Neg logit contributions
 written-0.040
 comm-0.036
 aback-0.035
 source-0.035
 calling-0.034
Token he
Token ID339
Feature activation-2.74e-02

Pos logit contributions
igs+0.110
oppy+0.099
 solar+0.095
ot+0.094
 gut+0.091
Neg logit contributions
 written-0.077
 such-0.067
 come-0.067
 meant-0.064
 coming-0.060
Token couldn
Token ID3521
Feature activation-1.44e-02

Pos logit contributions
igs+0.049
ot+0.040
 gut+0.040
 goals+0.040
ug+0.040
Neg logit contributions
 written-0.036
Gu-0.033
 comm-0.033
 appeared-0.032
 happened-0.031
Token't
Token ID470
Feature activation-3.80e-02

Pos logit contributions
igs+0.031
 often+0.030
 solar+0.029
ot+0.028
 gut+0.028
Neg logit contributions
 written-0.011
Gu-0.010
 aback-0.010
 comm-0.009
hered-0.009
Token help
Token ID1037
Feature activation-3.81e-03

Pos logit contributions
igs+0.075
 solar+0.065
 gut+0.064
 goals+0.063
 often+0.062
Neg logit contributions
 written-0.041
 come-0.040
hered-0.036
 source-0.036
 calling-0.035
Token anyone
Token ID2687
Feature activation+1.34e-02

Pos logit contributions
igs+0.005
 often+0.004
 solar+0.004
ot+0.004
 lived+0.004
Neg logit contributions
 written-0.007
 aback-0.006
 comm-0.006
Gu-0.006
 source-0.006
Token her
Token ID607
Feature activation-2.02e-02

Pos logit contributions
igs+0.079
 solar+0.073
 often+0.071
ug+0.069
 lived+0.069
Neg logit contributions
 written-0.073
 aback-0.073
 source-0.068
 saying-0.067
 such-0.065
Token eyes
Token ID2951
Feature activation-1.34e-02

Pos logit contributions
igs+0.031
 often+0.030
ug+0.027
ot+0.027
 lived+0.026
Neg logit contributions
 written-0.028
 comm-0.027
 source-0.027
 aback-0.026
 bisc-0.025
Token.
Token ID13
Feature activation-2.05e-02

Pos logit contributions
igs+0.018
 often+0.016
ot+0.014
 solar+0.014
 gut+0.014
Neg logit contributions
 written-0.023
 aback-0.022
 comm-0.021
 source-0.021
Gu-0.020
Token She
Token ID1375
Feature activation-1.89e-02

Pos logit contributions
igs+0.036
 often+0.032
ot+0.030
 gut+0.030
 solar+0.029
Neg logit contributions
 written-0.026
Gu-0.025
 aback-0.025
 comm-0.023
 bisc-0.022
Token balanced
Token ID12974
Feature activation-7.03e-04

Pos logit contributions
igs+0.033
 often+0.030
ot+0.029
ug+0.028
 solar+0.028
Neg logit contributions
 written-0.024
 comm-0.022
 aback-0.021
Gu-0.021
 announced-0.020
Token very
Token ID845
Feature activation-8.19e-02

Pos logit contributions
igs+0.001
 often+0.001
ot+0.001
 solar+0.001
 lived+0.001
Neg logit contributions
 written-0.001
 aback-0.001
 comm-0.001
Gu-0.001
 source-0.001
Token carefully
Token ID7773
Feature activation-3.31e-03

Pos logit contributions
 solar+0.110
 ships+0.108
 ig+0.108
igs+0.106
 lived+0.102
Neg logit contributions
 written-0.113
 right-0.102
 sure-0.099
 aback-0.096
 once-0.095
Token and
Token ID290
Feature activation+4.33e-03

Pos logit contributions
igs+0.005
 often+0.005
ot+0.005
 solar+0.004
 lived+0.004
Neg logit contributions
 written-0.005
 aback-0.004
 comm-0.004
Gu-0.004
 source-0.004
Token took
Token ID1718
Feature activation+2.21e-02

Pos logit contributions
 written+0.006
 aback+0.006
Gu+0.006
 comm+0.006
 source+0.005
Neg logit contributions
igs-0.007
 often-0.006
ot-0.006
 solar-0.006
 lived-0.006
Token a
Token ID257
Feature activation+1.83e-02

Pos logit contributions
Gu+0.032
 written+0.032
 comm+0.032
 aback+0.030
 source+0.030
Neg logit contributions
igs-0.032
 often-0.032
ot-0.030
 solar-0.027
apy-0.027
Token deep
Token ID2769
Feature activation-1.72e-03

Pos logit contributions
 written+0.032
Gu+0.031
 aback+0.031
 comm+0.031
 Birthday+0.029
Neg logit contributions
igs-0.021
 often-0.020
ot-0.019
 solar-0.017
apy-0.017
Token girl
Token ID2576
Feature activation-5.08e-02

Pos logit contributions
igs+0.085
 often+0.084
 gut+0.077
belt+0.077
 goals+0.076
Neg logit contributions
 bisc-0.022
 comm-0.021
Gu-0.020
 written-0.020
hered-0.018
Token.
Token ID13
Feature activation-1.21e-02

Pos logit contributions
igs+0.091
 goals+0.085
 gut+0.078
belt+0.075
apy+0.075
Neg logit contributions
 calling-0.065
 written-0.063
 saying-0.058
 coming-0.058
 telling-0.058
Token She
Token ID1375
Feature activation-4.61e-02

Pos logit contributions
igs+0.021
 often+0.018
ug+0.017
 gut+0.017
ind+0.017
Neg logit contributions
 written-0.017
 aback-0.016
Gu-0.016
 source-0.015
 comm-0.015
Token was
Token ID373
Feature activation+1.30e-02

Pos logit contributions
igs+0.084
 gut+0.075
aro+0.070
ug+0.070
 instincts+0.069
Neg logit contributions
 happened-0.065
 written-0.063
 baked-0.059
 appeared-0.057
Gu-0.056
Token three
Token ID1115
Feature activation-5.39e-02

Pos logit contributions
 written+0.023
 aback+0.022
 comm+0.022
Gu+0.022
hered+0.021
Neg logit contributions
 often-0.015
igs-0.015
ot-0.013
 lived-0.013
 solar-0.013
Token years
Token ID812
Feature activation-7.13e-02

Pos logit contributions
igs+0.078
 solar+0.075
belt+0.073
apy+0.069
 quartz+0.068
Neg logit contributions
 comm-0.075
 written-0.068
 ingredients-0.068
m-0.067
 Birthday-0.067
Token old
Token ID1468
Feature activation-9.77e-03

Pos logit contributions
 lived+0.128
igs+0.123
apy+0.121
 often+0.119
charged+0.118
Neg logit contributions
 news-0.067
cake-0.063
 written-0.061
 comm-0.060
 ingredients-0.060
Token and
Token ID290
Feature activation-5.62e-02

Pos logit contributions
igs+0.017
 often+0.016
ot+0.015
 solar+0.015
ug+0.014
Neg logit contributions
 written-0.012
 comm-0.012
 aback-0.011
Gu-0.011
 source-0.011
Token very
Token ID845
Feature activation-7.65e-02

Pos logit contributions
igs+0.107
ug+0.090
ind+0.084
alk+0.084
 goals+0.083
Neg logit contributions
 written-0.080
 meant-0.072
 come-0.070
 baked-0.069
 happened-0.068
Token curious
Token ID11040
Feature activation-3.05e-02

Pos logit contributions
igs+0.159
ug+0.138
lights+0.137
alk+0.136
 prope+0.135
Neg logit contributions
 smell-0.074
 sure-0.065
Gu-0.060
 come-0.059
 baked-0.058
Token.
Token ID13
Feature activation-3.46e-05

Pos logit contributions
igs+0.051
 often+0.040
aro+0.040
ug+0.040
ot+0.040
Neg logit contributions
 written-0.047
 comm-0.042
 news-0.040
 Birthday-0.040
 aback-0.040
Token two
Token ID734
Feature activation-2.55e-03

Pos logit contributions
igs+0.002
 often+0.002
ot+0.002
 solar+0.002
ug+0.002
Neg logit contributions
 written-0.003
 comm-0.003
Gu-0.003
 aback-0.003
 source-0.003
Token kids
Token ID3988
Feature activation-3.17e-02

Pos logit contributions
igs+0.004
 often+0.004
 solar+0.004
ot+0.004
 gut+0.003
Neg logit contributions
 written-0.003
 aback-0.003
 comm-0.003
Gu-0.003
hered-0.003
Token played
Token ID2826
Feature activation+1.81e-02

Pos logit contributions
igs+0.056
 solar+0.047
 gut+0.046
 often+0.046
ot+0.045
Neg logit contributions
 written-0.041
 aback-0.036
hered-0.036
 announced-0.036
 appeared-0.035
Token happily
Token ID18177
Feature activation-8.60e-03

Pos logit contributions
 written+0.028
Gu+0.027
 aback+0.027
 comm+0.027
 source+0.025
Neg logit contributions
igs-0.026
 often-0.026
ot-0.023
 lived-0.022
 solar-0.022
Token in
Token ID287
Feature activation-2.17e-02

Pos logit contributions
igs+0.015
 often+0.014
ot+0.013
 solar+0.013
 gut+0.013
Neg logit contributions
 written-0.011
 aback-0.010
 comm-0.009
hered-0.009
 source-0.009
Token their
Token ID511
Feature activation-7.04e-02

Pos logit contributions
igs+0.035
 often+0.030
ug+0.029
ind+0.029
belt+0.029
Neg logit contributions
 written-0.029
 aback-0.029
 calling-0.028
 source-0.028
 comm-0.027
Token new
Token ID649
Feature activation-4.41e-02

Pos logit contributions
ug+0.122
igs+0.111
aro+0.103
apy+0.100
 gut+0.100
Neg logit contributions
 surprise-0.083
 treats-0.081
 ingredients-0.078
 come-0.077
 bisc-0.075
Token sandbox
Token ID35204
Feature activation-2.75e-02

Pos logit contributions
igs+0.064
ug+0.061
apy+0.059
 gut+0.058
 often+0.056
Neg logit contributions
 ingredients-0.054
 come-0.054
 bisc-0.053
 treats-0.052
 surprise-0.052
Token for
Token ID329
Feature activation-8.32e-03

Pos logit contributions
igs+0.039
 often+0.034
 solar+0.034
ot+0.033
 gut+0.033
Neg logit contributions
 written-0.040
 aback-0.038
 comm-0.037
 source-0.036
 Birthday-0.035
Token the
Token ID262
Feature activation+1.29e-02

Pos logit contributions
igs+0.011
 often+0.010
ot+0.010
 gut+0.009
 solar+0.009
Neg logit contributions
 written-0.013
 comm-0.013
 aback-0.013
Gu-0.012
 source-0.012
Token rest
Token ID1334
Feature activation-2.32e-02

Pos logit contributions
 written+0.020
 aback+0.020
Gu+0.020
 comm+0.019
hered+0.018
Neg logit contributions
 often-0.017
igs-0.017
ot-0.015
 solar-0.015
 lived-0.014