TOP ACTIVATIONS
MAX = 0.089

. I will trade nap kins with you ."
start to paint the pump kins . They paint
, round , orange pump kins . They wanted to have
. They grew many pump kins on their farm . Tim
They picked up their nap kins and wiped their mouths and
. They make the pump kins look very colorful and bright
Hello all igator ! â .
, like paper , nap kins , and shirts .
you can 't . Nap kins are not for playing .
decided to make more pump kins like Jack . They took
you want to trade nap kins with me ?" she offered
to make more pic n ics and looked in the fridge
They swapped their nap kins and finished their lunch .
you can fold these nap kins and put them on the
to go on pic n ics with her family . One
Don 't hurt the pump kins ." Tim
We can share the cher ries . There are many cher
They quickly put their nap kins over their salad . Then
. He 'd fold nap kins and make them into hats
. They love their pump kins . They love to paint

MIN ACTIVATIONS
MIN = -0.121

lately . With work and err ands , they didn 't
Emma who loved learning about outer space . Everyday , she
of himself for using his talent to scare away the noise
her mom were out running err ands . Mary was three
by helping her mum with err ands and doing odd jobs
at his salon and running err ands . But every day
sle igh with eight re ind eer . The re ind
what the rocket discovered in outer space . It was a
looked like it came from outer space . L
. He said they had talent and potential .
. Lily had a magn ifying glass that she found in
it for what felt like hours . After a while ,
needed to go do some err ands . Mom
't been used for many years . One day , we
and asked if Bun - B un could teach it how
some carrots for his re ind eer ," Anna says .
been there for hundreds of years ! P olly
grandma had kept for many years . They only knew that
girl who was about three years old . She asked him
, it kept on flying err atically with inexplicable speed .

INTERVAL 0.036 - 0.089
CONTAINS 0.367%

Tony and Tommy were the best of friends ! <|endoftext|> Once
He suddenly got a brilliant idea ; he used the soap
Mom my . Let 's build a grocery store together !"
day , Lily got an idea . She decided to find
to share his sa us ages with his brother . He

INTERVAL -0.016 - 0.036
CONTAINS 88.648%

really wanted to play in the mud , so she decided
to look out of the window ." Jimmy smiled
gone . Lily cries . She is alone . She is
dog lots of c udd les and took her for walks
big flower with lots of honey in it . Tweet wanted

INTERVAL -0.068 - -0.016
CONTAINS 10.984%

The girl never saw her home again . <|endoftext|> Once upon
a sense of accomplishment and pride . She learned that had
he saw one . One day , his mom took him
and saw a big , black cow . Farmer David was
looks around the room and tries to find Mia . He

INTERVAL -0.121 - -0.068
CONTAINS 0.001%

at his salon and running err ands . But every day
needed to go do some err ands . Mom
girl who was about three years old . She asked him
doctor , which was an err and he didn 't understand
girl who was turning three years old . She was so
Token.
Token ID13
Feature activation+1.05e-02

Pos logit contributions
iring+0.018
 God+0.016
 please+0.016
ying+0.015
 who+0.015
Neg logit contributions
seek-0.041
 habitat-0.038
powers-0.037
growth-0.036
icol-0.036
Token I
Token ID314
Feature activation+9.17e-04

Pos logit contributions
iring+0.009
 please+0.008
 God+0.008
 who+0.007
ying+0.007
Neg logit contributions
seek-0.026
 habitat-0.024
growth-0.024
powers-0.023
icol-0.023
Token will
Token ID481
Feature activation+7.41e-03

Pos logit contributions
 do+0.001
 please+0.001
?"+0.001
 also+0.001
 are+0.001
Neg logit contributions
growth-0.002
seek-0.002
 habitat-0.002
icol-0.002
 arrest-0.002
Token trade
Token ID3292
Feature activation+1.84e-02

Pos logit contributions
 please+0.006
?"+0.006
 do+0.006
 who+0.005
 God+0.005
Neg logit contributions
seek-0.019
growth-0.018
 habitat-0.018
powers-0.018
icol-0.018
Token nap
Token ID25422
Feature activation-1.55e-03

Pos logit contributions
 please+0.012
iring+0.012
 God+0.012
 who+0.012
?"+0.011
Neg logit contributions
seek-0.049
 habitat-0.046
icol-0.045
growth-0.045
powers-0.044
Tokenkins
Token ID5331
Feature activation+8.89e-02

Pos logit contributions
seek+0.004
 habitat+0.004
powers+0.004
growth+0.003
icol+0.003
Neg logit contributions
iring-0.001
 God-0.001
 please-0.001
ying-0.001
 who-0.001
Token with
Token ID351
Feature activation-3.67e-03

Pos logit contributions
iring+0.071
 please+0.067
 God+0.065
 who+0.064
?"+0.062
Neg logit contributions
seek-0.223
 habitat-0.211
growth-0.205
icol-0.203
powers-0.202
Token you
Token ID345
Feature activation+2.97e-03

Pos logit contributions
seek+0.011
 habitat+0.010
icol+0.010
growth+0.010
powers+0.010
Neg logit contributions
 please-0.001
 God-0.001
 who-0.001
?"-0.001
iring-0.001
Token."
Token ID526
Feature activation-4.43e-03

Pos logit contributions
 please+0.002
 who+0.002
iring+0.002
?"+0.002
 God+0.002
Neg logit contributions
seek-0.008
 habitat-0.007
icol-0.007
growth-0.007
powers-0.007
Token
Token ID198
Feature activation+3.79e-03

Pos logit contributions
seek+0.011
 habitat+0.011
growth+0.010
icol+0.010
powers+0.010
Neg logit contributions
iring-0.004
 please-0.003
 God-0.003
 who-0.003
?"-0.003
Token
Token ID198
Feature activation+3.00e-03

Pos logit contributions
iring+0.005
nam+0.005
 please+0.004
ying+0.004
 God+0.004
Neg logit contributions
seek-0.008
 habitat-0.007
powers-0.007
growth-0.007
icol-0.007
Token start
Token ID923
Feature activation+3.14e-03

Pos logit contributions
seek+0.016
 habitat+0.015
growth+0.014
powers+0.014
icol+0.014
Neg logit contributions
iring-0.007
 God-0.006
 please-0.006
 who-0.006
ying-0.006
Token to
Token ID284
Feature activation-9.37e-03

Pos logit contributions
 who+0.004
 God+0.004
 by+0.004
iring+0.004
 please+0.004
Neg logit contributions
seek-0.006
 habitat-0.006
icol-0.006
growth-0.006
powers-0.006
Token paint
Token ID7521
Feature activation+1.33e-02

Pos logit contributions
seek+0.019
 habitat+0.018
growth+0.017
powers+0.017
icol+0.017
Neg logit contributions
iring-0.012
 please-0.011
 God-0.011
 who-0.011
ying-0.011
Token the
Token ID262
Feature activation+1.12e-04

Pos logit contributions
iring+0.012
 God+0.011
 please+0.011
 who+0.010
?"+0.010
Neg logit contributions
seek-0.033
 habitat-0.031
powers-0.030
growth-0.029
icol-0.029
Token pump
Token ID8901
Feature activation+2.30e-02

Pos logit contributions
iring+0.000
 please+0.000
 God+0.000
 who+0.000
ying+0.000
Neg logit contributions
seek-0.000
 habitat-0.000
growth-0.000
powers-0.000
icol-0.000
Tokenkins
Token ID5331
Feature activation+8.17e-02

Pos logit contributions
 please+0.021
 Amen+0.021
 God+0.021
iring+0.018
 okay+0.018
Neg logit contributions
seek-0.050
 habitat-0.046
growth-0.046
 bumps-0.046
ro-0.045
Token.
Token ID13
Feature activation+8.00e-03

Pos logit contributions
iring+0.078
 God+0.076
 please+0.070
mm+0.067
?"+0.065
Neg logit contributions
seek-0.187
 habitat-0.172
powers-0.169
growth-0.167
icol-0.161
Token
Token ID198
Feature activation+2.69e-03

Pos logit contributions
 please+0.006
iring+0.006
 God+0.006
 who+0.006
?"+0.005
Neg logit contributions
seek-0.021
 habitat-0.019
 arrest-0.019
powers-0.019
growth-0.018
Token
Token ID198
Feature activation+1.45e-03

Pos logit contributions
iring+0.003
 please+0.003
 God+0.003
 who+0.002
ying+0.002
Neg logit contributions
seek-0.006
 habitat-0.006
growth-0.005
powers-0.005
icol-0.005
TokenThey
Token ID2990
Feature activation-7.32e-03

Pos logit contributions
iring+0.001
 God+0.001
 please+0.001
ying+0.001
 who+0.001
Neg logit contributions
seek-0.004
 habitat-0.004
powers-0.004
growth-0.004
icol-0.004
Token paint
Token ID7521
Feature activation+3.27e-04

Pos logit contributions
seek+0.016
 habitat+0.015
growth+0.014
powers+0.014
icol+0.014
Neg logit contributions
iring-0.008
 please-0.008
 God-0.008
 who-0.007
?"-0.007
Token,
Token ID11
Feature activation+1.43e-02

Pos logit contributions
iring+0.001
 please+0.001
 God+0.001
 who+0.001
?"+0.001
Neg logit contributions
seek-0.003
 habitat-0.003
growth-0.002
powers-0.002
icol-0.002
Token round
Token ID2835
Feature activation+6.63e-03

Pos logit contributions
iring+0.017
ying+0.016
 please+0.015
 God+0.015
mm+0.015
Neg logit contributions
seek-0.028
 habitat-0.028
icol-0.027
growth-0.027
powers-0.027
Token,
Token ID11
Feature activation+7.55e-03

Pos logit contributions
iring+0.006
 who+0.006
 God+0.006
 please+0.006
?"+0.006
Neg logit contributions
seek-0.016
 habitat-0.015
icol-0.014
powers-0.014
growth-0.014
Token orange
Token ID10912
Feature activation+9.88e-03

Pos logit contributions
iring+0.009
ying+0.008
 please+0.008
 God+0.008
?"+0.008
Neg logit contributions
seek-0.015
 habitat-0.015
icol-0.014
growth-0.014
powers-0.014
Token pump
Token ID8901
Feature activation+1.80e-02

Pos logit contributions
iring+0.007
 who+0.007
 please+0.007
 God+0.007
?"+0.006
Neg logit contributions
seek-0.026
 habitat-0.024
icol-0.023
powers-0.023
growth-0.023
Tokenkins
Token ID5331
Feature activation+8.10e-02

Pos logit contributions
 please+0.011
iring+0.011
 God+0.010
nam+0.008
mm+0.008
Neg logit contributions
seek-0.046
 habitat-0.044
growth-0.043
powers-0.042
 arrest-0.041
Token.
Token ID13
Feature activation+5.00e-03

Pos logit contributions
iring+0.070
 please+0.065
 God+0.064
 who+0.062
?"+0.059
Neg logit contributions
seek-0.198
 habitat-0.186
growth-0.180
powers-0.179
icol-0.178
Token They
Token ID1119
Feature activation-5.63e-03

Pos logit contributions
 God+0.004
 please+0.004
 who+0.004
iring+0.004
?"+0.004
Neg logit contributions
seek-0.013
 habitat-0.012
powers-0.012
 arrest-0.011
growth-0.011
Token wanted
Token ID2227
Feature activation-1.92e-02

Pos logit contributions
seek+0.013
 habitat+0.012
icol+0.012
powers+0.012
growth+0.012
Neg logit contributions
 who-0.006
 please-0.006
 God-0.005
 also-0.005
?"-0.005
Token to
Token ID284
Feature activation-1.53e-02

Pos logit contributions
seek+0.046
icol+0.043
 habitat+0.041
powers+0.041
 arrest+0.039
Neg logit contributions
 who-0.023
 God-0.021
 please-0.021
 by-0.020
?"-0.020
Token have
Token ID423
Feature activation-1.04e-02

Pos logit contributions
seek+0.035
 habitat+0.032
powers+0.031
icol+0.031
growth+0.031
Neg logit contributions
iring-0.016
 please-0.016
 God-0.015
 who-0.015
?"-0.015
Token.
Token ID13
Feature activation-2.71e-03

Pos logit contributions
seek+0.038
 habitat+0.034
powers+0.033
growth+0.032
icol+0.031
Neg logit contributions
iring-0.021
 please-0.019
 God-0.019
mm-0.018
ying-0.018
Token They
Token ID1119
Feature activation-7.56e-03

Pos logit contributions
seek+0.005
 habitat+0.005
growth+0.004
powers+0.004
icol+0.004
Neg logit contributions
iring-0.004
ying-0.004
 God-0.004
bling-0.004
nam-0.004
Token grew
Token ID6348
Feature activation-1.45e-04

Pos logit contributions
seek+0.015
 habitat+0.014
growth+0.013
powers+0.013
icol+0.013
Neg logit contributions
iring-0.010
ying-0.009
 please-0.009
 God-0.009
mm-0.008
Token many
Token ID867
Feature activation-1.31e-02

Pos logit contributions
 habitat+0.000
seek+0.000
powers+0.000
ashed+0.000
growth+0.000
Neg logit contributions
mm-0.000
 please-0.000
heart-0.000
ma-0.000
 God-0.000
Token pump
Token ID8901
Feature activation+1.61e-02

Pos logit contributions
 habitat+0.018
powers+0.018
 bumps+0.018
 bites+0.017
growth+0.016
Neg logit contributions
 please-0.021
 Amen-0.020
mallow-0.019
apping-0.019
bling-0.019
Tokenkins
Token ID5331
Feature activation+8.04e-02

Pos logit contributions
 please+0.021
 ma+0.019
 Amen+0.019
 God+0.018
 okay+0.017
Neg logit contributions
seek-0.029
keeping-0.028
 habitat-0.027
powers-0.026
water-0.025
Token on
Token ID319
Feature activation+4.93e-03

Pos logit contributions
iring+0.094
 God+0.094
heart+0.092
mm+0.090
 please+0.089
Neg logit contributions
seek-0.159
 habitat-0.144
powers-0.142
growth-0.129
water-0.125
Token their
Token ID511
Feature activation-1.06e-02

Pos logit contributions
iring+0.005
 please+0.004
ying+0.004
 God+0.004
 who+0.004
Neg logit contributions
seek-0.012
 habitat-0.011
growth-0.011
powers-0.010
 arrest-0.010
Token farm
Token ID5318
Feature activation+1.84e-02

Pos logit contributions
seek+0.021
 habitat+0.020
powers+0.019
growth+0.019
icol+0.019
Neg logit contributions
iring-0.014
 please-0.013
 God-0.013
ying-0.013
 who-0.013
Token.
Token ID13
Feature activation+1.76e-03

Pos logit contributions
iring+0.018
 please+0.017
 God+0.017
 who+0.016
ying+0.016
Neg logit contributions
seek-0.043
 habitat-0.040
powers-0.038
growth-0.038
icol-0.038
Token Tim
Token ID5045
Feature activation+1.57e-02

Pos logit contributions
iring+0.003
ying+0.002
bling+0.002
itive+0.002
nam+0.002
Neg logit contributions
seek-0.003
 habitat-0.003
growth-0.003
powers-0.003
icol-0.003
Token They
Token ID1119
Feature activation-4.47e-03

Pos logit contributions
iring+0.006
 God+0.006
 who+0.006
 please+0.006
?"+0.005
Neg logit contributions
seek-0.018
 habitat-0.016
growth-0.016
powers-0.016
icol-0.016
Token picked
Token ID6497
Feature activation+1.75e-02

Pos logit contributions
seek+0.010
 habitat+0.009
powers+0.009
growth+0.009
icol+0.009
Neg logit contributions
 who-0.005
iring-0.005
 please-0.005
 God-0.005
?"-0.005
Token up
Token ID510
Feature activation+2.51e-02

Pos logit contributions
iring+0.011
 who+0.011
 God+0.010
ying+0.010
 by+0.009
Neg logit contributions
seek-0.049
 habitat-0.046
powers-0.044
growth-0.044
icol-0.043
Token their
Token ID511
Feature activation-1.04e-02

Pos logit contributions
iring+0.019
 God+0.018
 who+0.017
 please+0.017
?"+0.016
Neg logit contributions
seek-0.065
 habitat-0.061
powers-0.059
growth-0.059
icol-0.059
Token nap
Token ID25422
Feature activation-1.46e-02

Pos logit contributions
seek+0.024
icol+0.022
growth+0.021
 habitat+0.021
 arrest+0.019
Neg logit contributions
?"-0.012
 please-0.012
iring-0.011
 who-0.011
 do-0.011
Tokenkins
Token ID5331
Feature activation+8.03e-02

Pos logit contributions
seek+0.041
 habitat+0.039
growth+0.038
powers+0.038
icol+0.037
Neg logit contributions
iring-0.008
 please-0.007
 God-0.006
 who-0.006
?"-0.006
Token and
Token ID290
Feature activation+1.35e-03

Pos logit contributions
iring+0.067
 God+0.060
 please+0.059
ying+0.053
?"+0.053
Neg logit contributions
seek-0.198
 habitat-0.186
powers-0.179
growth-0.178
icol-0.175
Token wiped
Token ID21122
Feature activation+7.03e-04

Pos logit contributions
 who+0.002
nam+0.001
 God+0.001
 please+0.001
?"+0.001
Neg logit contributions
seek-0.003
 habitat-0.003
powers-0.003
growth-0.003
icol-0.003
Token their
Token ID511
Feature activation-1.94e-03

Pos logit contributions
iring+0.001
 please+0.001
 God+0.001
 who+0.001
?"+0.001
Neg logit contributions
seek-0.002
 habitat-0.002
powers-0.001
growth-0.001
icol-0.001
Token mouths
Token ID28552
Feature activation+2.32e-02

Pos logit contributions
seek+0.004
icol+0.004
 habitat+0.003
growth+0.003
powers+0.003
Neg logit contributions
kins-0.002
 God-0.002
iring-0.002
nam-0.002
 who-0.002
Token and
Token ID290
Feature activation-1.43e-04

Pos logit contributions
iring+0.019
 please+0.018
 God+0.017
 who+0.017
?"+0.016
Neg logit contributions
seek-0.058
 habitat-0.055
growth-0.053
powers-0.053
icol-0.053
Token.
Token ID13
Feature activation+5.92e-03

Pos logit contributions
seek+0.021
 habitat+0.020
growth+0.020
icol+0.020
powers+0.019
Neg logit contributions
iring-0.005
 please-0.005
 who-0.004
 God-0.004
?"-0.004
Token They
Token ID1119
Feature activation-5.79e-03

Pos logit contributions
iring+0.004
 please+0.004
 God+0.004
 who+0.004
She+0.003
Neg logit contributions
seek-0.016
 habitat-0.015
 arrest-0.014
icol-0.014
powers-0.014
Token make
Token ID787
Feature activation+2.33e-02

Pos logit contributions
seek+0.014
 habitat+0.013
growth+0.013
icol+0.013
powers+0.013
Neg logit contributions
iring-0.005
 please-0.005
 God-0.005
 who-0.005
?"-0.004
Token the
Token ID262
Feature activation-1.64e-03

Pos logit contributions
 who+0.022
iring+0.022
 God+0.022
 please+0.020
 happy+0.020
Neg logit contributions
seek-0.058
 habitat-0.053
growth-0.051
icol-0.051
powers-0.051
Token pump
Token ID8901
Feature activation+1.59e-02

Pos logit contributions
seek+0.003
 habitat+0.003
powers+0.003
growth+0.003
icol+0.003
Neg logit contributions
iring-0.002
 please-0.002
 God-0.002
 who-0.002
ying-0.002
Tokenkins
Token ID5331
Feature activation+7.98e-02

Pos logit contributions
 please+0.016
 God+0.015
 Amen+0.015
iring+0.015
nam+0.015
Neg logit contributions
seek-0.031
 bumps-0.030
 habitat-0.028
powers-0.028
water-0.028
Token look
Token ID804
Feature activation-4.52e-03

Pos logit contributions
iring+0.107
 God+0.100
mm+0.096
ying+0.094
heart+0.093
Neg logit contributions
seek-0.144
powers-0.135
 habitat-0.131
 bumps-0.128
growth-0.126
Token very
Token ID845
Feature activation+5.77e-04

Pos logit contributions
seek+0.010
 habitat+0.009
powers+0.009
icol+0.009
growth+0.009
Neg logit contributions
iring-0.005
 God-0.005
 please-0.004
ying-0.004
 who-0.004
Token colorful
Token ID20239
Feature activation+8.08e-03

Pos logit contributions
 God+0.001
iring+0.001
 who+0.001
 please+0.001
mm+0.001
Neg logit contributions
powers-0.001
seek-0.001
 habitat-0.001
 bumps-0.001
icol-0.001
Token and
Token ID290
Feature activation+1.70e-03

Pos logit contributions
iring+0.007
 please+0.006
 God+0.006
 who+0.006
?"+0.006
Neg logit contributions
seek-0.020
 habitat-0.019
growth-0.019
powers-0.018
icol-0.018
Token bright
Token ID6016
Feature activation-1.20e-02

Pos logit contributions
iring+0.002
ying+0.002
 please+0.002
 God+0.002
 who+0.002
Neg logit contributions
seek-0.004
 habitat-0.003
powers-0.003
icol-0.003
growth-0.003
Token
Token ID26391
Feature activation-1.09e-02

Pos logit contributions
iring+0.009
 please+0.009
 God+0.009
 who+0.009
?"+0.009
Neg logit contributions
seek-0.011
 habitat-0.011
powers-0.010
growth-0.010
icol-0.010
Token
Token ID129
Feature activation+4.58e-03

Pos logit contributions
seek+0.025
 habitat+0.023
powers+0.023
growth+0.022
icol+0.022
Neg logit contributions
iring-0.011
 God-0.010
 please-0.010
 who-0.010
ying-0.010
Token
Token ID241
Feature activation-6.88e-03

Pos logit contributions
iring+0.006
 please+0.006
?"+0.006
 who+0.006
 God+0.006
Neg logit contributions
seek-0.009
 habitat-0.008
growth-0.008
powers-0.008
icol-0.008
TokenHello
Token ID15496
Feature activation+5.97e-03

Pos logit contributions
seek+0.016
 habitat+0.015
growth+0.015
icol+0.015
powers+0.015
Neg logit contributions
iring-0.006
 who-0.006
 please-0.006
 God-0.006
She-0.006
Token all
Token ID477
Feature activation+4.47e-02

Pos logit contributions
iring+0.005
nam+0.004
ying+0.004
 God+0.004
She+0.004
Neg logit contributions
seek-0.014
 habitat-0.013
powers-0.013
icol-0.013
growth-0.012
Tokenigator
Token ID23823
Feature activation+7.97e-02

Pos logit contributions
iring+0.062
 please+0.060
mm+0.059
 God+0.059
 okay+0.057
Neg logit contributions
seek-0.078
 habitat-0.070
 arrest-0.069
powers-0.069
growth-0.069
Token!
Token ID0
Feature activation+1.08e-02

Pos logit contributions
iring+0.089
nam+0.080
 God+0.080
ying+0.077
 please+0.075
Neg logit contributions
seek-0.172
 habitat-0.157
powers-0.154
icol-0.151
 arrest-0.149
Tokenâ
Token ID22940
Feature activation+2.57e-02

Pos logit contributions
iring+0.013
ying+0.012
 God+0.011
 please+0.011
ie+0.011
Neg logit contributions
seek-0.023
 habitat-0.021
growth-0.021
powers-0.021
icol-0.020
Token
Token ID26391
Feature activation-1.39e-02

Pos logit contributions
iring+0.038
 please+0.036
 God+0.035
 who+0.034
nam+0.034
Neg logit contributions
seek-0.050
 habitat-0.044
icol-0.043
growth-0.042
powers-0.041
Token.
Token ID13
Feature activation+1.44e-02

Pos logit contributions
seek+0.031
 habitat+0.029
growth+0.028
powers+0.028
icol+0.028
Neg logit contributions
iring-0.014
 please-0.013
 who-0.013
 God-0.013
?"-0.013
Token
Token ID198
Feature activation+9.29e-03

Pos logit contributions
iring+0.018
 God+0.016
 please+0.016
ying+0.016
 who+0.016
Neg logit contributions
seek-0.029
 habitat-0.027
growth-0.027
powers-0.026
icol-0.026
Token,
Token ID11
Feature activation+9.67e-03

Pos logit contributions
iring+0.014
 please+0.013
 God+0.013
 who+0.013
?"+0.012
Neg logit contributions
seek-0.044
 habitat-0.041
growth-0.040
powers-0.040
icol-0.040
Token like
Token ID588
Feature activation+8.28e-03

Pos logit contributions
iring+0.008
 please+0.008
 who+0.008
 God+0.008
?"+0.007
Neg logit contributions
seek-0.024
 habitat-0.022
icol-0.022
growth-0.022
powers-0.022
Token paper
Token ID3348
Feature activation-1.36e-02

Pos logit contributions
 who+0.009
 please+0.009
 God+0.008
 do+0.008
 if+0.008
Neg logit contributions
seek-0.020
icol-0.018
 habitat-0.018
growth-0.018
ashed-0.018
Token,
Token ID11
Feature activation-7.14e-03

Pos logit contributions
seek+0.039
 habitat+0.036
growth+0.036
icol+0.035
powers+0.035
Neg logit contributions
iring-0.007
 who-0.007
 please-0.007
?"-0.006
 God-0.006
Token nap
Token ID25422
Feature activation-7.70e-03

Pos logit contributions
seek+0.015
 habitat+0.014
icol+0.013
powers+0.013
growth+0.013
Neg logit contributions
iring-0.009
 please-0.009
 who-0.008
 God-0.008
?"-0.008
Tokenkins
Token ID5331
Feature activation+7.96e-02

Pos logit contributions
seek+0.022
 habitat+0.021
growth+0.021
powers+0.020
icol+0.020
Neg logit contributions
iring-0.003
 please-0.003
 God-0.003
 who-0.003
?"-0.002
Token,
Token ID11
Feature activation+5.06e-04

Pos logit contributions
iring+0.040
 who+0.039
 please+0.038
?"+0.035
 God+0.033
Neg logit contributions
seek-0.223
 habitat-0.214
growth-0.210
icol-0.209
powers-0.205
Token and
Token ID290
Feature activation+7.92e-03

Pos logit contributions
 who+0.000
 please+0.000
 God+0.000
iring+0.000
?"+0.000
Neg logit contributions
seek-0.001
 habitat-0.001
icol-0.001
powers-0.001
growth-0.001
Token shirts
Token ID20858
Feature activation+8.16e-03

Pos logit contributions
 who+0.009
 God+0.008
 please+0.008
 do+0.008
nam+0.007
Neg logit contributions
seek-0.019
 habitat-0.017
icol-0.017
growth-0.016
powers-0.016
Token.
Token ID13
Feature activation+4.01e-03

Pos logit contributions
 who+0.004
 please+0.004
?"+0.004
iring+0.004
 in+0.004
Neg logit contributions
seek-0.023
growth-0.022
 habitat-0.022
icol-0.022
 arrest-0.021
Token
Token ID198
Feature activation-1.21e-03

Pos logit contributions
iring+0.005
 please+0.005
ying+0.005
nam+0.005
 God+0.005
Neg logit contributions
seek-0.008
 habitat-0.007
growth-0.007
powers-0.007
icol-0.007
Token you
Token ID345
Feature activation+1.91e-02

Pos logit contributions
 arrest+0.016
seek+0.016
 habitat+0.015
 bumps+0.015
growth+0.015
Neg logit contributions
iring-0.012
ying-0.011
ble-0.010
 God-0.010
nam-0.010
Token can
Token ID460
Feature activation-1.01e-02

Pos logit contributions
iring+0.023
 God+0.022
ying+0.021
 who+0.020
 please+0.019
Neg logit contributions
seek-0.039
 habitat-0.037
powers-0.037
growth-0.035
 arrest-0.035
Token't
Token ID470
Feature activation+2.45e-02

Pos logit contributions
seek+0.021
 habitat+0.020
icol+0.019
growth+0.019
powers+0.019
Neg logit contributions
 please-0.013
iring-0.012
?"-0.012
 who-0.012
 God-0.012
Token.
Token ID13
Feature activation+8.04e-03

Pos logit contributions
iring+0.023
 please+0.022
 God+0.021
 who+0.021
?"+0.020
Neg logit contributions
seek-0.059
 habitat-0.056
growth-0.053
icol-0.053
powers-0.053
Token Nap
Token ID14332
Feature activation-8.18e-03

Pos logit contributions
iring+0.007
 God+0.007
 please+0.006
 who+0.006
?"+0.006
Neg logit contributions
seek-0.020
 habitat-0.019
powers-0.018
growth-0.018
icol-0.018
Tokenkins
Token ID5331
Feature activation+7.94e-02

Pos logit contributions
seek+0.021
 habitat+0.020
growth+0.019
powers+0.019
icol+0.019
Neg logit contributions
iring-0.006
 please-0.006
 God-0.006
 who-0.005
?"-0.005
Token are
Token ID389
Feature activation+1.45e-02

Pos logit contributions
iring+0.061
 God+0.056
nam+0.049
heart+0.042
 please+0.042
Neg logit contributions
seek-0.198
 habitat-0.184
powers-0.183
 arrest-0.178
 drowned-0.176
Token not
Token ID407
Feature activation+2.08e-02

Pos logit contributions
 please+0.011
iring+0.011
 who+0.011
?"+0.011
 God+0.011
Neg logit contributions
seek-0.038
 habitat-0.035
growth-0.035
icol-0.035
powers-0.034
Token for
Token ID329
Feature activation+1.62e-02

Pos logit contributions
iring+0.019
 God+0.017
 please+0.017
 who+0.016
ying+0.016
Neg logit contributions
seek-0.050
 habitat-0.047
powers-0.045
growth-0.045
icol-0.044
Token playing
Token ID2712
Feature activation-1.83e-02

Pos logit contributions
 God+0.014
?"+0.014
 please+0.013
 who+0.013
 you+0.011
Neg logit contributions
seek-0.043
icol-0.040
growth-0.040
 habitat-0.039
powers-0.038
Token.
Token ID13
Feature activation+4.74e-03

Pos logit contributions
seek+0.047
 habitat+0.044
growth+0.043
icol+0.043
powers+0.043
Neg logit contributions
iring-0.014
 please-0.013
 who-0.013
?"-0.013
 God-0.013
Token decided
Token ID3066
Feature activation+4.74e-04

Pos logit contributions
seek+0.009
 habitat+0.009
powers+0.009
growth+0.009
icol+0.009
Neg logit contributions
 who-0.004
 please-0.003
 God-0.003
?"-0.003
iring-0.003
Token to
Token ID284
Feature activation-1.85e-02

Pos logit contributions
 who+0.001
 please+0.001
 God+0.001
?"+0.001
iring+0.001
Neg logit contributions
seek-0.001
growth-0.001
 habitat-0.001
powers-0.001
icol-0.001
Token make
Token ID787
Feature activation+2.77e-02

Pos logit contributions
seek+0.043
powers+0.041
icol+0.040
 habitat+0.040
growth+0.038
Neg logit contributions
 please-0.020
 who-0.018
?"-0.018
 God-0.018
 do-0.018
Token more
Token ID517
Feature activation-1.73e-02

Pos logit contributions
 God+0.027
iring+0.027
 who+0.026
 please+0.025
?"+0.024
Neg logit contributions
seek-0.067
 habitat-0.062
growth-0.059
powers-0.059
icol-0.059
Token pump
Token ID8901
Feature activation+1.25e-02

Pos logit contributions
seek+0.040
 habitat+0.036
growth+0.036
icol+0.035
powers+0.034
Neg logit contributions
iring-0.018
 God-0.018
 who-0.018
 please-0.017
?"-0.017
Tokenkins
Token ID5331
Feature activation+7.93e-02

Pos logit contributions
 please+0.012
iring+0.011
 God+0.010
 ma+0.010
She+0.010
Neg logit contributions
seek-0.027
 habitat-0.025
powers-0.024
keeping-0.024
 arrest-0.023
Token like
Token ID588
Feature activation-4.78e-03

Pos logit contributions
iring+0.083
 God+0.077
 please+0.073
mm+0.070
ying+0.069
Neg logit contributions
seek-0.174
 habitat-0.161
powers-0.159
growth-0.151
icol-0.149
Token Jack
Token ID3619
Feature activation-2.93e-03

Pos logit contributions
seek+0.012
 habitat+0.010
icol+0.010
growth+0.010
powers+0.010
Neg logit contributions
 who-0.005
 God-0.005
 please-0.005
iring-0.005
?"-0.005
Token.
Token ID13
Feature activation+1.46e-03

Pos logit contributions
seek+0.007
 habitat+0.007
growth+0.007
powers+0.007
icol+0.007
Neg logit contributions
iring-0.002
 please-0.002
 God-0.002
 who-0.002
?"-0.002
Token They
Token ID1119
Feature activation-4.22e-03

Pos logit contributions
 God+0.001
 please+0.001
 who+0.001
?"+0.001
iring+0.001
Neg logit contributions
seek-0.004
 habitat-0.003
powers-0.003
growth-0.003
 arrest-0.003
Token took
Token ID1718
Feature activation+1.38e-02

Pos logit contributions
seek+0.010
 habitat+0.009
icol+0.009
powers+0.009
growth+0.009
Neg logit contributions
 who-0.005
 please-0.004
 also-0.004
?"-0.004
 God-0.004
Token you
Token ID345
Feature activation+6.13e-03

Pos logit contributions
iring+0.018
ying+0.015
 God+0.014
heart+0.013
 please+0.013
Neg logit contributions
seek-0.049
powers-0.044
 arrest-0.043
 habitat-0.042
growth-0.041
Token want
Token ID765
Feature activation+3.82e-03

Pos logit contributions
 please+0.005
?"+0.004
 do+0.004
 okay+0.003
 who+0.003
Neg logit contributions
seek-0.017
growth-0.017
 habitat-0.017
icol-0.016
 arrest-0.015
Token to
Token ID284
Feature activation-1.30e-02

Pos logit contributions
?"+0.004
 please+0.004
 your+0.004
 God+0.004
 you+0.004
Neg logit contributions
seek-0.009
icol-0.009
 habitat-0.009
growth-0.009
ashed-0.009
Token trade
Token ID3292
Feature activation+1.11e-02

Pos logit contributions
seek+0.030
 habitat+0.030
icol+0.030
ashed+0.030
powers+0.029
Neg logit contributions
 please-0.014
?"-0.014
 do-0.013
?-0.011
 God-0.011
Token nap
Token ID25422
Feature activation-7.67e-03

Pos logit contributions
iring+0.009
 please+0.009
 God+0.009
 who+0.008
?"+0.008
Neg logit contributions
seek-0.027
 habitat-0.026
growth-0.025
powers-0.025
icol-0.024
Tokenkins
Token ID5331
Feature activation+7.92e-02

Pos logit contributions
seek+0.021
 habitat+0.019
powers+0.019
growth+0.018
icol+0.018
Neg logit contributions
iring-0.005
 God-0.004
 please-0.004
ying-0.003
 who-0.003
Token with
Token ID351
Feature activation-7.88e-03

Pos logit contributions
iring+0.055
 God+0.046
ying+0.042
nam+0.041
mm+0.038
Neg logit contributions
seek-0.205
 habitat-0.189
powers-0.186
icol-0.179
growth-0.176
Token me
Token ID502
Feature activation-1.22e-02

Pos logit contributions
seek+0.021
 habitat+0.020
icol+0.020
powers+0.019
growth+0.019
Neg logit contributions
 please-0.006
?"-0.006
 God-0.006
 who-0.006
iring-0.005
Token?"
Token ID1701
Feature activation-2.74e-03

Pos logit contributions
seek+0.037
 habitat+0.035
powers+0.034
growth+0.033
icol+0.033
Neg logit contributions
iring-0.004
 God-0.003
 please-0.003
ying-0.002
 who-0.002
Token she
Token ID673
Feature activation+1.97e-05

Pos logit contributions
seek+0.006
 habitat+0.005
powers+0.005
growth+0.005
icol+0.005
Neg logit contributions
iring-0.004
ying-0.003
 God-0.003
 please-0.003
bling-0.003
Token offered
Token ID4438
Feature activation+1.25e-02

Pos logit contributions
iring+0.000
 please+0.000
 God+0.000
ying+0.000
 who+0.000
Neg logit contributions
seek-0.000
 habitat-0.000
powers-0.000
growth-0.000
icol-0.000
Token to
Token ID284
Feature activation-1.90e-02

Pos logit contributions
seek+0.001
growth+0.001
 habitat+0.001
powers+0.001
icol+0.001
Neg logit contributions
 who-0.000
 please-0.000
 God-0.000
iring-0.000
 by-0.000
Token make
Token ID787
Feature activation+2.99e-02

Pos logit contributions
seek+0.047
powers+0.043
 habitat+0.042
icol+0.042
growth+0.041
Neg logit contributions
 please-0.018
 God-0.017
 who-0.017
?"-0.016
 do-0.016
Token more
Token ID517
Feature activation-1.54e-02

Pos logit contributions
iring+0.032
 God+0.030
 please+0.030
 who+0.029
?"+0.028
Neg logit contributions
seek-0.067
 habitat-0.062
growth-0.060
powers-0.060
icol-0.059
Token pic
Token ID8301
Feature activation-4.66e-03

Pos logit contributions
seek+0.030
 habitat+0.027
growth+0.026
icol+0.026
powers+0.026
Neg logit contributions
iring-0.022
 God-0.021
 please-0.021
 who-0.020
?"-0.020
Tokenn
Token ID77
Feature activation+1.70e-02

Pos logit contributions
seek+0.007
 habitat+0.007
icol+0.007
powers+0.007
growth+0.007
Neg logit contributions
iring-0.008
 please-0.007
 God-0.007
 who-0.007
ying-0.007
Tokenics
Token ID873
Feature activation+7.90e-02

Pos logit contributions
iring+0.019
 please+0.019
 God+0.019
nam+0.018
 who+0.018
Neg logit contributions
seek-0.037
 habitat-0.033
powers-0.032
 arrest-0.031
growth-0.031
Token and
Token ID290
Feature activation-2.14e-03

Pos logit contributions
iring+0.057
 please+0.052
 God+0.052
 who+0.050
?"+0.047
Neg logit contributions
seek-0.204
 habitat-0.192
growth-0.187
powers-0.186
icol-0.185
Token looked
Token ID3114
Feature activation-1.04e-02

Pos logit contributions
seek+0.005
 habitat+0.004
growth+0.004
icol+0.004
powers+0.004
Neg logit contributions
iring-0.002
 please-0.002
 God-0.002
 who-0.002
?"-0.002
Token in
Token ID287
Feature activation-5.31e-03

Pos logit contributions
seek+0.027
 habitat+0.025
growth+0.024
powers+0.024
icol+0.024
Neg logit contributions
iring-0.008
 please-0.007
 God-0.007
 who-0.007
?"-0.007
Token the
Token ID262
Feature activation+8.90e-03

Pos logit contributions
seek+0.015
 habitat+0.013
powers+0.013
growth+0.013
icol+0.013
Neg logit contributions
 who-0.003
 God-0.003
iring-0.003
 please-0.003
?"-0.003
Token fridge
Token ID25772
Feature activation+1.35e-02

Pos logit contributions
iring+0.011
 God+0.011
 please+0.011
 who+0.011
?"+0.010
Neg logit contributions
seek-0.019
 habitat-0.016
powers-0.016
growth-0.016
icol-0.016
Token
Token ID198
Feature activation+3.00e-03

Pos logit contributions
iring+0.005
nam+0.005
 please+0.004
ying+0.004
 God+0.004
Neg logit contributions
seek-0.008
 habitat-0.007
powers-0.007
growth-0.007
icol-0.007
TokenThey
Token ID2990
Feature activation-4.53e-03

Pos logit contributions
iring+0.004
 God+0.004
 please+0.004
ying+0.004
nam+0.004
Neg logit contributions
seek-0.006
powers-0.005
 habitat-0.005
growth-0.005
icol-0.005
Token swapped
Token ID37245
Feature activation+9.36e-03

Pos logit contributions
seek+0.010
 habitat+0.009
powers+0.009
icol+0.009
growth+0.009
Neg logit contributions
 who-0.005
 also-0.005
 please-0.005
nam-0.005
 God-0.005
Token their
Token ID511
Feature activation-6.13e-03

Pos logit contributions
iring+0.009
 God+0.009
 who+0.009
 please+0.008
nam+0.008
Neg logit contributions
seek-0.022
 habitat-0.020
powers-0.020
growth-0.020
icol-0.020
Token nap
Token ID25422
Feature activation-9.94e-03

Pos logit contributions
seek+0.014
icol+0.012
growth+0.012
 habitat+0.012
powers+0.011
Neg logit contributions
iring-0.007
 please-0.007
 who-0.007
 do-0.007
 God-0.007
Tokenkins
Token ID5331
Feature activation+7.81e-02

Pos logit contributions
seek+0.029
 habitat+0.028
growth+0.027
icol+0.027
powers+0.027
Neg logit contributions
iring-0.004
 God-0.003
 please-0.003
 who-0.003
ying-0.003
Token and
Token ID290
Feature activation+7.44e-03

Pos logit contributions
iring+0.051
 please+0.046
 God+0.045
 who+0.043
?"+0.040
Neg logit contributions
seek-0.208
 habitat-0.196
growth-0.190
powers-0.189
icol-0.188
Token finished
Token ID5201
Feature activation-8.34e-03

Pos logit contributions
 God+0.009
 who+0.009
nam+0.009
 forth+0.008
 please+0.008
Neg logit contributions
seek-0.016
 habitat-0.015
powers-0.015
growth-0.014
icol-0.014
Token their
Token ID511
Feature activation-1.29e-02

Pos logit contributions
seek+0.021
 habitat+0.019
growth+0.019
powers+0.019
icol+0.019
Neg logit contributions
iring-0.007
 please-0.006
 God-0.006
 who-0.006
?"-0.006
Token lunch
Token ID9965
Feature activation+1.41e-02

Pos logit contributions
seek+0.023
growth+0.021
 habitat+0.020
icol+0.020
powers+0.019
Neg logit contributions
 God-0.019
 do-0.019
 please-0.019
 happy-0.019
mm-0.019
Token.
Token ID13
Feature activation+7.04e-03

Pos logit contributions
iring+0.009
 who+0.008
 please+0.008
kins+0.008
 God+0.007
Neg logit contributions
seek-0.037
 habitat-0.036
growth-0.036
icol-0.035
powers-0.034
Token you
Token ID345
Feature activation+3.23e-02

Pos logit contributions
 please+0.020
iring+0.020
 God+0.019
 who+0.018
?"+0.018
Neg logit contributions
seek-0.063
 habitat-0.059
growth-0.057
powers-0.057
icol-0.056
Token can
Token ID460
Feature activation-1.29e-03

Pos logit contributions
 please+0.028
?"+0.026
 who+0.024
 God+0.023
iring+0.023
Neg logit contributions
seek-0.085
 habitat-0.080
icol-0.078
growth-0.077
powers-0.073
Token fold
Token ID5591
Feature activation+1.70e-02

Pos logit contributions
seek+0.003
 habitat+0.002
powers+0.002
growth+0.002
 arrest+0.002
Neg logit contributions
iring-0.002
 God-0.001
nam-0.001
ying-0.001
 who-0.001
Token these
Token ID777
Feature activation-6.56e-03

Pos logit contributions
iring+0.016
 please+0.014
 God+0.014
 who+0.013
ying+0.013
Neg logit contributions
seek-0.041
 habitat-0.038
growth-0.037
powers-0.037
icol-0.037
Token nap
Token ID25422
Feature activation-7.64e-03

Pos logit contributions
seek+0.011
 habitat+0.011
powers+0.011
growth+0.010
icol+0.010
Neg logit contributions
iring-0.009
 please-0.009
 God-0.009
 who-0.009
ying-0.008
Tokenkins
Token ID5331
Feature activation+7.79e-02

Pos logit contributions
seek+0.021
 habitat+0.020
powers+0.020
growth+0.019
icol+0.019
Neg logit contributions
iring-0.004
 God-0.003
 please-0.003
 who-0.003
nam-0.003
Token and
Token ID290
Feature activation+6.65e-03

Pos logit contributions
iring+0.079
 God+0.072
 please+0.070
ying+0.065
 who+0.065
Neg logit contributions
seek-0.179
 habitat-0.166
powers-0.161
growth-0.157
icol-0.155
Token put
Token ID1234
Feature activation+9.77e-03

Pos logit contributions
 please+0.007
 God+0.007
 who+0.007
?"+0.007
iring+0.007
Neg logit contributions
seek-0.015
 habitat-0.014
icol-0.014
growth-0.013
powers-0.013
Token them
Token ID606
Feature activation+8.17e-03

Pos logit contributions
 God+0.010
 please+0.010
iring+0.010
 who+0.010
?"+0.010
Neg logit contributions
seek-0.023
 habitat-0.021
powers-0.020
icol-0.020
growth-0.020
Token on
Token ID319
Feature activation-1.21e-02

Pos logit contributions
iring+0.003
 please+0.003
 God+0.003
 who+0.003
?"+0.003
Neg logit contributions
seek-0.024
 habitat-0.023
growth-0.022
icol-0.022
powers-0.022
Token the
Token ID262
Feature activation-1.78e-02

Pos logit contributions
seek+0.035
 habitat+0.032
icol+0.032
powers+0.032
growth+0.031
Neg logit contributions
?"-0.006
 God-0.006
 who-0.006
 please-0.006
iring-0.005
Token to
Token ID284
Feature activation+1.01e-02

Pos logit contributions
seek+0.025
 habitat+0.023
growth+0.022
icol+0.022
powers+0.022
Neg logit contributions
 God-0.019
iring-0.019
 please-0.018
 who-0.018
?"-0.018
Token go
Token ID467
Feature activation+2.94e-02

Pos logit contributions
iring+0.015
 please+0.015
 God+0.014
 who+0.014
?"+0.014
Neg logit contributions
seek-0.019
 habitat-0.017
growth-0.016
powers-0.016
icol-0.016
Token on
Token ID319
Feature activation+1.06e-02

Pos logit contributions
iring+0.041
keys+0.039
nam+0.039
mm+0.038
heart+0.038
Neg logit contributions
seek-0.057
powers-0.048
 habitat-0.048
icol-0.047
 arrest-0.046
Token pic
Token ID8301
Feature activation+5.91e-05

Pos logit contributions
iring+0.018
 God+0.018
?"+0.017
 please+0.017
 who+0.017
Neg logit contributions
seek-0.017
 habitat-0.015
powers-0.014
growth-0.014
icol-0.014
Tokenn
Token ID77
Feature activation+2.06e-02

Pos logit contributions
iring+0.000
 please+0.000
 God+0.000
?"+0.000
 who+0.000
Neg logit contributions
seek-0.000
 habitat-0.000
powers-0.000
growth-0.000
icol-0.000
Tokenics
Token ID873
Feature activation+7.78e-02

Pos logit contributions
 please+0.025
iring+0.024
 God+0.024
nam+0.023
 who+0.022
Neg logit contributions
seek-0.045
 habitat-0.039
powers-0.037
growth-0.037
 arrest-0.037
Token with
Token ID351
Feature activation-3.34e-03

Pos logit contributions
iring+0.098
ying+0.090
 God+0.086
 please+0.086
heart+0.085
Neg logit contributions
seek-0.167
 habitat-0.145
powers-0.135
 arrest-0.133
growth-0.132
Token her
Token ID607
Feature activation-1.51e-02

Pos logit contributions
seek+0.008
 habitat+0.007
growth+0.007
powers+0.007
icol+0.007
Neg logit contributions
iring-0.003
ying-0.003
 please-0.002
 who-0.002
 God-0.002
Token family
Token ID1641
Feature activation-2.50e-02

Pos logit contributions
seek+0.037
 habitat+0.034
powers+0.033
growth+0.033
icol+0.033
Neg logit contributions
iring-0.013
 please-0.012
 God-0.012
 who-0.012
ying-0.012
Token.
Token ID13
Feature activation-1.84e-03

Pos logit contributions
seek+0.046
 habitat+0.041
powers+0.040
growth+0.036
ashed+0.035
Neg logit contributions
iring-0.037
mm-0.034
 please-0.033
 God-0.033
bling-0.033
Token One
Token ID1881
Feature activation+1.14e-02

Pos logit contributions
seek+0.003
 habitat+0.003
growth+0.003
icol+0.003
powers+0.003
Neg logit contributions
iring-0.003
ying-0.003
bling-0.003
ble-0.003
keys-0.003
Token Don
Token ID2094
Feature activation+2.69e-02

Pos logit contributions
iring+0.005
 please+0.004
 God+0.004
 who+0.004
?"+0.004
Neg logit contributions
seek-0.010
 habitat-0.010
powers-0.009
growth-0.009
icol-0.009
Token't
Token ID470
Feature activation+1.76e-02

Pos logit contributions
iring+0.047
nam+0.046
 God+0.045
 who+0.044
ying+0.042
Neg logit contributions
seek-0.042
growth-0.035
powers-0.035
 arrest-0.034
 habitat-0.034
Token hurt
Token ID5938
Feature activation-1.30e-02

Pos logit contributions
iring+0.023
 please+0.022
 God+0.022
 who+0.021
?"+0.021
Neg logit contributions
seek-0.036
 habitat-0.033
growth-0.032
icol-0.032
powers-0.032
Token the
Token ID262
Feature activation+4.54e-04

Pos logit contributions
seek+0.033
 habitat+0.031
growth+0.030
powers+0.030
icol+0.030
Neg logit contributions
iring-0.011
 please-0.010
 God-0.010
 who-0.009
?"-0.009
Token pump
Token ID8901
Feature activation+2.01e-02

Pos logit contributions
iring+0.000
 please+0.000
 God+0.000
 who+0.000
?"+0.000
Neg logit contributions
seek-0.001
 habitat-0.001
growth-0.001
powers-0.001
icol-0.001
Tokenkins
Token ID5331
Feature activation+7.77e-02

Pos logit contributions
 God+0.018
 please+0.018
iring+0.018
nam+0.016
She+0.016
Neg logit contributions
seek-0.040
 arrest-0.038
 habitat-0.038
keeping-0.038
 drowned-0.037
Token."
Token ID526
Feature activation-7.60e-03

Pos logit contributions
iring+0.080
 God+0.075
 please+0.071
 who+0.067
nam+0.066
Neg logit contributions
seek-0.175
 habitat-0.163
powers-0.158
growth-0.155
icol-0.151
Token
Token ID220
Feature activation+1.89e-02

Pos logit contributions
seek+0.016
 habitat+0.015
growth+0.014
powers+0.014
icol+0.014
Neg logit contributions
iring-0.009
 please-0.009
 God-0.008
 who-0.008
?"-0.008
Token
Token ID198
Feature activation+6.23e-03

Pos logit contributions
iring+0.021
 God+0.020
 please+0.020
nam+0.020
 who+0.019
Neg logit contributions
seek-0.040
 habitat-0.037
powers-0.036
growth-0.035
icol-0.035
Token
Token ID198
Feature activation+5.61e-03

Pos logit contributions
iring+0.008
 God+0.007
nam+0.007
 please+0.007
ying+0.007
Neg logit contributions
seek-0.013
 habitat-0.012
powers-0.011
growth-0.011
icol-0.011
TokenTim
Token ID14967
Feature activation-2.41e-03

Pos logit contributions
 God+0.010
iring+0.009
ying+0.009
 angel+0.008
nam+0.008
Neg logit contributions
seek-0.009
 habitat-0.008
powers-0.007
 arrest-0.007
 drowned-0.007
Token We
Token ID775
Feature activation-1.56e-02

Pos logit contributions
iring+0.013
ying+0.012
ie+0.011
 please+0.011
 God+0.011
Neg logit contributions
seek-0.025
 habitat-0.023
growth-0.022
powers-0.022
icol-0.022
Token can
Token ID460
Feature activation-1.51e-02

Pos logit contributions
seek+0.043
icol+0.041
 habitat+0.041
growth+0.040
powers+0.038
Neg logit contributions
 please-0.012
 who-0.011
?"-0.010
 do-0.010
 are-0.010
Token share
Token ID2648
Feature activation+1.66e-02

Pos logit contributions
seek+0.034
 habitat+0.031
growth+0.030
powers+0.030
icol+0.029
Neg logit contributions
iring-0.016
 God-0.015
 please-0.015
ying-0.014
 who-0.014
Token the
Token ID262
Feature activation+1.51e-03

Pos logit contributions
iring+0.019
ying+0.017
bling+0.016
itive+0.016
heart+0.016
Neg logit contributions
seek-0.034
 habitat-0.033
growth-0.031
 arrest-0.031
powers-0.031
Token cher
Token ID21382
Feature activation+1.46e-02

Pos logit contributions
iring+0.002
 God+0.002
ying+0.002
 please+0.002
bling+0.002
Neg logit contributions
seek-0.003
 habitat-0.002
powers-0.002
 arrest-0.002
 bumps-0.002
Tokenries
Token ID1678
Feature activation+7.76e-02

Pos logit contributions
 God+0.023
 please+0.021
 ma+0.020
 okay+0.020
 Amen+0.020
Neg logit contributions
seek-0.024
water-0.022
 habitat-0.021
 arrest-0.021
con-0.021
Token.
Token ID13
Feature activation+1.48e-02

Pos logit contributions
iring+0.083
 God+0.078
heart+0.070
 please+0.070
bling+0.068
Neg logit contributions
seek-0.172
 habitat-0.157
powers-0.155
 arrest-0.144
growth-0.143
Token There
Token ID1318
Feature activation-3.08e-03

Pos logit contributions
iring+0.019
ying+0.018
ie+0.018
ma+0.016
heart+0.016
Neg logit contributions
seek-0.030
 habitat-0.029
growth-0.028
powers-0.027
icol-0.027
Token are
Token ID389
Feature activation+1.79e-02

Pos logit contributions
seek+0.007
 habitat+0.007
powers+0.007
growth+0.006
 arrest+0.006
Neg logit contributions
iring-0.003
 God-0.002
nam-0.002
itive-0.002
bling-0.002
Token many
Token ID867
Feature activation-1.45e-02

Pos logit contributions
iring+0.019
 please+0.018
 God+0.018
 who+0.017
ying+0.017
Neg logit contributions
seek-0.040
 habitat-0.037
powers-0.036
growth-0.036
icol-0.035
Token cher
Token ID21382
Feature activation-1.24e-02

Pos logit contributions
seek+0.027
 habitat+0.027
powers+0.025
growth+0.024
 arrest+0.024
Neg logit contributions
iring-0.019
 God-0.018
 please-0.018
ying-0.017
bling-0.017
Token They
Token ID1119
Feature activation-6.08e-03

Pos logit contributions
iring+0.004
 please+0.003
 God+0.003
 who+0.003
ying+0.003
Neg logit contributions
seek-0.014
 habitat-0.013
growth-0.013
powers-0.013
icol-0.013
Token quickly
Token ID2952
Feature activation-9.98e-03

Pos logit contributions
seek+0.013
 habitat+0.012
powers+0.011
icol+0.011
growth+0.011
Neg logit contributions
 who-0.008
 please-0.008
iring-0.007
 God-0.007
?"-0.007
Token put
Token ID1234
Feature activation+2.05e-02

Pos logit contributions
seek+0.023
 habitat+0.022
powers+0.020
 fault+0.020
icol+0.020
Neg logit contributions
 who-0.012
 do-0.010
 at-0.010
 also-0.010
 please-0.010
Token their
Token ID511
Feature activation-9.67e-03

Pos logit contributions
iring+0.023
 please+0.020
 God+0.019
ying+0.019
 who+0.018
Neg logit contributions
seek-0.043
growth-0.042
 habitat-0.041
icol-0.040
powers-0.039
Token nap
Token ID25422
Feature activation-1.58e-02

Pos logit contributions
seek+0.019
 habitat+0.017
growth+0.016
powers+0.016
icol+0.016
Neg logit contributions
iring-0.014
 please-0.013
 God-0.013
 who-0.013
?"-0.012
Tokenkins
Token ID5331
Feature activation+7.73e-02

Pos logit contributions
seek+0.043
 habitat+0.040
growth+0.039
powers+0.039
icol+0.039
Neg logit contributions
iring-0.009
 God-0.008
 please-0.008
 who-0.007
?"-0.007
Token over
Token ID625
Feature activation+1.34e-02

Pos logit contributions
iring+0.050
 God+0.044
 please+0.044
 who+0.040
?"+0.038
Neg logit contributions
seek-0.206
 habitat-0.194
growth-0.188
powers-0.188
icol-0.186
Token their
Token ID511
Feature activation-1.09e-02

Pos logit contributions
iring+0.007
 who+0.007
 God+0.007
 please+0.006
?"+0.005
Neg logit contributions
seek-0.039
 habitat-0.036
powers-0.036
icol-0.035
growth-0.034
Token salad
Token ID20698
Feature activation+3.69e-02

Pos logit contributions
seek+0.021
 habitat+0.019
growth+0.018
powers+0.018
icol+0.018
Neg logit contributions
iring-0.015
 God-0.015
 please-0.015
 who-0.014
?"-0.014
Token.
Token ID13
Feature activation+8.96e-03

Pos logit contributions
iring+0.021
 who+0.020
 please+0.018
kins+0.017
?"+0.017
Neg logit contributions
seek-0.102
 habitat-0.095
icol-0.094
growth-0.094
powers-0.093
Token Then
Token ID3244
Feature activation-9.34e-04

Pos logit contributions
iring+0.007
 God+0.007
 please+0.007
 who+0.007
She+0.006
Neg logit contributions
seek-0.023
 habitat-0.021
powers-0.021
 arrest-0.020
growth-0.020
Token.
Token ID13
Feature activation+3.61e-03

Pos logit contributions
 who+0.004
iring+0.004
 please+0.003
?"+0.003
 God+0.003
Neg logit contributions
seek-0.015
 habitat-0.014
powers-0.014
icol-0.014
growth-0.014
Token He
Token ID679
Feature activation+3.20e-03

Pos logit contributions
iring+0.003
 God+0.003
 please+0.003
 who+0.003
?"+0.003
Neg logit contributions
seek-0.009
 habitat-0.008
growth-0.008
icol-0.008
powers-0.008
Token'd
Token ID1549
Feature activation+3.08e-03

Pos logit contributions
 who+0.004
 also+0.004
?"+0.003
 do+0.003
ie+0.003
Neg logit contributions
seek-0.008
 habitat-0.007
icol-0.007
growth-0.007
 arrest-0.007
Token fold
Token ID5591
Feature activation+4.92e-03

Pos logit contributions
 who+0.003
 please+0.003
?"+0.003
 do+0.003
 God+0.003
Neg logit contributions
seek-0.007
 habitat-0.007
powers-0.007
icol-0.006
growth-0.006
Token nap
Token ID25422
Feature activation-1.34e-02

Pos logit contributions
iring+0.004
 God+0.004
 please+0.004
?"+0.004
 who+0.004
Neg logit contributions
seek-0.012
 habitat-0.011
powers-0.011
icol-0.011
 arrest-0.011
Tokenkins
Token ID5331
Feature activation+7.71e-02

Pos logit contributions
seek+0.036
 habitat+0.033
growth+0.033
powers+0.032
icol+0.032
Neg logit contributions
iring-0.008
 God-0.008
 please-0.007
 who-0.006
ying-0.006
Token and
Token ID290
Feature activation+2.47e-03

Pos logit contributions
iring+0.059
 God+0.053
 please+0.052
 who+0.049
ying+0.047
Neg logit contributions
seek-0.197
 habitat-0.184
powers-0.179
growth-0.178
icol-0.176
Token make
Token ID787
Feature activation+2.31e-02

Pos logit contributions
 God+0.003
 who+0.003
 please+0.003
 do+0.002
?"+0.002
Neg logit contributions
seek-0.006
 habitat-0.005
powers-0.005
icol-0.005
growth-0.005
Token them
Token ID606
Feature activation+1.49e-02

Pos logit contributions
iring+0.023
 God+0.022
 who+0.022
 please+0.021
?"+0.020
Neg logit contributions
seek-0.055
 habitat-0.052
growth-0.049
icol-0.049
powers-0.048
Token into
Token ID656
Feature activation-1.19e-02

Pos logit contributions
iring+0.019
 God+0.017
 please+0.017
 who+0.017
ying+0.017
Neg logit contributions
seek-0.030
 habitat-0.028
powers-0.027
growth-0.027
icol-0.027
Token hats
Token ID23910
Feature activation+3.24e-02

Pos logit contributions
seek+0.029
 habitat+0.027
powers+0.026
growth+0.025
icol+0.025
Neg logit contributions
iring-0.011
 God-0.010
 please-0.010
 who-0.010
?"-0.010
Token.
Token ID13
Feature activation+1.80e-03

Pos logit contributions
iring+0.000
ying+0.000
 please+0.000
nam+0.000
 God+0.000
Neg logit contributions
seek-0.001
 habitat-0.001
powers-0.001
growth-0.001
 drowned-0.001
Token They
Token ID1119
Feature activation-8.40e-03

Pos logit contributions
iring+0.002
 God+0.002
 please+0.002
 who+0.002
?"+0.002
Neg logit contributions
seek-0.004
 habitat-0.004
growth-0.004
powers-0.004
icol-0.004
Token love
Token ID1842
Feature activation-2.53e-02

Pos logit contributions
seek+0.019
 habitat+0.018
growth+0.017
powers+0.017
icol+0.017
Neg logit contributions
 please-0.009
 who-0.008
 God-0.008
iring-0.008
?"-0.008
Token their
Token ID511
Feature activation-1.96e-02

Pos logit contributions
seek+0.059
 habitat+0.056
growth+0.054
powers+0.053
icol+0.053
Neg logit contributions
iring-0.025
 please-0.022
 God-0.021
 who-0.021
ying-0.021
Token pump
Token ID8901
Feature activation+1.06e-02

Pos logit contributions
seek+0.044
 habitat+0.041
powers+0.040
growth+0.039
icol+0.039
Neg logit contributions
iring-0.021
 please-0.019
 God-0.019
 who-0.019
ying-0.018
Tokenkins
Token ID5331
Feature activation+7.70e-02

Pos logit contributions
 please+0.012
iring+0.011
 God+0.010
nam+0.010
 ma+0.010
Neg logit contributions
seek-0.020
 habitat-0.018
keeping-0.018
powers-0.017
 bumps-0.017
Token.
Token ID13
Feature activation+5.57e-03

Pos logit contributions
iring+0.108
ying+0.098
 please+0.093
nam+0.092
 God+0.092
Neg logit contributions
seek-0.145
 habitat-0.131
powers-0.128
growth-0.116
icol-0.114
Token They
Token ID1119
Feature activation-4.49e-03

Pos logit contributions
iring+0.005
 God+0.005
 please+0.005
 who+0.005
?"+0.005
Neg logit contributions
seek-0.013
 habitat-0.012
growth-0.012
powers-0.012
icol-0.012
Token love
Token ID1842
Feature activation-2.27e-02

Pos logit contributions
seek+0.010
 habitat+0.010
growth+0.009
icol+0.009
powers+0.009
Neg logit contributions
 please-0.005
 who-0.005
 God-0.004
?"-0.004
iring-0.004
Token to
Token ID284
Feature activation+1.81e-02

Pos logit contributions
seek+0.053
 habitat+0.049
growth+0.047
powers+0.047
icol+0.047
Neg logit contributions
iring-0.023
 please-0.021
 God-0.020
 who-0.020
ying-0.019
Token paint
Token ID7521
Feature activation+2.02e-02

Pos logit contributions
iring+0.026
ying+0.024
nam+0.021
bling+0.021
heart+0.020
Neg logit contributions
seek-0.036
 habitat-0.032
growth-0.032
 arrest-0.031
icol-0.030
Token lately
Token ID16537
Feature activation+1.21e-02

Pos logit contributions
seek+0.018
growth+0.018
powers+0.017
 habitat+0.017
icol+0.017
Neg logit contributions
 who-0.007
iring-0.006
 please-0.006
 by-0.006
 God-0.006
Token.
Token ID13
Feature activation-6.49e-03

Pos logit contributions
 who+0.007
iring+0.007
 please+0.006
 God+0.006
?"+0.005
Neg logit contributions
seek-0.034
 habitat-0.032
powers-0.031
growth-0.031
icol-0.031
Token With
Token ID2080
Feature activation+2.38e-03

Pos logit contributions
seek+0.014
 habitat+0.013
growth+0.012
powers+0.012
icol+0.012
Neg logit contributions
iring-0.008
 please-0.007
 God-0.007
 who-0.007
ying-0.007
Token work
Token ID670
Feature activation+1.01e-02

Pos logit contributions
 God+0.002
 please+0.002
 who+0.002
iring+0.002
nam+0.002
Neg logit contributions
seek-0.006
 habitat-0.006
powers-0.005
icol-0.005
growth-0.005
Token and
Token ID290
Feature activation-3.06e-03

Pos logit contributions
iring+0.006
 please+0.005
 God+0.005
 who+0.005
ying+0.004
Neg logit contributions
seek-0.028
 habitat-0.027
growth-0.026
powers-0.026
icol-0.026
Token err
Token ID11454
Feature activation-8.90e-02

Pos logit contributions
seek+0.006
 habitat+0.006
icol+0.006
powers+0.006
growth+0.006
Neg logit contributions
 God-0.004
 who-0.004
 please-0.004
iring-0.004
 happy-0.004
Tokenands
Token ID1746
Feature activation-3.26e-02

Pos logit contributions
seek+0.101
 habitat+0.098
icol+0.095
growth+0.093
powers+0.090
Neg logit contributions
iring-0.183
 God-0.179
ying-0.178
 please-0.178
 who-0.174
Token,
Token ID11
Feature activation+2.02e-03

Pos logit contributions
seek+0.088
 habitat+0.087
growth+0.084
icol+0.084
powers+0.080
Neg logit contributions
 please-0.019
iring-0.019
 who-0.018
ying-0.018
 God-0.018
Token they
Token ID484
Feature activation+2.52e-03

Pos logit contributions
iring+0.003
ying+0.003
 please+0.003
?"+0.003
mm+0.002
Neg logit contributions
seek-0.003
growth-0.003
 habitat-0.003
icol-0.003
 arrest-0.003
Token didn
Token ID1422
Feature activation-1.85e-03

Pos logit contributions
 who+0.003
 please+0.003
iring+0.003
 God+0.003
?"+0.003
Neg logit contributions
seek-0.005
 habitat-0.005
icol-0.005
powers-0.005
growth-0.005
Token't
Token ID470
Feature activation+3.19e-03

Pos logit contributions
seek+0.004
 habitat+0.004
icol+0.003
powers+0.003
growth+0.003
Neg logit contributions
iring-0.002
 please-0.002
 God-0.002
?"-0.002
 who-0.002
Token Emma
Token ID18966
Feature activation-3.94e-03

Pos logit contributions
seek+0.065
 habitat+0.061
growth+0.058
powers+0.058
icol+0.057
Neg logit contributions
iring-0.043
 please-0.041
 God-0.041
 who-0.040
?"-0.039
Token who
Token ID508
Feature activation-2.68e-02

Pos logit contributions
seek+0.008
 habitat+0.007
growth+0.007
icol+0.007
powers+0.007
Neg logit contributions
iring-0.005
 please-0.005
ying-0.005
mm-0.005
bling-0.004
Token loved
Token ID6151
Feature activation-1.92e-02

Pos logit contributions
seek+0.065
 habitat+0.062
growth+0.060
powers+0.060
icol+0.060
Neg logit contributions
iring-0.023
 who-0.023
 God-0.022
 please-0.022
?"-0.021
Token learning
Token ID4673
Feature activation-1.92e-02

Pos logit contributions
seek+0.041
growth+0.038
 habitat+0.037
icol+0.036
powers+0.035
Neg logit contributions
 God-0.025
 who-0.024
 please-0.023
?"-0.022
nam-0.022
Token about
Token ID546
Feature activation-1.58e-02

Pos logit contributions
seek+0.045
 habitat+0.042
growth+0.040
powers+0.040
icol+0.040
Neg logit contributions
iring-0.019
 please-0.018
 God-0.018
 who-0.017
?"-0.017
Token outer
Token ID12076
Feature activation-8.80e-02

Pos logit contributions
seek+0.037
 habitat+0.034
powers+0.033
icol+0.033
growth+0.033
Neg logit contributions
iring-0.016
 God-0.015
 who-0.015
 please-0.015
?"-0.014
Token space
Token ID2272
Feature activation-2.88e-02

Pos logit contributions
seek+0.180
 habitat+0.158
icol+0.156
growth+0.154
 drowned+0.153
Neg logit contributions
iring-0.118
 God-0.115
?"-0.114
 who-0.111
 please-0.107
Token.
Token ID13
Feature activation+2.49e-03

Pos logit contributions
seek+0.074
 habitat+0.070
icol+0.069
growth+0.069
powers+0.067
Neg logit contributions
iring-0.021
 who-0.020
 God-0.020
 please-0.019
?"-0.018
Token Everyday
Token ID48154
Feature activation+4.14e-02

Pos logit contributions
iring+0.002
 please+0.002
ying+0.002
 who+0.002
 God+0.002
Neg logit contributions
seek-0.006
 habitat-0.005
growth-0.005
powers-0.005
icol-0.005
Token,
Token ID11
Feature activation+5.35e-03

Pos logit contributions
iring+0.053
ying+0.043
ble+0.042
bling+0.041
nam+0.037
Neg logit contributions
seek-0.087
 habitat-0.078
icol-0.078
growth-0.077
powers-0.074
Token she
Token ID673
Feature activation+9.66e-03

Pos logit contributions
iring+0.008
ying+0.007
ble+0.007
bling+0.007
ang+0.007
Neg logit contributions
seek-0.009
growth-0.008
 baths-0.008
powers-0.008
 arrest-0.008
Token of
Token ID286
Feature activation+5.12e-03

Pos logit contributions
seek+0.047
 habitat+0.044
growth+0.044
icol+0.043
powers+0.042
Neg logit contributions
 who-0.005
iring-0.004
 please-0.003
 by-0.003
 God-0.002
Token himself
Token ID2241
Feature activation+6.82e-03

Pos logit contributions
iring+0.006
 please+0.006
 God+0.006
 who+0.006
?"+0.006
Neg logit contributions
seek-0.011
 habitat-0.010
growth-0.009
powers-0.009
icol-0.009
Token for
Token ID329
Feature activation+1.63e-02

Pos logit contributions
iring+0.005
 please+0.004
 God+0.004
?"+0.004
nam+0.004
Neg logit contributions
seek-0.018
 habitat-0.016
powers-0.016
growth-0.016
icol-0.016
Token using
Token ID1262
Feature activation-8.54e-03

Pos logit contributions
iring+0.016
 God+0.015
 please+0.015
 who+0.015
?"+0.013
Neg logit contributions
seek-0.039
 habitat-0.036
powers-0.035
growth-0.035
icol-0.035
Token his
Token ID465
Feature activation-3.08e-02

Pos logit contributions
seek+0.027
 habitat+0.026
icol+0.025
ashed+0.025
powers+0.024
Neg logit contributions
 who-0.003
 your-0.003
 God-0.002
 best-0.002
 the-0.002
Token talent
Token ID7401
Feature activation-8.04e-02

Pos logit contributions
seek+0.072
 habitat+0.066
growth+0.064
icol+0.063
keeping+0.060
Neg logit contributions
 best-0.036
 do-0.035
 kind-0.033
 happy-0.033
 who-0.033
Token to
Token ID284
Feature activation-1.70e-02

Pos logit contributions
seek+0.208
 habitat+0.206
icol+0.201
growth+0.200
 arrest+0.194
Neg logit contributions
 who-0.062
?"-0.054
 please-0.051
?-0.051
 by-0.049
Token scare
Token ID19437
Feature activation-9.94e-03

Pos logit contributions
seek+0.040
 habitat+0.038
powers+0.037
icol+0.037
growth+0.036
Neg logit contributions
 please-0.017
 who-0.017
 God-0.016
 do-0.016
?"-0.016
Token away
Token ID1497
Feature activation-5.30e-03

Pos logit contributions
seek+0.025
 habitat+0.023
icol+0.023
powers+0.023
growth+0.022
Neg logit contributions
 who-0.009
 God-0.008
iring-0.008
 please-0.008
?"-0.008
Token the
Token ID262
Feature activation-1.95e-03

Pos logit contributions
seek+0.015
 habitat+0.014
icol+0.014
powers+0.014
growth+0.013
Neg logit contributions
 who-0.003
iring-0.003
 God-0.003
 by-0.003
 please-0.002
Token noise
Token ID7838
Feature activation+2.38e-03

Pos logit contributions
seek+0.005
growth+0.005
 habitat+0.005
icol+0.004
ashed+0.004
Neg logit contributions
 do-0.002
 who-0.002
 girl-0.002
 kind-0.002
 happy-0.002
Token her
Token ID607
Feature activation+1.14e-02

Pos logit contributions
iring+0.005
ying+0.004
bling+0.004
kins+0.004
mm+0.004
Neg logit contributions
seek-0.010
 habitat-0.010
growth-0.010
 arrest-0.009
icol-0.009
Token mom
Token ID1995
Feature activation-2.27e-03

Pos logit contributions
 who+0.007
iring+0.007
 God+0.007
 please+0.007
?"+0.006
Neg logit contributions
seek-0.032
 habitat-0.029
growth-0.028
powers-0.028
icol-0.028
Token were
Token ID547
Feature activation+3.08e-03

Pos logit contributions
seek+0.006
 habitat+0.005
growth+0.005
icol+0.005
powers+0.005
Neg logit contributions
iring-0.002
 please-0.002
 God-0.002
ying-0.001
 who-0.001
Token out
Token ID503
Feature activation-2.26e-02

Pos logit contributions
iring+0.003
 God+0.003
 please+0.003
mm+0.003
ying+0.003
Neg logit contributions
seek-0.007
 habitat-0.007
icol-0.006
powers-0.006
growth-0.006
Token running
Token ID2491
Feature activation-1.83e-02

Pos logit contributions
seek+0.054
 habitat+0.050
growth+0.048
powers+0.048
icol+0.048
Neg logit contributions
iring-0.021
 please-0.020
 God-0.019
 who-0.018
?"-0.018
Token err
Token ID11454
Feature activation-7.91e-02

Pos logit contributions
growth+0.048
seek+0.046
 habitat+0.046
icol+0.046
powers+0.045
Neg logit contributions
 in-0.014
 by-0.014
 who-0.014
 at-0.013
?"-0.011
Tokenands
Token ID1746
Feature activation-2.50e-02

Pos logit contributions
icol+0.098
seek+0.098
 habitat+0.096
growth+0.096
powers+0.090
Neg logit contributions
iring-0.152
 God-0.146
 who-0.146
ying-0.146
 please-0.144
Token.
Token ID13
Feature activation-2.52e-03

Pos logit contributions
seek+0.067
growth+0.067
 habitat+0.065
icol+0.064
powers+0.064
Neg logit contributions
 who-0.016
iring-0.013
 by-0.013
 in-0.012
 please-0.011
Token Mary
Token ID5335
Feature activation-3.54e-04

Pos logit contributions
seek+0.006
 habitat+0.005
growth+0.005
powers+0.005
icol+0.005
Neg logit contributions
iring-0.003
 please-0.002
 God-0.002
 who-0.002
?"-0.002
Token was
Token ID373
Feature activation-6.25e-03

Pos logit contributions
seek+0.001
 habitat+0.001
growth+0.001
icol+0.001
powers+0.001
Neg logit contributions
 who-0.000
?"-0.000
 please-0.000
 also-0.000
 God-0.000
Token three
Token ID1115
Feature activation-1.98e-02

Pos logit contributions
seek+0.015
 habitat+0.014
growth+0.014
powers+0.014
icol+0.014
Neg logit contributions
iring-0.006
 please-0.005
 who-0.005
 God-0.005
?"-0.005
Token by
Token ID416
Feature activation+6.79e-03

Pos logit contributions
iring+0.010
 God+0.009
 please+0.009
 who+0.008
?"+0.008
Neg logit contributions
seek-0.024
 habitat-0.022
powers-0.021
growth-0.021
icol-0.021
Token helping
Token ID5742
Feature activation+9.73e-03

Pos logit contributions
iring+0.008
 please+0.007
 God+0.007
 who+0.007
?"+0.007
Neg logit contributions
seek-0.015
 habitat-0.014
growth-0.013
powers-0.013
icol-0.013
Token her
Token ID607
Feature activation+1.26e-02

Pos logit contributions
iring+0.008
 please+0.007
 God+0.007
 who+0.007
?"+0.007
Neg logit contributions
seek-0.024
 habitat-0.023
growth-0.022
powers-0.022
icol-0.022
Token mum
Token ID25682
Feature activation+3.43e-03

Pos logit contributions
iring+0.014
 please+0.013
 God+0.013
ying+0.013
?"+0.012
Neg logit contributions
seek-0.027
 habitat-0.026
powers-0.024
growth-0.024
 arrest-0.024
Token with
Token ID351
Feature activation-1.30e-02

Pos logit contributions
iring+0.004
 God+0.003
mm+0.003
 please+0.003
nam+0.003
Neg logit contributions
seek-0.008
 habitat-0.007
growth-0.007
 arrest-0.007
powers-0.007
Token err
Token ID11454
Feature activation-7.79e-02

Pos logit contributions
seek+0.035
 habitat+0.032
icol+0.031
powers+0.031
growth+0.030
Neg logit contributions
 please-0.010
 God-0.010
 who-0.009
nam-0.009
iring-0.009
Tokenands
Token ID1746
Feature activation-3.25e-02

Pos logit contributions
seek+0.088
 habitat+0.081
icol+0.081
growth+0.077
powers+0.073
Neg logit contributions
iring-0.165
 please-0.160
 God-0.160
ying-0.159
 who-0.157
Token and
Token ID290
Feature activation-9.69e-03

Pos logit contributions
seek+0.088
 habitat+0.084
growth+0.084
icol+0.083
powers+0.081
Neg logit contributions
iring-0.019
 who-0.019
 please-0.018
ying-0.017
 God-0.016
Token doing
Token ID1804
Feature activation+7.23e-03

Pos logit contributions
seek+0.023
 habitat+0.022
icol+0.021
growth+0.021
powers+0.021
Neg logit contributions
 who-0.010
 please-0.009
 God-0.009
iring-0.009
nam-0.009
Token odd
Token ID5629
Feature activation+2.56e-04

Pos logit contributions
iring+0.007
 please+0.007
 God+0.006
 who+0.006
?"+0.006
Neg logit contributions
seek-0.017
 habitat-0.016
growth-0.015
powers-0.015
icol-0.015
Token jobs
Token ID3946
Feature activation-6.63e-04

Pos logit contributions
iring+0.000
 God+0.000
 who+0.000
 please+0.000
ying+0.000
Neg logit contributions
seek-0.001
 habitat-0.001
growth-0.001
icol-0.001
powers-0.001
Token at
Token ID379
Feature activation+1.34e-02

Pos logit contributions
 by+0.004
 who+0.004
 in+0.004
iring+0.004
 on+0.004
Neg logit contributions
growth-0.026
seek-0.026
 habitat-0.025
icol-0.024
powers-0.024
Token his
Token ID465
Feature activation-5.10e-03

Pos logit contributions
 God+0.012
iring+0.012
?"+0.011
 who+0.011
 please+0.010
Neg logit contributions
seek-0.032
 habitat-0.031
icol-0.031
powers-0.030
growth-0.029
Token salon
Token ID46608
Feature activation+3.10e-02

Pos logit contributions
seek+0.010
growth+0.010
icol+0.010
 habitat+0.009
powers+0.009
Neg logit contributions
iring-0.007
 please-0.006
 God-0.006
 who-0.006
?"-0.006
Token and
Token ID290
Feature activation-5.35e-03

Pos logit contributions
iring+0.022
 who+0.022
 please+0.021
 God+0.020
?"+0.019
Neg logit contributions
seek-0.078
 habitat-0.075
growth-0.075
icol-0.074
powers-0.073
Token running
Token ID2491
Feature activation-2.61e-02

Pos logit contributions
seek+0.012
growth+0.012
powers+0.012
icol+0.012
 habitat+0.011
Neg logit contributions
 who-0.006
 God-0.005
 please-0.005
 do-0.005
?"-0.005
Token err
Token ID11454
Feature activation-7.78e-02

Pos logit contributions
growth+0.070
icol+0.067
 habitat+0.067
seek+0.065
 arrest+0.064
Neg logit contributions
 in-0.020
 by-0.018
 who-0.017
 at-0.016
 on-0.016
Tokenands
Token ID1746
Feature activation-3.04e-02

Pos logit contributions
icol+0.086
 habitat+0.084
growth+0.082
seek+0.082
powers+0.075
Neg logit contributions
iring-0.161
ying-0.155
 God-0.154
 who-0.154
 please-0.152
Token.
Token ID13
Feature activation-3.71e-03

Pos logit contributions
growth+0.083
icol+0.082
 habitat+0.081
seek+0.080
powers+0.076
Neg logit contributions
 who-0.019
 by-0.017
 in-0.016
iring-0.015
 at-0.015
Token But
Token ID887
Feature activation-1.09e-02

Pos logit contributions
seek+0.008
 habitat+0.007
growth+0.007
powers+0.007
icol+0.007
Neg logit contributions
iring-0.004
 please-0.004
 God-0.004
 who-0.004
?"-0.004
Token every
Token ID790
Feature activation+2.62e-02

Pos logit contributions
seek+0.021
 habitat+0.020
growth+0.019
 drowned+0.018
 arrest+0.017
Neg logit contributions
iring-0.015
ying-0.013
 please-0.013
bling-0.013
kins-0.013
Token day
Token ID1110
Feature activation-1.07e-02

Pos logit contributions
 please+0.031
 God+0.031
?"+0.031
 who+0.031
iring+0.027
Neg logit contributions
seek-0.057
 habitat-0.053
icol-0.053
powers-0.051
growth-0.051
Token sle
Token ID3133
Feature activation-1.92e-03

Pos logit contributions
seek+0.006
 habitat+0.005
growth+0.005
icol+0.005
powers+0.005
Neg logit contributions
iring-0.003
 please-0.003
 God-0.003
 who-0.003
ying-0.003
Tokenigh
Token ID394
Feature activation+6.44e-03

Pos logit contributions
seek+0.004
growth+0.004
 habitat+0.004
 arrest+0.004
icol+0.004
Neg logit contributions
iring-0.002
 God-0.002
 please-0.002
 who-0.002
nam-0.002
Token with
Token ID351
Feature activation-2.74e-02

Pos logit contributions
iring+0.006
 God+0.006
 please+0.006
mm+0.006
nam+0.006
Neg logit contributions
seek-0.015
 habitat-0.013
growth-0.013
powers-0.013
icol-0.013
Token eight
Token ID3624
Feature activation-1.76e-02

Pos logit contributions
seek+0.078
 habitat+0.066
icol+0.065
powers+0.065
ro+0.064
Neg logit contributions
 who-0.022
 God-0.021
nam-0.020
iring-0.019
 happy-0.018
Token re
Token ID302
Feature activation-3.54e-02

Pos logit contributions
seek+0.036
 habitat+0.031
icol+0.030
growth+0.029
powers+0.029
Neg logit contributions
iring-0.023
 who-0.023
 please-0.023
 God-0.023
?"-0.022
Tokenind
Token ID521
Feature activation-7.75e-02

Pos logit contributions
seek+0.085
powers+0.077
 habitat+0.077
growth+0.075
icol+0.075
Neg logit contributions
iring-0.034
 who-0.031
 God-0.031
ma-0.030
el-0.030
Tokeneer
Token ID28153
Feature activation-3.13e-02

Pos logit contributions
seek+0.164
 habitat+0.150
icol+0.148
powers+0.142
growth+0.135
Neg logit contributions
 God-0.098
 who-0.094
iring-0.089
ie-0.086
?"-0.085
Token.
Token ID13
Feature activation+3.06e-03

Pos logit contributions
seek+0.079
 habitat+0.074
powers+0.073
icol+0.072
growth+0.071
Neg logit contributions
 who-0.028
iring-0.026
 please-0.023
?"-0.022
 God-0.022
Token The
Token ID383
Feature activation-5.49e-03

Pos logit contributions
iring+0.003
 God+0.003
 who+0.003
 please+0.003
?"+0.003
Neg logit contributions
seek-0.007
 habitat-0.007
powers-0.006
icol-0.006
growth-0.006
Token re
Token ID302
Feature activation-7.88e-03

Pos logit contributions
seek+0.012
 habitat+0.012
growth+0.011
powers+0.011
icol+0.011
Neg logit contributions
iring-0.006
 please-0.005
 God-0.005
ying-0.005
 who-0.005
Tokenind
Token ID521
Feature activation-4.91e-02

Pos logit contributions
seek+0.017
 habitat+0.016
powers+0.015
growth+0.015
icol+0.015
Neg logit contributions
 please-0.009
iring-0.008
 God-0.008
?"-0.008
ying-0.008
Token what
Token ID644
Feature activation-6.81e-03

Pos logit contributions
iring+0.004
 God+0.004
 who+0.004
 please+0.004
?"+0.003
Neg logit contributions
seek-0.012
 habitat-0.011
powers-0.011
icol-0.011
growth-0.011
Token the
Token ID262
Feature activation-1.05e-02

Pos logit contributions
seek+0.018
 habitat+0.017
icol+0.017
growth+0.016
powers+0.016
Neg logit contributions
 who-0.006
 God-0.005
 please-0.005
iring-0.005
?"-0.005
Token rocket
Token ID10701
Feature activation-1.30e-02

Pos logit contributions
seek+0.027
 habitat+0.024
icol+0.024
growth+0.023
ashed+0.023
Neg logit contributions
 God-0.011
 who-0.010
iring-0.009
 please-0.009
?"-0.009
Token discovered
Token ID5071
Feature activation+4.57e-03

Pos logit contributions
seek+0.029
 habitat+0.027
growth+0.026
powers+0.026
icol+0.026
Neg logit contributions
iring-0.014
 who-0.014
 please-0.013
 God-0.013
?"-0.013
Token in
Token ID287
Feature activation-7.31e-03

Pos logit contributions
iring+0.004
 please+0.004
 God+0.004
 who+0.004
?"+0.004
Neg logit contributions
seek-0.011
 habitat-0.010
powers-0.010
growth-0.010
icol-0.010
Token outer
Token ID12076
Feature activation-7.57e-02

Pos logit contributions
seek+0.022
powers+0.020
ashed+0.020
icol+0.020
 habitat+0.019
Neg logit contributions
 God-0.004
 who-0.004
?"-0.003
iring-0.003
 please-0.003
Token space
Token ID2272
Feature activation-1.96e-02

Pos logit contributions
seek+0.156
ashed+0.133
icol+0.130
powers+0.130
 habitat+0.129
Neg logit contributions
 God-0.102
?"-0.099
 who-0.098
iring-0.094
 please-0.091
Token.
Token ID13
Feature activation+9.78e-04

Pos logit contributions
seek+0.049
 habitat+0.046
powers+0.045
growth+0.045
icol+0.045
Neg logit contributions
iring-0.016
 who-0.016
 God-0.015
 please-0.015
?"-0.014
Token It
Token ID632
Feature activation-3.28e-02

Pos logit contributions
 who+0.001
 God+0.001
iring+0.001
 please+0.001
?"+0.001
Neg logit contributions
seek-0.002
 habitat-0.002
powers-0.002
icol-0.002
growth-0.002
Token was
Token ID373
Feature activation-1.80e-03

Pos logit contributions
seek+0.078
 habitat+0.073
icol+0.071
powers+0.070
growth+0.070
Neg logit contributions
 who-0.032
iring-0.031
 God-0.030
 please-0.029
?"-0.028
Token a
Token ID257
Feature activation+3.32e-03

Pos logit contributions
seek+0.005
powers+0.005
 habitat+0.004
growth+0.004
icol+0.004
Neg logit contributions
 God-0.001
iring-0.001
 who-0.001
 please-0.001
?"-0.001
Token looked
Token ID3114
Feature activation-6.89e-03

Pos logit contributions
seek+0.043
 habitat+0.041
growth+0.040
icol+0.040
 arrest+0.040
Neg logit contributions
 who-0.010
 please-0.010
 do-0.010
iring-0.010
?"-0.009
Token like
Token ID588
Feature activation-3.00e-03

Pos logit contributions
seek+0.017
 habitat+0.016
growth+0.015
powers+0.015
icol+0.015
Neg logit contributions
iring-0.006
 please-0.006
 who-0.006
 God-0.006
?"-0.006
Token it
Token ID340
Feature activation-1.32e-03

Pos logit contributions
seek+0.009
 habitat+0.008
icol+0.008
growth+0.008
powers+0.008
Neg logit contributions
 God-0.002
 please-0.002
 who-0.002
 okay-0.002
 you-0.002
Token came
Token ID1625
Feature activation-7.33e-03

Pos logit contributions
seek+0.003
 habitat+0.003
icol+0.003
growth+0.003
powers+0.003
Neg logit contributions
iring-0.001
 who-0.001
 please-0.001
?"-0.001
 God-0.001
Token from
Token ID422
Feature activation+8.09e-03

Pos logit contributions
seek+0.018
 habitat+0.017
growth+0.017
icol+0.017
powers+0.017
Neg logit contributions
iring-0.006
 please-0.006
 God-0.005
 who-0.005
?"-0.005
Token outer
Token ID12076
Feature activation-7.56e-02

Pos logit contributions
?"+0.007
 who+0.006
 God+0.006
 please+0.006
?+0.005
Neg logit contributions
seek-0.022
powers-0.020
 habitat-0.019
icol-0.019
growth-0.019
Token space
Token ID2272
Feature activation-1.54e-02

Pos logit contributions
seek+0.171
icol+0.150
ashed+0.140
 habitat+0.139
powers+0.134
Neg logit contributions
?"-0.092
iring-0.090
 God-0.089
 who-0.085
 your-0.083
Token.
Token ID13
Feature activation+9.13e-04

Pos logit contributions
seek+0.044
icol+0.042
growth+0.041
powers+0.040
 habitat+0.040
Neg logit contributions
 who-0.010
?"-0.008
 by-0.008
iring-0.007
 in-0.007
Token
Token ID198
Feature activation+4.03e-05

Pos logit contributions
iring+0.001
 God+0.001
 please+0.001
 who+0.001
ying+0.001
Neg logit contributions
seek-0.002
 habitat-0.002
powers-0.002
growth-0.002
icol-0.002
Token
Token ID198
Feature activation-9.88e-05

Pos logit contributions
nam+0.000
iring+0.000
 God+0.000
 please+0.000
itive+0.000
Neg logit contributions
seek-0.000
powers-0.000
 habitat-0.000
growth-0.000
icol-0.000
TokenL
Token ID43
Feature activation-2.14e-02

Pos logit contributions
seek+0.000
 habitat+0.000
powers+0.000
icol+0.000
growth+0.000
Neg logit contributions
 God-0.000
iring-0.000
 please-0.000
ying-0.000
nam-0.000
Token.
Token ID13
Feature activation+6.25e-03

Pos logit contributions
seek+0.033
 habitat+0.031
growth+0.031
icol+0.031
powers+0.030
Neg logit contributions
 who-0.010
iring-0.010
 please-0.009
 God-0.009
?"-0.009
Token He
Token ID679
Feature activation+8.42e-03

Pos logit contributions
iring+0.005
 please+0.005
 who+0.005
 God+0.005
?"+0.005
Neg logit contributions
seek-0.016
 habitat-0.015
growth-0.014
icol-0.014
powers-0.014
Token said
Token ID531
Feature activation+7.84e-03

Pos logit contributions
 who+0.007
 also+0.006
?"+0.005
 please+0.005
 do+0.005
Neg logit contributions
seek-0.024
 habitat-0.023
growth-0.022
icol-0.022
powers-0.021
Token they
Token ID484
Feature activation-1.71e-02

Pos logit contributions
 please+0.006
 God+0.006
 who+0.006
iring+0.005
?"+0.005
Neg logit contributions
seek-0.021
 habitat-0.020
growth-0.019
icol-0.019
powers-0.019
Token had
Token ID550
Feature activation-2.00e-02

Pos logit contributions
seek+0.043
 habitat+0.040
growth+0.039
icol+0.039
powers+0.039
Neg logit contributions
iring-0.014
 please-0.014
 who-0.014
 God-0.013
?"-0.012
Token talent
Token ID7401
Feature activation-7.41e-02

Pos logit contributions
growth+0.048
seek+0.048
icol+0.046
powers+0.044
 habitat+0.043
Neg logit contributions
 who-0.020
?"-0.020
 okay-0.020
 do-0.019
 happy-0.018
Token and
Token ID290
Feature activation-1.14e-02

Pos logit contributions
seek+0.191
 habitat+0.183
growth+0.179
icol+0.177
powers+0.173
Neg logit contributions
iring-0.054
 please-0.052
 who-0.052
 God-0.050
?"-0.048
Token potential
Token ID2785
Feature activation-8.78e-03

Pos logit contributions
seek+0.029
icol+0.028
 habitat+0.027
growth+0.027
powers+0.027
Neg logit contributions
 who-0.011
 God-0.009
 please-0.009
?"-0.009
 do-0.008
Token.
Token ID13
Feature activation+4.99e-03

Pos logit contributions
seek+0.024
icol+0.022
growth+0.022
 habitat+0.022
powers+0.021
Neg logit contributions
 who-0.007
iring-0.007
 God-0.006
 please-0.006
?"-0.006
Token
Token ID198
Feature activation+4.40e-04

Pos logit contributions
iring+0.004
 please+0.004
 God+0.004
 who+0.004
?"+0.004
Neg logit contributions
seek-0.012
 habitat-0.012
growth-0.011
icol-0.011
powers-0.011
Token
Token ID198
Feature activation-5.70e-04

Pos logit contributions
iring+0.000
 please+0.000
 God+0.000
 who+0.000
ying+0.000
Neg logit contributions
seek-0.001
 habitat-0.001
growth-0.001
powers-0.001
icol-0.001
Token.
Token ID13
Feature activation-2.29e-04

Pos logit contributions
 who+0.012
iring+0.012
 please+0.010
?"+0.009
 by+0.009
Neg logit contributions
seek-0.074
 habitat-0.072
icol-0.071
growth-0.070
powers-0.068
Token Lily
Token ID20037
Feature activation-6.11e-03

Pos logit contributions
seek+0.001
 habitat+0.001
 arrest+0.001
icol+0.001
growth+0.001
Neg logit contributions
iring-0.000
 please-0.000
 who-0.000
 God-0.000
?"-0.000
Token had
Token ID550
Feature activation-1.38e-02

Pos logit contributions
seek+0.014
 habitat+0.014
growth+0.014
icol+0.013
powers+0.013
Neg logit contributions
 who-0.006
 also-0.006
?"-0.005
 please-0.005
ma-0.005
Token a
Token ID257
Feature activation-1.64e-02

Pos logit contributions
icol+0.040
seek+0.039
 habitat+0.038
ashed+0.037
powers+0.037
Neg logit contributions
 who-0.011
 also-0.008
 in-0.006
She-0.006
 her-0.006
Token magn
Token ID7842
Feature activation-2.84e-02

Pos logit contributions
seek+0.039
 habitat+0.036
growth+0.034
powers+0.032
 arrest+0.032
Neg logit contributions
 please-0.018
 who-0.017
?"-0.017
 girl-0.016
 happy-0.016
Tokenifying
Token ID4035
Feature activation-7.41e-02

Pos logit contributions
seek+0.058
 habitat+0.053
powers+0.050
icol+0.049
growth+0.049
Neg logit contributions
iring-0.039
 please-0.037
 who-0.035
?"-0.034
 God-0.034
Token glass
Token ID5405
Feature activation-2.20e-02

Pos logit contributions
seek+0.148
 habitat+0.129
powers+0.128
icol+0.125
growth+0.120
Neg logit contributions
?"-0.106
 please-0.106
?-0.093
iring-0.092
 God-0.092
Token that
Token ID326
Feature activation+7.76e-03

Pos logit contributions
seek+0.065
icol+0.061
 habitat+0.061
growth+0.060
powers+0.059
Neg logit contributions
 who-0.010
?"-0.009
 please-0.009
iring-0.009
 in-0.008
Token she
Token ID673
Feature activation+1.01e-03

Pos logit contributions
 who+0.005
 please+0.005
iring+0.005
 also+0.005
?"+0.005
Neg logit contributions
seek-0.021
 habitat-0.020
icol-0.020
 arrest-0.020
growth-0.019
Token found
Token ID1043
Feature activation+9.08e-03

Pos logit contributions
 who+0.001
 also+0.001
?"+0.001
 do+0.001
 please+0.001
Neg logit contributions
seek-0.002
icol-0.002
 habitat-0.002
growth-0.002
 fault-0.002
Token in
Token ID287
Feature activation-1.35e-02

Pos logit contributions
 who+0.005
iring+0.005
 by+0.004
 in+0.004
ying+0.004
Neg logit contributions
seek-0.026
icol-0.025
 habitat-0.024
powers-0.024
growth-0.023
Token it
Token ID340
Feature activation-1.26e-02

Pos logit contributions
seek+0.053
 habitat+0.049
powers+0.047
growth+0.047
icol+0.047
Neg logit contributions
iring-0.021
 God-0.019
 who-0.019
 please-0.019
?"-0.018
Token for
Token ID329
Feature activation+7.69e-03

Pos logit contributions
seek+0.033
 habitat+0.031
icol+0.031
growth+0.030
powers+0.030
Neg logit contributions
iring-0.009
 who-0.009
 please-0.009
 God-0.008
?"-0.008
Token what
Token ID644
Feature activation-2.35e-02

Pos logit contributions
 God+0.007
 who+0.007
 please+0.006
iring+0.006
?"+0.006
Neg logit contributions
seek-0.019
icol-0.018
powers-0.018
 habitat-0.018
ashed-0.018
Token felt
Token ID2936
Feature activation-1.36e-02

Pos logit contributions
seek+0.060
 habitat+0.056
icol+0.055
growth+0.053
powers+0.052
Neg logit contributions
 who-0.020
 God-0.020
 please-0.020
iring-0.019
?"-0.018
Token like
Token ID588
Feature activation-3.37e-02

Pos logit contributions
seek+0.039
 habitat+0.036
icol+0.035
growth+0.035
powers+0.034
Neg logit contributions
 who-0.008
?"-0.008
iring-0.008
 please-0.008
 by-0.007
Token hours
Token ID2250
Feature activation-7.39e-02

Pos logit contributions
seek+0.091
 habitat+0.080
powers+0.079
icol+0.079
growth+0.078
Neg logit contributions
 who-0.032
 please-0.027
 God-0.027
?"-0.026
 in-0.025
Token.
Token ID13
Feature activation-9.16e-03

Pos logit contributions
seek+0.192
 habitat+0.184
icol+0.182
powers+0.180
growth+0.178
Neg logit contributions
 who-0.052
iring-0.051
 please-0.046
ying-0.045
 God-0.044
Token After
Token ID2293
Feature activation-1.20e-02

Pos logit contributions
seek+0.023
 habitat+0.021
powers+0.020
icol+0.020
growth+0.020
Neg logit contributions
 God-0.008
 who-0.008
 please-0.008
iring-0.007
She-0.007
Token a
Token ID257
Feature activation-2.52e-02

Pos logit contributions
seek+0.032
 habitat+0.031
powers+0.031
icol+0.030
ro+0.028
Neg logit contributions
 God-0.009
 who-0.009
 please-0.008
 happy-0.007
 sitting-0.007
Token while
Token ID981
Feature activation-3.00e-03

Pos logit contributions
seek+0.062
 habitat+0.060
growth+0.057
 arrest+0.054
icol+0.054
Neg logit contributions
 please-0.024
 who-0.023
 God-0.021
 happy-0.020
?"-0.019
Token,
Token ID11
Feature activation-6.63e-03

Pos logit contributions
seek+0.009
 habitat+0.009
powers+0.008
icol+0.008
growth+0.008
Neg logit contributions
 please-0.001
 God-0.001
iring-0.001
 who-0.001
?"-0.001
Token needed
Token ID2622
Feature activation-2.11e-02

Pos logit contributions
seek+0.046
 habitat+0.043
powers+0.042
growth+0.041
icol+0.041
Neg logit contributions
iring-0.023
 God-0.021
 please-0.020
ying-0.020
 who-0.019
Token to
Token ID284
Feature activation-9.63e-03

Pos logit contributions
seek+0.045
icol+0.042
 habitat+0.042
growth+0.041
powers+0.040
Neg logit contributions
 please-0.028
 God-0.027
 who-0.026
?"-0.025
 do-0.023
Token go
Token ID467
Feature activation+2.22e-02

Pos logit contributions
seek+0.022
 habitat+0.021
powers+0.020
growth+0.020
icol+0.020
Neg logit contributions
iring-0.009
 please-0.009
 God-0.009
 who-0.009
?"-0.008
Token do
Token ID466
Feature activation+2.20e-02

Pos logit contributions
iring+0.032
 God+0.029
nam+0.029
mm+0.029
ying+0.029
Neg logit contributions
seek-0.043
 habitat-0.038
icol-0.036
powers-0.036
growth-0.035
Token some
Token ID617
Feature activation+1.15e-02

Pos logit contributions
 who+0.021
?"+0.019
 please+0.019
 do+0.018
 God+0.017
Neg logit contributions
seek-0.052
 habitat-0.050
growth-0.050
icol-0.049
powers-0.047
Token err
Token ID11454
Feature activation-7.34e-02

Pos logit contributions
iring+0.017
 please+0.016
 God+0.016
 who+0.016
?"+0.016
Neg logit contributions
seek-0.021
 habitat-0.020
growth-0.019
icol-0.019
powers-0.018
Tokenands
Token ID1746
Feature activation-2.44e-02

Pos logit contributions
seek+0.095
 habitat+0.092
icol+0.089
growth+0.089
powers+0.084
Neg logit contributions
iring-0.140
 please-0.135
 God-0.135
ying-0.133
 who-0.132
Token.
Token ID13
Feature activation-8.61e-04

Pos logit contributions
seek+0.061
 habitat+0.058
growth+0.057
icol+0.056
powers+0.056
Neg logit contributions
iring-0.019
 please-0.018
 who-0.018
 God-0.017
?"-0.017
Token
Token ID198
Feature activation-3.02e-03

Pos logit contributions
seek+0.002
 habitat+0.002
growth+0.002
powers+0.002
icol+0.002
Neg logit contributions
iring-0.001
 please-0.001
 God-0.001
 who-0.001
ying-0.001
Token
Token ID198
Feature activation-2.99e-03

Pos logit contributions
seek+0.007
 habitat+0.006
powers+0.006
growth+0.006
icol+0.006
Neg logit contributions
iring-0.003
 God-0.003
ying-0.003
nam-0.003
 please-0.003
TokenMom
Token ID29252
Feature activation-1.45e-02

Pos logit contributions
seek+0.007
 habitat+0.006
powers+0.006
icol+0.006
growth+0.006
Neg logit contributions
iring-0.003
 God-0.003
ying-0.003
mm-0.002
?"-0.002
Token't
Token ID470
Feature activation-6.29e-04

Pos logit contributions
 habitat+0.072
seek+0.069
icol+0.069
powers+0.065
 habitats+0.063
Neg logit contributions
 please-0.029
?"-0.028
"-0.027
'-0.026
 who-0.024
Token been
Token ID587
Feature activation-1.99e-02

Pos logit contributions
icol+0.001
seek+0.001
 habitat+0.001
growth+0.001
powers+0.001
Neg logit contributions
?"-0.001
 who-0.001
 please-0.001
 do-0.001
?-0.001
Token used
Token ID973
Feature activation-1.42e-02

Pos logit contributions
seek+0.048
icol+0.048
growth+0.047
 habitat+0.045
powers+0.041
Neg logit contributions
 who-0.024
?"-0.022
 in-0.021
 please-0.020
 okay-0.020
Token for
Token ID329
Feature activation-1.19e-03

Pos logit contributions
seek+0.035
icol+0.034
growth+0.033
 habitat+0.032
 arrest+0.030
Neg logit contributions
?"-0.015
 by-0.015
 please-0.014
 in-0.014
 who-0.014
Token many
Token ID867
Feature activation-4.37e-02

Pos logit contributions
seek+0.003
icol+0.003
ashed+0.003
 habitat+0.003
growth+0.003
Neg logit contributions
 who-0.001
 please-0.001
 God-0.001
?"-0.001
 your-0.001
Token years
Token ID812
Feature activation-7.34e-02

Pos logit contributions
seek+0.110
icol+0.102
growth+0.097
 habitat+0.095
 arrest+0.089
Neg logit contributions
 who-0.045
?"-0.042
iring-0.042
 please-0.040
ying-0.039
Token.
Token ID13
Feature activation-6.49e-03

Pos logit contributions
seek+0.187
growth+0.171
icol+0.165
 habitat+0.164
 drowned+0.160
Neg logit contributions
?"-0.067
iring-0.066
 who-0.066
 God-0.060
 please-0.060
Token One
Token ID1881
Feature activation+4.23e-03

Pos logit contributions
seek+0.014
 habitat+0.013
growth+0.013
powers+0.013
icol+0.012
Neg logit contributions
iring-0.007
 please-0.007
 God-0.007
ying-0.007
 who-0.006
Token day
Token ID1110
Feature activation-3.24e-02

Pos logit contributions
iring+0.006
 please+0.006
nam+0.006
 God+0.006
ying+0.006
Neg logit contributions
seek-0.007
 habitat-0.007
powers-0.007
growth-0.007
icol-0.006
Token,
Token ID11
Feature activation-5.48e-03

Pos logit contributions
seek+0.098
 habitat+0.095
powers+0.091
growth+0.090
icol+0.090
Neg logit contributions
 please-0.013
 God-0.011
?"-0.011
 who-0.010
iring-0.007
Token we
Token ID356
Feature activation-1.76e-03

Pos logit contributions
seek+0.013
 habitat+0.012
powers+0.012
growth+0.012
icol+0.012
Neg logit contributions
iring-0.005
 please-0.005
 God-0.005
 who-0.005
?"-0.004
Token and
Token ID290
Feature activation+1.16e-03

Pos logit contributions
iring+0.004
 God+0.004
 please+0.004
nam+0.004
ying+0.004
Neg logit contributions
seek-0.013
 habitat-0.012
powers-0.012
icol-0.012
growth-0.012
Token asked
Token ID1965
Feature activation-2.50e-02

Pos logit contributions
iring+0.001
 please+0.001
 God+0.001
 who+0.001
?"+0.001
Neg logit contributions
seek-0.003
 habitat-0.002
growth-0.002
powers-0.002
icol-0.002
Token if
Token ID611
Feature activation+2.17e-02

Pos logit contributions
seek+0.057
 habitat+0.052
powers+0.052
icol+0.051
growth+0.051
Neg logit contributions
iring-0.025
 God-0.022
 please-0.022
mm-0.022
ying-0.022
Token Bun
Token ID28515
Feature activation-5.01e-02

Pos logit contributions
iring+0.018
 God+0.017
 please+0.017
 who+0.016
?"+0.015
Neg logit contributions
seek-0.055
 habitat-0.052
growth-0.050
powers-0.049
icol-0.049
Token-
Token ID12
Feature activation-5.07e-02

Pos logit contributions
seek+0.102
 habitat+0.100
growth+0.097
icol+0.094
 arrest+0.091
Neg logit contributions
 who-0.064
iring-0.062
 please-0.061
ie-0.060
 God-0.058
TokenB
Token ID33
Feature activation-7.31e-02

Pos logit contributions
seek+0.111
 habitat+0.106
icol+0.103
powers+0.099
growth+0.099
Neg logit contributions
iring-0.057
 God-0.056
 who-0.055
 please-0.054
?"-0.052
Tokenun
Token ID403
Feature activation-3.21e-02

Pos logit contributions
seek+0.154
growth+0.147
 habitat+0.143
icol+0.140
 arrest+0.136
Neg logit contributions
iring-0.091
 please-0.085
 who-0.082
ie-0.081
heart-0.077
Token could
Token ID714
Feature activation-3.47e-02

Pos logit contributions
seek+0.066
 habitat+0.064
icol+0.062
growth+0.061
powers+0.058
Neg logit contributions
 who-0.041
 please-0.040
iring-0.039
ie-0.038
?"-0.038
Token teach
Token ID4545
Feature activation-1.87e-02

Pos logit contributions
seek+0.057
growth+0.051
powers+0.050
 habitat+0.048
 arrest+0.048
Neg logit contributions
iring-0.058
nam-0.055
ying-0.054
mm-0.053
 God-0.052
Token it
Token ID340
Feature activation-1.20e-02

Pos logit contributions
seek+0.053
 habitat+0.050
icol+0.049
powers+0.048
growth+0.048
Neg logit contributions
 who-0.011
 God-0.010
iring-0.010
 please-0.010
 by-0.009
Token how
Token ID703
Feature activation-2.32e-02

Pos logit contributions
seek+0.027
 habitat+0.026
icol+0.026
growth+0.025
powers+0.024
Neg logit contributions
iring-0.013
 who-0.012
 please-0.012
 God-0.012
?"-0.011
Token some
Token ID617
Feature activation-1.06e-03

Pos logit contributions
seek+0.034
 habitat+0.032
powers+0.030
growth+0.030
icol+0.030
Neg logit contributions
iring-0.015
 please-0.015
 God-0.015
 who-0.014
?"-0.014
Token carrots
Token ID34397
Feature activation+2.44e-02

Pos logit contributions
seek+0.002
 habitat+0.002
powers+0.002
growth+0.002
icol+0.002
Neg logit contributions
iring-0.001
 please-0.001
 God-0.001
 who-0.001
ying-0.001
Token for
Token ID329
Feature activation+5.31e-03

Pos logit contributions
iring+0.018
 God+0.016
 please+0.015
ying+0.015
nam+0.014
Neg logit contributions
seek-0.062
 habitat-0.058
powers-0.057
growth-0.056
icol-0.055
Token his
Token ID465
Feature activation-2.81e-02

Pos logit contributions
iring+0.005
 God+0.005
 please+0.005
 who+0.005
?"+0.005
Neg logit contributions
seek-0.012
 habitat-0.011
powers-0.011
growth-0.011
icol-0.011
Token re
Token ID302
Feature activation-2.76e-02

Pos logit contributions
seek+0.068
growth+0.061
 habitat+0.061
icol+0.060
powers+0.058
Neg logit contributions
 God-0.027
 please-0.027
 who-0.026
iring-0.025
?"-0.025
Tokenind
Token ID521
Feature activation-7.30e-02

Pos logit contributions
seek+0.068
 habitat+0.063
growth+0.062
icol+0.061
powers+0.061
Neg logit contributions
iring-0.024
 God-0.022
 please-0.022
 who-0.022
?"-0.021
Tokeneer
Token ID28153
Feature activation-2.91e-02

Pos logit contributions
seek+0.144
 habitat+0.131
icol+0.131
growth+0.128
powers+0.125
Neg logit contributions
 God-0.097
iring-0.095
 who-0.093
?"-0.093
 please-0.091
Token,"
Token ID553
Feature activation-7.72e-03

Pos logit contributions
seek+0.082
 habitat+0.077
growth+0.077
icol+0.077
powers+0.073
Neg logit contributions
 please-0.014
 who-0.014
?"-0.014
iring-0.013
 God-0.013
Token Anna
Token ID11735
Feature activation-2.23e-02

Pos logit contributions
seek+0.012
growth+0.011
 habitat+0.011
powers+0.011
icol+0.011
Neg logit contributions
iring-0.014
ying-0.012
bling-0.012
ble-0.011
 please-0.011
Token says
Token ID1139
Feature activation+4.17e-04

Pos logit contributions
seek+0.056
 habitat+0.052
growth+0.050
powers+0.050
icol+0.050
Neg logit contributions
iring-0.018
 please-0.017
 God-0.017
 who-0.016
?"-0.016
Token.
Token ID13
Feature activation+1.74e-03

Pos logit contributions
iring+0.001
mm+0.000
 God+0.000
 please+0.000
?"+0.000
Neg logit contributions
seek-0.001
powers-0.001
 habitat-0.001
icol-0.001
growth-0.001
Token been
Token ID587
Feature activation-1.39e-02

Pos logit contributions
icol+0.011
seek+0.010
 habitat+0.009
igsaw+0.009
powers+0.009
Neg logit contributions
 in-0.005
 of-0.005
 who-0.005
 also-0.005
 from-0.005
Token there
Token ID612
Feature activation-7.29e-03

Pos logit contributions
seek+0.035
 habitat+0.034
icol+0.034
powers+0.033
growth+0.033
Neg logit contributions
 who-0.015
 in-0.014
 sitting-0.012
 by-0.012
iring-0.011
Token for
Token ID329
Feature activation+1.48e-02

Pos logit contributions
seek+0.018
 habitat+0.017
growth+0.017
powers+0.017
icol+0.017
Neg logit contributions
iring-0.006
 please-0.006
 who-0.006
 God-0.006
?"-0.006
Token hundreds
Token ID5179
Feature activation-2.97e-02

Pos logit contributions
 God+0.009
 who+0.008
 please+0.007
 her+0.007
iring+0.007
Neg logit contributions
icol-0.040
seek-0.040
powers-0.039
 habitat-0.039
ashed-0.037
Token of
Token ID286
Feature activation-2.91e-02

Pos logit contributions
seek+0.101
icol+0.095
 habitat+0.092
growth+0.090
powers+0.088
Neg logit contributions
iring-0.005
 who-0.003
ying-0.002
ie-0.002
 of-0.000
Token years
Token ID812
Feature activation-7.22e-02

Pos logit contributions
seek+0.078
icol+0.072
 habitat+0.065
growth+0.065
ashed+0.062
Neg logit contributions
 who-0.030
iring-0.026
 God-0.025
nam-0.025
 happy-0.025
Token!
Token ID0
Feature activation-1.13e-02

Pos logit contributions
seek+0.202
 habitat+0.178
growth+0.178
 drowned+0.174
icol+0.172
Neg logit contributions
iring-0.052
 who-0.052
?"-0.047
 God-0.047
 by-0.043
Token
Token ID198
Feature activation-4.10e-03

Pos logit contributions
seek+0.030
 habitat+0.028
growth+0.027
icol+0.027
powers+0.027
Neg logit contributions
iring-0.007
 please-0.007
 who-0.006
 God-0.006
?"-0.006
Token
Token ID198
Feature activation-3.45e-03

Pos logit contributions
seek+0.009
 habitat+0.008
powers+0.008
growth+0.008
icol+0.008
Neg logit contributions
iring-0.004
 God-0.004
 please-0.004
ying-0.004
nam-0.004
TokenP
Token ID47
Feature activation-2.94e-02

Pos logit contributions
seek+0.008
 habitat+0.008
growth+0.008
powers+0.008
icol+0.008
Neg logit contributions
iring-0.003
 please-0.003
 God-0.003
 who-0.003
?"-0.003
Tokenolly
Token ID5098
Feature activation+2.53e-03

Pos logit contributions
seek+0.066
 habitat+0.061
growth+0.059
icol+0.059
powers+0.058
Neg logit contributions
iring-0.032
 please-0.031
 God-0.028
ying-0.028
 who-0.028
Token grandma
Token ID49890
Feature activation-2.59e-02

Pos logit contributions
seek+0.054
growth+0.049
icol+0.045
powers+0.044
 habitat+0.044
Neg logit contributions
 who-0.010
 God-0.009
 happy-0.008
 best-0.008
 girl-0.007
Token had
Token ID550
Feature activation-1.15e-02

Pos logit contributions
seek+0.061
 habitat+0.057
growth+0.055
powers+0.055
icol+0.055
Neg logit contributions
 who-0.027
iring-0.026
 please-0.025
 God-0.025
?"-0.024
Token kept
Token ID4030
Feature activation+1.06e-03

Pos logit contributions
icol+0.024
seek+0.023
growth+0.022
powers+0.022
 habitat+0.021
Neg logit contributions
 who-0.016
 in-0.015
 please-0.014
?"-0.014
 by-0.014
Token for
Token ID329
Feature activation+1.58e-02

Pos logit contributions
iring+0.001
 God+0.001
 who+0.001
 please+0.001
?"+0.001
Neg logit contributions
seek-0.003
 habitat-0.003
powers-0.003
growth-0.003
icol-0.003
Token many
Token ID867
Feature activation-3.28e-02

Pos logit contributions
 God+0.013
iring+0.012
 who+0.012
 please+0.012
?"+0.010
Neg logit contributions
seek-0.041
 habitat-0.038
icol-0.038
powers-0.037
growth-0.037
Token years
Token ID812
Feature activation-7.13e-02

Pos logit contributions
seek+0.079
icol+0.074
growth+0.071
 habitat+0.070
powers+0.066
Neg logit contributions
 who-0.033
iring-0.033
 God-0.032
ie-0.029
?"-0.029
Token.
Token ID13
Feature activation-2.70e-03

Pos logit contributions
seek+0.197
growth+0.183
icol+0.177
 habitat+0.177
powers+0.172
Neg logit contributions
 who-0.048
iring-0.045
 God-0.044
?"-0.040
 please-0.038
Token They
Token ID1119
Feature activation-9.13e-03

Pos logit contributions
seek+0.007
 habitat+0.006
powers+0.006
growth+0.006
icol+0.006
Neg logit contributions
iring-0.002
 God-0.002
 please-0.002
 who-0.002
?"-0.002
Token only
Token ID691
Feature activation-5.73e-04

Pos logit contributions
seek+0.021
 habitat+0.019
powers+0.019
icol+0.019
growth+0.018
Neg logit contributions
 who-0.010
 please-0.010
 God-0.009
iring-0.009
 also-0.009
Token knew
Token ID2993
Feature activation+3.84e-04

Pos logit contributions
seek+0.001
icol+0.001
 habitat+0.001
powers+0.001
growth+0.001
Neg logit contributions
 who-0.001
 please-0.001
 also-0.001
 God-0.001
 do-0.001
Token that
Token ID326
Feature activation+5.20e-03

Pos logit contributions
iring+0.000
 who+0.000
 God+0.000
 please+0.000
?"+0.000
Neg logit contributions
seek-0.001
 habitat-0.001
icol-0.001
powers-0.001
growth-0.001
Token girl
Token ID2576
Feature activation+5.05e-04

Pos logit contributions
iring+0.003
 please+0.003
 God+0.003
ying+0.003
mm+0.003
Neg logit contributions
seek-0.008
 habitat-0.008
powers-0.008
icol-0.008
growth-0.008
Token who
Token ID508
Feature activation-3.25e-02

Pos logit contributions
iring+0.001
 please+0.000
 God+0.000
ying+0.000
mm+0.000
Neg logit contributions
seek-0.001
 habitat-0.001
growth-0.001
icol-0.001
powers-0.001
Token was
Token ID373
Feature activation-6.97e-03

Pos logit contributions
seek+0.073
 habitat+0.068
powers+0.066
growth+0.065
icol+0.065
Neg logit contributions
iring-0.036
 who-0.034
 God-0.034
 please-0.034
?"-0.032
Token about
Token ID546
Feature activation-2.34e-02

Pos logit contributions
seek+0.015
 habitat+0.014
growth+0.013
powers+0.013
icol+0.013
Neg logit contributions
iring-0.008
 please-0.008
 God-0.008
 who-0.007
ying-0.007
Token three
Token ID1115
Feature activation-2.68e-02

Pos logit contributions
seek+0.053
 habitat+0.048
icol+0.048
powers+0.047
growth+0.047
Neg logit contributions
 who-0.029
 God-0.028
 please-0.027
?"-0.026
 by-0.025
Token years
Token ID812
Feature activation-7.12e-02

Pos logit contributions
seek+0.064
 habitat+0.059
growth+0.058
icol+0.057
powers+0.057
Neg logit contributions
iring-0.026
 God-0.023
 please-0.023
 who-0.023
?"-0.022
Token old
Token ID1468
Feature activation-1.10e-02

Pos logit contributions
seek+0.199
 habitat+0.181
growth+0.179
icol+0.174
powers+0.171
Neg logit contributions
iring-0.049
 God-0.047
 who-0.043
 please-0.040
?"-0.039
Token.
Token ID13
Feature activation-3.73e-03

Pos logit contributions
seek+0.027
 habitat+0.026
growth+0.025
powers+0.025
icol+0.025
Neg logit contributions
iring-0.009
 who-0.009
 God-0.009
 please-0.009
?"-0.008
Token She
Token ID1375
Feature activation-2.09e-03

Pos logit contributions
seek+0.009
 habitat+0.009
growth+0.008
powers+0.008
icol+0.008
Neg logit contributions
iring-0.003
 please-0.003
 God-0.003
 who-0.003
ying-0.003
Token asked
Token ID1965
Feature activation-1.91e-02

Pos logit contributions
seek+0.005
 habitat+0.004
powers+0.004
growth+0.004
icol+0.004
Neg logit contributions
iring-0.002
 who-0.002
 please-0.002
 God-0.002
?"-0.002
Token him
Token ID683
Feature activation-1.32e-02

Pos logit contributions
seek+0.030
growth+0.028
 habitat+0.027
 arrest+0.026
 bites+0.024
Neg logit contributions
iring-0.030
mm-0.028
nam-0.027
ang-0.026
ble-0.026
Token,
Token ID11
Feature activation+4.38e-03

Pos logit contributions
iring+0.006
 God+0.005
 please+0.005
 who+0.005
ying+0.004
Neg logit contributions
seek-0.027
 habitat-0.025
powers-0.024
growth-0.024
icol-0.024
Token it
Token ID340
Feature activation+1.61e-03

Pos logit contributions
iring+0.004
 please+0.003
 God+0.003
ying+0.003
 who+0.003
Neg logit contributions
seek-0.011
 habitat-0.010
growth-0.010
powers-0.010
icol-0.010
Token kept
Token ID4030
Feature activation+9.94e-03

Pos logit contributions
 who+0.002
iring+0.002
 also+0.002
 please+0.002
ie+0.002
Neg logit contributions
seek-0.004
 habitat-0.003
powers-0.003
icol-0.003
growth-0.003
Token on
Token ID319
Feature activation+8.15e-03

Pos logit contributions
iring+0.010
 God+0.009
 please+0.009
 who+0.009
nam+0.008
Neg logit contributions
seek-0.024
powers-0.022
 habitat-0.022
growth-0.022
icol-0.021
Token flying
Token ID7348
Feature activation+1.00e-02

Pos logit contributions
 by+0.007
 God+0.007
 who+0.007
 happy+0.007
iring+0.007
Neg logit contributions
seek-0.020
powers-0.018
 habitat-0.017
growth-0.017
icol-0.017
Token err
Token ID11454
Feature activation-7.12e-02

Pos logit contributions
iring+0.007
 who+0.007
 by+0.006
 please+0.006
?"+0.006
Neg logit contributions
seek-0.026
 habitat-0.025
powers-0.024
growth-0.024
icol-0.024
Tokenatically
Token ID4142
Feature activation+5.48e-03

Pos logit contributions
seek+0.080
icol+0.078
 habitat+0.078
growth+0.072
powers+0.070
Neg logit contributions
iring-0.147
ying-0.139
 God-0.138
 who-0.137
 please-0.136
Token with
Token ID351
Feature activation-8.77e-03

Pos logit contributions
 who+0.003
iring+0.003
 by+0.003
 please+0.003
 God+0.003
Neg logit contributions
seek-0.015
growth-0.014
icol-0.014
 habitat-0.014
powers-0.014
Token inexplicable
Token ID49602
Feature activation+4.88e-03

Pos logit contributions
seek+0.026
icol+0.023
 habitat+0.023
powers+0.023
ro+0.021
Neg logit contributions
 who-0.005
 God-0.005
 happy-0.005
nam-0.004
iring-0.004
Token speed
Token ID2866
Feature activation+6.35e-03

Pos logit contributions
 please+0.006
iring+0.006
 who+0.006
 do+0.006
 happy+0.005
Neg logit contributions
seek-0.010
growth-0.009
 habitat-0.009
icol-0.009
ashed-0.008
Token.
Token ID13
Feature activation-2.07e-03

Pos logit contributions
 who+0.004
 by+0.003
iring+0.003
 in+0.003
ie+0.003
Neg logit contributions
seek-0.017
growth-0.017
 habitat-0.017
icol-0.017
powers-0.016
Token Tony
Token ID8832
Feature activation+6.86e-03

Pos logit contributions
iring+0.012
 please+0.011
 God+0.011
 who+0.010
?"+0.010
Neg logit contributions
seek-0.032
 habitat-0.030
powers-0.029
growth-0.029
icol-0.029
Token and
Token ID290
Feature activation-1.45e-04

Pos logit contributions
 who+0.006
?"+0.005
 God+0.005
 please+0.005
iring+0.005
Neg logit contributions
seek-0.017
 habitat-0.016
growth-0.016
icol-0.016
powers-0.016
Token Tommy
Token ID19919
Feature activation-3.14e-03

Pos logit contributions
seek+0.000
 habitat+0.000
powers+0.000
icol+0.000
growth+0.000
Neg logit contributions
 God-0.000
 who-0.000
iring-0.000
 please-0.000
nam-0.000
Token were
Token ID547
Feature activation+1.10e-02

Pos logit contributions
seek+0.007
 habitat+0.007
powers+0.007
growth+0.007
icol+0.007
Neg logit contributions
 who-0.003
iring-0.003
 please-0.003
 God-0.003
?"-0.003
Token the
Token ID262
Feature activation+1.27e-03

Pos logit contributions
iring+0.009
 please+0.008
 God+0.008
 who+0.008
?"+0.008
Neg logit contributions
seek-0.027
 habitat-0.026
growth-0.025
powers-0.025
icol-0.025
Token best
Token ID1266
Feature activation+3.87e-02

Pos logit contributions
iring+0.000
 please+0.000
 God+0.000
 who+0.000
?"+0.000
Neg logit contributions
seek-0.004
 habitat-0.004
growth-0.004
icol-0.004
powers-0.004
Token of
Token ID286
Feature activation+2.05e-02

Pos logit contributions
iring+0.009
 who+0.009
 God+0.006
 please+0.006
ying+0.004
Neg logit contributions
seek-0.125
 habitat-0.120
icol-0.115
growth-0.115
powers-0.111
Token friends
Token ID2460
Feature activation+1.21e-02

Pos logit contributions
 please+0.011
iring+0.011
 who+0.011
 God+0.010
?"+0.009
Neg logit contributions
seek-0.058
 habitat-0.055
icol-0.055
growth-0.054
powers-0.052
Token!
Token ID0
Feature activation+3.54e-03

Pos logit contributions
iring+0.010
 please+0.009
 God+0.009
?"+0.008
ying+0.008
Neg logit contributions
seek-0.030
 habitat-0.028
growth-0.027
powers-0.027
icol-0.027
Token<|endoftext|>
Token ID50256
Feature activation-3.12e-02

Pos logit contributions
iring+0.003
 God+0.003
 who+0.003
 please+0.003
?"+0.002
Neg logit contributions
seek-0.009
 habitat-0.008
powers-0.008
icol-0.008
growth-0.008
TokenOnce
Token ID7454
Feature activation+7.26e-03

Pos logit contributions
seek+0.073
 habitat+0.069
powers+0.066
icol+0.066
growth+0.065
Neg logit contributions
iring-0.030
 God-0.028
 who-0.028
 please-0.028
?"-0.027
Token He
Token ID679
Feature activation+3.47e-03

Pos logit contributions
 please+0.001
 who+0.001
 God+0.001
iring+0.001
?"+0.001
Neg logit contributions
seek-0.004
 habitat-0.004
icol-0.004
 drowned-0.004
 arrest-0.004
Token suddenly
Token ID6451
Feature activation-5.98e-03

Pos logit contributions
 who+0.004
 also+0.004
 please+0.003
 happy+0.003
?"+0.003
Neg logit contributions
seek-0.009
 habitat-0.008
growth-0.008
icol-0.007
powers-0.007
Token got
Token ID1392
Feature activation-4.38e-03

Pos logit contributions
seek+0.014
 habitat+0.013
icol+0.012
growth+0.012
powers+0.012
Neg logit contributions
 who-0.007
?"-0.006
 happy-0.006
 at-0.006
 in-0.006
Token a
Token ID257
Feature activation+2.07e-03

Pos logit contributions
seek+0.012
 habitat+0.012
icol+0.011
growth+0.011
 arrest+0.011
Neg logit contributions
 please-0.003
 in-0.003
 by-0.003
 happy-0.003
iring-0.003
Token brilliant
Token ID10457
Feature activation+2.19e-02

Pos logit contributions
 please+0.002
 happy+0.002
 do+0.002
 who+0.002
 girl+0.002
Neg logit contributions
seek-0.005
 habitat-0.005
growth-0.005
 arrest-0.004
keeping-0.004
Token idea
Token ID2126
Feature activation+4.59e-02

Pos logit contributions
 please+0.027
 do+0.025
?"+0.025
 girl+0.024
 happy+0.024
Neg logit contributions
seek-0.051
 habitat-0.047
growth-0.046
 arrest-0.043
powers-0.042
Token;
Token ID26
Feature activation-6.53e-03

Pos logit contributions
 who+0.036
 please+0.033
?"+0.028
 by+0.027
iring+0.026
Neg logit contributions
seek-0.122
 habitat-0.120
powers-0.115
growth-0.114
icol-0.113
Token he
Token ID339
Feature activation-8.07e-04

Pos logit contributions
seek+0.017
 habitat+0.016
growth+0.015
powers+0.015
icol+0.015
Neg logit contributions
 please-0.006
 who-0.005
 God-0.005
iring-0.005
?"-0.005
Token used
Token ID973
Feature activation-1.78e-03

Pos logit contributions
seek+0.002
 habitat+0.002
growth+0.002
 arrest+0.002
icol+0.002
Neg logit contributions
 who-0.001
 please-0.001
?"-0.001
 also-0.001
ie-0.001
Token the
Token ID262
Feature activation-1.67e-02

Pos logit contributions
seek+0.006
 habitat+0.006
icol+0.006
 habitats+0.005
powers+0.005
Neg logit contributions
 the-0.001
 her-0.001
 in-0.001
 please-0.000
 of-0.000
Token soap
Token ID19533
Feature activation-2.23e-02

Pos logit contributions
seek+0.035
 habitat+0.032
growth+0.030
 arrest+0.029
icol+0.029
Neg logit contributions
 please-0.025
 do-0.024
 best-0.024
?"-0.023
 girl-0.023
Token Mom
Token ID11254
Feature activation+2.08e-02

Pos logit contributions
iring+0.011
ying+0.010
bling+0.010
ap+0.010
kins+0.009
Neg logit contributions
powers-0.023
 habitat-0.023
seek-0.023
 arrest-0.022
growth-0.022
Tokenmy
Token ID1820
Feature activation+2.66e-02

Pos logit contributions
iring+0.023
nam+0.021
rac+0.021
bling+0.020
keys+0.020
Neg logit contributions
seek-0.043
 habitat-0.038
 fault-0.036
powers-0.036
icol-0.035
Token.
Token ID13
Feature activation+7.52e-03

Pos logit contributions
iring+0.024
 please+0.022
 God+0.022
 who+0.021
?"+0.021
Neg logit contributions
seek-0.065
 habitat-0.061
growth-0.059
powers-0.059
icol-0.058
Token Let
Token ID3914
Feature activation+1.59e-02

Pos logit contributions
iring+0.006
ying+0.005
 God+0.005
 please+0.005
 who+0.005
Neg logit contributions
seek-0.018
 habitat-0.017
powers-0.017
growth-0.017
icol-0.017
Token's
Token ID338
Feature activation+1.95e-02

Pos logit contributions
iring+0.015
 God+0.013
nam+0.013
 please+0.012
 who+0.012
Neg logit contributions
seek-0.038
 habitat-0.035
growth-0.034
powers-0.034
icol-0.034
Token build
Token ID1382
Feature activation+4.24e-02

Pos logit contributions
iring+0.021
 God+0.019
 please+0.019
ying+0.019
 who+0.018
Neg logit contributions
seek-0.043
 habitat-0.041
growth-0.039
powers-0.039
icol-0.039
Token a
Token ID257
Feature activation+1.40e-02

Pos logit contributions
iring+0.043
 God+0.042
 please+0.042
 who+0.041
?"+0.040
Neg logit contributions
seek-0.098
 habitat-0.091
powers-0.087
growth-0.086
icol-0.086
Token grocery
Token ID16918
Feature activation+3.40e-02

Pos logit contributions
iring+0.016
 please+0.016
 God+0.015
 who+0.015
?"+0.015
Neg logit contributions
seek-0.030
 habitat-0.028
growth-0.027
powers-0.027
icol-0.027
Token store
Token ID3650
Feature activation+2.98e-02

Pos logit contributions
iring+0.043
 God+0.040
 please+0.038
mm+0.037
ying+0.037
Neg logit contributions
seek-0.070
 habitat-0.064
powers-0.062
 arrest-0.059
growth-0.059
Token together
Token ID1978
Feature activation+9.85e-03

Pos logit contributions
iring+0.024
 please+0.022
 God+0.022
 who+0.021
?"+0.020
Neg logit contributions
seek-0.075
 habitat-0.070
growth-0.068
powers-0.068
icol-0.067
Token!"
Token ID2474
Feature activation-1.18e-03

Pos logit contributions
iring+0.009
 please+0.009
 who+0.009
?"+0.009
 God+0.009
Neg logit contributions
seek-0.023
 habitat-0.022
growth-0.021
icol-0.021
powers-0.021
Token day
Token ID1110
Feature activation-3.43e-02

Pos logit contributions
seek+0.015
 habitat+0.014
powers+0.013
growth+0.013
icol+0.012
Neg logit contributions
iring-0.013
nam-0.012
 please-0.012
 God-0.012
ying-0.012
Token,
Token ID11
Feature activation-6.72e-05

Pos logit contributions
seek+0.108
 habitat+0.107
powers+0.101
growth+0.100
icol+0.100
Neg logit contributions
 please-0.009
 who-0.006
 God-0.006
?"-0.005
iring-0.004
Token Lily
Token ID20037
Feature activation-4.87e-03

Pos logit contributions
seek+0.000
 habitat+0.000
powers+0.000
growth+0.000
icol+0.000
Neg logit contributions
iring-0.000
 please-0.000
 who-0.000
 God-0.000
nam-0.000
Token got
Token ID1392
Feature activation-1.91e-03

Pos logit contributions
seek+0.014
powers+0.013
 habitat+0.013
growth+0.013
icol+0.012
Neg logit contributions
?"-0.003
 who-0.003
ie-0.003
?-0.002
 also-0.002
Token an
Token ID281
Feature activation+7.27e-03

Pos logit contributions
seek+0.005
 habitat+0.005
growth+0.005
icol+0.005
powers+0.004
Neg logit contributions
 please-0.002
nam-0.002
iring-0.002
?"-0.002
 who-0.001
Token idea
Token ID2126
Feature activation+4.37e-02

Pos logit contributions
iring+0.008
 God+0.007
 who+0.007
 please+0.007
ying+0.007
Neg logit contributions
seek-0.015
 habitat-0.014
powers-0.014
growth-0.014
icol-0.014
Token.
Token ID13
Feature activation-2.92e-03

Pos logit contributions
 please+0.036
 who+0.035
iring+0.035
?"+0.033
 God+0.033
Neg logit contributions
seek-0.109
 habitat-0.104
growth-0.100
powers-0.100
icol-0.099
Token She
Token ID1375
Feature activation-3.28e-03

Pos logit contributions
seek+0.009
 habitat+0.008
growth+0.008
icol+0.008
 arrest+0.008
Neg logit contributions
 please-0.001
iring-0.001
 God-0.001
 who-0.001
She-0.001
Token decided
Token ID3066
Feature activation-6.52e-06

Pos logit contributions
seek+0.008
 habitat+0.007
growth+0.007
 arrest+0.007
icol+0.007
Neg logit contributions
 who-0.004
?"-0.004
 please-0.003
 also-0.003
ie-0.003
Token to
Token ID284
Feature activation-1.84e-02

Feature not active on prev token;
no predictions were affected.
Token find
Token ID1064
Feature activation+1.15e-02

Pos logit contributions
seek+0.046
 habitat+0.042
icol+0.042
powers+0.041
ashed+0.039
Neg logit contributions
 please-0.019
?"-0.018
 do-0.017
 who-0.017
 God-0.017
Token to
Token ID284
Feature activation-1.89e-02

Pos logit contributions
 please+0.004
 who+0.004
?"+0.003
 by+0.003
 God+0.003
Neg logit contributions
seek-0.007
growth-0.006
 habitat-0.006
icol-0.006
powers-0.006
Token share
Token ID2648
Feature activation+1.33e-02

Pos logit contributions
seek+0.045
 habitat+0.041
powers+0.041
icol+0.040
growth+0.040
Neg logit contributions
 please-0.020
 God-0.018
?"-0.018
 who-0.018
 do-0.018
Token his
Token ID465
Feature activation+1.53e-02

Pos logit contributions
iring+0.013
 please+0.012
 God+0.012
 who+0.012
?"+0.012
Neg logit contributions
seek-0.031
 habitat-0.029
growth-0.028
powers-0.028
icol-0.028
Token sa
Token ID473
Feature activation+2.29e-02

Pos logit contributions
iring+0.020
 please+0.019
 God+0.019
 who+0.019
?"+0.019
Neg logit contributions
seek-0.031
 habitat-0.028
growth-0.028
icol-0.027
powers-0.027
Tokenus
Token ID385
Feature activation+1.09e-03

Pos logit contributions
 God+0.038
 please+0.038
iring+0.037
 who+0.036
?"+0.036
Neg logit contributions
seek-0.037
 habitat-0.033
powers-0.032
growth-0.030
icol-0.029
Tokenages
Token ID1095
Feature activation+4.45e-02

Pos logit contributions
iring+0.002
 please+0.002
 God+0.002
 who+0.002
?"+0.002
Neg logit contributions
seek-0.002
 habitat-0.001
powers-0.001
growth-0.001
icol-0.001
Token with
Token ID351
Feature activation+3.41e-03

Pos logit contributions
iring+0.030
 please+0.027
 who+0.026
 God+0.026
?"+0.024
Neg logit contributions
seek-0.118
 habitat-0.112
growth-0.109
icol-0.108
powers-0.108
Token his
Token ID465
Feature activation-1.22e-02

Pos logit contributions
 who+0.003
 God+0.002
 please+0.002
iring+0.002
?"+0.002
Neg logit contributions
seek-0.009
icol-0.009
 habitat-0.009
powers-0.008
growth-0.008
Token brother
Token ID3956
Feature activation-1.13e-02

Pos logit contributions
seek+0.031
growth+0.029
 habitat+0.028
icol+0.028
powers+0.027
Neg logit contributions
iring-0.010
 please-0.010
 God-0.009
 who-0.009
?"-0.009
Token.
Token ID13
Feature activation+1.27e-03

Pos logit contributions
seek+0.031
 habitat+0.029
growth+0.029
icol+0.029
powers+0.029
Neg logit contributions
 who-0.008
 please-0.006
 God-0.006
?"-0.006
iring-0.006
Token He
Token ID679
Feature activation+4.20e-03

Pos logit contributions
 God+0.001
 please+0.001
iring+0.001
 who+0.001
She+0.001
Neg logit contributions
seek-0.003
 habitat-0.003
icol-0.003
powers-0.003
growth-0.003
Token really
Token ID1107
Feature activation-1.62e-02

Pos logit contributions
 who+0.003
iring+0.003
 please+0.003
 God+0.003
?"+0.003
Neg logit contributions
seek-0.008
 habitat-0.007
icol-0.007
growth-0.007
powers-0.007
Token wanted
Token ID2227
Feature activation-1.48e-02

Pos logit contributions
seek+0.036
 habitat+0.033
growth+0.032
icol+0.032
powers+0.031
Neg logit contributions
iring-0.018
 please-0.018
 God-0.017
 who-0.017
?"-0.017
Token to
Token ID284
Feature activation-1.21e-02

Pos logit contributions
seek+0.038
icol+0.038
 habitat+0.037
powers+0.035
growth+0.035
Neg logit contributions
 who-0.014
 by-0.014
 please-0.013
 her-0.013
 in-0.013
Token play
Token ID711
Feature activation-3.51e-03

Pos logit contributions
seek+0.029
 habitat+0.028
icol+0.027
powers+0.027
growth+0.026
Neg logit contributions
 please-0.013
 do-0.012
?"-0.012
 who-0.011
?-0.010
Token in
Token ID287
Feature activation+8.61e-03

Pos logit contributions
seek+0.009
 habitat+0.008
growth+0.008
powers+0.008
icol+0.008
Neg logit contributions
iring-0.003
 God-0.002
 please-0.002
 who-0.002
ying-0.002
Token the
Token ID262
Feature activation+1.16e-02

Pos logit contributions
 please+0.005
 who+0.005
iring+0.005
?"+0.005
 God+0.004
Neg logit contributions
seek-0.024
 habitat-0.023
powers-0.022
icol-0.022
growth-0.022
Token mud
Token ID17492
Feature activation-1.24e-02

Pos logit contributions
iring+0.017
 please+0.016
?"+0.016
She+0.015
 who+0.015
Neg logit contributions
seek-0.023
growth-0.021
 habitat-0.020
icol-0.020
powers-0.020
Token,
Token ID11
Feature activation+2.99e-03

Pos logit contributions
seek+0.033
growth+0.032
icol+0.032
powers+0.032
 habitat+0.032
Neg logit contributions
 please-0.007
iring-0.007
 who-0.007
?"-0.007
ie-0.006
Token so
Token ID523
Feature activation+2.54e-02

Pos logit contributions
iring+0.001
 please+0.001
 God+0.001
 who+0.001
?"+0.001
Neg logit contributions
seek-0.009
 habitat-0.008
growth-0.008
powers-0.008
icol-0.008
Token she
Token ID673
Feature activation+1.01e-02

Pos logit contributions
iring+0.015
 please+0.013
 God+0.013
 who+0.012
?"+0.011
Neg logit contributions
seek-0.070
 habitat-0.066
growth-0.064
powers-0.064
icol-0.064
Token decided
Token ID3066
Feature activation+1.02e-03

Pos logit contributions
 who+0.013
 please+0.012
 also+0.012
ie+0.012
?"+0.011
Neg logit contributions
seek-0.023
 habitat-0.022
growth-0.021
icol-0.020
 arrest-0.020
Token to
Token ID284
Feature activation-1.69e-02

Pos logit contributions
seek+0.007
icol+0.007
growth+0.007
ro+0.006
 habitat+0.006
Neg logit contributions
 who-0.003
?"-0.003
 please-0.003
 your-0.003
 okay-0.003
Token look
Token ID804
Feature activation-1.53e-02

Pos logit contributions
seek+0.040
 habitat+0.037
powers+0.037
icol+0.036
growth+0.036
Neg logit contributions
 please-0.017
 God-0.016
?"-0.016
 who-0.016
iring-0.015
Token out
Token ID503
Feature activation-9.59e-03

Pos logit contributions
seek+0.041
 habitat+0.039
growth+0.038
powers+0.037
icol+0.037
Neg logit contributions
iring-0.010
 please-0.009
 God-0.009
 who-0.009
?"-0.008
Token of
Token ID286
Feature activation+4.29e-03

Pos logit contributions
seek+0.024
 habitat+0.023
powers+0.022
growth+0.022
icol+0.022
Neg logit contributions
iring-0.007
 God-0.006
 please-0.006
 who-0.006
ying-0.006
Token the
Token ID262
Feature activation-9.71e-03

Pos logit contributions
?"+0.002
 your+0.002
 God+0.002
 who+0.002
?+0.001
Neg logit contributions
seek-0.014
ashed-0.013
powers-0.012
 habitat-0.012
icol-0.012
Token window
Token ID4324
Feature activation-1.51e-02

Pos logit contributions
seek+0.024
growth+0.020
powers+0.020
 habitat+0.020
icol+0.020
Neg logit contributions
iring-0.011
 God-0.010
 please-0.010
 who-0.010
 girl-0.010
Token."
Token ID526
Feature activation-1.76e-02

Pos logit contributions
seek+0.034
 habitat+0.033
powers+0.031
growth+0.031
icol+0.030
Neg logit contributions
iring-0.015
 God-0.014
 please-0.013
nam-0.013
ying-0.013
Token
Token ID198
Feature activation-6.40e-05

Pos logit contributions
seek+0.041
 habitat+0.038
growth+0.037
icol+0.037
powers+0.037
Neg logit contributions
iring-0.017
 who-0.017
 please-0.017
 God-0.016
?"-0.016
Token
Token ID198
Feature activation-1.20e-04

Pos logit contributions
seek+0.000
 habitat+0.000
growth+0.000
icol+0.000
powers+0.000
Neg logit contributions
iring-0.000
 please-0.000
 God-0.000
 who-0.000
?"-0.000
TokenJimmy
Token ID40335
Feature activation-1.47e-02

Pos logit contributions
seek+0.000
 habitat+0.000
powers+0.000
growth+0.000
icol+0.000
Neg logit contributions
iring-0.000
 God-0.000
 please-0.000
?"-0.000
ying-0.000
Token smiled
Token ID13541
Feature activation+6.95e-04

Pos logit contributions
seek+0.034
 habitat+0.032
powers+0.031
growth+0.031
icol+0.031
Neg logit contributions
 who-0.016
 please-0.014
 God-0.014
 also-0.014
iring-0.014
Token gone
Token ID3750
Feature activation+9.06e-03

Pos logit contributions
seek+0.009
 habitat+0.009
growth+0.009
icol+0.009
powers+0.008
Neg logit contributions
iring-0.003
 who-0.003
 please-0.003
 God-0.003
?"-0.002
Token.
Token ID13
Feature activation+1.20e-03

Pos logit contributions
 who+0.005
iring+0.005
?"+0.005
 please+0.005
 by+0.004
Neg logit contributions
seek-0.025
growth-0.024
icol-0.024
 habitat-0.024
powers-0.023
Token Lily
Token ID20037
Feature activation-5.02e-03

Pos logit contributions
iring+0.001
 please+0.001
 God+0.001
 who+0.001
?"+0.001
Neg logit contributions
seek-0.003
 habitat-0.003
icol-0.003
growth-0.003
powers-0.003
Token cries
Token ID24691
Feature activation-2.97e-02

Pos logit contributions
seek+0.014
 habitat+0.013
growth+0.012
icol+0.012
powers+0.012
Neg logit contributions
 who-0.004
?"-0.003
iring-0.003
 please-0.003
 also-0.003
Token.
Token ID13
Feature activation-6.10e-03

Pos logit contributions
seek+0.073
 habitat+0.069
growth+0.066
powers+0.066
icol+0.066
Neg logit contributions
iring-0.026
 please-0.024
 God-0.024
 who-0.023
?"-0.022
Token She
Token ID1375
Feature activation-6.48e-03

Pos logit contributions
seek+0.016
 habitat+0.015
growth+0.014
powers+0.014
icol+0.014
Neg logit contributions
iring-0.004
 please-0.004
 God-0.004
 who-0.004
?"-0.004
Token is
Token ID318
Feature activation-8.73e-03

Pos logit contributions
seek+0.016
 habitat+0.014
icol+0.013
growth+0.013
powers+0.013
Neg logit contributions
 who-0.007
?"-0.007
iring-0.007
 please-0.006
 do-0.006
Token alone
Token ID3436
Feature activation-6.13e-04

Pos logit contributions
seek+0.022
 habitat+0.021
growth+0.020
powers+0.020
icol+0.020
Neg logit contributions
iring-0.007
 please-0.006
 God-0.006
 who-0.006
?"-0.006
Token.
Token ID13
Feature activation+2.15e-03

Pos logit contributions
seek+0.002
 habitat+0.002
icol+0.002
growth+0.002
powers+0.002
Neg logit contributions
 who-0.000
iring-0.000
?"-0.000
 please-0.000
 God-0.000
Token She
Token ID1375
Feature activation+5.60e-04

Pos logit contributions
iring+0.002
 please+0.002
 God+0.002
 who+0.002
?"+0.001
Neg logit contributions
seek-0.005
 habitat-0.005
growth-0.005
powers-0.005
icol-0.005
Token is
Token ID318
Feature activation-1.76e-03

Pos logit contributions
?"+0.000
 who+0.000
 do+0.000
 also+0.000
 are+0.000
Neg logit contributions
seek-0.001
 habitat-0.001
icol-0.001
growth-0.001
powers-0.001
Token dog
Token ID3290
Feature activation+2.06e-03

Pos logit contributions
seek+0.008
 habitat+0.007
growth+0.007
icol+0.007
powers+0.007
Neg logit contributions
 who-0.003
 please-0.003
iring-0.003
 God-0.003
 girl-0.003
Token lots
Token ID6041
Feature activation-4.13e-03

Pos logit contributions
 who+0.002
 God+0.002
 by+0.002
 please+0.002
 kind+0.002
Neg logit contributions
seek-0.005
icol-0.005
 habitat-0.005
growth-0.005
powers-0.005
Token of
Token ID286
Feature activation+1.15e-03

Pos logit contributions
seek+0.014
 habitat+0.013
icol+0.013
growth+0.013
powers+0.012
Neg logit contributions
iring-0.000
 who-0.000
 please-0.000
 God-0.000
 of-0.000
Token c
Token ID269
Feature activation-3.50e-02

Pos logit contributions
 God+0.002
 please+0.001
 who+0.001
?"+0.001
 happy+0.001
Neg logit contributions
seek-0.002
icol-0.002
 habitat-0.002
growth-0.002
 arrest-0.002
Tokenudd
Token ID4185
Feature activation-1.56e-02

Pos logit contributions
seek+0.067
growth+0.062
 habitat+0.058
icol+0.056
 arrest+0.056
Neg logit contributions
iring-0.055
mm-0.046
ying-0.046
 please-0.045
ie-0.045
Tokenles
Token ID829
Feature activation+6.42e-03

Pos logit contributions
seek+0.023
 habitat+0.020
powers+0.019
growth+0.019
icol+0.019
Neg logit contributions
iring-0.029
 please-0.028
 God-0.028
 who-0.027
?"-0.027
Token and
Token ID290
Feature activation-8.83e-03

Pos logit contributions
 who+0.003
 in+0.002
 please+0.002
 by+0.002
iring+0.002
Neg logit contributions
seek-0.020
 habitat-0.018
icol-0.018
growth-0.018
powers-0.018
Token took
Token ID1718
Feature activation+1.78e-03

Pos logit contributions
seek+0.022
 habitat+0.020
growth+0.020
powers+0.019
icol+0.019
Neg logit contributions
 who-0.009
 God-0.009
 happy-0.008
 please-0.008
 kind-0.007
Token her
Token ID607
Feature activation-9.01e-03

Pos logit contributions
 God+0.002
 in+0.001
 by+0.001
 please+0.001
 who+0.001
Neg logit contributions
seek-0.005
 habitat-0.004
icol-0.004
powers-0.004
 fault-0.004
Token for
Token ID329
Feature activation+1.43e-02

Pos logit contributions
seek+0.022
icol+0.020
 habitat+0.019
growth+0.019
 arrest+0.019
Neg logit contributions
 who-0.010
 in-0.009
 by-0.009
 please-0.009
?"-0.009
Token walks
Token ID11114
Feature activation-2.04e-02

Pos logit contributions
 God+0.011
 who+0.010
 please+0.010
 happy+0.009
 okay+0.008
Neg logit contributions
seek-0.038
 habitat-0.036
powers-0.036
ashed-0.036
icol-0.036
Token big
Token ID1263
Feature activation+7.02e-04

Pos logit contributions
seek+0.012
icol+0.012
powers+0.012
 habitat+0.012
growth+0.011
Neg logit contributions
ying-0.010
iring-0.010
mm-0.009
 God-0.009
rac-0.009
Token flower
Token ID15061
Feature activation-1.53e-02

Pos logit contributions
iring+0.001
ying+0.001
nam+0.001
 God+0.001
She+0.001
Neg logit contributions
icol-0.001
 arrest-0.001
seek-0.001
 habitat-0.001
 bumps-0.001
Token with
Token ID351
Feature activation-2.05e-02

Pos logit contributions
seek+0.036
 habitat+0.035
growth+0.033
powers+0.033
icol+0.032
Neg logit contributions
 God-0.013
iring-0.013
 please-0.012
ying-0.012
mm-0.012
Token lots
Token ID6041
Feature activation-2.36e-02

Pos logit contributions
seek+0.055
 habitat+0.050
icol+0.049
powers+0.047
ro+0.046
Neg logit contributions
iring-0.016
nam-0.015
 who-0.015
She-0.015
 God-0.014
Token of
Token ID286
Feature activation-1.50e-02

Pos logit contributions
seek+0.078
icol+0.073
 habitat+0.073
growth+0.070
powers+0.068
Neg logit contributions
iring-0.005
 who-0.002
 please-0.001
?"-0.001
ying-0.001
Token honey
Token ID12498
Feature activation+1.96e-03

Pos logit contributions
seek+0.035
 habitat+0.032
icol+0.031
growth+0.030
powers+0.029
Neg logit contributions
iring-0.017
 God-0.015
 who-0.015
 please-0.015
?"-0.015
Token in
Token ID287
Feature activation-7.19e-03

Pos logit contributions
iring+0.002
 God+0.002
 please+0.002
 who+0.001
?"+0.001
Neg logit contributions
seek-0.005
 habitat-0.005
growth-0.004
powers-0.004
icol-0.004
Token it
Token ID340
Feature activation+1.23e-02

Pos logit contributions
seek+0.020
 habitat+0.018
icol+0.018
powers+0.018
ashed+0.017
Neg logit contributions
?"-0.005
iring-0.005
 who-0.004
She-0.004
 please-0.004
Token.
Token ID13
Feature activation+6.54e-04

Pos logit contributions
iring+0.011
 please+0.010
 who+0.010
 God+0.009
?"+0.009
Neg logit contributions
seek-0.030
 habitat-0.028
growth-0.028
icol-0.027
powers-0.027
Token Tweet
Token ID18752
Feature activation+1.65e-02

Pos logit contributions
iring+0.001
 God+0.001
ying+0.001
 please+0.001
ma+0.001
Neg logit contributions
seek-0.001
 habitat-0.001
powers-0.001
growth-0.001
icol-0.001
Token wanted
Token ID2227
Feature activation-2.07e-02

Pos logit contributions
iring+0.008
 please+0.007
 God+0.007
nam+0.006
ying+0.005
Neg logit contributions
seek-0.047
 habitat-0.044
powers-0.043
growth-0.042
icol-0.042
Token The
Token ID383
Feature activation+2.62e-03

Pos logit contributions
seek+0.001
 habitat+0.001
powers+0.001
icol+0.001
growth+0.001
Neg logit contributions
iring-0.000
 God-0.000
 who-0.000
 please-0.000
She-0.000
Token girl
Token ID2576
Feature activation-5.16e-04

Pos logit contributions
iring+0.002
 God+0.002
 please+0.002
 who+0.001
?"+0.001
Neg logit contributions
seek-0.007
 habitat-0.007
powers-0.006
growth-0.006
icol-0.006
Token never
Token ID1239
Feature activation+5.02e-03

Pos logit contributions
seek+0.001
 habitat+0.001
icol+0.001
growth+0.001
powers+0.001
Neg logit contributions
 who-0.001
iring-0.000
 please-0.000
 God-0.000
?"-0.000
Token saw
Token ID2497
Feature activation+1.46e-03

Pos logit contributions
 who+0.006
 please+0.006
ie+0.006
 do+0.006
 happy+0.006
Neg logit contributions
growth-0.011
seek-0.010
icol-0.010
 habitat-0.010
powers-0.010
Token her
Token ID607
Feature activation-7.94e-03

Pos logit contributions
 God+0.001
 who+0.001
iring+0.001
 please+0.001
ying+0.001
Neg logit contributions
seek-0.004
 habitat-0.004
powers-0.004
icol-0.004
growth-0.004
Token home
Token ID1363
Feature activation-3.00e-02

Pos logit contributions
seek+0.019
 habitat+0.017
powers+0.017
icol+0.017
growth+0.017
Neg logit contributions
iring-0.008
 who-0.008
 God-0.008
 please-0.007
?"-0.007
Token again
Token ID757
Feature activation+4.06e-03

Pos logit contributions
seek+0.062
 habitat+0.057
growth+0.055
powers+0.055
icol+0.054
Neg logit contributions
iring-0.038
 please-0.035
 God-0.035
 who-0.034
?"-0.033
Token.
Token ID13
Feature activation+3.06e-03

Pos logit contributions
iring+0.003
 please+0.003
 who+0.003
 God+0.003
?"+0.003
Neg logit contributions
seek-0.010
 habitat-0.010
powers-0.010
growth-0.009
icol-0.009
Token<|endoftext|>
Token ID50256
Feature activation-3.43e-02

Pos logit contributions
iring+0.003
 please+0.003
 God+0.003
 who+0.003
?"+0.002
Neg logit contributions
seek-0.007
 habitat-0.007
growth-0.007
powers-0.007
icol-0.006
TokenOnce
Token ID7454
Feature activation+6.27e-03

Pos logit contributions
seek+0.081
 habitat+0.077
powers+0.074
icol+0.074
growth+0.073
Neg logit contributions
iring-0.032
 please-0.029
 God-0.029
 who-0.029
?"-0.029
Token upon
Token ID2402
Feature activation+9.36e-03

Pos logit contributions
iring+0.007
keys+0.007
mm+0.007
bling+0.007
nam+0.007
Neg logit contributions
seek-0.011
powers-0.011
 habitat-0.010
 fault-0.010
ashed-0.010
Token a
Token ID257
Feature activation-1.23e-03

Pos logit contributions
 please+0.003
iring+0.003
 who+0.003
 happy+0.003
 God+0.002
Neg logit contributions
seek-0.012
 habitat-0.012
icol-0.011
growth-0.011
powers-0.011
Token sense
Token ID2565
Feature activation+3.78e-03

Pos logit contributions
seek+0.003
 habitat+0.002
growth+0.002
powers+0.002
icol+0.002
Neg logit contributions
iring-0.002
 please-0.001
 God-0.001
 who-0.001
?"-0.001
Token of
Token ID286
Feature activation+6.11e-03

Pos logit contributions
iring+0.002
 please+0.001
 who+0.001
 God+0.001
ying+0.001
Neg logit contributions
seek-0.011
 habitat-0.011
growth-0.010
powers-0.010
icol-0.010
Token accomplishment
Token ID29340
Feature activation+3.11e-02

Pos logit contributions
 happy+0.008
 God+0.008
 please+0.008
 do+0.008
 kind+0.008
Neg logit contributions
seek-0.013
 habitat-0.012
icol-0.011
powers-0.011
growth-0.011
Token and
Token ID290
Feature activation+2.65e-04

Pos logit contributions
iring+0.026
 please+0.024
 God+0.024
 who+0.024
ying+0.022
Neg logit contributions
seek-0.077
 habitat-0.073
growth-0.070
powers-0.070
icol-0.069
Token pride
Token ID11293
Feature activation-2.02e-02

Pos logit contributions
 who+0.000
 God+0.000
 happy+0.000
iring+0.000
 please+0.000
Neg logit contributions
seek-0.001
 habitat-0.001
powers-0.001
growth-0.001
icol-0.001
Token.
Token ID13
Feature activation-1.51e-03

Pos logit contributions
seek+0.050
 habitat+0.049
growth+0.047
powers+0.047
icol+0.046
Neg logit contributions
iring-0.017
 who-0.016
 please-0.015
 God-0.015
ying-0.014
Token She
Token ID1375
Feature activation-3.45e-03

Pos logit contributions
seek+0.004
 habitat+0.004
growth+0.004
 arrest+0.004
powers+0.004
Neg logit contributions
iring-0.001
 God-0.001
 please-0.001
 who-0.001
She-0.001
Token learned
Token ID4499
Feature activation-1.28e-02

Pos logit contributions
seek+0.008
 habitat+0.008
growth+0.007
powers+0.007
icol+0.007
Neg logit contributions
 who-0.004
 also-0.003
 please-0.003
?"-0.003
 happy-0.003
Token that
Token ID326
Feature activation+6.03e-03

Pos logit contributions
seek+0.033
 habitat+0.031
icol+0.031
powers+0.030
growth+0.030
Neg logit contributions
iring-0.010
 who-0.010
 God-0.010
 please-0.010
?"-0.009
Token had
Token ID550
Feature activation-7.06e-03

Pos logit contributions
iring+0.006
 please+0.006
 God+0.006
 who+0.005
?"+0.005
Neg logit contributions
seek-0.014
 habitat-0.013
growth-0.013
powers-0.013
icol-0.012
Token he
Token ID339
Feature activation-1.61e-02

Pos logit contributions
 God+0.015
iring+0.014
 please+0.014
 who+0.014
?"+0.013
Neg logit contributions
seek-0.052
 habitat-0.049
icol-0.047
powers-0.047
growth-0.047
Token saw
Token ID2497
Feature activation-1.19e-02

Pos logit contributions
seek+0.036
icol+0.033
 habitat+0.032
powers+0.031
growth+0.031
Neg logit contributions
?"-0.020
 who-0.019
 do-0.018
?-0.017
 please-0.017
Token one
Token ID530
Feature activation-1.36e-02

Pos logit contributions
seek+0.032
powers+0.029
icol+0.029
 habitat+0.029
growth+0.028
Neg logit contributions
 who-0.009
 God-0.009
?"-0.008
iring-0.008
 please-0.007
Token.
Token ID13
Feature activation-2.11e-03

Pos logit contributions
seek+0.032
 habitat+0.030
powers+0.029
growth+0.029
icol+0.029
Neg logit contributions
iring-0.012
 God-0.011
 please-0.011
 who-0.010
ying-0.010
Token One
Token ID1881
Feature activation-5.70e-05

Pos logit contributions
seek+0.005
 habitat+0.005
powers+0.005
icol+0.005
 arrest+0.004
Neg logit contributions
 God-0.002
 who-0.002
 please-0.002
?"-0.002
iring-0.002
Token day
Token ID1110
Feature activation-3.42e-02

Pos logit contributions
seek+0.000
 habitat+0.000
powers+0.000
growth+0.000
icol+0.000
Neg logit contributions
iring-0.000
 please-0.000
 God-0.000
ying-0.000
 who-0.000
Token,
Token ID11
Feature activation-5.93e-03

Pos logit contributions
seek+0.103
 habitat+0.102
icol+0.096
powers+0.096
growth+0.094
Neg logit contributions
 please-0.014
 God-0.011
 who-0.011
?"-0.009
 by-0.008
Token his
Token ID465
Feature activation-4.94e-04

Pos logit contributions
seek+0.015
 habitat+0.014
powers+0.014
icol+0.014
growth+0.013
Neg logit contributions
 who-0.005
iring-0.005
 God-0.005
 please-0.005
nam-0.004
Token mom
Token ID1995
Feature activation-5.94e-03

Pos logit contributions
seek+0.002
growth+0.001
 habitat+0.001
icol+0.001
powers+0.001
Neg logit contributions
 God-0.000
 best-0.000
 who-0.000
iring-0.000
 please-0.000
Token took
Token ID1718
Feature activation+7.65e-03

Pos logit contributions
seek+0.016
 habitat+0.015
powers+0.015
growth+0.015
icol+0.014
Neg logit contributions
iring-0.004
 who-0.004
 please-0.004
?"-0.004
 God-0.004
Token him
Token ID683
Feature activation-6.07e-03

Pos logit contributions
 God+0.006
 please+0.006
 who+0.005
 by+0.005
iring+0.005
Neg logit contributions
seek-0.020
 habitat-0.019
powers-0.019
icol-0.018
growth-0.018
Token and
Token ID290
Feature activation+1.10e-02

Pos logit contributions
iring+0.009
 please+0.008
 God+0.008
ying+0.008
?"+0.008
Neg logit contributions
seek-0.024
 habitat-0.023
growth-0.022
powers-0.022
icol-0.022
Token saw
Token ID2497
Feature activation-2.93e-03

Pos logit contributions
iring+0.012
 God+0.012
 who+0.012
 please+0.012
?"+0.011
Neg logit contributions
seek-0.025
 habitat-0.023
powers-0.022
icol-0.022
growth-0.022
Token a
Token ID257
Feature activation-5.32e-03

Pos logit contributions
seek+0.008
 habitat+0.007
icol+0.007
powers+0.007
 arrest+0.006
Neg logit contributions
 who-0.003
iring-0.003
 God-0.003
?"-0.002
 please-0.002
Token big
Token ID1263
Feature activation+1.02e-03

Pos logit contributions
seek+0.012
 habitat+0.012
growth+0.011
powers+0.011
icol+0.011
Neg logit contributions
iring-0.005
 please-0.005
 God-0.005
 who-0.005
?"-0.005
Token,
Token ID11
Feature activation+8.76e-03

Pos logit contributions
iring+0.001
ying+0.001
 God+0.001
 please+0.001
 who+0.001
Neg logit contributions
seek-0.002
 habitat-0.002
icol-0.002
growth-0.002
powers-0.002
Token black
Token ID2042
Feature activation-2.35e-02

Pos logit contributions
iring+0.011
ying+0.010
 please+0.010
 God+0.010
mm+0.010
Neg logit contributions
seek-0.017
 habitat-0.016
icol-0.016
powers-0.016
growth-0.016
Token cow
Token ID9875
Feature activation+3.27e-03

Pos logit contributions
seek+0.051
 habitat+0.047
powers+0.045
growth+0.045
icol+0.045
Neg logit contributions
iring-0.027
 please-0.026
 God-0.025
 who-0.025
?"-0.024
Token.
Token ID13
Feature activation+1.61e-03

Pos logit contributions
iring+0.003
 please+0.003
 who+0.003
 God+0.003
?"+0.002
Neg logit contributions
seek-0.008
 habitat-0.007
powers-0.007
growth-0.007
icol-0.007
Token Farmer
Token ID37755
Feature activation+8.82e-03

Pos logit contributions
iring+0.002
ying+0.002
 please+0.002
 God+0.002
 who+0.002
Neg logit contributions
seek-0.003
 habitat-0.003
growth-0.003
icol-0.003
powers-0.003
Token David
Token ID3271
Feature activation+6.09e-03

Pos logit contributions
iring+0.012
 please+0.012
 God+0.011
 who+0.011
ying+0.011
Neg logit contributions
seek-0.017
 habitat-0.016
growth-0.015
powers-0.015
icol-0.015
Token was
Token ID373
Feature activation-4.56e-04

Pos logit contributions
iring+0.006
 who+0.006
 please+0.006
 God+0.006
?"+0.006
Neg logit contributions
seek-0.014
 habitat-0.013
powers-0.012
growth-0.012
icol-0.012
Token looks
Token ID3073
Feature activation-6.20e-03

Pos logit contributions
 also+0.004
?"+0.004
 who+0.003
 do+0.003
 at+0.003
Neg logit contributions
seek-0.007
keeping-0.006
 habitat-0.006
growth-0.006
icol-0.006
Token around
Token ID1088
Feature activation+3.44e-03

Pos logit contributions
seek+0.017
 habitat+0.016
growth+0.015
powers+0.015
icol+0.015
Neg logit contributions
iring-0.004
 please-0.004
 who-0.004
 God-0.004
?"-0.004
Token the
Token ID262
Feature activation-7.57e-03

Pos logit contributions
iring+0.002
 God+0.002
 please+0.002
 who+0.002
?"+0.002
Neg logit contributions
seek-0.009
 habitat-0.009
powers-0.009
growth-0.008
icol-0.008
Token room
Token ID2119
Feature activation-4.36e-03

Pos logit contributions
seek+0.018
 habitat+0.017
growth+0.017
powers+0.017
icol+0.017
Neg logit contributions
 who-0.007
iring-0.007
 girl-0.006
She-0.006
ma-0.006
Token and
Token ID290
Feature activation+4.60e-03

Pos logit contributions
seek+0.012
 habitat+0.012
icol+0.011
growth+0.011
powers+0.011
Neg logit contributions
 who-0.002
iring-0.002
 please-0.002
 God-0.002
 by-0.002
Token tries
Token ID8404
Feature activation-2.40e-02

Pos logit contributions
 who+0.005
 God+0.005
 please+0.005
iring+0.005
?"+0.005
Neg logit contributions
seek-0.010
 habitat-0.009
powers-0.009
growth-0.009
icol-0.009
Token to
Token ID284
Feature activation-3.92e-02

Pos logit contributions
icol+0.053
 habitat+0.052
growth+0.052
seek+0.050
 arrest+0.049
Neg logit contributions
 who-0.027
 by-0.026
 please-0.025
iring-0.025
 God-0.024
Token find
Token ID1064
Feature activation-6.97e-04

Pos logit contributions
seek+0.082
 habitat+0.077
icol+0.074
powers+0.074
growth+0.073
Neg logit contributions
iring-0.047
 please-0.046
 God-0.045
 who-0.045
?"-0.043
Token Mia
Token ID32189
Feature activation-2.84e-03

Pos logit contributions
seek+0.002
 habitat+0.002
powers+0.001
icol+0.001
growth+0.001
Neg logit contributions
iring-0.001
 God-0.001
 please-0.001
 who-0.001
?"-0.001
Token.
Token ID13
Feature activation-5.77e-03

Pos logit contributions
seek+0.007
 habitat+0.007
icol+0.007
growth+0.007
powers+0.007
Neg logit contributions
 who-0.002
iring-0.002
 please-0.002
 God-0.002
?"-0.002
Token He
Token ID679
Feature activation-2.59e-03

Pos logit contributions
seek+0.015
 habitat+0.015
icol+0.014
 arrest+0.014
growth+0.014
Neg logit contributions
 please-0.004
iring-0.004
 She-0.003
She-0.003
 who-0.003
Token at
Token ID379
Feature activation+1.34e-02

Pos logit contributions
 by+0.004
 who+0.004
 in+0.004
iring+0.004
 on+0.004
Neg logit contributions
growth-0.026
seek-0.026
 habitat-0.025
icol-0.024
powers-0.024
Token his
Token ID465
Feature activation-5.10e-03

Pos logit contributions
 God+0.012
iring+0.012
?"+0.011
 who+0.011
 please+0.010
Neg logit contributions
seek-0.032
 habitat-0.031
icol-0.031
powers-0.030
growth-0.029
Token salon
Token ID46608
Feature activation+3.10e-02

Pos logit contributions
seek+0.010
growth+0.010
icol+0.010
 habitat+0.009
powers+0.009
Neg logit contributions
iring-0.007
 please-0.006
 God-0.006
 who-0.006
?"-0.006
Token and
Token ID290
Feature activation-5.35e-03

Pos logit contributions
iring+0.022
 who+0.022
 please+0.021
 God+0.020
?"+0.019
Neg logit contributions
seek-0.078
 habitat-0.075
growth-0.075
icol-0.074
powers-0.073
Token running
Token ID2491
Feature activation-2.61e-02

Pos logit contributions
seek+0.012
growth+0.012
powers+0.012
icol+0.012
 habitat+0.011
Neg logit contributions
 who-0.006
 God-0.005
 please-0.005
 do-0.005
?"-0.005
Token err
Token ID11454
Feature activation-7.78e-02

Pos logit contributions
growth+0.070
icol+0.067
 habitat+0.067
seek+0.065
 arrest+0.064
Neg logit contributions
 in-0.020
 by-0.018
 who-0.017
 at-0.016
 on-0.016
Tokenands
Token ID1746
Feature activation-3.04e-02

Pos logit contributions
icol+0.086
 habitat+0.084
growth+0.082
seek+0.082
powers+0.075
Neg logit contributions
iring-0.161
ying-0.155
 God-0.154
 who-0.154
 please-0.152
Token.
Token ID13
Feature activation-3.71e-03

Pos logit contributions
growth+0.083
icol+0.082
 habitat+0.081
seek+0.080
powers+0.076
Neg logit contributions
 who-0.019
 by-0.017
 in-0.016
iring-0.015
 at-0.015
Token But
Token ID887
Feature activation-1.09e-02

Pos logit contributions
seek+0.008
 habitat+0.007
growth+0.007
powers+0.007
icol+0.007
Neg logit contributions
iring-0.004
 please-0.004
 God-0.004
 who-0.004
?"-0.004
Token every
Token ID790
Feature activation+2.62e-02

Pos logit contributions
seek+0.021
 habitat+0.020
growth+0.019
 drowned+0.018
 arrest+0.017
Neg logit contributions
iring-0.015
ying-0.013
 please-0.013
bling-0.013
kins-0.013
Token day
Token ID1110
Feature activation-1.07e-02

Pos logit contributions
 please+0.031
 God+0.031
?"+0.031
 who+0.031
iring+0.027
Neg logit contributions
seek-0.057
 habitat-0.053
icol-0.053
powers-0.051
growth-0.051
Token needed
Token ID2622
Feature activation-2.11e-02

Pos logit contributions
seek+0.046
 habitat+0.043
powers+0.042
growth+0.041
icol+0.041
Neg logit contributions
iring-0.023
 God-0.021
 please-0.020
ying-0.020
 who-0.019
Token to
Token ID284
Feature activation-9.63e-03

Pos logit contributions
seek+0.045
icol+0.042
 habitat+0.042
growth+0.041
powers+0.040
Neg logit contributions
 please-0.028
 God-0.027
 who-0.026
?"-0.025
 do-0.023
Token go
Token ID467
Feature activation+2.22e-02

Pos logit contributions
seek+0.022
 habitat+0.021
powers+0.020
growth+0.020
icol+0.020
Neg logit contributions
iring-0.009
 please-0.009
 God-0.009
 who-0.009
?"-0.008
Token do
Token ID466
Feature activation+2.20e-02

Pos logit contributions
iring+0.032
 God+0.029
nam+0.029
mm+0.029
ying+0.029
Neg logit contributions
seek-0.043
 habitat-0.038
icol-0.036
powers-0.036
growth-0.035
Token some
Token ID617
Feature activation+1.15e-02

Pos logit contributions
 who+0.021
?"+0.019
 please+0.019
 do+0.018
 God+0.017
Neg logit contributions
seek-0.052
 habitat-0.050
growth-0.050
icol-0.049
powers-0.047
Token err
Token ID11454
Feature activation-7.34e-02

Pos logit contributions
iring+0.017
 please+0.016
 God+0.016
 who+0.016
?"+0.016
Neg logit contributions
seek-0.021
 habitat-0.020
growth-0.019
icol-0.019
powers-0.018
Tokenands
Token ID1746
Feature activation-2.44e-02

Pos logit contributions
seek+0.095
 habitat+0.092
icol+0.089
growth+0.089
powers+0.084
Neg logit contributions
iring-0.140
 please-0.135
 God-0.135
ying-0.133
 who-0.132
Token.
Token ID13
Feature activation-8.61e-04

Pos logit contributions
seek+0.061
 habitat+0.058
growth+0.057
icol+0.056
powers+0.056
Neg logit contributions
iring-0.019
 please-0.018
 who-0.018
 God-0.017
?"-0.017
Token
Token ID198
Feature activation-3.02e-03

Pos logit contributions
seek+0.002
 habitat+0.002
growth+0.002
powers+0.002
icol+0.002
Neg logit contributions
iring-0.001
 please-0.001
 God-0.001
 who-0.001
ying-0.001
Token
Token ID198
Feature activation-2.99e-03

Pos logit contributions
seek+0.007
 habitat+0.006
powers+0.006
growth+0.006
icol+0.006
Neg logit contributions
iring-0.003
 God-0.003
ying-0.003
nam-0.003
 please-0.003
TokenMom
Token ID29252
Feature activation-1.45e-02

Pos logit contributions
seek+0.007
 habitat+0.006
powers+0.006
icol+0.006
growth+0.006
Neg logit contributions
iring-0.003
 God-0.003
ying-0.003
mm-0.002
?"-0.002
Token girl
Token ID2576
Feature activation+5.05e-04

Pos logit contributions
iring+0.003
 please+0.003
 God+0.003
ying+0.003
mm+0.003
Neg logit contributions
seek-0.008
 habitat-0.008
powers-0.008
icol-0.008
growth-0.008
Token who
Token ID508
Feature activation-3.25e-02

Pos logit contributions
iring+0.001
 please+0.000
 God+0.000
ying+0.000
mm+0.000
Neg logit contributions
seek-0.001
 habitat-0.001
growth-0.001
icol-0.001
powers-0.001
Token was
Token ID373
Feature activation-6.97e-03

Pos logit contributions
seek+0.073
 habitat+0.068
powers+0.066
growth+0.065
icol+0.065
Neg logit contributions
iring-0.036
 who-0.034
 God-0.034
 please-0.034
?"-0.032
Token about
Token ID546
Feature activation-2.34e-02

Pos logit contributions
seek+0.015
 habitat+0.014
growth+0.013
powers+0.013
icol+0.013
Neg logit contributions
iring-0.008
 please-0.008
 God-0.008
 who-0.007
ying-0.007
Token three
Token ID1115
Feature activation-2.68e-02

Pos logit contributions
seek+0.053
 habitat+0.048
icol+0.048
powers+0.047
growth+0.047
Neg logit contributions
 who-0.029
 God-0.028
 please-0.027
?"-0.026
 by-0.025
Token years
Token ID812
Feature activation-7.12e-02

Pos logit contributions
seek+0.064
 habitat+0.059
growth+0.058
icol+0.057
powers+0.057
Neg logit contributions
iring-0.026
 God-0.023
 please-0.023
 who-0.023
?"-0.022
Token old
Token ID1468
Feature activation-1.10e-02

Pos logit contributions
seek+0.199
 habitat+0.181
growth+0.179
icol+0.174
powers+0.171
Neg logit contributions
iring-0.049
 God-0.047
 who-0.043
 please-0.040
?"-0.039
Token.
Token ID13
Feature activation-3.73e-03

Pos logit contributions
seek+0.027
 habitat+0.026
growth+0.025
powers+0.025
icol+0.025
Neg logit contributions
iring-0.009
 who-0.009
 God-0.009
 please-0.009
?"-0.008
Token She
Token ID1375
Feature activation-2.09e-03

Pos logit contributions
seek+0.009
 habitat+0.009
growth+0.008
powers+0.008
icol+0.008
Neg logit contributions
iring-0.003
 please-0.003
 God-0.003
 who-0.003
ying-0.003
Token asked
Token ID1965
Feature activation-1.91e-02

Pos logit contributions
seek+0.005
 habitat+0.004
powers+0.004
growth+0.004
icol+0.004
Neg logit contributions
iring-0.002
 who-0.002
 please-0.002
 God-0.002
?"-0.002
Token him
Token ID683
Feature activation-1.32e-02

Pos logit contributions
seek+0.030
growth+0.028
 habitat+0.027
 arrest+0.026
 bites+0.024
Neg logit contributions
iring-0.030
mm-0.028
nam-0.027
ang-0.026
ble-0.026
Token doctor
Token ID6253
Feature activation+4.09e-03

Pos logit contributions
 who+0.014
 God+0.014
iring+0.013
 please+0.013
?"+0.012
Neg logit contributions
seek-0.027
 habitat-0.025
powers-0.025
growth-0.025
icol-0.025
Token,
Token ID11
Feature activation+1.90e-03

Pos logit contributions
iring+0.004
 God+0.003
 please+0.003
ying+0.003
 who+0.003
Neg logit contributions
seek-0.010
 habitat-0.009
growth-0.009
powers-0.009
icol-0.009
Token which
Token ID543
Feature activation+1.16e-02

Pos logit contributions
 who+0.001
iring+0.001
 please+0.001
 God+0.001
?"+0.001
Neg logit contributions
seek-0.005
 habitat-0.005
powers-0.005
growth-0.005
icol-0.005
Token was
Token ID373
Feature activation-5.73e-03

Pos logit contributions
iring+0.012
 God+0.011
 please+0.011
 who+0.011
?"+0.011
Neg logit contributions
seek-0.027
 habitat-0.025
icol-0.024
growth-0.024
powers-0.024
Token an
Token ID281
Feature activation+1.52e-03

Pos logit contributions
seek+0.015
 habitat+0.014
growth+0.014
powers+0.014
icol+0.014
Neg logit contributions
 please-0.005
iring-0.004
 who-0.004
 God-0.004
?"-0.004
Token err
Token ID11454
Feature activation-6.90e-02

Pos logit contributions
iring+0.002
 please+0.002
nam+0.002
kins+0.002
She+0.002
Neg logit contributions
seek-0.003
 habitat-0.003
powers-0.003
 arrest-0.003
growth-0.003
Tokenand
Token ID392
Feature activation-1.47e-02

Pos logit contributions
seek+0.110
 habitat+0.101
growth+0.096
icol+0.094
powers+0.094
Neg logit contributions
iring-0.118
 please-0.113
 God-0.113
 who-0.110
ying-0.108
Token he
Token ID339
Feature activation+1.20e-03

Pos logit contributions
seek+0.035
 habitat+0.033
growth+0.032
icol+0.031
powers+0.031
Neg logit contributions
iring-0.014
 please-0.013
 God-0.013
 who-0.013
ying-0.012
Token didn
Token ID1422
Feature activation-6.56e-03

Pos logit contributions
 who+0.001
 please+0.001
?"+0.001
 do+0.001
 God+0.001
Neg logit contributions
seek-0.003
growth-0.003
icol-0.003
 habitat-0.003
powers-0.002
Token't
Token ID470
Feature activation-5.29e-03

Pos logit contributions
seek+0.013
 habitat+0.012
icol+0.012
growth+0.012
powers+0.012
Neg logit contributions
iring-0.009
 please-0.008
 God-0.008
?"-0.008
 who-0.008
Token understand
Token ID1833
Feature activation+6.12e-03

Pos logit contributions
seek+0.012
 habitat+0.011
growth+0.010
icol+0.010
powers+0.010
Neg logit contributions
iring-0.006
 please-0.006
 God-0.006
 who-0.006
?"-0.005
Token girl
Token ID2576
Feature activation+8.75e-03

Pos logit contributions
iring+0.012
ying+0.010
 please+0.010
 God+0.010
mm+0.009
Neg logit contributions
seek-0.026
 habitat-0.024
powers-0.024
icol-0.023
growth-0.023
Token who
Token ID508
Feature activation-2.06e-02

Pos logit contributions
iring+0.012
 please+0.010
ying+0.010
mm+0.010
 God+0.010
Neg logit contributions
seek-0.018
 habitat-0.016
growth-0.016
icol-0.016
powers-0.015
Token was
Token ID373
Feature activation+2.17e-04

Pos logit contributions
seek+0.051
 habitat+0.048
growth+0.047
icol+0.046
powers+0.046
Neg logit contributions
 who-0.017
 God-0.017
iring-0.017
 please-0.016
?"-0.016
Token turning
Token ID6225
Feature activation-2.93e-02

Pos logit contributions
iring+0.000
ying+0.000
mm+0.000
 please+0.000
 God+0.000
Neg logit contributions
seek-0.001
 habitat-0.000
powers-0.000
growth-0.000
icol-0.000
Token three
Token ID1115
Feature activation-2.51e-02

Pos logit contributions
seek+0.071
 habitat+0.065
icol+0.064
growth+0.064
powers+0.061
Neg logit contributions
iring-0.028
 God-0.027
 who-0.027
 please-0.026
nam-0.025
Token years
Token ID812
Feature activation-6.89e-02

Pos logit contributions
seek+0.058
 habitat+0.054
growth+0.052
icol+0.051
powers+0.051
Neg logit contributions
iring-0.026
 please-0.024
 God-0.024
 who-0.023
?"-0.022
Token old
Token ID1468
Feature activation-8.96e-03

Pos logit contributions
seek+0.192
growth+0.176
 habitat+0.175
icol+0.171
 arrest+0.165
Neg logit contributions
 God-0.046
iring-0.046
 who-0.040
 please-0.039
?"-0.039
Token.
Token ID13
Feature activation-5.27e-03

Pos logit contributions
seek+0.020
 habitat+0.019
powers+0.018
growth+0.018
icol+0.018
Neg logit contributions
iring-0.010
 please-0.009
 God-0.009
ying-0.009
?"-0.008
Token She
Token ID1375
Feature activation-4.69e-03

Pos logit contributions
seek+0.013
 habitat+0.013
growth+0.012
powers+0.012
icol+0.012
Neg logit contributions
iring-0.004
 please-0.004
ying-0.004
 God-0.004
 who-0.003
Token was
Token ID373
Feature activation-4.92e-03

Pos logit contributions
seek+0.013
 habitat+0.012
icol+0.012
growth+0.012
powers+0.012
Neg logit contributions
 who-0.003
?"-0.003
 also-0.003
 do-0.003
 God-0.003
Token so
Token ID523
Feature activation+1.83e-02

Pos logit contributions
seek+0.011
 habitat+0.010
powers+0.010
icol+0.009
growth+0.009
Neg logit contributions
iring-0.005
ying-0.005
mm-0.005
 please-0.004
kins-0.004