TOP ACTIVATIONS
MAX = 5.669

wasn 't fair for the rac coon to take all the
. The rac coon tried asking the people
not want to see the rac coon again . <|endoftext|> L
car that he didn 't play with anymore . His mom
!" As the rac coon crawled to explore the
a toy motorcycle that he played with every day . One
comfortable inside ." Ol iver thanked the polar bear
sad because he couldn 't play with his car anymore .
chy so he liked to bat he in a river .
many toy automobiles that he played with every day . He
a toy car that he played with every day . One
a toy motorcycle that he played with every day . One
" Hello !" the rac coon said .
and sunny day and the rac coon decided to explore the
first !" The rac coon thought about it for
and comfortable . Ol iver was excited and said
a toy truck that he played with every day . One
it tastes ." L ula could barely contain her
knew that accidents happen . Fel icia learned to be more
to me ." Ol iver was surprised and asked

MIN ACTIVATIONS
MIN = -5.447

als and bloom ed in bright colors - red , yellow
made the gloomy sky look much nicer .
beautiful flower bloom ed with bright colors . The flower 's
the sky was filled with bright yellow and orange h ues
and the sky filled with lovely colors . Ally kept tw
big bush with lots of pretty flowers .
Mom smiled and pointed to the land . "
purple bird with lots of colourful feathers . He watched as
" Yes , sorry , Mom . We will
the sky lit up with pretty colours . Max stayed and
in a beautiful garden with bright flowers , gentle bree zes
she found big flowers with colorful pet als and birds picking
beautiful plant with lots of pretty leaves . The
was a tall tree with bright orange and yellow leaves .
flower garden . There were pretty pink roses and yellow sun
hall together and it looked much better . They both felt
stood side by side and pointed up at the sky .
at Sarah with its be ady eyes . Sarah
and saw some big , pretty butterflies flying around . She
. She thought they looked much better . She

INTERVAL 2.890 - 5.669
CONTAINS 0.614%

" O uch !" the oct opus said . It spoke
can 't play in the soil !" She lifted the cat
play with the ball , but I 'm too shy to
showing Max her toys , but she did not let him
the orange bag . T eddy felt so sad and

INTERVAL 0.111 - 2.890
CONTAINS 45.025%

escape ! <|endoftext|> Once upon a time , there was a
ily was so happy that her bag was fixed . She
get burned ." After lunch , Tim my
it . James was so disappointed because he could not watch
wash it . Day after day , Joey kept washing the

INTERVAL -2.668 - 0.111
CONTAINS 53.575%

to listen , and sure enough , there it was again
the birds were safe and even the tigers were safe .
was so original that it had colorful stars and moons on
Maybe it 's hiding somewhere . Let 's play a game
ara takes Tom 's hand . " Don 't worry ,

INTERVAL -5.447 - -2.668
CONTAINS 0.786%

and fresh . Do you want to try it ?"
, too . Do you want to help me get ready
big noise ! It was much too loud , so the
frog was green and had big eyes . He liked to
have fun . Do you want to play with me ?"
Token wasn
Token ID2492
Feature activation+2.63e+00

Pos logit contributions
lf+0.010
 rarely+0.010
 hallway+0.010
é+0.010
 tv+0.010
Neg logit contributions
endant-0.008
als-0.008
 seeds-0.008
 half-0.008
 enough-0.007
Token't
Token ID470
Feature activation-3.31e-02

Pos logit contributions
lf+0.032
 rarely+0.032
 hallway+0.029
é+0.028
 tv+0.027
Neg logit contributions
endant-0.085
als-0.081
 seeds-0.079
Sur-0.079
 half-0.076
Token fair
Token ID3148
Feature activation-1.03e+00

Pos logit contributions
endant+0.001
als+0.001
Sur+0.001
 seeds+0.001
elt+0.001
Neg logit contributions
 rarely-0.001
lf-0.001
é-0.001
 tv-0.001
 hallway-0.001
Token for
Token ID329
Feature activation+2.80e+00

Pos logit contributions
endant+0.017
als+0.015
 seeds+0.014
Sur+0.014
 da+0.012
Neg logit contributions
 rarely-0.029
lf-0.029
é-0.028
 hallway-0.027
 tv-0.027
Token the
Token ID262
Feature activation+1.45e+00

Pos logit contributions
lf+0.061
 rarely+0.061
é+0.057
 tv+0.056
 hallway+0.056
Neg logit contributions
endant-0.065
als-0.061
 seeds-0.058
Sur-0.057
 da-0.052
Token rac
Token ID3444
Feature activation+5.67e+00

Pos logit contributions
lf+0.037
 rarely+0.037
é+0.036
loe+0.034
 hallway+0.034
Neg logit contributions
 seeds-0.027
endant-0.026
als-0.026
Sur-0.024
 da-0.024
Tokencoon
Token ID20912
Feature activation+1.91e+00

Pos logit contributions
 rarely+0.116
 hallway+0.111
 tv+0.105
 Chess+0.105
 seeker+0.103
Neg logit contributions
als-0.121
Sur-0.119
 seeds-0.116
aw-0.115
of-0.113
Token to
Token ID284
Feature activation+1.12e+00

Pos logit contributions
lf+0.058
 rarely+0.058
é+0.056
 hallway+0.055
 tv+0.054
Neg logit contributions
endant-0.027
als-0.025
 seeds-0.023
Sur-0.022
 half-0.020
Token take
Token ID1011
Feature activation+1.04e+00

Pos logit contributions
 rarely+0.026
lf+0.026
é+0.025
 hallway+0.024
 tv+0.024
Neg logit contributions
endant-0.024
als-0.022
 seeds-0.022
Sur-0.021
 half-0.020
Token all
Token ID477
Feature activation-3.86e-01

Pos logit contributions
 rarely+0.026
lf+0.026
é+0.025
 hallway+0.025
 tv+0.025
Neg logit contributions
endant-0.021
als-0.019
 seeds-0.018
Sur-0.017
 da-0.016
Token the
Token ID262
Feature activation-2.41e-01

Pos logit contributions
endant+0.008
als+0.007
 seeds+0.007
Sur+0.007
 da+0.006
Neg logit contributions
 rarely-0.009
lf-0.009
é-0.009
 hallway-0.009
 tv-0.009
Token.
Token ID13
Feature activation+2.31e-01

Pos logit contributions
lf+0.036
 rarely+0.036
é+0.035
 hallway+0.035
 tv+0.035
Neg logit contributions
endant-0.014
als-0.013
 seeds-0.011
Sur-0.010
 half-0.010
Token
Token ID220
Feature activation+1.71e+00

Pos logit contributions
 rarely+0.005
lf+0.005
é+0.005
 hallway+0.005
 tv+0.005
Neg logit contributions
endant-0.005
als-0.005
 seeds-0.004
Sur-0.004
 half-0.004
Token
Token ID198
Feature activation-4.69e-01

Pos logit contributions
lf+0.058
 rarely+0.057
 hallway+0.055
 tv+0.054
é+0.053
Neg logit contributions
endant-0.019
Sur-0.016
als-0.015
 half-0.014
 seeds-0.013
Token
Token ID198
Feature activation+5.54e-02

Pos logit contributions
endant+0.006
Sur+0.005
als+0.005
 seeds+0.005
of+0.004
Neg logit contributions
 rarely-0.015
lf-0.015
 tv-0.014
é-0.014
 hallway-0.014
TokenThe
Token ID464
Feature activation+2.07e+00

Pos logit contributions
 rarely+0.001
lf+0.001
é+0.001
 hallway+0.001
 tv+0.001
Neg logit contributions
endant-0.001
als-0.001
 seeds-0.001
Sur-0.001
 half-0.001
Token rac
Token ID3444
Feature activation+5.48e+00

Pos logit contributions
 rarely+0.048
é+0.046
lf+0.045
 tv+0.043
icia+0.042
Neg logit contributions
endant-0.045
 seeds-0.042
als-0.042
Sur-0.041
 da-0.039
Tokencoon
Token ID20912
Feature activation+1.00e+00

Pos logit contributions
 rarely+0.124
 Chess+0.117
 tv+0.115
 hardly+0.110
 hallway+0.110
Neg logit contributions
Sur-0.113
aw-0.106
als-0.104
of-0.098
endant-0.098
Token tried
Token ID3088
Feature activation-8.84e-01

Pos logit contributions
 rarely+0.024
lf+0.023
 tv+0.023
é+0.023
 hallway+0.022
Neg logit contributions
endant-0.020
als-0.020
 seeds-0.019
Sur-0.018
 half-0.017
Token asking
Token ID4737
Feature activation+2.83e-01

Pos logit contributions
endant+0.009
als+0.008
 seeds+0.007
Sur+0.007
 half+0.005
Neg logit contributions
 rarely-0.031
lf-0.031
é-0.030
 hallway-0.029
 tv-0.029
Token the
Token ID262
Feature activation+2.71e-01

Pos logit contributions
lf+0.008
 rarely+0.008
 hallway+0.008
é+0.007
 tv+0.007
Neg logit contributions
als-0.005
 seeds-0.004
endant-0.004
 half-0.004
Sur-0.004
Token people
Token ID661
Feature activation-3.83e-01

Pos logit contributions
 rarely+0.006
lf+0.006
é+0.006
 hallway+0.006
 tv+0.006
Neg logit contributions
endant-0.006
als-0.006
 seeds-0.005
Sur-0.005
 da-0.005
Token not
Token ID407
Feature activation-8.40e-01

Pos logit contributions
é+0.014
lf+0.013
 hallway+0.013
 rarely+0.013
 tv+0.012
Neg logit contributions
endant-0.014
 seeds-0.013
 half-0.013
 enough-0.013
 sandwiches-0.012
Token want
Token ID765
Feature activation-1.25e+00

Pos logit contributions
endant+0.019
als+0.018
 seeds+0.017
Sur+0.016
 half+0.015
Neg logit contributions
lf-0.019
 rarely-0.019
é-0.018
 hallway-0.018
 tv-0.017
Token to
Token ID284
Feature activation+2.95e-01

Pos logit contributions
endant+0.021
als+0.017
Sur+0.016
 seeds+0.015
lets+0.014
Neg logit contributions
 rarely-0.037
 tv-0.035
 especially-0.035
é-0.034
 Chess-0.034
Token see
Token ID766
Feature activation+6.09e-01

Pos logit contributions
lf+0.007
 rarely+0.007
é+0.007
 hallway+0.007
 tv+0.007
Neg logit contributions
endant-0.006
als-0.005
 seeds-0.005
Sur-0.005
 half-0.005
Token the
Token ID262
Feature activation+1.06e+00

Pos logit contributions
lf+0.016
 rarely+0.016
é+0.015
 hallway+0.015
 tv+0.015
Neg logit contributions
endant-0.012
als-0.011
 seeds-0.010
Sur-0.010
 da-0.009
Token rac
Token ID3444
Feature activation+5.47e+00

Pos logit contributions
 rarely+0.025
lf+0.025
é+0.024
loe+0.022
 tv+0.022
Neg logit contributions
endant-0.023
 seeds-0.022
als-0.021
Sur-0.020
 da-0.020
Tokencoon
Token ID20912
Feature activation+9.02e-01

Pos logit contributions
 rarely+0.127
 hardly+0.105
 seeker+0.103
 hallway+0.103
 tv+0.102
Neg logit contributions
Sur-0.111
als-0.109
aw-0.108
 seeds-0.105
of-0.103
Token again
Token ID757
Feature activation-1.78e-01

Pos logit contributions
 rarely+0.026
lf+0.026
é+0.025
 hallway+0.025
 tv+0.024
Neg logit contributions
endant-0.014
als-0.013
 seeds-0.013
Sur-0.011
 half-0.011
Token.
Token ID13
Feature activation+1.17e-02

Pos logit contributions
endant+0.003
als+0.003
 seeds+0.003
Sur+0.003
elt+0.002
Neg logit contributions
 rarely-0.005
lf-0.005
é-0.004
 hallway-0.004
 tv-0.004
Token<|endoftext|>
Token ID50256
Feature activation-9.56e-01

Pos logit contributions
 rarely+0.000
lf+0.000
é+0.000
 hallway+0.000
 tv+0.000
Neg logit contributions
endant-0.000
als-0.000
 seeds-0.000
Sur-0.000
 da-0.000
TokenL
Token ID43
Feature activation+2.15e+00

Pos logit contributions
endant+0.015
als+0.014
 seeds+0.013
Sur+0.012
 da+0.011
Neg logit contributions
 rarely-0.027
lf-0.027
é-0.026
 hallway-0.026
 tv-0.026
Token car
Token ID1097
Feature activation+2.52e+00

Pos logit contributions
 rarely+0.029
lf+0.028
é+0.028
 hallway+0.027
icia+0.027
Neg logit contributions
endant-0.022
als-0.021
 seeds-0.020
 da-0.019
 bread-0.018
Token that
Token ID326
Feature activation+1.56e+00

Pos logit contributions
 rarely+0.093
é+0.092
icia+0.092
loe+0.091
lf+0.091
Neg logit contributions
endant-0.017
als-0.014
 seeds-0.011
 enough-0.008
aw-0.008
Token he
Token ID339
Feature activation+2.45e+00

Pos logit contributions
é+0.041
icia+0.041
loe+0.040
 tv+0.040
oving+0.040
Neg logit contributions
endant-0.026
als-0.025
 seeds-0.024
 half-0.021
Sur-0.021
Token didn
Token ID1422
Feature activation+4.84e+00

Pos logit contributions
 tv+0.065
 hallway+0.063
loe+0.060
 stair+0.060
lf+0.059
Neg logit contributions
endant-0.043
 seeds-0.037
s-0.036
als-0.036
Sur-0.035
Token't
Token ID470
Feature activation+1.75e+00

Pos logit contributions
 rarely+0.089
 hallway+0.088
 seeker+0.086
icia+0.084
lf+0.083
Neg logit contributions
endant-0.111
Sur-0.111
 half-0.110
s-0.108
of-0.108
Token play
Token ID711
Feature activation+5.40e+00

Pos logit contributions
lf+0.042
 hallway+0.040
 tv+0.040
é+0.040
 rarely+0.039
Neg logit contributions
endant-0.036
 seeds-0.032
als-0.032
Sur-0.030
 half-0.030
Token with
Token ID351
Feature activation+2.28e+00

Pos logit contributions
icia+0.176
é+0.176
loe+0.175
 hallway+0.174
 rarely+0.171
Neg logit contributions
 to-0.052
 too-0.046
 that-0.046
 candy-0.045
 more-0.045
Token anymore
Token ID7471
Feature activation+2.01e+00

Pos logit contributions
 hallway+0.079
é+0.077
lf+0.077
 rarely+0.077
loe+0.076
Neg logit contributions
als-0.019
endant-0.018
 seeds-0.017
 enough-0.016
 nuts-0.015
Token.
Token ID13
Feature activation+1.59e+00

Pos logit contributions
 rarely+0.066
é+0.066
 hallway+0.066
lf+0.064
icia+0.063
Neg logit contributions
endant-0.020
als-0.019
 seeds-0.018
Sur-0.015
 half-0.014
Token His
Token ID2399
Feature activation+2.29e+00

Pos logit contributions
 rarely+0.040
lf+0.040
icia+0.038
é+0.038
 hallway+0.037
Neg logit contributions
endant-0.031
als-0.027
 seeds-0.027
Sur-0.026
 half-0.026
Token mom
Token ID1995
Feature activation-2.63e-01

Pos logit contributions
é+0.066
 rarely+0.065
icia+0.065
lf+0.065
loe+0.062
Neg logit contributions
endant-0.037
 eyes-0.032
 half-0.030
 seeds-0.029
 feathers-0.028
Token!"
Token ID2474
Feature activation+1.93e-01

Pos logit contributions
endant+0.028
als+0.026
 seeds+0.025
Sur+0.023
 da+0.022
Neg logit contributions
 rarely-0.043
lf-0.042
é-0.041
 hallway-0.040
 tv-0.039
Token
Token ID198
Feature activation-7.98e-01

Pos logit contributions
 rarely+0.005
lf+0.005
é+0.005
 hallway+0.005
 tv+0.005
Neg logit contributions
endant-0.003
als-0.003
 seeds-0.003
Sur-0.003
 da-0.003
Token
Token ID198
Feature activation-3.25e-01

Pos logit contributions
endant+0.010
Sur+0.009
als+0.009
 seeds+0.009
 half+0.008
Neg logit contributions
 rarely-0.025
lf-0.024
é-0.024
 hallway-0.024
 tv-0.024
TokenAs
Token ID1722
Feature activation+1.10e+00

Pos logit contributions
endant+0.006
als+0.005
 seeds+0.005
Sur+0.005
 da+0.005
Neg logit contributions
lf-0.009
 rarely-0.009
é-0.008
 hallway-0.008
 tv-0.008
Token the
Token ID262
Feature activation+1.46e+00

Pos logit contributions
idget+0.035
é+0.034
lf+0.033
 hallway+0.033
loe+0.033
Neg logit contributions
 seeds-0.013
 enough-0.012
als-0.012
 da-0.012
 half-0.011
Token rac
Token ID3444
Feature activation+5.33e+00

Pos logit contributions
 rarely+0.034
lf+0.034
é+0.033
 hallway+0.032
 tv+0.032
Neg logit contributions
endant-0.031
als-0.030
 seeds-0.029
Sur-0.028
 da-0.026
Tokencoon
Token ID20912
Feature activation+5.52e-01

Pos logit contributions
 rarely+0.113
 tv+0.105
 hallway+0.105
 Chess+0.103
 hardly+0.098
Neg logit contributions
als-0.118
Sur-0.112
 seeds-0.111
aw-0.110
endant-0.101
Token crawled
Token ID45668
Feature activation-2.23e-01

Pos logit contributions
 rarely+0.014
lf+0.014
é+0.013
 hallway+0.013
 tv+0.012
Neg logit contributions
endant-0.011
als-0.010
 seeds-0.010
Sur-0.010
 half-0.009
Token to
Token ID284
Feature activation+1.23e-01

Pos logit contributions
endant+0.005
als+0.005
 seeds+0.004
Sur+0.004
 da+0.004
Neg logit contributions
lf-0.005
 rarely-0.005
é-0.005
 hallway-0.005
 tv-0.005
Token explore
Token ID7301
Feature activation+1.86e-01

Pos logit contributions
 rarely+0.003
lf+0.003
é+0.003
 hallway+0.003
 tv+0.003
Neg logit contributions
endant-0.002
 seeds-0.002
als-0.002
 half-0.002
 da-0.002
Token the
Token ID262
Feature activation-3.30e-01

Pos logit contributions
 rarely+0.005
lf+0.005
é+0.005
 hallway+0.005
 tv+0.005
Neg logit contributions
endant-0.003
als-0.003
 seeds-0.003
Sur-0.002
 da-0.002
Token a
Token ID257
Feature activation+1.65e+00

Pos logit contributions
endant+0.013
als+0.012
 seeds+0.011
Sur+0.010
elt+0.009
Neg logit contributions
 rarely-0.020
lf-0.019
é-0.019
 hallway-0.018
 tv-0.018
Token toy
Token ID13373
Feature activation+1.61e+00

Pos logit contributions
lf+0.048
é+0.047
 rarely+0.045
loe+0.045
 hallway+0.045
Neg logit contributions
endant-0.026
 seeds-0.022
Sur-0.022
 da-0.021
 half-0.020
Token motorcycle
Token ID18757
Feature activation+2.08e+00

Pos logit contributions
 rarely+0.037
lf+0.037
 hallway+0.035
é+0.035
 tv+0.034
Neg logit contributions
endant-0.035
als-0.033
 seeds-0.031
Sur-0.029
 da-0.028
Token that
Token ID326
Feature activation+1.76e+00

Pos logit contributions
 rarely+0.067
lf+0.066
 hallway+0.064
é+0.064
 tv+0.063
Neg logit contributions
endant-0.027
als-0.024
 seeds-0.022
Sur-0.019
 half-0.018
Token he
Token ID339
Feature activation+2.53e+00

Pos logit contributions
 hallway+0.046
 tv+0.046
loe+0.046
icia+0.045
oving+0.045
Neg logit contributions
als-0.028
endant-0.028
 seeds-0.027
 half-0.023
 nuts-0.023
Token played
Token ID2826
Feature activation+5.33e+00

Pos logit contributions
 hallway+0.068
 tv+0.066
loe+0.063
 stair+0.061
 seeker+0.059
Neg logit contributions
endant-0.043
 seeds-0.039
lets-0.039
als-0.039
s-0.039
Token with
Token ID351
Feature activation+1.55e+00

Pos logit contributions
é+0.182
loe+0.181
idget+0.177
 hallway+0.175
icia+0.173
Neg logit contributions
 to-0.046
 some-0.046
 that-0.040
 fair-0.039
 more-0.038
Token every
Token ID790
Feature activation+1.15e+00

Pos logit contributions
loe+0.052
 hallway+0.052
é+0.049
idget+0.048
inge+0.048
Neg logit contributions
als-0.015
 to-0.015
 seeds-0.015
aw-0.014
s-0.013
Token day
Token ID1110
Feature activation+1.32e+00

Pos logit contributions
 rarely+0.023
é+0.022
lf+0.021
icia+0.021
loe+0.021
Neg logit contributions
endant-0.026
als-0.025
 seeds-0.025
 half-0.024
Sur-0.024
Token.
Token ID13
Feature activation+1.12e+00

Pos logit contributions
 rarely+0.043
lf+0.042
é+0.042
 hallway+0.041
loe+0.041
Neg logit contributions
endant-0.015
als-0.014
 seeds-0.013
Sur-0.011
 bread-0.011
Token One
Token ID1881
Feature activation+9.56e-01

Pos logit contributions
lf+0.032
 rarely+0.032
é+0.031
icia+0.031
 hallway+0.031
Neg logit contributions
endant-0.017
als-0.014
 seeds-0.014
Sur-0.013
 half-0.013
Token comfortable
Token ID6792
Feature activation+7.78e-01

Pos logit contributions
endant+0.010
als+0.009
 seeds+0.009
Sur+0.008
 da+0.008
Neg logit contributions
lf-0.012
 rarely-0.012
é-0.012
 hallway-0.012
 tv-0.011
Token inside
Token ID2641
Feature activation+1.71e+00

Pos logit contributions
 rarely+0.021
lf+0.021
é+0.020
 hallway+0.020
 tv+0.019
Neg logit contributions
endant-0.013
als-0.013
 seeds-0.012
 enough-0.011
Sur-0.011
Token."
Token ID526
Feature activation+6.25e-01

Pos logit contributions
 rarely+0.050
lf+0.049
é+0.048
 hallway+0.048
 tv+0.045
Neg logit contributions
endant-0.025
als-0.025
 seeds-0.024
 half-0.020
 bread-0.020
Token
Token ID198
Feature activation-2.45e-01

Pos logit contributions
lf+0.017
 rarely+0.017
é+0.016
 hallway+0.016
 tv+0.015
Neg logit contributions
endant-0.011
als-0.011
 seeds-0.010
Sur-0.010
 da-0.009
Token
Token ID198
Feature activation+5.35e-01

Pos logit contributions
endant+0.003
als+0.003
Sur+0.003
 seeds+0.003
ossible+0.003
Neg logit contributions
 rarely-0.008
lf-0.007
 hallway-0.007
é-0.007
 tv-0.007
TokenOl
Token ID30098
Feature activation+5.33e+00

Pos logit contributions
 rarely+0.013
lf+0.013
é+0.012
 hallway+0.012
 tv+0.012
Neg logit contributions
endant-0.011
als-0.010
 seeds-0.010
Sur-0.010
 da-0.009
Tokeniver
Token ID1428
Feature activation-2.49e-01

Pos logit contributions
lf+0.091
 rarely+0.091
 hallway+0.082
é+0.082
 tv+0.081
Neg logit contributions
endant-0.145
Sur-0.136
als-0.135
 seeds-0.133
 half-0.127
Token thanked
Token ID26280
Feature activation+2.64e+00

Pos logit contributions
endant+0.005
als+0.005
 seeds+0.005
Sur+0.005
 da+0.004
Neg logit contributions
 rarely-0.006
lf-0.006
é-0.006
 hallway-0.005
 tv-0.005
Token the
Token ID262
Feature activation+1.82e+00

Pos logit contributions
 rarely+0.085
 hallway+0.085
 seeker+0.080
lf+0.079
oving+0.079
Neg logit contributions
 seeds-0.031
als-0.031
 da-0.030
of-0.028
 half-0.028
Token polar
Token ID13559
Feature activation+1.62e+00

Pos logit contributions
 rarely+0.024
lf+0.024
 hallway+0.022
é+0.022
 tv+0.021
Neg logit contributions
endant-0.058
als-0.056
 seeds-0.053
Sur-0.052
elt-0.049
Token bear
Token ID6842
Feature activation-7.61e-01

Pos logit contributions
 rarely+0.031
lf+0.030
 hallway+0.029
é+0.028
 tv+0.028
Neg logit contributions
endant-0.041
als-0.040
 seeds-0.039
Sur-0.036
 da-0.036
Token sad
Token ID6507
Feature activation-4.88e-01

Pos logit contributions
lf+0.029
 tv+0.028
é+0.028
 hallway+0.027
loe+0.027
Neg logit contributions
endant-0.025
 seeds-0.023
Sur-0.023
elt-0.023
 bread-0.022
Token because
Token ID780
Feature activation+3.92e+00

Pos logit contributions
endant+0.009
als+0.008
 seeds+0.008
Sur+0.008
 enough+0.007
Neg logit contributions
lf-0.013
 rarely-0.012
é-0.012
 hallway-0.012
 tv-0.012
Token he
Token ID339
Feature activation+2.11e+00

Pos logit contributions
icia+0.137
idget+0.128
oving+0.128
é+0.127
ussy+0.126
Neg logit contributions
 half-0.037
 that-0.035
 some-0.034
 enough-0.032
 seeds-0.032
Token couldn
Token ID3521
Feature activation+2.99e+00

Pos logit contributions
 tv+0.055
loe+0.055
lf+0.053
é+0.052
 hallway+0.052
Neg logit contributions
endant-0.036
s-0.032
als-0.030
elt-0.030
 seeds-0.029
Token't
Token ID470
Feature activation+2.19e+00

Pos logit contributions
 rarely+0.062
 seeker+0.058
icia+0.057
lf+0.056
 hallway+0.056
Neg logit contributions
s-0.065
 half-0.063
Sur-0.063
endant-0.060
als-0.060
Token play
Token ID711
Feature activation+5.25e+00

Pos logit contributions
 rarely+0.060
é+0.058
 hallway+0.058
icia+0.058
loe+0.057
Neg logit contributions
endant-0.029
of-0.029
als-0.028
 half-0.027
 seeds-0.027
Token with
Token ID351
Feature activation+3.54e+00

Pos logit contributions
icia+0.163
 rarely+0.161
é+0.159
loe+0.158
 hallway+0.156
Neg logit contributions
 to-0.060
 candy-0.059
 that-0.057
 some-0.055
 more-0.055
Token his
Token ID465
Feature activation+2.39e+00

Pos logit contributions
 rarely+0.082
lf+0.082
 hallway+0.081
é+0.079
 tv+0.076
Neg logit contributions
endant-0.074
 seeds-0.069
als-0.068
 half-0.063
Sur-0.063
Token car
Token ID1097
Feature activation+2.61e+00

Pos logit contributions
 rarely+0.062
lf+0.060
é+0.060
 hallway+0.058
 tv+0.057
Neg logit contributions
endant-0.046
 seeds-0.041
als-0.039
 half-0.038
 bread-0.037
Token anymore
Token ID7471
Feature activation+2.29e+00

Pos logit contributions
loe+0.078
é+0.077
 rarely+0.076
 hallway+0.076
icia+0.075
Neg logit contributions
 enough-0.035
endant-0.031
als-0.031
 half-0.030
 seeds-0.028
Token.
Token ID13
Feature activation+8.02e-01

Pos logit contributions
é+0.078
 rarely+0.077
 hallway+0.076
icia+0.075
loe+0.074
Neg logit contributions
endant-0.022
als-0.017
 seeds-0.017
 half-0.015
Sur-0.015
Tokenchy
Token ID29658
Feature activation+2.34e+00

Pos logit contributions
lf+0.020
 rarely+0.019
é+0.019
 hallway+0.018
loe+0.018
Neg logit contributions
endant-0.025
als-0.025
 seeds-0.024
 half-0.022
elt-0.022
Token so
Token ID523
Feature activation+2.53e+00

Pos logit contributions
é+0.075
lf+0.075
 rarely+0.074
 hallway+0.072
loe+0.071
Neg logit contributions
als-0.026
 seeds-0.025
 enough-0.024
endant-0.023
 eyes-0.021
Token he
Token ID339
Feature activation+2.57e+00

Pos logit contributions
 hallway+0.065
lf+0.064
é+0.064
 rarely+0.063
 tv+0.062
Neg logit contributions
endant-0.046
als-0.043
 seeds-0.043
Sur-0.039
 bread-0.037
Token liked
Token ID8288
Feature activation+1.08e+00

Pos logit contributions
 hallway+0.075
 tv+0.074
loe+0.073
é+0.071
idget+0.070
Neg logit contributions
 seeds-0.039
als-0.037
endant-0.034
 bread-0.032
 nuts-0.032
Token to
Token ID284
Feature activation+2.03e+00

Pos logit contributions
lf+0.040
 rarely+0.040
é+0.040
 hallway+0.039
icia+0.039
Neg logit contributions
endant-0.007
 seeds-0.006
als-0.006
Sur-0.004
 bread-0.004
Token bat
Token ID7365
Feature activation+5.25e+00

Pos logit contributions
é+0.051
lf+0.050
 rarely+0.050
icia+0.049
issa+0.049
Neg logit contributions
 seeds-0.035
endant-0.034
als-0.033
 bread-0.032
 feathers-0.032
Tokenhe
Token ID258
Feature activation+4.75e+00

Pos logit contributions
 rarely+0.095
 hallway+0.087
é+0.081
lf+0.080
 pretending+0.079
Neg logit contributions
als-0.136
s-0.135
of-0.131
aw-0.131
 seeds-0.122
Token in
Token ID287
Feature activation+2.39e+00

Pos logit contributions
idget+0.167
é+0.158
 hallway+0.157
 seeker+0.155
loe+0.152
Neg logit contributions
 seeds-0.041
 bread-0.035
 sandwiches-0.033
 more-0.032
als-0.032
Token a
Token ID257
Feature activation+1.17e+00

Pos logit contributions
 rarely+0.065
lf+0.064
é+0.063
 hallway+0.060
icia+0.060
Neg logit contributions
endant-0.039
 seeds-0.038
als-0.037
 half-0.036
Sur-0.035
Token river
Token ID7850
Feature activation+2.02e+00

Pos logit contributions
 rarely+0.030
lf+0.030
é+0.029
 hallway+0.028
 tv+0.028
Neg logit contributions
endant-0.022
als-0.020
 seeds-0.020
Sur-0.019
 da-0.017
Token.
Token ID13
Feature activation+3.57e-01

Pos logit contributions
 rarely+0.061
lf+0.061
é+0.059
 hallway+0.058
 tv+0.057
Neg logit contributions
endant-0.029
als-0.026
 seeds-0.026
Sur-0.023
 bread-0.021
Token many
Token ID867
Feature activation+1.23e+00

Pos logit contributions
 rarely+0.008
lf+0.008
é+0.008
 hallway+0.008
 tv+0.008
Neg logit contributions
endant-0.005
als-0.005
 seeds-0.005
Sur-0.004
 da-0.004
Token toy
Token ID13373
Feature activation+1.40e+00

Pos logit contributions
 rarely+0.032
lf+0.032
é+0.031
 hallway+0.030
 tv+0.030
Neg logit contributions
endant-0.023
als-0.021
 seeds-0.021
Sur-0.019
 half-0.018
Token automobiles
Token ID44820
Feature activation+2.70e+00

Pos logit contributions
 rarely+0.027
lf+0.027
é+0.025
 hallway+0.025
 tv+0.025
Neg logit contributions
endant-0.036
als-0.034
 seeds-0.033
Sur-0.032
 da-0.030
Token that
Token ID326
Feature activation+1.64e+00

Pos logit contributions
lf+0.075
 rarely+0.075
é+0.072
 hallway+0.071
 tv+0.070
Neg logit contributions
endant-0.046
als-0.042
 seeds-0.040
Sur-0.039
 da-0.035
Token he
Token ID339
Feature activation+2.59e+00

Pos logit contributions
lf+0.040
 rarely+0.040
é+0.040
 tv+0.040
 hallway+0.040
Neg logit contributions
endant-0.030
als-0.030
 seeds-0.028
Sur-0.026
 bread-0.024
Token played
Token ID2826
Feature activation+5.22e+00

Pos logit contributions
 tv+0.071
 hallway+0.069
loe+0.067
 seeker+0.065
é+0.064
Neg logit contributions
endant-0.043
als-0.038
 seeds-0.037
lets-0.035
s-0.035
Token with
Token ID351
Feature activation+1.29e+00

Pos logit contributions
 rarely+0.152
icia+0.149
é+0.148
 hallway+0.146
loe+0.144
Neg logit contributions
 seeds-0.066
als-0.065
Sur-0.058
 half-0.058
endant-0.057
Token every
Token ID790
Feature activation+1.50e+00

Pos logit contributions
 hallway+0.043
 rarely+0.043
loe+0.043
é+0.042
lf+0.042
Neg logit contributions
als-0.012
 seeds-0.012
endant-0.010
 half-0.009
 enough-0.009
Token day
Token ID1110
Feature activation+1.38e+00

Pos logit contributions
 rarely+0.030
lf+0.029
icia+0.029
idget+0.028
é+0.028
Neg logit contributions
als-0.033
endant-0.033
 seeds-0.032
 half-0.030
 bread-0.029
Token.
Token ID13
Feature activation+1.24e+00

Pos logit contributions
 rarely+0.042
lf+0.042
é+0.041
 hallway+0.040
 tv+0.040
Neg logit contributions
endant-0.019
als-0.017
 seeds-0.016
Sur-0.015
 half-0.014
Token He
Token ID679
Feature activation+1.57e+00

Pos logit contributions
lf+0.031
 rarely+0.031
é+0.029
 hallway+0.029
 tv+0.028
Neg logit contributions
endant-0.025
als-0.023
 seeds-0.022
Sur-0.021
 half-0.020
Token a
Token ID257
Feature activation+1.33e+00

Pos logit contributions
endant+0.005
als+0.004
 seeds+0.004
Sur+0.004
 da+0.003
Neg logit contributions
 rarely-0.008
lf-0.008
é-0.007
 hallway-0.007
 tv-0.007
Token toy
Token ID13373
Feature activation+1.35e+00

Pos logit contributions
lf+0.043
é+0.042
loe+0.042
idget+0.041
icia+0.041
Neg logit contributions
endant-0.015
 seeds-0.012
 half-0.012
 da-0.012
Sur-0.012
Token car
Token ID1097
Feature activation+2.02e+00

Pos logit contributions
 rarely+0.034
lf+0.033
icia+0.032
 hallway+0.031
é+0.031
Neg logit contributions
endant-0.026
als-0.026
 seeds-0.024
Sur-0.022
 da-0.022
Token that
Token ID326
Feature activation+1.34e+00

Pos logit contributions
 rarely+0.069
lf+0.068
é+0.066
 hallway+0.066
 tv+0.065
Neg logit contributions
endant-0.021
als-0.019
 seeds-0.017
Sur-0.015
elt-0.013
Token he
Token ID339
Feature activation+2.59e+00

Pos logit contributions
icia+0.035
 tv+0.034
 hallway+0.034
lf+0.034
 rarely+0.034
Neg logit contributions
als-0.022
endant-0.022
 seeds-0.021
 glitter-0.018
Sur-0.018
Token played
Token ID2826
Feature activation+5.21e+00

Pos logit contributions
 tv+0.069
 hallway+0.068
loe+0.066
 stair+0.064
 seeker+0.064
Neg logit contributions
endant-0.042
s-0.038
 seeds-0.037
als-0.037
lets-0.036
Token with
Token ID351
Feature activation+1.74e+00

Pos logit contributions
icia+0.177
loe+0.175
 rarely+0.172
é+0.172
idget+0.172
Neg logit contributions
 some-0.047
 to-0.044
 that-0.041
 fair-0.039
 seeds-0.037
Token every
Token ID790
Feature activation+1.86e+00

Pos logit contributions
loe+0.058
 hallway+0.057
icia+0.056
lf+0.055
inge+0.055
Neg logit contributions
als-0.018
 seeds-0.017
s-0.017
 to-0.016
aw-0.016
Token day
Token ID1110
Feature activation+1.73e+00

Pos logit contributions
icia+0.042
 rarely+0.041
idget+0.039
oving+0.039
loe+0.038
Neg logit contributions
 half-0.035
endant-0.034
als-0.034
 seeds-0.034
 bread-0.033
Token.
Token ID13
Feature activation+9.96e-01

Pos logit contributions
 rarely+0.057
lf+0.056
é+0.055
 hallway+0.055
loe+0.054
Neg logit contributions
endant-0.018
als-0.018
 seeds-0.016
 bread-0.013
 enough-0.013
Token One
Token ID1881
Feature activation+1.07e+00

Pos logit contributions
lf+0.030
 rarely+0.030
icia+0.029
 hallway+0.029
é+0.029
Neg logit contributions
endant-0.014
als-0.011
 seeds-0.011
Sur-0.011
 half-0.010
Token a
Token ID257
Feature activation+1.76e+00

Pos logit contributions
endant+0.000
als+0.000
 seeds+0.000
Sur+0.000
elt+0.000
Neg logit contributions
 rarely-0.000
lf-0.000
é-0.000
 hallway-0.000
 especially-0.000
Token toy
Token ID13373
Feature activation+1.40e+00

Pos logit contributions
é+0.055
lf+0.054
loe+0.053
 rarely+0.053
idget+0.052
Neg logit contributions
endant-0.023
 seeds-0.020
Sur-0.019
 da-0.018
 bread-0.018
Token motorcycle
Token ID18757
Feature activation+2.02e+00

Pos logit contributions
 rarely+0.033
lf+0.032
 hallway+0.031
é+0.031
icia+0.030
Neg logit contributions
endant-0.030
als-0.028
 seeds-0.027
Sur-0.025
 bread-0.024
Token that
Token ID326
Feature activation+1.40e+00

Pos logit contributions
 rarely+0.070
lf+0.068
 hallway+0.068
é+0.068
icia+0.068
Neg logit contributions
endant-0.020
als-0.018
 seeds-0.015
Sur-0.012
 enough-0.012
Token he
Token ID339
Feature activation+2.87e+00

Pos logit contributions
 hallway+0.036
icia+0.036
 tv+0.036
loe+0.036
é+0.035
Neg logit contributions
als-0.023
endant-0.023
 seeds-0.022
 nuts-0.019
 half-0.019
Token played
Token ID2826
Feature activation+5.20e+00

Pos logit contributions
 hallway+0.077
 tv+0.077
loe+0.075
 stair+0.071
idget+0.070
Neg logit contributions
endant-0.047
 seeds-0.044
s-0.043
als-0.043
 coins-0.042
Token with
Token ID351
Feature activation+1.46e+00

Pos logit contributions
loe+0.172
idget+0.172
é+0.171
 hallway+0.167
icia+0.165
Neg logit contributions
 some-0.048
 to-0.047
 seeds-0.044
 fair-0.043
 that-0.043
Token every
Token ID790
Feature activation+1.33e+00

Pos logit contributions
 hallway+0.048
loe+0.047
idget+0.044
inge+0.044
icia+0.044
Neg logit contributions
als-0.017
 seeds-0.016
aw-0.016
s-0.015
 to-0.015
Token day
Token ID1110
Feature activation+1.70e+00

Pos logit contributions
 rarely+0.027
é+0.026
idget+0.026
icia+0.026
loe+0.026
Neg logit contributions
endant-0.029
als-0.028
 seeds-0.028
 half-0.027
 bread-0.027
Token.
Token ID13
Feature activation+8.55e-01

Pos logit contributions
 rarely+0.056
lf+0.055
é+0.055
 hallway+0.054
loe+0.053
Neg logit contributions
endant-0.018
als-0.017
 seeds-0.016
Sur-0.013
 bread-0.013
Token One
Token ID1881
Feature activation+1.16e+00

Pos logit contributions
lf+0.025
 rarely+0.025
icia+0.024
 hallway+0.024
é+0.024
Neg logit contributions
endant-0.013
als-0.011
 seeds-0.011
Sur-0.010
 half-0.010
Token
Token ID198
Feature activation-1.97e-01

Pos logit contributions
Sur+0.005
endant+0.005
als+0.005
of+0.004
 seeds+0.004
Neg logit contributions
 rarely-0.018
lf-0.018
 tv-0.017
é-0.017
 hallway-0.017
Token"
Token ID1
Feature activation-2.53e+00

Pos logit contributions
endant+0.003
als+0.002
 seeds+0.002
Sur+0.002
 half+0.002
Neg logit contributions
 rarely-0.006
lf-0.006
é-0.006
 hallway-0.006
 tv-0.006
TokenHello
Token ID15496
Feature activation-1.75e+00

Pos logit contributions
endant+0.058
 seeds+0.049
als+0.048
 sandwiches+0.046
 eyes+0.046
Neg logit contributions
 rarely-0.061
lf-0.058
é-0.057
 hallway-0.055
 especially-0.054
Token!"
Token ID2474
Feature activation+6.61e-02

Pos logit contributions
endant+0.034
als+0.030
 seeds+0.029
Sur+0.028
elt+0.026
Neg logit contributions
lf-0.045
 rarely-0.044
é-0.042
 tv-0.042
 hallway-0.041
Token the
Token ID262
Feature activation+1.65e+00

Pos logit contributions
 rarely+0.002
lf+0.002
é+0.002
 hallway+0.002
 tv+0.002
Neg logit contributions
endant-0.001
als-0.001
 seeds-0.001
Sur-0.001
 half-0.001
Token rac
Token ID3444
Feature activation+5.20e+00

Pos logit contributions
 rarely+0.042
lf+0.042
é+0.040
 hallway+0.040
 tv+0.039
Neg logit contributions
endant-0.032
als-0.030
 seeds-0.029
Sur-0.028
 da-0.025
Tokencoon
Token ID20912
Feature activation-7.93e-01

Pos logit contributions
 rarely+0.119
 hallway+0.108
 tv+0.107
 Chess+0.106
 hardly+0.101
Neg logit contributions
Sur-0.108
als-0.107
aw-0.101
of-0.100
 seeds-0.100
Token said
Token ID531
Feature activation-1.01e+00

Pos logit contributions
endant+0.014
als+0.013
 seeds+0.012
Sur+0.012
 bread+0.011
Neg logit contributions
lf-0.022
 rarely-0.021
é-0.021
 hallway-0.020
 tv-0.020
Token.
Token ID13
Feature activation+1.32e-01

Pos logit contributions
endant+0.013
als+0.012
 seeds+0.011
Sur+0.011
 half+0.010
Neg logit contributions
 rarely-0.032
lf-0.032
 hallway-0.030
é-0.030
 tv-0.029
Token
Token ID220
Feature activation+8.52e-01

Pos logit contributions
 rarely+0.003
lf+0.003
é+0.003
 hallway+0.003
 tv+0.003
Neg logit contributions
endant-0.003
als-0.003
 seeds-0.003
Sur-0.003
 da-0.002
Token
Token ID198
Feature activation-9.85e-01

Pos logit contributions
lf+0.027
 rarely+0.027
 hallway+0.026
é+0.025
 tv+0.025
Neg logit contributions
endant-0.011
als-0.010
Sur-0.010
 seeds-0.009
 half-0.009
Token and
Token ID290
Feature activation+1.91e+00

Pos logit contributions
 rarely+0.038
lf+0.037
é+0.036
icia+0.035
loe+0.035
Neg logit contributions
endant-0.019
 seeds-0.017
als-0.016
 da-0.016
 eyes-0.014
Token sunny
Token ID27737
Feature activation-3.87e-01

Pos logit contributions
 rarely+0.050
lf+0.049
é+0.049
 tv+0.047
 hallway+0.047
Neg logit contributions
endant-0.035
 seeds-0.032
als-0.032
Sur-0.029
 da-0.028
Token day
Token ID1110
Feature activation-4.47e-02

Pos logit contributions
endant+0.011
als+0.010
 seeds+0.010
Sur+0.010
 da+0.009
Neg logit contributions
 rarely-0.006
lf-0.006
é-0.006
 tv-0.005
icia-0.005
Token and
Token ID290
Feature activation+2.14e+00

Pos logit contributions
endant+0.001
als+0.001
 seeds+0.001
Sur+0.001
 fiber+0.001
Neg logit contributions
 rarely-0.001
lf-0.001
 hallway-0.001
é-0.001
 tv-0.001
Token the
Token ID262
Feature activation+1.57e+00

Pos logit contributions
é+0.057
idget+0.056
lf+0.055
loe+0.054
 tv+0.054
Neg logit contributions
 seeds-0.035
als-0.035
endant-0.032
 da-0.030
 bread-0.029
Token rac
Token ID3444
Feature activation+5.18e+00

Pos logit contributions
lf+0.039
 rarely+0.039
é+0.038
 tv+0.036
loe+0.035
Neg logit contributions
endant-0.030
als-0.030
 seeds-0.030
Sur-0.028
 da-0.027
Tokencoon
Token ID20912
Feature activation+8.45e-01

Pos logit contributions
 rarely+0.116
 tv+0.107
 Chess+0.106
irl+0.099
 hallway+0.099
Neg logit contributions
aw-0.107
als-0.100
 seeds-0.096
Sur-0.094
of-0.087
Token decided
Token ID3066
Feature activation-2.00e+00

Pos logit contributions
lf+0.018
 tv+0.017
é+0.017
 hallway+0.017
 rarely+0.017
Neg logit contributions
endant-0.019
als-0.019
 seeds-0.018
Sur-0.016
 da-0.016
Token to
Token ID284
Feature activation-9.16e-01

Pos logit contributions
endant+0.029
als+0.025
 seeds+0.022
Sur+0.022
 half+0.019
Neg logit contributions
 rarely-0.064
lf-0.062
 hallway-0.059
 tv-0.058
é-0.058
Token explore
Token ID7301
Feature activation-2.04e-01

Pos logit contributions
endant+0.018
als+0.017
 seeds+0.017
Sur+0.015
 bread+0.015
Neg logit contributions
 rarely-0.022
lf-0.022
é-0.022
 hallway-0.021
 tv-0.020
Token the
Token ID262
Feature activation-9.40e-01

Pos logit contributions
endant+0.004
als+0.004
Sur+0.004
 seeds+0.004
elt+0.003
Neg logit contributions
lf-0.005
 rarely-0.005
é-0.005
 tv-0.005
 hallway-0.005
Token first
Token ID717
Feature activation+4.62e-01

Pos logit contributions
endant+0.009
als+0.008
 seeds+0.008
Sur+0.008
 da+0.007
Neg logit contributions
 rarely-0.011
lf-0.011
é-0.010
 hallway-0.010
 tv-0.010
Token!"
Token ID2474
Feature activation+7.99e-01

Pos logit contributions
 rarely+0.012
lf+0.012
é+0.012
 hallway+0.012
 tv+0.011
Neg logit contributions
endant-0.008
als-0.008
 seeds-0.007
Sur-0.007
 da-0.006
Token
Token ID198
Feature activation+2.91e-01

Pos logit contributions
lf+0.024
 rarely+0.024
é+0.022
 hallway+0.022
 tv+0.022
Neg logit contributions
endant-0.012
als-0.011
 seeds-0.011
Sur-0.011
elt-0.009
Token
Token ID198
Feature activation+6.84e-01

Pos logit contributions
 rarely+0.010
lf+0.009
 hallway+0.009
 seeker+0.009
é+0.009
Neg logit contributions
endant-0.003
Sur-0.003
als-0.003
 seeds-0.003
of-0.002
TokenThe
Token ID464
Feature activation+2.44e+00

Pos logit contributions
 rarely+0.018
lf+0.018
é+0.018
 hallway+0.017
 tv+0.017
Neg logit contributions
endant-0.012
Sur-0.011
als-0.011
 seeds-0.011
 half-0.010
Token rac
Token ID3444
Feature activation+5.16e+00

Pos logit contributions
 rarely+0.057
lf+0.056
é+0.055
 tv+0.053
 hallway+0.052
Neg logit contributions
endant-0.049
 seeds-0.049
als-0.047
Sur-0.046
 da-0.045
Tokencoon
Token ID20912
Feature activation+9.35e-01

Pos logit contributions
 rarely+0.119
 Chess+0.112
 hallway+0.109
 tv+0.106
 hardly+0.105
Neg logit contributions
Sur-0.102
als-0.100
aw-0.098
 seeds-0.094
of-0.090
Token thought
Token ID1807
Feature activation+3.29e-01

Pos logit contributions
 rarely+0.023
lf+0.023
 tv+0.022
 hallway+0.022
é+0.022
Neg logit contributions
endant-0.018
als-0.017
 seeds-0.017
Sur-0.016
 half-0.015
Token about
Token ID546
Feature activation+1.92e-01

Pos logit contributions
 hallway+0.010
 rarely+0.010
lf+0.010
é+0.010
 seeker+0.010
Neg logit contributions
endant-0.004
 seeds-0.004
als-0.004
 enough-0.004
 da-0.004
Token it
Token ID340
Feature activation-2.72e-01

Pos logit contributions
 rarely+0.006
lf+0.006
é+0.005
 hallway+0.005
 tv+0.005
Neg logit contributions
endant-0.003
als-0.003
 seeds-0.003
Sur-0.002
 da-0.002
Token for
Token ID329
Feature activation+2.50e+00

Pos logit contributions
endant+0.004
als+0.004
 seeds+0.004
 enough+0.003
 half+0.003
Neg logit contributions
 rarely-0.008
lf-0.008
 hallway-0.008
é-0.008
 tv-0.008
Token and
Token ID290
Feature activation-3.42e-01

Pos logit contributions
endant+0.042
Sur+0.037
als+0.036
 seeds+0.036
 fiber+0.034
Neg logit contributions
lf-0.038
 rarely-0.036
 hallway-0.035
 tv-0.035
é-0.034
Token comfortable
Token ID6792
Feature activation-4.01e-01

Pos logit contributions
endant+0.007
als+0.006
 seeds+0.006
Sur+0.006
 half+0.005
Neg logit contributions
 rarely-0.009
lf-0.009
é-0.008
 hallway-0.008
 tv-0.008
Token.
Token ID13
Feature activation+1.27e+00

Pos logit contributions
endant+0.007
als+0.006
 seeds+0.006
Sur+0.006
 da+0.005
Neg logit contributions
 rarely-0.011
lf-0.011
é-0.011
 hallway-0.010
 tv-0.010
Token
Token ID198
Feature activation+6.65e-01

Pos logit contributions
lf+0.029
 rarely+0.028
é+0.027
 hallway+0.026
 tv+0.025
Neg logit contributions
als-0.027
endant-0.027
Sur-0.026
 seeds-0.025
elt-0.024
Token
Token ID198
Feature activation+1.22e+00

Pos logit contributions
 rarely+0.024
lf+0.023
 hallway+0.023
é+0.023
 tv+0.022
Neg logit contributions
Sur-0.005
endant-0.005
of-0.005
ossible-0.004
s-0.004
TokenOl
Token ID30098
Feature activation+5.16e+00

Pos logit contributions
 rarely+0.030
lf+0.029
é+0.029
 hallway+0.028
 tv+0.027
Neg logit contributions
endant-0.023
Sur-0.023
als-0.023
 seeds-0.022
 half-0.020
Tokeniver
Token ID1428
Feature activation+3.39e-03

Pos logit contributions
 rarely+0.091
lf+0.089
é+0.083
 hallway+0.080
 seeker+0.079
Neg logit contributions
endant-0.134
Sur-0.129
als-0.125
 seeds-0.121
of-0.120
Token was
Token ID373
Feature activation+1.71e+00

Pos logit contributions
 rarely+0.000
lf+0.000
é+0.000
 hallway+0.000
 tv+0.000
Neg logit contributions
endant-0.000
als-0.000
 seeds-0.000
Sur-0.000
 da-0.000
Token excited
Token ID6568
Feature activation-2.46e+00

Pos logit contributions
lf+0.042
é+0.039
 rarely+0.038
 tv+0.037
icia+0.037
Neg logit contributions
endant-0.034
 seeds-0.031
 eyes-0.030
 enough-0.030
elt-0.030
Token and
Token ID290
Feature activation+3.51e-01

Pos logit contributions
endant+0.054
als+0.047
Sur+0.046
 seeds+0.045
 fiber+0.042
Neg logit contributions
lf-0.060
 rarely-0.058
 tv-0.056
 especially-0.056
 hallway-0.055
Token said
Token ID531
Feature activation-1.49e+00

Pos logit contributions
lf+0.008
é+0.008
 rarely+0.008
 hallway+0.008
 tv+0.008
Neg logit contributions
endant-0.007
als-0.007
 seeds-0.007
Sur-0.006
 da-0.006
Token a
Token ID257
Feature activation+1.45e+00

Pos logit contributions
endant+0.005
als+0.004
 seeds+0.004
Sur+0.004
 da+0.003
Neg logit contributions
 rarely-0.008
lf-0.008
é-0.008
 hallway-0.008
 tv-0.008
Token toy
Token ID13373
Feature activation+1.51e+00

Pos logit contributions
é+0.046
 rarely+0.046
lf+0.046
loe+0.045
idget+0.044
Neg logit contributions
endant-0.018
 seeds-0.016
 da-0.014
 bread-0.014
Sur-0.014
Token truck
Token ID7779
Feature activation+1.72e+00

Pos logit contributions
 rarely+0.038
lf+0.037
 hallway+0.035
é+0.035
icia+0.035
Neg logit contributions
endant-0.030
als-0.029
 seeds-0.028
 da-0.025
 bread-0.025
Token that
Token ID326
Feature activation+1.30e+00

Pos logit contributions
 rarely+0.054
lf+0.052
é+0.051
 hallway+0.051
 tv+0.050
Neg logit contributions
endant-0.024
als-0.022
 seeds-0.020
Sur-0.018
 half-0.017
Token he
Token ID339
Feature activation+2.44e+00

Pos logit contributions
 rarely+0.032
 tv+0.032
lf+0.032
icia+0.032
 hallway+0.032
Neg logit contributions
endant-0.023
 seeds-0.023
als-0.023
 bread-0.019
Sur-0.019
Token played
Token ID2826
Feature activation+5.12e+00

Pos logit contributions
 tv+0.062
 hallway+0.062
loe+0.061
 seeker+0.059
 stair+0.058
Neg logit contributions
 seeds-0.041
endant-0.040
als-0.037
 nuts-0.036
 cones-0.036
Token with
Token ID351
Feature activation+1.41e+00

Pos logit contributions
loe+0.173
 rarely+0.172
idget+0.170
icia+0.169
é+0.169
Neg logit contributions
 seeds-0.051
 some-0.047
 to-0.045
 bread-0.045
 sandwiches-0.044
Token every
Token ID790
Feature activation+1.60e+00

Pos logit contributions
loe+0.046
 hallway+0.046
inge+0.043
idget+0.043
 rarely+0.043
Neg logit contributions
 seeds-0.017
als-0.016
 to-0.015
s-0.015
 bread-0.015
Token day
Token ID1110
Feature activation+1.64e+00

Pos logit contributions
 rarely+0.035
icia+0.033
loe+0.032
idget+0.032
é+0.032
Neg logit contributions
 seeds-0.034
als-0.033
 bread-0.032
 half-0.032
endant-0.032
Token.
Token ID13
Feature activation+9.61e-01

Pos logit contributions
 rarely+0.055
lf+0.054
é+0.053
 hallway+0.053
loe+0.052
Neg logit contributions
endant-0.017
als-0.016
 seeds-0.015
 bread-0.012
 enough-0.012
Token One
Token ID1881
Feature activation+1.23e+00

Pos logit contributions
lf+0.028
 rarely+0.028
é+0.026
icia+0.026
 hallway+0.026
Neg logit contributions
endant-0.015
als-0.013
 seeds-0.013
Sur-0.012
 half-0.012
Token it
Token ID340
Feature activation+2.23e-01

Pos logit contributions
endant+0.023
als+0.021
Sur+0.020
 seeds+0.019
 da+0.018
Neg logit contributions
 rarely-0.026
lf-0.024
é-0.024
 tv-0.023
 especially-0.023
Token tastes
Token ID18221
Feature activation-3.67e-01

Pos logit contributions
 rarely+0.006
lf+0.006
é+0.006
 hallway+0.006
 tv+0.006
Neg logit contributions
endant-0.004
als-0.004
 seeds-0.003
Sur-0.003
 da-0.003
Token."
Token ID526
Feature activation+7.01e-01

Pos logit contributions
endant+0.009
als+0.008
Sur+0.008
 seeds+0.008
elt+0.007
Neg logit contributions
 rarely-0.008
lf-0.007
é-0.007
 especially-0.007
 tv-0.007
Token
Token ID198
Feature activation-1.31e-01

Pos logit contributions
lf+0.022
 rarely+0.022
 hallway+0.020
é+0.020
 tv+0.020
Neg logit contributions
endant-0.010
als-0.010
Sur-0.009
 seeds-0.008
of-0.008
Token
Token ID198
Feature activation+3.01e-01

Pos logit contributions
endant+0.001
Sur+0.001
als+0.001
of+0.001
ossible+0.001
Neg logit contributions
 rarely-0.004
lf-0.004
 hallway-0.004
é-0.004
 tv-0.004
TokenL
Token ID43
Feature activation+5.11e+00

Pos logit contributions
lf+0.008
 rarely+0.008
é+0.008
 tv+0.008
 hallway+0.008
Neg logit contributions
endant-0.005
Sur-0.005
als-0.005
 seeds-0.005
 half-0.004
Tokenula
Token ID4712
Feature activation+1.24e+00

Pos logit contributions
 rarely+0.102
lf+0.099
 hallway+0.091
 tv+0.086
é+0.084
Neg logit contributions
endant-0.130
als-0.129
Sur-0.129
 half-0.122
 seeds-0.116
Token could
Token ID714
Feature activation+2.20e+00

Pos logit contributions
 rarely+0.030
é+0.029
lf+0.028
icia+0.027
 hallway+0.027
Neg logit contributions
endant-0.026
 seeds-0.022
Sur-0.021
als-0.021
 sandwiches-0.019
Token barely
Token ID8523
Feature activation+1.51e+00

Pos logit contributions
lf+0.047
 hallway+0.046
é+0.046
 rarely+0.044
 tv+0.043
Neg logit contributions
endant-0.049
 seeds-0.046
als-0.046
 bread-0.044
 half-0.043
Token contain
Token ID3994
Feature activation+1.26e+00

Pos logit contributions
lf+0.035
 rarely+0.035
é+0.034
 hallway+0.033
 tv+0.033
Neg logit contributions
endant-0.033
als-0.030
 seeds-0.029
Sur-0.028
 half-0.026
Token her
Token ID607
Feature activation+4.05e-02

Pos logit contributions
 rarely+0.031
lf+0.031
é+0.030
 hallway+0.029
 tv+0.029
Neg logit contributions
endant-0.025
als-0.023
 seeds-0.023
Sur-0.022
 half-0.020
Token knew
Token ID2993
Feature activation-6.25e-02

Pos logit contributions
endant+0.002
als+0.002
 seeds+0.002
Sur+0.002
 da+0.002
Neg logit contributions
 rarely-0.002
lf-0.002
é-0.002
 hallway-0.002
 tv-0.002
Token that
Token ID326
Feature activation+3.60e-01

Pos logit contributions
endant+0.001
als+0.001
 seeds+0.001
Sur+0.001
 da+0.001
Neg logit contributions
lf-0.002
 rarely-0.002
é-0.002
 hallway-0.002
 tv-0.002
Token accidents
Token ID17390
Feature activation+1.09e+00

Pos logit contributions
lf+0.009
 rarely+0.009
é+0.009
 hallway+0.009
 tv+0.009
Neg logit contributions
endant-0.006
 seeds-0.006
als-0.006
Sur-0.006
 half-0.005
Token happen
Token ID1645
Feature activation+6.77e-01

Pos logit contributions
 rarely+0.023
lf+0.023
é+0.022
 hallway+0.021
 tv+0.021
Neg logit contributions
endant-0.026
als-0.024
 seeds-0.023
Sur-0.023
 da-0.021
Token.
Token ID13
Feature activation+4.18e-02

Pos logit contributions
 rarely+0.018
lf+0.018
é+0.018
 hallway+0.017
 tv+0.017
Neg logit contributions
endant-0.012
als-0.011
 seeds-0.010
Sur-0.010
 da-0.009
Token Fel
Token ID13937
Feature activation+5.08e+00

Pos logit contributions
lf+0.001
 rarely+0.001
é+0.001
 hallway+0.001
 tv+0.001
Neg logit contributions
endant-0.001
als-0.001
 seeds-0.001
Sur-0.001
 half-0.001
Tokenicia
Token ID33577
Feature activation+1.40e+00

Pos logit contributions
 rarely+0.048
 hallway+0.042
é+0.041
lf+0.041
 ramp+0.036
Neg logit contributions
endant-0.167
als-0.165
 seeds-0.157
Sur-0.157
 enough-0.153
Token learned
Token ID4499
Feature activation-1.46e+00

Pos logit contributions
lf+0.033
 rarely+0.032
é+0.031
 tv+0.031
 hallway+0.031
Neg logit contributions
endant-0.029
als-0.028
 seeds-0.027
Sur-0.025
 half-0.024
Token to
Token ID284
Feature activation+1.01e+00

Pos logit contributions
endant+0.027
als+0.026
Sur+0.022
 seeds+0.022
elt+0.020
Neg logit contributions
 rarely-0.040
lf-0.039
 tv-0.037
é-0.037
 hallway-0.036
Token be
Token ID307
Feature activation-1.45e-01

Pos logit contributions
lf+0.024
 rarely+0.024
é+0.023
 hallway+0.023
 tv+0.023
Neg logit contributions
endant-0.021
als-0.019
 seeds-0.019
Sur-0.018
 half-0.017
Token more
Token ID517
Feature activation-3.35e-01

Pos logit contributions
endant+0.003
als+0.003
 seeds+0.002
Sur+0.002
of+0.002
Neg logit contributions
 rarely-0.004
lf-0.003
é-0.003
 hallway-0.003
 especially-0.003
Token to
Token ID284
Feature activation+1.24e+00

Pos logit contributions
endant+0.028
als+0.025
Sur+0.023
 seeds+0.023
elt+0.020
Neg logit contributions
 rarely-0.048
lf-0.047
é-0.045
 hallway-0.045
 tv-0.045
Token me
Token ID502
Feature activation+5.96e-01

Pos logit contributions
 rarely+0.037
lf+0.037
é+0.036
 hallway+0.035
 tv+0.035
Neg logit contributions
endant-0.018
als-0.017
 seeds-0.016
Sur-0.015
 half-0.013
Token."
Token ID526
Feature activation+1.46e+00

Pos logit contributions
lf+0.017
 rarely+0.016
é+0.016
 hallway+0.016
 tv+0.015
Neg logit contributions
endant-0.010
als-0.009
 seeds-0.009
Sur-0.008
 half-0.008
Token
Token ID198
Feature activation+7.72e-01

Pos logit contributions
lf+0.037
 rarely+0.037
 hallway+0.035
é+0.034
 tv+0.033
Neg logit contributions
endant-0.028
als-0.027
 seeds-0.025
Sur-0.025
elt-0.023
Token
Token ID198
Feature activation+1.10e+00

Pos logit contributions
 rarely+0.026
lf+0.025
 hallway+0.024
é+0.024
 tv+0.024
Neg logit contributions
endant-0.008
Sur-0.008
als-0.008
 seeds-0.007
ossible-0.007
TokenOl
Token ID30098
Feature activation+5.07e+00

Pos logit contributions
 rarely+0.027
lf+0.026
é+0.026
 hallway+0.025
 tv+0.025
Neg logit contributions
endant-0.022
als-0.021
Sur-0.020
 seeds-0.020
 half-0.018
Tokeniver
Token ID1428
Feature activation+2.05e-01

Pos logit contributions
 rarely+0.088
lf+0.087
 hallway+0.080
é+0.078
 tv+0.078
Neg logit contributions
endant-0.136
Sur-0.128
als-0.125
 seeds-0.124
of-0.120
Token was
Token ID373
Feature activation+1.41e+00

Pos logit contributions
 rarely+0.005
lf+0.005
é+0.005
 hallway+0.005
 tv+0.005
Neg logit contributions
endant-0.004
als-0.004
 seeds-0.004
Sur-0.004
 half-0.003
Token surprised
Token ID6655
Feature activation-2.01e+00

Pos logit contributions
lf+0.033
 rarely+0.032
é+0.031
 hallway+0.031
 tv+0.030
Neg logit contributions
endant-0.030
als-0.027
 seeds-0.027
Sur-0.026
 half-0.025
Token and
Token ID290
Feature activation+4.43e-01

Pos logit contributions
endant+0.051
als+0.046
 seeds+0.045
Sur+0.043
 bread+0.040
Neg logit contributions
 rarely-0.041
lf-0.040
 especially-0.039
 tv-0.038
é-0.037
Token asked
Token ID1965
Feature activation+1.25e-01

Pos logit contributions
lf+0.011
 rarely+0.011
é+0.011
 hallway+0.010
 tv+0.010
Neg logit contributions
endant-0.009
als-0.008
 seeds-0.008
Sur-0.008
 da-0.007
Tokenals
Token ID874
Feature activation-2.41e+00

Pos logit contributions
endant+0.065
 fiber+0.054
ossible+0.043
Sur+0.041
aid+0.041
Neg logit contributions
 tv-0.130
 hallway-0.129
é-0.125
lf-0.120
 carpet-0.118
Token and
Token ID290
Feature activation-2.32e+00

Pos logit contributions
endant+0.066
 fiber+0.063
Sur+0.061
ossible+0.054
ww+0.054
Neg logit contributions
 hallway-0.039
lf-0.038
 rarely-0.037
 especially-0.036
é-0.036
Token bloom
Token ID29955
Feature activation-2.34e+00

Pos logit contributions
endant+0.053
Sur+0.045
als+0.043
 seeds+0.042
 fiber+0.041
Neg logit contributions
 rarely-0.053
lf-0.052
é-0.049
 hallway-0.049
 tv-0.047
Tokened
Token ID276
Feature activation-2.38e+00

Pos logit contributions
endant+0.054
ossible+0.048
 fiber+0.048
 cones+0.044
 nuts+0.044
Neg logit contributions
 rarely-0.047
lf-0.046
 hallway-0.045
é-0.045
 especially-0.044
Token in
Token ID287
Feature activation-1.19e+00

Pos logit contributions
endant+0.058
Sur+0.049
als+0.048
 seeds+0.047
 fiber+0.046
Neg logit contributions
 rarely-0.050
lf-0.047
é-0.047
 tv-0.047
 hallway-0.045
Token bright
Token ID6016
Feature activation-5.45e+00

Pos logit contributions
endant+0.035
elt+0.029
als+0.029
Sur+0.027
 fiber+0.027
Neg logit contributions
 tv-0.021
 hallway-0.020
 rarely-0.020
 especially-0.020
 Chess-0.019
Token colors
Token ID7577
Feature activation-1.18e+00

Pos logit contributions
endant+0.139
asure+0.132
nuts+0.127
 fiber+0.125
ray+0.121
Neg logit contributions
 hallway-0.091
 tv-0.086
 cartoons-0.085
lf-0.085
 especially-0.084
Token -
Token ID532
Feature activation-5.35e-01

Pos logit contributions
endant+0.026
Sur+0.024
ossible+0.023
 fiber+0.023
 seeds+0.022
Neg logit contributions
lf-0.026
é-0.026
 rarely-0.025
 hallway-0.024
 tv-0.024
Token red
Token ID2266
Feature activation-3.65e+00

Pos logit contributions
endant+0.010
als+0.009
 seeds+0.009
Sur+0.008
 half+0.008
Neg logit contributions
 rarely-0.014
lf-0.014
é-0.013
 hallway-0.013
 tv-0.013
Token,
Token ID11
Feature activation-8.67e-01

Pos logit contributions
endant+0.080
ossible+0.075
Sur+0.071
 fiber+0.065
prise+0.063
Neg logit contributions
 tv-0.074
lf-0.074
 hallway-0.073
 cartoons-0.071
é-0.070
Token yellow
Token ID7872
Feature activation-3.43e+00

Pos logit contributions
endant+0.014
als+0.013
 seeds+0.012
Sur+0.012
 half+0.010
Neg logit contributions
 rarely-0.025
lf-0.025
é-0.024
 hallway-0.024
 tv-0.023
Token made
Token ID925
Feature activation+2.00e+00

Pos logit contributions
endant+0.023
als+0.021
 seeds+0.020
Sur+0.020
elt+0.018
Neg logit contributions
 rarely-0.024
lf-0.024
é-0.022
 hallway-0.022
 tv-0.021
Token the
Token ID262
Feature activation-1.78e-01

Pos logit contributions
 rarely+0.046
lf+0.045
é+0.044
 tv+0.043
 hallway+0.042
Neg logit contributions
endant-0.044
als-0.038
 seeds-0.037
Sur-0.036
elt-0.033
Token gloomy
Token ID46400
Feature activation-1.19e+00

Pos logit contributions
endant+0.004
Sur+0.003
als+0.003
elt+0.003
of+0.003
Neg logit contributions
 rarely-0.004
lf-0.004
 hallway-0.004
é-0.004
icia-0.004
Token sky
Token ID6766
Feature activation-5.37e-01

Pos logit contributions
endant+0.025
Sur+0.022
als+0.022
 seeds+0.020
ossible+0.020
Neg logit contributions
lf-0.028
 rarely-0.028
 hallway-0.028
é-0.026
 tv-0.026
Token look
Token ID804
Feature activation-1.06e+00

Pos logit contributions
endant+0.012
Sur+0.011
als+0.011
 seeds+0.010
ossible+0.010
Neg logit contributions
 rarely-0.012
lf-0.011
é-0.011
 hallway-0.011
 tv-0.011
Token much
Token ID881
Feature activation-5.16e+00

Pos logit contributions
endant+0.022
als+0.021
 seeds+0.020
Sur+0.020
elt+0.018
Neg logit contributions
 rarely-0.026
lf-0.024
é-0.023
 tv-0.023
 especially-0.023
Token nicer
Token ID36597
Feature activation-2.38e+00

Pos logit contributions
ossible+0.137
als+0.135
endant+0.135
Sur+0.131
of+0.129
Neg logit contributions
 especially-0.075
 rarely-0.073
 cartoons-0.071
 tv-0.069
 unhappy-0.065
Token.
Token ID13
Feature activation-1.10e+00

Pos logit contributions
endant+0.055
Sur+0.053
ossible+0.048
als+0.047
 fiber+0.047
Neg logit contributions
 rarely-0.046
lf-0.045
 especially-0.043
 hallway-0.042
é-0.042
Token
Token ID220
Feature activation+6.46e-01

Pos logit contributions
endant+0.028
 seeds+0.024
Sur+0.023
 bread+0.023
als+0.022
Neg logit contributions
 rarely-0.021
é-0.019
 hallway-0.019
 Chess-0.019
 tv-0.018
Token
Token ID198
Feature activation-1.44e+00

Pos logit contributions
 rarely+0.018
lf+0.018
é+0.017
 hallway+0.017
 tv+0.017
Neg logit contributions
endant-0.011
als-0.010
 seeds-0.010
Sur-0.009
 half-0.008
Token
Token ID198
Feature activation-8.95e-01

Pos logit contributions
endant+0.030
als+0.027
 seeds+0.026
Sur+0.024
 half+0.023
Neg logit contributions
 rarely-0.035
lf-0.034
é-0.033
 hallway-0.033
 tv-0.032
Token beautiful
Token ID4950
Feature activation-2.23e+00

Pos logit contributions
endant+0.032
als+0.029
 seeds+0.027
Sur+0.026
elt+0.025
Neg logit contributions
lf-0.038
 rarely-0.038
 hallway-0.036
é-0.036
 tv-0.035
Token flower
Token ID15061
Feature activation-1.79e+00

Pos logit contributions
endant+0.046
als+0.043
Sur+0.041
 seeds+0.038
elt+0.038
Neg logit contributions
lf-0.055
 rarely-0.053
 hallway-0.052
 tv-0.050
é-0.049
Token bloom
Token ID29955
Feature activation-2.01e+00

Pos logit contributions
endant+0.038
Sur+0.033
als+0.032
 seeds+0.032
 fiber+0.031
Neg logit contributions
lf-0.043
 rarely-0.042
é-0.040
 hallway-0.040
 tv-0.038
Tokened
Token ID276
Feature activation-2.10e+00

Pos logit contributions
endant+0.044
 fiber+0.039
ossible+0.038
Sur+0.037
 nuts+0.036
Neg logit contributions
lf-0.044
 rarely-0.043
é-0.043
 hallway-0.041
 especially-0.039
Token with
Token ID351
Feature activation-2.32e+00

Pos logit contributions
endant+0.051
Sur+0.044
als+0.043
 fiber+0.042
 seeds+0.042
Neg logit contributions
 rarely-0.044
lf-0.042
é-0.041
 tv-0.040
 hallway-0.040
Token bright
Token ID6016
Feature activation-5.13e+00

Pos logit contributions
endant+0.057
Sur+0.048
elt+0.047
lets+0.046
als+0.045
Neg logit contributions
 rarely-0.046
 especially-0.042
 tv-0.041
lf-0.041
 hardly-0.039
Token colors
Token ID7577
Feature activation-1.75e+00

Pos logit contributions
endant+0.111
Sur+0.105
of+0.100
ossible+0.099
Nic+0.097
Neg logit contributions
 tv-0.097
 rarely-0.097
lf-0.095
é-0.095
 hallway-0.093
Token.
Token ID13
Feature activation-3.71e-01

Pos logit contributions
endant+0.039
Sur+0.038
ossible+0.037
 fiber+0.036
 seeds+0.033
Neg logit contributions
lf-0.035
é-0.035
 rarely-0.034
 tv-0.033
 hallway-0.033
Token The
Token ID383
Feature activation+5.63e-02

Pos logit contributions
endant+0.008
als+0.008
 seeds+0.008
Sur+0.007
 half+0.007
Neg logit contributions
 rarely-0.008
lf-0.008
é-0.008
 hallway-0.008
 tv-0.008
Token flower
Token ID15061
Feature activation-1.49e+00

Pos logit contributions
lf+0.001
 rarely+0.001
 hallway+0.001
é+0.001
 tv+0.001
Neg logit contributions
endant-0.001
als-0.001
Sur-0.001
of-0.001
 fiber-0.001
Token's
Token ID338
Feature activation-4.05e-01

Pos logit contributions
endant+0.034
Sur+0.031
als+0.030
 seeds+0.029
elt+0.028
Neg logit contributions
 rarely-0.033
lf-0.033
é-0.031
 hallway-0.030
 especially-0.029
Token the
Token ID262
Feature activation-2.03e+00

Pos logit contributions
endant+0.010
 seeds+0.008
als+0.008
Sur+0.008
 fiber+0.007
Neg logit contributions
lf-0.010
 rarely-0.010
 hallway-0.010
 tv-0.010
é-0.009
Token sky
Token ID6766
Feature activation-1.51e+00

Pos logit contributions
endant+0.039
als+0.034
Sur+0.032
 seeds+0.031
of+0.029
Neg logit contributions
lf-0.052
 rarely-0.052
 hallway-0.051
é-0.049
 tv-0.049
Token was
Token ID373
Feature activation-4.33e-01

Pos logit contributions
endant+0.037
Sur+0.032
als+0.031
 seeds+0.030
 sandwiches+0.029
Neg logit contributions
 rarely-0.033
é-0.030
lf-0.030
 hallway-0.029
 especially-0.028
Token filled
Token ID5901
Feature activation+2.09e-01

Pos logit contributions
endant+0.010
als+0.009
Sur+0.009
 seeds+0.009
of+0.008
Neg logit contributions
 rarely-0.010
lf-0.010
 hallway-0.009
é-0.009
 tv-0.009
Token with
Token ID351
Feature activation-3.40e+00

Pos logit contributions
 rarely+0.005
lf+0.005
é+0.005
 hallway+0.005
 tv+0.005
Neg logit contributions
endant-0.004
 seeds-0.004
als-0.004
Sur-0.004
 bread-0.004
Token bright
Token ID6016
Feature activation-5.06e+00

Pos logit contributions
endant+0.078
als+0.064
of+0.061
Sur+0.061
lets+0.059
Neg logit contributions
 rarely-0.077
 especially-0.076
 tv-0.072
 hallway-0.071
é-0.069
Token yellow
Token ID7872
Feature activation-4.10e+00

Pos logit contributions
endant+0.119
ossible+0.104
 fiber+0.104
of+0.104
Sur+0.103
Neg logit contributions
 cartoons-0.096
 tv-0.095
é-0.095
 hallway-0.093
 especially-0.093
Token and
Token ID290
Feature activation-1.64e+00

Pos logit contributions
endant+0.098
ossible+0.092
of+0.087
Sur+0.087
 fiber+0.087
Neg logit contributions
 hallway-0.073
 cartoons-0.071
é-0.070
 especially-0.070
 tv-0.070
Token orange
Token ID10912
Feature activation-2.83e+00

Pos logit contributions
endant+0.031
als+0.027
Sur+0.025
 seeds+0.025
of+0.023
Neg logit contributions
 rarely-0.044
lf-0.044
 hallway-0.042
é-0.041
 tv-0.040
Token h
Token ID289
Feature activation+3.32e-01

Pos logit contributions
endant+0.061
ossible+0.057
Sur+0.056
of+0.055
 fiber+0.052
Neg logit contributions
lf-0.060
 hallway-0.059
 rarely-0.058
é-0.057
 tv-0.056
Tokenues
Token ID947
Feature activation-7.51e-01

Pos logit contributions
 rarely+0.006
lf+0.006
é+0.006
 hallway+0.006
 tv+0.006
Neg logit contributions
endant-0.008
 seeds-0.008
als-0.008
Sur-0.008
 half-0.007
Token and
Token ID290
Feature activation-1.10e+00

Pos logit contributions
endant+0.037
 seeds+0.032
als+0.031
Sur+0.031
 fiber+0.029
Neg logit contributions
 rarely-0.039
lf-0.038
 hallway-0.038
é-0.037
 tv-0.036
Token the
Token ID262
Feature activation-1.36e+00

Pos logit contributions
endant+0.023
als+0.021
 seeds+0.021
Sur+0.020
 da+0.018
Neg logit contributions
 rarely-0.027
lf-0.027
é-0.025
 hallway-0.025
 tv-0.025
Token sky
Token ID6766
Feature activation-1.00e+00

Pos logit contributions
endant+0.025
als+0.023
 seeds+0.022
Sur+0.021
 half+0.019
Neg logit contributions
 rarely-0.036
lf-0.036
é-0.035
 hallway-0.034
 tv-0.034
Token filled
Token ID5901
Feature activation+6.15e-01

Pos logit contributions
endant+0.025
als+0.022
 seeds+0.022
Sur+0.021
 bread+0.020
Neg logit contributions
 rarely-0.021
lf-0.020
é-0.019
 hallway-0.019
 tv-0.018
Token with
Token ID351
Feature activation-2.77e+00

Pos logit contributions
 rarely+0.016
lf+0.016
é+0.015
 hallway+0.015
 drummer+0.015
Neg logit contributions
als-0.011
 seeds-0.010
endant-0.010
 half-0.010
Sur-0.010
Token lovely
Token ID14081
Feature activation-4.93e+00

Pos logit contributions
endant+0.060
als+0.049
Sur+0.047
 fiber+0.047
ossible+0.046
Neg logit contributions
 rarely-0.065
 hallway-0.061
 tv-0.061
 especially-0.060
lf-0.059
Token colors
Token ID7577
Feature activation-8.95e-01

Pos logit contributions
endant+0.104
ossible+0.091
 fiber+0.090
als+0.089
Sur+0.088
Neg logit contributions
 hallway-0.106
lf-0.105
 tv-0.102
 rarely-0.101
 cartoons-0.100
Token.
Token ID13
Feature activation-4.37e-01

Pos logit contributions
endant+0.019
Sur+0.019
 fiber+0.018
ossible+0.018
 seeds+0.017
Neg logit contributions
lf-0.019
 rarely-0.018
é-0.018
 hallway-0.018
 especially-0.018
Token Ally
Token ID43039
Feature activation+7.62e-01

Pos logit contributions
endant+0.008
als+0.008
 seeds+0.007
Sur+0.007
 da+0.007
Neg logit contributions
lf-0.011
 rarely-0.011
é-0.011
 hallway-0.011
 tv-0.010
Token kept
Token ID4030
Feature activation-6.00e-01

Pos logit contributions
lf+0.018
 rarely+0.018
 tv+0.017
 hallway+0.017
é+0.017
Neg logit contributions
endant-0.016
als-0.015
 seeds-0.014
Sur-0.014
 da-0.013
Token tw
Token ID665
Feature activation+3.80e-01

Pos logit contributions
endant+0.016
als+0.015
 seeds+0.015
Sur+0.014
 da+0.014
Neg logit contributions
 rarely-0.011
lf-0.011
é-0.010
 hallway-0.010
 tv-0.010
Token big
Token ID1263
Feature activation-1.95e+00

Pos logit contributions
endant+0.034
als+0.031
 seeds+0.030
elt+0.029
 fiber+0.029
Neg logit contributions
 hallway-0.032
lf-0.032
 rarely-0.030
é-0.028
 tv-0.028
Token bush
Token ID24484
Feature activation-8.04e-01

Pos logit contributions
endant+0.046
als+0.041
 seeds+0.039
Sur+0.039
elt+0.037
Neg logit contributions
lf-0.044
 hallway-0.043
 rarely-0.042
é-0.041
 tv-0.039
Token with
Token ID351
Feature activation-2.99e+00

Pos logit contributions
endant+0.018
Sur+0.016
als+0.016
 fiber+0.016
 seeds+0.015
Neg logit contributions
 hallway-0.019
lf-0.019
 rarely-0.018
é-0.017
 especially-0.017
Token lots
Token ID6041
Feature activation-3.18e+00

Pos logit contributions
endant+0.067
als+0.056
ossible+0.053
 fiber+0.052
Sur+0.052
Neg logit contributions
 rarely-0.066
 hallway-0.063
 especially-0.062
lf-0.061
 tv-0.061
Token of
Token ID286
Feature activation-2.77e+00

Pos logit contributions
endant+0.090
ossible+0.086
als+0.082
Sur+0.082
 fiber+0.081
Neg logit contributions
lf-0.046
é-0.042
 especially-0.040
 rarely-0.040
 hallway-0.039
Token pretty
Token ID2495
Feature activation-4.90e+00

Pos logit contributions
endant+0.049
als+0.043
Sur+0.039
 seeds+0.038
 half+0.035
Neg logit contributions
 rarely-0.075
lf-0.073
 hallway-0.072
é-0.071
 tv-0.070
Token flowers
Token ID12734
Feature activation-1.14e+00

Pos logit contributions
endant+0.104
ossible+0.098
Sur+0.090
of+0.089
 half+0.087
Neg logit contributions
lf-0.102
 hallway-0.096
 rarely-0.092
 pretending-0.087
 cartoons-0.086
Token.
Token ID13
Feature activation-6.38e-01

Pos logit contributions
Sur+0.028
endant+0.027
 fiber+0.027
ossible+0.026
of+0.024
Neg logit contributions
lf-0.020
 pretending-0.020
 hallway-0.020
é-0.019
 having-0.019
Token
Token ID220
Feature activation+7.35e-01

Pos logit contributions
endant+0.011
als+0.010
 seeds+0.010
Sur+0.009
 half+0.008
Neg logit contributions
lf-0.018
 rarely-0.018
é-0.017
 hallway-0.016
 tv-0.016
Token
Token ID198
Feature activation-8.73e-01

Pos logit contributions
lf+0.022
 rarely+0.022
 hallway+0.021
é+0.021
 tv+0.021
Neg logit contributions
endant-0.011
als-0.010
 seeds-0.009
Sur-0.009
 half-0.008
Token
Token ID198
Feature activation-5.94e-01

Pos logit contributions
endant+0.010
Sur+0.009
als+0.009
 seeds+0.008
of+0.008
Neg logit contributions
 rarely-0.028
lf-0.028
é-0.027
 tv-0.027
 hallway-0.026
Token
Token ID198
Feature activation-4.18e-01

Pos logit contributions
lf+0.019
 rarely+0.019
é+0.018
 hallway+0.018
 tv+0.018
Neg logit contributions
endant-0.010
als-0.009
 seeds-0.009
Sur-0.009
 half-0.008
Token
Token ID198
Feature activation-3.08e-01

Pos logit contributions
endant+0.008
als+0.008
 seeds+0.007
Sur+0.007
 da+0.006
Neg logit contributions
lf-0.011
 rarely-0.011
é-0.010
 hallway-0.010
 tv-0.010
TokenMom
Token ID29252
Feature activation-1.94e+00

Pos logit contributions
endant+0.007
als+0.007
 seeds+0.006
 berries+0.006
elt+0.006
Neg logit contributions
lf-0.008
 rarely-0.007
 hallway-0.007
é-0.006
loe-0.006
Token smiled
Token ID13541
Feature activation-2.42e+00

Pos logit contributions
endant+0.042
 seeds+0.039
als+0.038
 fiber+0.035
Sur+0.035
Neg logit contributions
lf-0.049
é-0.044
 rarely-0.044
 hallway-0.043
 tv-0.041
Token and
Token ID290
Feature activation-2.62e+00

Pos logit contributions
endant+0.069
 fiber+0.063
als+0.061
 seeds+0.061
Sur+0.059
Neg logit contributions
 especially-0.037
 rarely-0.037
lf-0.035
 tv-0.035
é-0.035
Token pointed
Token ID6235
Feature activation-4.87e+00

Pos logit contributions
endant+0.061
 fiber+0.056
 seeds+0.054
als+0.052
ossible+0.052
Neg logit contributions
lf-0.055
 rarely-0.051
é-0.047
 especially-0.046
issa-0.045
Token to
Token ID284
Feature activation-6.66e-01

Pos logit contributions
endant+0.110
 fiber+0.092
Sur+0.091
 seeds+0.084
als+0.079
Neg logit contributions
é-0.108
 tv-0.103
 rarely-0.102
 anx-0.101
 Chess-0.099
Token the
Token ID262
Feature activation-1.47e+00

Pos logit contributions
endant+0.015
als+0.013
 seeds+0.013
Sur+0.012
elt+0.011
Neg logit contributions
 rarely-0.015
lf-0.015
é-0.015
 hallway-0.015
 tv-0.015
Token land
Token ID1956
Feature activation-1.05e+00

Pos logit contributions
endant+0.034
als+0.031
Sur+0.031
elt+0.031
lets+0.029
Neg logit contributions
 hallway-0.032
lf-0.031
 rarely-0.031
é-0.029
 tv-0.029
Token.
Token ID13
Feature activation+2.37e-01

Pos logit contributions
endant+0.019
als+0.017
Sur+0.017
 seeds+0.016
 fiber+0.014
Neg logit contributions
lf-0.028
 rarely-0.028
é-0.027
 hallway-0.027
 tv-0.026
Token "
Token ID366
Feature activation-1.75e+00

Pos logit contributions
 rarely+0.005
lf+0.005
 hallway+0.004
 tv+0.004
é+0.004
Neg logit contributions
endant-0.006
als-0.006
 seeds-0.005
Sur-0.005
of-0.005
Token purple
Token ID14032
Feature activation-2.06e+00

Pos logit contributions
endant+0.004
 seeds+0.003
als+0.003
Sur+0.003
 da+0.003
Neg logit contributions
 rarely-0.006
lf-0.005
é-0.005
loe-0.005
 tv-0.005
Token bird
Token ID6512
Feature activation-6.47e-01

Pos logit contributions
endant+0.035
als+0.032
 seeds+0.031
Sur+0.029
 da+0.027
Neg logit contributions
 rarely-0.058
lf-0.056
é-0.055
 hallway-0.054
 tv-0.053
Token with
Token ID351
Feature activation-2.68e+00

Pos logit contributions
endant+0.014
als+0.013
 seeds+0.012
Sur+0.012
 half+0.011
Neg logit contributions
lf-0.015
 rarely-0.015
é-0.014
 hallway-0.014
 tv-0.014
Token lots
Token ID6041
Feature activation-2.90e+00

Pos logit contributions
endant+0.056
als+0.049
 fiber+0.047
of+0.046
ossible+0.044
Neg logit contributions
 rarely-0.062
lf-0.057
é-0.055
 especially-0.054
 hallway-0.054
Token of
Token ID286
Feature activation-2.75e+00

Pos logit contributions
ossible+0.075
endant+0.074
als+0.071
Sur+0.071
 fiber+0.069
Neg logit contributions
lf-0.043
é-0.042
 rarely-0.040
 tv-0.040
 especially-0.039
Token colourful
Token ID44799
Feature activation-4.83e+00

Pos logit contributions
endant+0.043
als+0.039
Sur+0.036
of+0.033
 seeds+0.032
Neg logit contributions
 rarely-0.077
lf-0.075
é-0.074
 hallway-0.073
 tv-0.072
Token feathers
Token ID27400
Feature activation-8.20e-01

Pos logit contributions
endant+0.074
of+0.068
als+0.061
ossible+0.060
ocular+0.060
Neg logit contributions
lf-0.132
 pretending-0.118
 hallway-0.118
 tv-0.115
 cartoons-0.114
Token.
Token ID13
Feature activation+6.84e-01

Pos logit contributions
Sur+0.019
endant+0.018
 fiber+0.017
als+0.016
ossible+0.016
Neg logit contributions
é-0.016
lf-0.016
 rarely-0.015
 hallway-0.014
 especially-0.014
Token He
Token ID679
Feature activation+1.39e-03

Pos logit contributions
 rarely+0.015
lf+0.015
é+0.014
 hallway+0.014
 tv+0.013
Neg logit contributions
endant-0.016
als-0.015
 seeds-0.014
Sur-0.014
 half-0.013
Token watched
Token ID7342
Feature activation+7.38e-01

Pos logit contributions
 rarely+0.000
lf+0.000
é+0.000
 hallway+0.000
 tv+0.000
Neg logit contributions
endant-0.000
als-0.000
 seeds-0.000
Sur-0.000
 bread-0.000
Token as
Token ID355
Feature activation+1.33e+00

Pos logit contributions
 rarely+0.020
lf+0.019
 hallway+0.019
é+0.019
 tv+0.018
Neg logit contributions
endant-0.013
als-0.013
 seeds-0.012
Sur-0.011
 da-0.011
Token
Token ID198
Feature activation-4.04e-01

Pos logit contributions
 rarely+0.042
lf+0.041
é+0.040
 hallway+0.039
 tv+0.039
Neg logit contributions
endant-0.027
als-0.024
 seeds-0.023
Sur-0.023
 half-0.020
Token
Token ID198
Feature activation-4.32e-01

Pos logit contributions
endant+0.008
als+0.007
 seeds+0.007
Sur+0.006
 da+0.006
Neg logit contributions
lf-0.011
 rarely-0.011
é-0.010
 hallway-0.010
 tv-0.010
Token"
Token ID1
Feature activation-1.99e+00

Pos logit contributions
endant+0.009
als+0.008
 seeds+0.008
 da+0.007
Sur+0.007
Neg logit contributions
lf-0.011
 rarely-0.011
 hallway-0.010
é-0.010
 tv-0.010
TokenYes
Token ID5297
Feature activation-1.38e+00

Pos logit contributions
endant+0.049
als+0.045
 seeds+0.043
 da+0.039
 enough+0.039
Neg logit contributions
 rarely-0.043
lf-0.042
é-0.039
 hallway-0.039
 tv-0.038
Token,
Token ID11
Feature activation+1.31e-02

Pos logit contributions
endant+0.030
als+0.027
 seeds+0.026
Sur+0.025
elt+0.024
Neg logit contributions
lf-0.034
 rarely-0.033
é-0.031
 tv-0.031
loe-0.030
Token sorry
Token ID7926
Feature activation-4.78e+00

Pos logit contributions
lf+0.000
 rarely+0.000
 tv+0.000
é+0.000
 especially+0.000
Neg logit contributions
endant-0.000
als-0.000
 seeds-0.000
Sur-0.000
elt-0.000
Token,
Token ID11
Feature activation-1.47e-01

Pos logit contributions
endant+0.138
als+0.115
Sur+0.114
 seeds+0.109
elt+0.104
Neg logit contributions
 Chess-0.077
 cartoons-0.077
 tv-0.076
lf-0.072
 seeker-0.066
Token Mom
Token ID11254
Feature activation-1.74e+00

Pos logit contributions
endant+0.003
als+0.003
 seeds+0.003
Sur+0.003
elt+0.003
Neg logit contributions
lf-0.004
 rarely-0.004
 tv-0.003
 especially-0.003
é-0.003
Token.
Token ID13
Feature activation-1.47e+00

Pos logit contributions
endant+0.036
Sur+0.034
als+0.034
 seeds+0.031
 da+0.029
Neg logit contributions
lf-0.040
é-0.039
 rarely-0.038
 hallway-0.036
 tv-0.036
Token We
Token ID775
Feature activation-1.27e+00

Pos logit contributions
endant+0.033
 seeds+0.029
als+0.029
Sur+0.028
 da+0.026
Neg logit contributions
 rarely-0.033
lf-0.032
é-0.032
 tv-0.031
 hallway-0.031
Token will
Token ID481
Feature activation-1.04e+00

Pos logit contributions
endant+0.024
als+0.022
 seeds+0.021
Sur+0.020
 da+0.018
Neg logit contributions
 rarely-0.033
lf-0.032
é-0.032
 hallway-0.031
 tv-0.031
Token the
Token ID262
Feature activation-4.60e-01

Pos logit contributions
 rarely+0.003
é+0.003
lf+0.003
 hallway+0.003
loe+0.003
Neg logit contributions
endant-0.003
als-0.003
 seeds-0.003
Sur-0.003
 half-0.003
Token sky
Token ID6766
Feature activation-1.00e+00

Pos logit contributions
endant+0.009
als+0.008
Sur+0.007
 seeds+0.007
of+0.007
Neg logit contributions
lf-0.012
 rarely-0.012
 hallway-0.012
é-0.011
 tv-0.011
Token lit
Token ID6578
Feature activation-1.02e+00

Pos logit contributions
endant+0.022
als+0.020
 seeds+0.020
Sur+0.019
 da+0.018
Neg logit contributions
 rarely-0.023
lf-0.023
é-0.022
 hallway-0.022
 tv-0.021
Token up
Token ID510
Feature activation-6.65e-01

Pos logit contributions
endant+0.027
als+0.026
 seeds+0.025
Sur+0.024
 half+0.023
Neg logit contributions
lf-0.018
 rarely-0.018
é-0.017
 hallway-0.017
 tv-0.016
Token with
Token ID351
Feature activation-1.09e+00

Pos logit contributions
endant+0.011
als+0.010
 seeds+0.010
Sur+0.009
 da+0.008
Neg logit contributions
 rarely-0.019
lf-0.019
é-0.018
 hallway-0.018
 tv-0.018
Token pretty
Token ID2495
Feature activation-4.76e+00

Pos logit contributions
endant+0.025
als+0.022
Sur+0.020
 seeds+0.019
elt+0.019
Neg logit contributions
 rarely-0.025
lf-0.024
 tv-0.024
 hallway-0.024
é-0.023
Token colours
Token ID18915
Feature activation-1.75e+00

Pos logit contributions
endant+0.101
ossible+0.098
Sur+0.090
 fiber+0.087
asure+0.085
Neg logit contributions
 hallway-0.098
 cartoons-0.098
 tv-0.096
 rarely-0.095
lf-0.093
Token.
Token ID13
Feature activation+1.34e-01

Pos logit contributions
endant+0.038
Sur+0.036
ossible+0.035
 fiber+0.035
als+0.033
Neg logit contributions
lf-0.037
é-0.036
 rarely-0.036
 tv-0.035
 hallway-0.035
Token Max
Token ID5436
Feature activation+1.33e-01

Pos logit contributions
 rarely+0.003
lf+0.003
é+0.003
 hallway+0.002
 seeker+0.002
Neg logit contributions
endant-0.003
als-0.003
Sur-0.003
 seeds-0.003
 eyes-0.003
Token stayed
Token ID9658
Feature activation-6.43e-02

Pos logit contributions
 rarely+0.003
lf+0.003
é+0.003
 hallway+0.003
 tv+0.003
Neg logit contributions
endant-0.003
als-0.003
 seeds-0.003
Sur-0.002
 half-0.002
Token and
Token ID290
Feature activation-1.41e+00

Pos logit contributions
endant+0.001
als+0.001
 seeds+0.001
Sur+0.001
 da+0.001
Neg logit contributions
 rarely-0.001
lf-0.001
é-0.001
 hallway-0.001
 tv-0.001
Token in
Token ID287
Feature activation+1.45e-01

Pos logit contributions
lf+0.025
é+0.023
 rarely+0.023
loe+0.023
 tv+0.023
Neg logit contributions
endant-0.017
 seeds-0.015
als-0.015
 enough-0.015
 da-0.014
Token a
Token ID257
Feature activation-3.65e-01

Pos logit contributions
 rarely+0.004
lf+0.004
é+0.004
 hallway+0.004
 tv+0.004
Neg logit contributions
endant-0.003
als-0.002
 seeds-0.002
Sur-0.002
 half-0.002
Token beautiful
Token ID4950
Feature activation-1.82e+00

Pos logit contributions
endant+0.008
als+0.008
 seeds+0.007
elt+0.007
Sur+0.007
Neg logit contributions
 hallway-0.008
lf-0.008
 rarely-0.008
 tv-0.008
é-0.007
Token garden
Token ID11376
Feature activation-4.17e-01

Pos logit contributions
endant+0.043
als+0.042
Sur+0.040
elt+0.039
 fiber+0.038
Neg logit contributions
lf-0.041
 hallway-0.039
 rarely-0.036
 tv-0.035
 especially-0.032
Token with
Token ID351
Feature activation-1.33e+00

Pos logit contributions
endant+0.009
als+0.008
Sur+0.008
 seeds+0.008
 fiber+0.007
Neg logit contributions
lf-0.010
 rarely-0.010
 hallway-0.009
é-0.009
 tv-0.009
Token bright
Token ID6016
Feature activation-4.76e+00

Pos logit contributions
endant+0.030
als+0.027
Sur+0.026
of+0.024
lets+0.024
Neg logit contributions
 rarely-0.029
lf-0.028
 hallway-0.027
 tv-0.026
 especially-0.026
Token flowers
Token ID12734
Feature activation-1.12e+00

Pos logit contributions
endant+0.097
Sur+0.091
als+0.084
ossible+0.084
of+0.084
Neg logit contributions
lf-0.102
 hallway-0.101
 tv-0.100
 rarely-0.098
é-0.098
Token,
Token ID11
Feature activation-1.14e+00

Pos logit contributions
Sur+0.028
endant+0.027
 fiber+0.026
ossible+0.025
 half+0.024
Neg logit contributions
lf-0.020
 hallway-0.020
 pretending-0.019
 especially-0.019
é-0.019
Token gentle
Token ID10296
Feature activation-4.09e+00

Pos logit contributions
endant+0.020
als+0.018
 seeds+0.017
Sur+0.016
 half+0.015
Neg logit contributions
 rarely-0.032
lf-0.031
é-0.030
 hallway-0.030
 tv-0.029
Token bree
Token ID20780
Feature activation-1.14e+00

Pos logit contributions
endant+0.098
ossible+0.085
Sur+0.076
 half+0.076
prise+0.075
Neg logit contributions
 hallway-0.083
 pretending-0.077
 rarely-0.076
 tv-0.074
lf-0.071
Tokenzes
Token ID12271
Feature activation-1.52e+00

Pos logit contributions
endant+0.035
als+0.034
Sur+0.033
 seeds+0.032
 half+0.031
Neg logit contributions
lf-0.017
 rarely-0.016
é-0.014
 hallway-0.014
 tv-0.014
Token she
Token ID673
Feature activation+2.67e-01

Pos logit contributions
lf+0.023
é+0.023
 rarely+0.022
idget+0.022
 tv+0.021
Neg logit contributions
als-0.014
 seeds-0.012
endant-0.012
Sur-0.012
 enough-0.011
Token found
Token ID1043
Feature activation-1.51e+00

Pos logit contributions
 rarely+0.007
lf+0.007
é+0.006
 hallway+0.006
 tv+0.006
Neg logit contributions
endant-0.005
als-0.005
 seeds-0.005
Sur-0.005
 half-0.004
Token big
Token ID1263
Feature activation-2.89e+00

Pos logit contributions
endant+0.028
Sur+0.023
als+0.023
 fiber+0.021
 seeds+0.021
Neg logit contributions
 rarely-0.040
é-0.038
lf-0.038
 hallway-0.037
 tv-0.036
Token flowers
Token ID12734
Feature activation-6.74e-01

Pos logit contributions
endant+0.049
als+0.042
Sur+0.042
 seeds+0.040
 half+0.036
Neg logit contributions
 rarely-0.080
lf-0.079
 hallway-0.077
é-0.077
 tv-0.076
Token with
Token ID351
Feature activation-2.66e+00

Pos logit contributions
endant+0.017
Sur+0.017
 fiber+0.016
als+0.015
ossible+0.015
Neg logit contributions
lf-0.012
 hallway-0.012
 especially-0.012
 rarely-0.012
é-0.011
Token colorful
Token ID20239
Feature activation-4.76e+00

Pos logit contributions
endant+0.057
Sur+0.048
als+0.046
ossible+0.044
elt+0.043
Neg logit contributions
 rarely-0.062
 especially-0.060
 hallway-0.058
lf-0.058
 tv-0.057
Token pet
Token ID4273
Feature activation-3.83e+00

Pos logit contributions
endant+0.099
ossible+0.091
ray+0.087
Sur+0.085
 half+0.081
Neg logit contributions
lf-0.099
 especially-0.098
 hallway-0.096
 cartoons-0.094
 tv-0.093
Tokenals
Token ID874
Feature activation-1.80e+00

Pos logit contributions
endant+0.048
ossible+0.036
 fiber+0.033
Sur+0.032
 half+0.031
Neg logit contributions
é-0.120
 tv-0.114
 especially-0.109
 hallway-0.108
 cartoons-0.108
Token and
Token ID290
Feature activation-1.40e+00

Pos logit contributions
Sur+0.042
endant+0.042
 fiber+0.038
ossible+0.036
 half+0.035
Neg logit contributions
lf-0.034
é-0.033
 especially-0.033
 rarely-0.033
 hallway-0.032
Token birds
Token ID10087
Feature activation-1.07e+00

Pos logit contributions
endant+0.026
als+0.022
Sur+0.022
 seeds+0.021
 half+0.019
Neg logit contributions
 rarely-0.038
lf-0.037
 hallway-0.035
é-0.035
 tv-0.034
Token picking
Token ID10868
Feature activation-2.32e+00

Pos logit contributions
endant+0.023
Sur+0.023
 fiber+0.020
als+0.020
 seeds+0.020
Neg logit contributions
lf-0.023
 rarely-0.023
é-0.023
 hallway-0.022
 tv-0.021
Token beautiful
Token ID4950
Feature activation-1.60e+00

Pos logit contributions
 rarely+0.002
lf+0.002
é+0.002
 hallway+0.002
 tv+0.002
Neg logit contributions
endant-0.001
 seeds-0.001
als-0.001
Sur-0.001
 da-0.001
Token plant
Token ID4618
Feature activation-1.20e+00

Pos logit contributions
endant+0.030
als+0.027
Sur+0.026
 seeds+0.024
elt+0.023
Neg logit contributions
lf-0.042
 rarely-0.041
 hallway-0.040
 tv-0.040
é-0.039
Token with
Token ID351
Feature activation-2.03e+00

Pos logit contributions
endant+0.025
Sur+0.022
als+0.020
 fiber+0.019
ossible+0.019
Neg logit contributions
lf-0.029
 rarely-0.028
é-0.027
 hallway-0.027
 especially-0.026
Token lots
Token ID6041
Feature activation-3.12e+00

Pos logit contributions
endant+0.047
Sur+0.041
als+0.041
ari+0.040
ossible+0.039
Neg logit contributions
 rarely-0.043
 especially-0.041
 hardly-0.038
 tv-0.038
lf-0.036
Token of
Token ID286
Feature activation-2.32e+00

Pos logit contributions
endant+0.090
ossible+0.089
Sur+0.083
als+0.082
Nic+0.081
Neg logit contributions
 especially-0.038
 rarely-0.037
lf-0.037
 tv-0.036
é-0.034
Token pretty
Token ID2495
Feature activation-4.73e+00

Pos logit contributions
endant+0.042
als+0.038
Sur+0.036
ossible+0.033
elt+0.032
Neg logit contributions
 rarely-0.061
lf-0.057
 tv-0.057
 especially-0.057
é-0.056
Token leaves
Token ID5667
Feature activation-5.87e-01

Pos logit contributions
endant+0.097
ossible+0.087
ocular+0.086
ari+0.083
Sur+0.082
Neg logit contributions
lf-0.103
 rarely-0.097
 tv-0.097
 hallway-0.093
 especially-0.093
Token.
Token ID13
Feature activation-5.40e-01

Pos logit contributions
Sur+0.014
endant+0.013
ossible+0.012
 fiber+0.012
elt+0.011
Neg logit contributions
lf-0.011
é-0.011
 rarely-0.011
 especially-0.011
 hallway-0.010
Token
Token ID198
Feature activation-6.84e-01

Pos logit contributions
endant+0.013
als+0.012
 seeds+0.011
Sur+0.011
 half+0.010
Neg logit contributions
lf-0.012
 rarely-0.012
é-0.011
 hallway-0.011
 tv-0.011
Token
Token ID198
Feature activation-1.66e-03

Pos logit contributions
endant+0.006
Sur+0.006
ossible+0.005
of+0.005
 seeds+0.004
Neg logit contributions
 rarely-0.024
lf-0.023
 seeker-0.023
 tv-0.023
é-0.023
TokenThe
Token ID464
Feature activation+1.15e+00

Pos logit contributions
Sur+0.000
endant+0.000
 seeds+0.000
als+0.000
 half+0.000
Neg logit contributions
é-0.000
 rarely-0.000
lf-0.000
 seeker-0.000
 hallway-0.000
Token was
Token ID373
Feature activation+5.25e-01

Pos logit contributions
endant+0.025
als+0.024
 seeds+0.023
Sur+0.022
 half+0.021
Neg logit contributions
 rarely-0.024
lf-0.023
é-0.023
 tv-0.022
 hallway-0.022
Token a
Token ID257
Feature activation-1.47e-01

Pos logit contributions
 rarely+0.016
lf+0.016
é+0.016
 hallway+0.016
 tv+0.015
Neg logit contributions
endant-0.007
 seeds-0.006
als-0.006
Sur-0.005
 half-0.005
Token tall
Token ID7331
Feature activation-7.87e-01

Pos logit contributions
endant+0.003
als+0.002
 seeds+0.002
Sur+0.002
 da+0.002
Neg logit contributions
 rarely-0.004
lf-0.004
é-0.004
 hallway-0.004
 tv-0.004
Token tree
Token ID5509
Feature activation+2.60e-01

Pos logit contributions
endant+0.014
als+0.013
 seeds+0.013
Sur+0.012
 da+0.012
Neg logit contributions
 rarely-0.021
é-0.020
lf-0.020
 tv-0.019
 hallway-0.019
Token with
Token ID351
Feature activation-2.68e+00

Pos logit contributions
 rarely+0.007
lf+0.007
é+0.006
 hallway+0.006
 tv+0.006
Neg logit contributions
endant-0.005
als-0.005
 seeds-0.004
Sur-0.004
 da-0.004
Token bright
Token ID6016
Feature activation-4.72e+00

Pos logit contributions
endant+0.059
ossible+0.049
als+0.048
Sur+0.047
 fiber+0.046
Neg logit contributions
 rarely-0.061
 hallway-0.058
lf-0.057
 especially-0.055
é-0.055
Token orange
Token ID10912
Feature activation-3.32e+00

Pos logit contributions
endant+0.096
ossible+0.088
Sur+0.084
of+0.084
 fiber+0.082
Neg logit contributions
lf-0.105
 hallway-0.103
é-0.099
 tv-0.099
 rarely-0.098
Token and
Token ID290
Feature activation-6.96e-01

Pos logit contributions
endant+0.061
ossible+0.055
of+0.050
Sur+0.050
 fiber+0.048
Neg logit contributions
lf-0.085
 hallway-0.083
 rarely-0.078
 tv-0.078
é-0.077
Token yellow
Token ID7872
Feature activation-3.69e+00

Pos logit contributions
endant+0.013
als+0.012
 seeds+0.011
Sur+0.011
ossible+0.010
Neg logit contributions
lf-0.018
 rarely-0.018
 hallway-0.017
é-0.017
 tv-0.017
Token leaves
Token ID5667
Feature activation+1.96e-01

Pos logit contributions
endant+0.068
ossible+0.066
of+0.057
prise+0.057
Sur+0.057
Neg logit contributions
 hallway-0.087
lf-0.084
 tv-0.080
 rarely-0.079
é-0.078
Token.
Token ID13
Feature activation-1.44e-01

Pos logit contributions
lf+0.004
 rarely+0.004
é+0.004
 hallway+0.004
 tv+0.004
Neg logit contributions
endant-0.004
Sur-0.004
als-0.004
 fiber-0.004
 seeds-0.004
Token flower
Token ID15061
Feature activation-2.12e+00

Pos logit contributions
endant+0.042
als+0.039
 seeds+0.037
Sur+0.036
 half+0.032
Neg logit contributions
lf-0.059
 rarely-0.059
é-0.056
 hallway-0.056
 tv-0.055
Token garden
Token ID11376
Feature activation-1.63e+00

Pos logit contributions
endant+0.038
Sur+0.036
als+0.034
 seeds+0.033
 fiber+0.032
Neg logit contributions
lf-0.055
 hallway-0.052
 rarely-0.052
é-0.051
 tv-0.049
Token.
Token ID13
Feature activation-9.00e-01

Pos logit contributions
endant+0.032
Sur+0.031
 fiber+0.029
als+0.028
 seeds+0.027
Neg logit contributions
lf-0.039
 hallway-0.038
é-0.037
 rarely-0.036
 tv-0.035
Token There
Token ID1318
Feature activation-1.24e+00

Pos logit contributions
endant+0.021
als+0.019
 seeds+0.019
Sur+0.018
 half+0.017
Neg logit contributions
 rarely-0.020
lf-0.019
é-0.019
 hallway-0.019
 tv-0.018
Token were
Token ID547
Feature activation-1.94e+00

Pos logit contributions
endant+0.029
als+0.026
 seeds+0.025
Sur+0.025
elt+0.023
Neg logit contributions
 rarely-0.027
lf-0.027
é-0.026
 hallway-0.025
 tv-0.025
Token pretty
Token ID2495
Feature activation-4.71e+00

Pos logit contributions
endant+0.040
als+0.034
Sur+0.032
 seeds+0.031
of+0.031
Neg logit contributions
 rarely-0.048
lf-0.045
 hallway-0.045
 especially-0.044
é-0.043
Token pink
Token ID11398
Feature activation-3.36e+00

Pos logit contributions
endant+0.086
ossible+0.083
Sur+0.082
forts+0.073
elt+0.073
Neg logit contributions
lf-0.113
 hallway-0.105
 rarely-0.103
é-0.101
 especially-0.101
Token roses
Token ID42152
Feature activation-4.55e-01

Pos logit contributions
endant+0.058
Sur+0.055
ossible+0.055
 fiber+0.048
of+0.047
Neg logit contributions
lf-0.086
 hallway-0.080
 rarely-0.079
é-0.078
 especially-0.077
Token and
Token ID290
Feature activation-2.84e-01

Pos logit contributions
endant+0.009
Sur+0.009
 fiber+0.008
ossible+0.008
 seeds+0.008
Neg logit contributions
lf-0.010
é-0.010
 hallway-0.010
 rarely-0.010
 especially-0.009
Token yellow
Token ID7872
Feature activation-3.62e+00

Pos logit contributions
endant+0.005
als+0.005
Sur+0.004
 seeds+0.004
elt+0.004
Neg logit contributions
 rarely-0.008
lf-0.008
é-0.007
 hallway-0.007
 tv-0.007
Token sun
Token ID4252
Feature activation-2.07e+00

Pos logit contributions
endant+0.067
Sur+0.065
ossible+0.064
off+0.057
 half+0.055
Neg logit contributions
 especially-0.078
 rarely-0.078
é-0.076
lf-0.076
 hallway-0.075
Token hall
Token ID6899
Feature activation-2.33e-01

Pos logit contributions
 rarely+0.007
é+0.006
lf+0.006
 tv+0.005
icia+0.005
Neg logit contributions
endant-0.010
als-0.009
 seeds-0.009
Sur-0.009
 da-0.009
Token together
Token ID1978
Feature activation-1.50e+00

Pos logit contributions
endant+0.004
als+0.004
 seeds+0.004
Sur+0.004
 half+0.003
Neg logit contributions
 rarely-0.006
lf-0.006
é-0.006
 hallway-0.006
 tv-0.006
Token and
Token ID290
Feature activation-1.41e+00

Pos logit contributions
endant+0.029
als+0.025
 seeds+0.024
Sur+0.024
 da+0.021
Neg logit contributions
lf-0.040
 rarely-0.040
é-0.038
 hallway-0.038
 tv-0.037
Token it
Token ID340
Feature activation-1.67e+00

Pos logit contributions
als+0.023
endant+0.023
 seeds+0.022
 half+0.021
Sur+0.020
Neg logit contributions
 rarely-0.038
é-0.038
lf-0.037
 tv-0.036
 hallway-0.036
Token looked
Token ID3114
Feature activation-1.41e+00

Pos logit contributions
endant+0.036
als+0.034
 seeds+0.033
Sur+0.032
 half+0.031
Neg logit contributions
 rarely-0.038
lf-0.037
é-0.036
 tv-0.036
 hallway-0.035
Token much
Token ID881
Feature activation-4.71e+00

Pos logit contributions
endant+0.026
als+0.024
 seeds+0.023
Sur+0.022
 half+0.020
Neg logit contributions
lf-0.037
 rarely-0.037
é-0.036
 hallway-0.035
 tv-0.035
Token better
Token ID1365
Feature activation-1.57e+00

Pos logit contributions
als+0.132
endant+0.129
lets+0.120
Sur+0.119
elt+0.116
Neg logit contributions
 especially-0.067
 rarely-0.063
 cartoons-0.062
 tv-0.058
lf-0.058
Token.
Token ID13
Feature activation-1.16e+00

Pos logit contributions
endant+0.036
Sur+0.033
als+0.032
 seeds+0.030
elt+0.028
Neg logit contributions
 rarely-0.035
lf-0.035
 especially-0.032
 seeker-0.032
 hallway-0.032
Token They
Token ID1119
Feature activation-1.61e+00

Pos logit contributions
endant+0.025
als+0.023
 seeds+0.023
Sur+0.022
 da+0.020
Neg logit contributions
 rarely-0.027
lf-0.026
é-0.025
 hallway-0.025
 tv-0.025
Token both
Token ID1111
Feature activation-1.71e+00

Pos logit contributions
endant+0.033
als+0.031
 seeds+0.030
Sur+0.028
 da+0.026
Neg logit contributions
 rarely-0.039
lf-0.039
é-0.037
 hallway-0.037
 tv-0.037
Token felt
Token ID2936
Feature activation-3.51e-01

Pos logit contributions
endant+0.039
als+0.036
 seeds+0.035
Sur+0.034
 bread+0.031
Neg logit contributions
 rarely-0.038
lf-0.038
é-0.036
 hallway-0.036
 tv-0.034
Token stood
Token ID6204
Feature activation-8.32e-01

Pos logit contributions
endant+0.058
als+0.051
Sur+0.051
 seeds+0.051
 fiber+0.046
Neg logit contributions
lf-0.057
 rarely-0.056
é-0.056
 hallway-0.053
 tv-0.051
Token side
Token ID1735
Feature activation-8.07e-01

Pos logit contributions
endant+0.019
als+0.018
 seeds+0.018
 da+0.017
Sur+0.016
Neg logit contributions
 rarely-0.019
lf-0.019
é-0.017
 hallway-0.017
 tv-0.016
Token by
Token ID416
Feature activation+6.74e-01

Pos logit contributions
endant+0.013
als+0.012
 seeds+0.012
Sur+0.011
 half+0.010
Neg logit contributions
 rarely-0.023
lf-0.023
é-0.021
 hallway-0.021
 tv-0.021
Token side
Token ID1735
Feature activation-9.01e-01

Pos logit contributions
 rarely+0.017
lf+0.017
é+0.016
 hallway+0.016
 tv+0.015
Neg logit contributions
endant-0.014
als-0.012
 seeds-0.012
Sur-0.012
 half-0.011
Token and
Token ID290
Feature activation-1.39e+00

Pos logit contributions
endant+0.015
als+0.013
 seeds+0.013
Sur+0.012
 half+0.011
Neg logit contributions
 rarely-0.026
lf-0.025
é-0.024
 hallway-0.024
 tv-0.024
Token pointed
Token ID6235
Feature activation-4.70e+00

Pos logit contributions
endant+0.028
als+0.026
 seeds+0.025
Sur+0.024
 da+0.022
Neg logit contributions
lf-0.034
 rarely-0.034
é-0.033
 hallway-0.032
 tv-0.032
Token up
Token ID510
Feature activation-3.71e+00

Pos logit contributions
endant+0.113
Sur+0.096
 fiber+0.096
 seeds+0.094
als+0.088
Neg logit contributions
é-0.094
 tv-0.093
 rarely-0.091
 especially-0.089
 anx-0.086
Token at
Token ID379
Feature activation-1.89e-01

Pos logit contributions
endant+0.084
Sur+0.071
 fiber+0.071
 seeds+0.070
als+0.069
Neg logit contributions
 rarely-0.081
é-0.081
 tv-0.077
 hallway-0.076
lf-0.076
Token the
Token ID262
Feature activation-8.89e-01

Pos logit contributions
endant+0.004
als+0.004
Sur+0.004
 seeds+0.004
elt+0.003
Neg logit contributions
 rarely-0.004
é-0.004
lf-0.004
 tv-0.004
 hallway-0.004
Token sky
Token ID6766
Feature activation-2.34e+00

Pos logit contributions
endant+0.020
 seeds+0.018
als+0.018
elt+0.018
Sur+0.018
Neg logit contributions
lf-0.019
 hallway-0.018
 rarely-0.018
é-0.017
 tv-0.017
Token.
Token ID13
Feature activation-8.76e-01

Pos logit contributions
endant+0.052
Sur+0.045
 seeds+0.044
 fiber+0.043
ossible+0.042
Neg logit contributions
lf-0.053
é-0.052
 rarely-0.051
 hallway-0.051
 tv-0.051
Token at
Token ID379
Feature activation+5.24e-01

Pos logit contributions
endant+0.053
als+0.049
 seeds+0.048
Sur+0.044
 da+0.043
Neg logit contributions
lf-0.059
 rarely-0.058
é-0.056
 hallway-0.055
 tv-0.053
Token Sarah
Token ID10490
Feature activation-1.23e+00

Pos logit contributions
 rarely+0.013
lf+0.013
é+0.012
 hallway+0.012
 tv+0.012
Neg logit contributions
endant-0.011
als-0.010
 seeds-0.009
Sur-0.009
 half-0.008
Token with
Token ID351
Feature activation-1.83e+00

Pos logit contributions
endant+0.023
als+0.021
Sur+0.020
 seeds+0.020
elt+0.017
Neg logit contributions
lf-0.032
 rarely-0.032
é-0.031
 hallway-0.031
 tv-0.030
Token its
Token ID663
Feature activation-1.05e+00

Pos logit contributions
endant+0.049
als+0.047
Sur+0.044
elt+0.043
land+0.043
Neg logit contributions
 rarely-0.033
 especially-0.028
 hardly-0.026
irl-0.026
 tv-0.026
Token be
Token ID307
Feature activation-1.53e-01

Pos logit contributions
als+0.022
endant+0.022
Sur+0.021
aw+0.019
of+0.019
Neg logit contributions
 rarely-0.024
lf-0.022
 hallway-0.022
 hardly-0.022
 especially-0.021
Tokenady
Token ID4597
Feature activation-4.70e+00

Pos logit contributions
als+0.003
endant+0.003
Sur+0.003
 fiber+0.003
 seeds+0.003
Neg logit contributions
lf-0.004
 rarely-0.003
 especially-0.003
 hallway-0.003
ined-0.003
Token eyes
Token ID2951
Feature activation-2.03e+00

Pos logit contributions
endant+0.068
Sur+0.062
 fiber+0.059
of+0.051
forts+0.050
Neg logit contributions
lf-0.124
 tv-0.123
 pretending-0.120
sey-0.120
é-0.117
Token.
Token ID13
Feature activation-2.49e-01

Pos logit contributions
endant+0.049
 fiber+0.047
Sur+0.046
ossible+0.045
 seeds+0.044
Neg logit contributions
lf-0.038
 especially-0.037
 rarely-0.036
 pretending-0.036
 hallway-0.036
Token
Token ID198
Feature activation-1.18e+00

Pos logit contributions
endant+0.005
als+0.005
Sur+0.005
 seeds+0.005
elt+0.004
Neg logit contributions
 rarely-0.006
lf-0.006
é-0.006
 hallway-0.005
 tv-0.005
Token
Token ID198
Feature activation-8.60e-01

Pos logit contributions
endant+0.015
Sur+0.014
als+0.013
 seeds+0.012
 half+0.011
Neg logit contributions
 rarely-0.037
lf-0.036
é-0.036
 tv-0.035
 hallway-0.035
TokenSarah
Token ID29284
Feature activation-3.14e-01

Pos logit contributions
endant+0.015
als+0.014
 seeds+0.013
Sur+0.012
 bread+0.011
Neg logit contributions
lf-0.024
 rarely-0.024
 hallway-0.023
é-0.023
 tv-0.022
Token and
Token ID290
Feature activation-1.47e+00

Pos logit contributions
endant+0.046
Sur+0.042
als+0.041
 fiber+0.039
 seeds+0.039
Neg logit contributions
 hallway-0.039
lf-0.039
 rarely-0.037
 tv-0.036
é-0.035
Token saw
Token ID2497
Feature activation-2.73e+00

Pos logit contributions
endant+0.028
als+0.025
 seeds+0.024
Sur+0.023
 half+0.021
Neg logit contributions
 rarely-0.038
lf-0.038
é-0.037
 hallway-0.036
 tv-0.036
Token some
Token ID617
Feature activation-3.01e+00

Pos logit contributions
endant+0.055
Sur+0.049
als+0.048
 seeds+0.046
 fiber+0.043
Neg logit contributions
 rarely-0.066
lf-0.066
é-0.064
 tv-0.063
 hallway-0.062
Token big
Token ID1263
Feature activation-3.72e+00

Pos logit contributions
endant+0.049
als+0.044
 seeds+0.041
Sur+0.040
 half+0.036
Neg logit contributions
 rarely-0.085
lf-0.085
é-0.082
 hallway-0.081
 tv-0.080
Token,
Token ID11
Feature activation-3.30e-01

Pos logit contributions
endant+0.069
Sur+0.061
als+0.060
 seeds+0.056
of+0.054
Neg logit contributions
lf-0.096
 rarely-0.093
 hallway-0.093
é-0.092
 tv-0.092
Token pretty
Token ID2495
Feature activation-4.69e+00

Pos logit contributions
endant+0.005
 seeds+0.004
als+0.004
 da+0.004
Sur+0.004
Neg logit contributions
 rarely-0.010
lf-0.010
é-0.010
loe-0.010
icia-0.009
Token butterflies
Token ID44571
Feature activation-1.28e+00

Pos logit contributions
endant+0.081
Sur+0.073
ossible+0.071
 fiber+0.068
forts+0.065
Neg logit contributions
lf-0.124
 hallway-0.119
 tv-0.117
é-0.116
 rarely-0.112
Token flying
Token ID7348
Feature activation-1.30e+00

Pos logit contributions
endant+0.028
Sur+0.026
 fiber+0.024
als+0.024
 seeds+0.023
Neg logit contributions
lf-0.029
é-0.028
 rarely-0.028
 hallway-0.028
 tv-0.026
Token around
Token ID1088
Feature activation+2.51e-02

Pos logit contributions
endant+0.038
Sur+0.034
als+0.034
 seeds+0.033
 fiber+0.032
Neg logit contributions
 rarely-0.020
lf-0.020
é-0.020
 hallway-0.019
 tv-0.018
Token.
Token ID13
Feature activation-9.80e-01

Pos logit contributions
 rarely+0.001
lf+0.001
é+0.001
 hallway+0.001
 tv+0.001
Neg logit contributions
endant-0.000
als-0.000
 seeds-0.000
Sur-0.000
 da-0.000
Token She
Token ID1375
Feature activation-5.81e-01

Pos logit contributions
endant+0.023
 seeds+0.020
als+0.020
Sur+0.019
 bread+0.018
Neg logit contributions
 rarely-0.022
lf-0.021
é-0.021
 hallway-0.021
 tv-0.020
Token.
Token ID13
Feature activation+8.48e-02

Pos logit contributions
endant+0.011
Sur+0.010
 seeds+0.009
als+0.009
 fiber+0.009
Neg logit contributions
é-0.009
lf-0.009
 rarely-0.009
 tv-0.009
 especially-0.009
Token She
Token ID1375
Feature activation-9.06e-01

Pos logit contributions
lf+0.002
 rarely+0.002
é+0.002
 hallway+0.002
 tv+0.002
Neg logit contributions
endant-0.002
als-0.001
 seeds-0.001
Sur-0.001
 half-0.001
Token thought
Token ID1807
Feature activation+1.16e-01

Pos logit contributions
endant+0.019
als+0.018
 seeds+0.017
Sur+0.016
 half+0.015
Neg logit contributions
 rarely-0.022
lf-0.021
é-0.021
 tv-0.021
 hallway-0.020
Token they
Token ID484
Feature activation-6.40e-01

Pos logit contributions
 rarely+0.003
 hallway+0.003
é+0.003
lf+0.003
 seeker+0.003
Neg logit contributions
endant-0.002
als-0.002
 seeds-0.002
Sur-0.002
 da-0.002
Token looked
Token ID3114
Feature activation-1.75e+00

Pos logit contributions
endant+0.013
als+0.012
 seeds+0.012
Sur+0.011
 da+0.010
Neg logit contributions
 rarely-0.016
lf-0.016
é-0.015
 hallway-0.015
 tv-0.014
Token much
Token ID881
Feature activation-4.67e+00

Pos logit contributions
endant+0.041
als+0.038
 seeds+0.036
Sur+0.036
elt+0.034
Neg logit contributions
 rarely-0.038
lf-0.035
 tv-0.034
 especially-0.034
é-0.033
Token better
Token ID1365
Feature activation-1.33e+00

Pos logit contributions
endant+0.118
als+0.118
of+0.109
upp+0.109
Sur+0.108
Neg logit contributions
 cartoons-0.074
 especially-0.072
 rarely-0.069
 drummer-0.067
 Chess-0.064
Token.
Token ID13
Feature activation-2.91e-01

Pos logit contributions
endant+0.027
als+0.024
Sur+0.023
 seeds+0.023
 da+0.021
Neg logit contributions
 rarely-0.033
lf-0.033
é-0.031
 hallway-0.030
 tv-0.030
Token
Token ID198
Feature activation-9.46e-01

Pos logit contributions
endant+0.006
als+0.005
 seeds+0.005
Sur+0.005
 da+0.004
Neg logit contributions
 rarely-0.007
lf-0.007
é-0.007
 hallway-0.007
 tv-0.007
Token
Token ID198
Feature activation-5.27e-01

Pos logit contributions
endant+0.016
als+0.014
 seeds+0.013
Sur+0.013
 half+0.011
Neg logit contributions
 rarely-0.027
lf-0.027
é-0.026
 hallway-0.026
 tv-0.025
TokenShe
Token ID3347
Feature activation-9.27e-01

Pos logit contributions
endant+0.012
als+0.011
 seeds+0.011
 da+0.009
 eyes+0.009
Neg logit contributions
lf-0.013
 rarely-0.012
 hallway-0.012
é-0.011
 tv-0.011
Token"
Token ID1
Feature activation-1.73e+00

Pos logit contributions
endant+0.004
als+0.004
Sur+0.004
 seeds+0.004
 half+0.003
Neg logit contributions
 rarely-0.008
lf-0.008
é-0.007
 hallway-0.007
 tv-0.007
TokenO
Token ID46
Feature activation+1.29e+00

Pos logit contributions
endant+0.041
als+0.036
 seeds+0.035
Sur+0.032
 eyes+0.031
Neg logit contributions
 rarely-0.038
lf-0.038
é-0.036
 hallway-0.036
 tv-0.035
Tokenuch
Token ID794
Feature activation-1.39e-01

Pos logit contributions
 rarely+0.031
lf+0.031
é+0.030
 hallway+0.029
 tv+0.029
Neg logit contributions
endant-0.027
als-0.025
 seeds-0.024
Sur-0.023
 da-0.021
Token!"
Token ID2474
Feature activation+8.31e-01

Pos logit contributions
endant+0.002
als+0.002
 seeds+0.002
Sur+0.002
 half+0.001
Neg logit contributions
 rarely-0.004
lf-0.004
 hallway-0.004
é-0.004
 tv-0.004
Token the
Token ID262
Feature activation+1.55e+00

Pos logit contributions
 rarely+0.021
lf+0.021
é+0.020
 hallway+0.020
 tv+0.019
Neg logit contributions
endant-0.016
als-0.015
 seeds-0.015
Sur-0.014
 bread-0.013
Token oct
Token ID19318
Feature activation+2.99e+00

Pos logit contributions
 rarely+0.036
lf+0.034
é+0.033
 hallway+0.032
 tv+0.032
Neg logit contributions
endant-0.034
als-0.033
 seeds-0.032
Sur-0.030
 da-0.029
Tokenopus
Token ID25790
Feature activation+8.24e-01

Pos logit contributions
 rarely+0.079
 hallway+0.074
 tv+0.071
lf+0.071
é+0.070
Neg logit contributions
endant-0.059
als-0.055
Sur-0.052
elt-0.047
of-0.047
Token said
Token ID531
Feature activation-7.39e-01

Pos logit contributions
 rarely+0.022
lf+0.022
é+0.021
 hallway+0.020
 tv+0.020
Neg logit contributions
endant-0.016
als-0.014
 seeds-0.013
Sur-0.013
 da-0.012
Token.
Token ID13
Feature activation+1.39e+00

Pos logit contributions
endant+0.011
als+0.010
 seeds+0.009
Sur+0.009
 half+0.008
Neg logit contributions
 rarely-0.022
lf-0.022
 hallway-0.022
é-0.021
 tv-0.021
Token It
Token ID632
Feature activation+5.21e-01

Pos logit contributions
 rarely+0.034
lf+0.034
é+0.033
 hallway+0.032
 tv+0.032
Neg logit contributions
endant-0.028
als-0.026
 seeds-0.025
Sur-0.024
 half-0.022
Token spoke
Token ID5158
Feature activation-2.18e+00

Pos logit contributions
lf+0.013
 rarely+0.013
é+0.013
 hallway+0.013
 tv+0.012
Neg logit contributions
endant-0.010
als-0.010
 seeds-0.009
Sur-0.009
 half-0.008
Token can
Token ID460
Feature activation+6.65e-01

Pos logit contributions
endant+0.007
als+0.006
 seeds+0.006
Sur+0.006
elt+0.006
Neg logit contributions
 rarely-0.009
lf-0.009
é-0.009
 hallway-0.008
 tv-0.008
Token't
Token ID470
Feature activation-6.60e-02

Pos logit contributions
lf+0.015
 hallway+0.015
 rarely+0.014
 tv+0.014
icia+0.014
Neg logit contributions
endant-0.014
als-0.013
of-0.013
 seeds-0.012
 half-0.012
Token play
Token ID711
Feature activation+3.31e+00

Pos logit contributions
endant+0.001
als+0.001
 seeds+0.001
Sur+0.001
of+0.001
Neg logit contributions
lf-0.002
 rarely-0.002
 hallway-0.002
 tv-0.002
é-0.001
Token in
Token ID287
Feature activation+1.61e+00

Pos logit contributions
 rarely+0.094
é+0.092
 hallway+0.091
icia+0.089
idget+0.088
Neg logit contributions
 seeds-0.050
endant-0.047
als-0.047
 nuts-0.046
Sur-0.044
Token the
Token ID262
Feature activation-1.93e-01

Pos logit contributions
 rarely+0.042
lf+0.042
é+0.040
 hallway+0.040
 tv+0.039
Neg logit contributions
endant-0.030
als-0.028
 seeds-0.027
Sur-0.026
 half-0.024
Token soil
Token ID9260
Feature activation+2.91e+00

Pos logit contributions
endant+0.004
als+0.004
Sur+0.004
of+0.003
lets+0.003
Neg logit contributions
 hallway-0.005
 rarely-0.004
lf-0.004
 tv-0.004
é-0.004
Token!"
Token ID2474
Feature activation+4.31e-01

Pos logit contributions
 rarely+0.085
lf+0.084
é+0.081
 hallway+0.080
 tv+0.079
Neg logit contributions
endant-0.043
als-0.043
 seeds-0.040
 enough-0.036
 half-0.035
Token She
Token ID1375
Feature activation-3.64e-01

Pos logit contributions
lf+0.012
 rarely+0.012
 hallway+0.012
é+0.012
 tv+0.011
Neg logit contributions
endant-0.007
als-0.007
Sur-0.006
 seeds-0.006
elt-0.005
Token lifted
Token ID13663
Feature activation+2.93e-01

Pos logit contributions
endant+0.007
als+0.007
 seeds+0.006
Sur+0.006
 da+0.006
Neg logit contributions
 rarely-0.009
lf-0.009
é-0.009
 hallway-0.009
 tv-0.009
Token the
Token ID262
Feature activation-1.15e-01

Pos logit contributions
lf+0.009
 rarely+0.008
 hallway+0.008
é+0.007
 tv+0.007
Neg logit contributions
als-0.005
 seeds-0.004
endant-0.004
 half-0.004
 enough-0.004
Token cat
Token ID3797
Feature activation-8.83e-01

Pos logit contributions
endant+0.002
als+0.002
Sur+0.002
 seeds+0.002
of+0.002
Neg logit contributions
 hallway-0.003
lf-0.003
 rarely-0.003
é-0.003
 tv-0.003
Token play
Token ID711
Feature activation+2.17e+00

Pos logit contributions
 rarely+0.000
lf+0.000
é+0.000
 hallway+0.000
 tv+0.000
Neg logit contributions
endant-0.000
als-0.000
 seeds-0.000
Sur-0.000
 da-0.000
Token with
Token ID351
Feature activation+4.76e-01

Pos logit contributions
 rarely+0.059
é+0.056
lf+0.056
 hallway+0.056
icia+0.053
Neg logit contributions
endant-0.037
 seeds-0.036
als-0.036
Sur-0.034
 half-0.033
Token the
Token ID262
Feature activation+4.85e-03

Pos logit contributions
lf+0.013
 rarely+0.013
é+0.012
 tv+0.012
 hallway+0.012
Neg logit contributions
endant-0.009
als-0.008
 seeds-0.007
Sur-0.007
elt-0.007
Token ball
Token ID2613
Feature activation+1.04e+00

Pos logit contributions
lf+0.000
 rarely+0.000
é+0.000
 hallway+0.000
 tv+0.000
Neg logit contributions
endant-0.000
 seeds-0.000
als-0.000
Sur-0.000
 bread-0.000
Token,
Token ID11
Feature activation+1.37e+00

Pos logit contributions
 rarely+0.032
lf+0.032
é+0.030
 hallway+0.030
 tv+0.030
Neg logit contributions
endant-0.015
als-0.014
 seeds-0.013
Sur-0.011
 half-0.011
Token but
Token ID475
Feature activation+3.03e+00

Pos logit contributions
lf+0.033
 rarely+0.032
é+0.032
 hallway+0.031
icia+0.030
Neg logit contributions
endant-0.028
als-0.026
 seeds-0.026
Sur-0.024
 da-0.024
Token I
Token ID314
Feature activation+7.68e-01

Pos logit contributions
 hallway+0.090
lf+0.090
 rarely+0.090
é+0.089
 tv+0.087
Neg logit contributions
endant-0.042
als-0.039
 seeds-0.039
 half-0.034
Sur-0.032
Token'm
Token ID1101
Feature activation+2.22e+00

Pos logit contributions
lf+0.022
 hallway+0.020
 tv+0.020
é+0.020
 rarely+0.020
Neg logit contributions
endant-0.014
als-0.012
 seeds-0.012
 half-0.011
Sur-0.011
Token too
Token ID1165
Feature activation-2.24e-01

Pos logit contributions
lf+0.056
 hallway+0.054
 rarely+0.052
 tv+0.051
é+0.051
Neg logit contributions
endant-0.044
 seeds-0.039
 enough-0.037
als-0.037
 half-0.036
Token shy
Token ID15800
Feature activation-1.43e+00

Pos logit contributions
endant+0.005
 seeds+0.005
als+0.005
Sur+0.005
 half+0.004
Neg logit contributions
lf-0.005
 rarely-0.005
é-0.005
 hallway-0.004
 tv-0.004
Token to
Token ID284
Feature activation+5.40e-01

Pos logit contributions
endant+0.021
als+0.019
 seeds+0.018
Sur+0.018
 da+0.015
Neg logit contributions
 rarely-0.043
lf-0.042
é-0.041
 tv-0.041
 hallway-0.041
Token showing
Token ID4478
Feature activation-1.84e+00

Pos logit contributions
 rarely+0.018
lf+0.018
 hallway+0.018
é+0.017
 tv+0.017
Neg logit contributions
endant-0.016
 seeds-0.014
 da-0.014
als-0.014
Sur-0.013
Token Max
Token ID5436
Feature activation-6.37e-01

Pos logit contributions
endant+0.036
 seeds+0.030
als+0.030
Sur+0.028
 da+0.026
Neg logit contributions
 tv-0.047
é-0.046
 rarely-0.046
lf-0.046
 Chess-0.044
Token her
Token ID607
Feature activation+9.55e-02

Pos logit contributions
endant+0.012
als+0.011
Sur+0.010
 seeds+0.010
elt+0.009
Neg logit contributions
lf-0.016
 rarely-0.016
é-0.016
 tv-0.016
 hallway-0.015
Token toys
Token ID14958
Feature activation+7.81e-01

Pos logit contributions
lf+0.002
 rarely+0.002
é+0.002
 hallway+0.002
 tv+0.002
Neg logit contributions
endant-0.002
als-0.002
Sur-0.002
 seeds-0.002
elt-0.002
Token,
Token ID11
Feature activation+2.16e+00

Pos logit contributions
lf+0.018
 rarely+0.017
é+0.017
 especially+0.016
 hallway+0.016
Neg logit contributions
endant-0.016
als-0.016
Sur-0.015
 seeds-0.014
 da-0.013
Token but
Token ID475
Feature activation+3.00e+00

Pos logit contributions
é+0.052
 rarely+0.051
 hallway+0.050
lf+0.050
icia+0.048
Neg logit contributions
endant-0.042
 seeds-0.041
als-0.039
 bread-0.038
 eyes-0.038
Token she
Token ID673
Feature activation+1.06e+00

Pos logit contributions
 rarely+0.081
é+0.081
lf+0.079
 hallway+0.079
 tv+0.076
Neg logit contributions
endant-0.051
 seeds-0.047
als-0.047
 half-0.042
Sur-0.042
Token did
Token ID750
Feature activation+1.47e+00

Pos logit contributions
 hallway+0.027
 rarely+0.027
 tv+0.027
lf+0.027
é+0.026
Neg logit contributions
endant-0.020
als-0.018
 seeds-0.018
 half-0.015
Sur-0.015
Token not
Token ID407
Feature activation-1.01e+00

Pos logit contributions
 rarely+0.027
lf+0.027
é+0.026
 hallway+0.026
 tv+0.025
Neg logit contributions
endant-0.039
 seeds-0.036
als-0.036
Sur-0.034
 half-0.034
Token let
Token ID1309
Feature activation+1.48e+00

Pos logit contributions
endant+0.025
als+0.023
 seeds+0.021
Sur+0.021
elt+0.020
Neg logit contributions
 rarely-0.022
lf-0.021
é-0.020
 tv-0.020
 hallway-0.019
Token him
Token ID683
Feature activation+5.11e-02

Pos logit contributions
 rarely+0.041
lf+0.041
 hallway+0.040
é+0.039
 tv+0.037
Neg logit contributions
als-0.023
endant-0.023
 seeds-0.022
Sur-0.021
 half-0.020
Token the
Token ID262
Feature activation+7.21e-01

Pos logit contributions
 rarely+0.018
lf+0.018
é+0.017
 hallway+0.017
 tv+0.016
Neg logit contributions
endant-0.016
als-0.015
 seeds-0.015
Sur-0.014
 da-0.013
Token orange
Token ID10912
Feature activation+9.92e-01

Pos logit contributions
lf+0.020
 hallway+0.020
 rarely+0.020
 tv+0.019
é+0.019
Neg logit contributions
endant-0.012
als-0.011
Sur-0.010
 seeds-0.010
elt-0.009
Token bag
Token ID6131
Feature activation+2.16e+00

Pos logit contributions
lf+0.026
 rarely+0.025
é+0.024
 hallway+0.024
 tv+0.024
Neg logit contributions
endant-0.019
als-0.017
 seeds-0.016
Sur-0.016
elt-0.014
Token.
Token ID13
Feature activation+1.05e+00

Pos logit contributions
 rarely+0.063
lf+0.060
é+0.059
 tv+0.058
 hallway+0.058
Neg logit contributions
endant-0.034
als-0.033
 seeds-0.030
 half-0.030
Sur-0.028
Token
Token ID198
Feature activation+5.22e-01

Pos logit contributions
 rarely+0.024
lf+0.024
é+0.023
 hallway+0.022
 seeker+0.021
Neg logit contributions
endant-0.023
als-0.022
Sur-0.021
 seeds-0.020
elt-0.019
TokenT
Token ID51
Feature activation+3.84e+00

Pos logit contributions
 rarely+0.015
é+0.015
lf+0.014
 hallway+0.014
 seeker+0.014
Neg logit contributions
Sur-0.008
endant-0.008
als-0.007
ossible-0.007
of-0.007
Tokeneddy
Token ID21874
Feature activation+1.78e+00

Pos logit contributions
 rarely+0.097
 hallway+0.093
lf+0.090
 tv+0.087
é+0.085
Neg logit contributions
endant-0.080
Sur-0.071
als-0.070
 half-0.067
 seeds-0.067
Token felt
Token ID2936
Feature activation+2.00e+00

Pos logit contributions
 hallway+0.044
 tv+0.044
 rarely+0.044
é+0.042
lf+0.042
Neg logit contributions
als-0.034
 seeds-0.032
endant-0.032
 half-0.030
 da-0.029
Token so
Token ID523
Feature activation+6.03e-01

Pos logit contributions
lf+0.039
 rarely+0.038
é+0.036
 hallway+0.036
 tv+0.034
Neg logit contributions
endant-0.052
als-0.048
 seeds-0.048
Sur-0.046
 half-0.044
Token sad
Token ID6507
Feature activation-2.63e-01

Pos logit contributions
 rarely+0.008
lf+0.008
é+0.007
 hallway+0.007
 tv+0.007
Neg logit contributions
endant-0.019
als-0.019
 seeds-0.018
Sur-0.017
 da-0.017
Token and
Token ID290
Feature activation+1.40e+00

Pos logit contributions
endant+0.004
als+0.004
 seeds+0.004
Sur+0.003
 da+0.003
Neg logit contributions
 rarely-0.008
lf-0.008
é-0.007
 hallway-0.007
 tv-0.007
Token escape
Token ID6654
Feature activation+5.57e-01

Pos logit contributions
lf+0.015
 rarely+0.015
é+0.014
 hallway+0.014
 tv+0.013
Neg logit contributions
endant-0.018
als-0.017
 seeds-0.016
Sur-0.016
 half-0.015
Token!
Token ID0
Feature activation-7.72e-01

Pos logit contributions
 rarely+0.016
lf+0.016
é+0.015
 hallway+0.015
 tv+0.015
Neg logit contributions
endant-0.009
als-0.008
 seeds-0.008
Sur-0.008
 da-0.007
Token<|endoftext|>
Token ID50256
Feature activation-2.55e-01

Pos logit contributions
endant+0.016
als+0.014
 seeds+0.014
Sur+0.013
 da+0.012
Neg logit contributions
 rarely-0.019
lf-0.019
é-0.019
 hallway-0.018
 tv-0.018
TokenOnce
Token ID7454
Feature activation+3.78e-01

Pos logit contributions
endant+0.003
als+0.003
Sur+0.003
 seeds+0.003
 half+0.002
Neg logit contributions
 rarely-0.008
lf-0.008
é-0.008
 tv-0.008
 hallway-0.008
Token upon
Token ID2402
Feature activation+4.52e-01

Pos logit contributions
 rarely+0.010
lf+0.010
 hallway+0.010
 tv+0.010
é+0.010
Neg logit contributions
endant-0.007
als-0.006
 seeds-0.005
Sur-0.005
 half-0.005
Token a
Token ID257
Feature activation+6.55e-01

Pos logit contributions
 rarely+0.014
lf+0.014
é+0.014
 hallway+0.013
 tv+0.013
Neg logit contributions
endant-0.006
als-0.005
 seeds-0.005
Sur-0.005
 half-0.004
Token time
Token ID640
Feature activation+4.08e-01

Pos logit contributions
lf+0.013
 rarely+0.012
é+0.012
 tv+0.012
 hallway+0.012
Neg logit contributions
endant-0.016
als-0.015
 seeds-0.015
Sur-0.015
 half-0.014
Token,
Token ID11
Feature activation+1.15e+00

Pos logit contributions
lf+0.011
 rarely+0.011
é+0.011
 hallway+0.011
 tv+0.010
Neg logit contributions
endant-0.007
als-0.007
 seeds-0.006
Sur-0.006
 bread-0.005
Token there
Token ID612
Feature activation+3.71e-01

Pos logit contributions
lf+0.038
 rarely+0.038
 tv+0.037
 hallway+0.036
é+0.036
Neg logit contributions
endant-0.013
 half-0.011
als-0.011
Sur-0.010
 seeds-0.010
Token was
Token ID373
Feature activation+4.42e-01

Pos logit contributions
 rarely+0.010
 tv+0.010
lf+0.009
é+0.009
 drummer+0.009
Neg logit contributions
endant-0.006
 half-0.006
als-0.006
Sur-0.006
 seeds-0.005
Token a
Token ID257
Feature activation+1.75e+00

Pos logit contributions
lf+0.015
 rarely+0.015
é+0.014
 hallway+0.014
 tv+0.014
Neg logit contributions
endant-0.005
 seeds-0.005
als-0.004
Sur-0.004
 half-0.004
Tokenily
Token ID813
Feature activation+5.20e-01

Pos logit contributions
 rarely+0.078
 hallway+0.075
lf+0.074
 tv+0.074
é+0.073
Neg logit contributions
endant-0.066
Sur-0.057
 seeds-0.056
als-0.055
 half-0.055
Token was
Token ID373
Feature activation+1.38e+00

Pos logit contributions
 rarely+0.014
lf+0.013
 tv+0.013
é+0.013
 hallway+0.013
Neg logit contributions
endant-0.009
als-0.009
 seeds-0.008
Sur-0.008
 half-0.008
Token so
Token ID523
Feature activation-5.33e-01

Pos logit contributions
lf+0.040
é+0.039
 tv+0.039
 hallway+0.039
loe+0.037
Neg logit contributions
endant-0.020
 half-0.019
 enough-0.019
Sur-0.017
 seeds-0.017
Token happy
Token ID3772
Feature activation-1.92e+00

Pos logit contributions
endant+0.012
Sur+0.011
 seeds+0.011
als+0.011
 half+0.010
Neg logit contributions
lf-0.012
 rarely-0.012
 hallway-0.012
é-0.012
 tv-0.011
Token that
Token ID326
Feature activation+8.49e-01

Pos logit contributions
endant+0.035
als+0.031
 seeds+0.029
Sur+0.028
 half+0.025
Neg logit contributions
lf-0.053
 rarely-0.052
é-0.050
 hallway-0.049
 tv-0.049
Token her
Token ID607
Feature activation+1.82e+00

Pos logit contributions
é+0.028
oving+0.027
 rarely+0.027
lf+0.027
 hallway+0.027
Neg logit contributions
 half-0.008
 seeds-0.008
als-0.008
endant-0.008
 enough-0.008
Token bag
Token ID6131
Feature activation+8.20e-01

Pos logit contributions
 rarely+0.042
lf+0.041
é+0.040
 hallway+0.039
 tv+0.039
Neg logit contributions
endant-0.041
 seeds-0.037
als-0.037
Sur-0.034
 half-0.033
Token was
Token ID373
Feature activation+1.54e+00

Pos logit contributions
 rarely+0.019
lf+0.018
 tv+0.018
é+0.018
 hallway+0.017
Neg logit contributions
endant-0.018
als-0.017
 seeds-0.016
Sur-0.015
 half-0.015
Token fixed
Token ID5969
Feature activation+1.33e+00

Pos logit contributions
 rarely+0.034
lf+0.033
é+0.031
 hallway+0.031
 tv+0.030
Neg logit contributions
endant-0.036
als-0.035
 seeds-0.032
Sur-0.032
 da-0.030
Token.
Token ID13
Feature activation-1.73e-01

Pos logit contributions
 rarely+0.046
lf+0.044
 hallway+0.044
é+0.044
 tv+0.043
Neg logit contributions
 enough-0.012
als-0.012
 seeds-0.012
endant-0.011
 half-0.011
Token She
Token ID1375
Feature activation-3.33e-01

Pos logit contributions
endant+0.003
als+0.003
Sur+0.003
 seeds+0.003
elt+0.002
Neg logit contributions
lf-0.005
 rarely-0.005
é-0.005
 hallway-0.004
 tv-0.004
Token get
Token ID651
Feature activation-8.47e-01

Pos logit contributions
 rarely+0.008
lf+0.008
é+0.008
 hallway+0.008
 tv+0.008
Neg logit contributions
endant-0.007
 seeds-0.006
als-0.006
 half-0.006
 bread-0.005
Token burned
Token ID11544
Feature activation+4.33e-01

Pos logit contributions
endant+0.020
als+0.019
Sur+0.018
of+0.016
 seeds+0.016
Neg logit contributions
 rarely-0.018
é-0.017
 tv-0.017
lf-0.017
 especially-0.016
Token."
Token ID526
Feature activation+6.32e-01

Pos logit contributions
 rarely+0.012
lf+0.012
é+0.012
 hallway+0.012
 tv+0.011
Neg logit contributions
endant-0.007
als-0.007
 seeds-0.006
Sur-0.006
 half-0.006
Token
Token ID220
Feature activation+1.38e+00

Pos logit contributions
 rarely+0.016
lf+0.016
é+0.016
 hallway+0.015
 tv+0.015
Neg logit contributions
endant-0.012
als-0.011
 seeds-0.010
Sur-0.010
 half-0.009
Token
Token ID198
Feature activation+2.18e-01

Pos logit contributions
lf+0.043
 rarely+0.043
 hallway+0.042
é+0.041
 tv+0.041
Neg logit contributions
endant-0.019
als-0.016
Sur-0.015
 seeds-0.015
 half-0.014
Token
Token ID198
Feature activation+4.16e-01

Pos logit contributions
 rarely+0.007
lf+0.007
é+0.006
 hallway+0.006
 tv+0.006
Neg logit contributions
endant-0.003
Sur-0.003
als-0.003
 seeds-0.003
 half-0.002
TokenAfter
Token ID3260
Feature activation+6.43e-01

Pos logit contributions
é+0.010
 rarely+0.010
 tv+0.010
icia+0.009
 hallway+0.009
Neg logit contributions
endant-0.008
Sur-0.008
 seeds-0.007
 half-0.007
als-0.007
Token lunch
Token ID9965
Feature activation+7.60e-01

Pos logit contributions
lf+0.016
 rarely+0.016
 tv+0.015
 hallway+0.015
é+0.015
Neg logit contributions
endant-0.013
als-0.012
 seeds-0.011
Sur-0.011
of-0.010
Token,
Token ID11
Feature activation+1.63e+00

Pos logit contributions
 rarely+0.024
lf+0.023
é+0.023
 hallway+0.022
 tv+0.022
Neg logit contributions
endant-0.009
als-0.009
Sur-0.009
 seeds-0.008
 half-0.007
Token Tim
Token ID5045
Feature activation+2.01e+00

Pos logit contributions
é+0.041
icia+0.041
 hallway+0.041
 rarely+0.040
loe+0.039
Neg logit contributions
 seeds-0.029
 half-0.027
als-0.027
endant-0.027
 nuts-0.026
Tokenmy
Token ID1820
Feature activation-7.89e-02

Pos logit contributions
 rarely+0.057
lf+0.055
 tv+0.054
é+0.054
 hallway+0.053
Neg logit contributions
endant-0.033
 half-0.031
 seeds-0.029
als-0.029
ossible-0.028
Token it
Token ID340
Feature activation+9.78e-01

Pos logit contributions
 rarely+0.062
lf+0.060
é+0.059
 hallway+0.059
 seeker+0.056
Neg logit contributions
endant-0.025
als-0.025
 seeds-0.024
 half-0.021
Sur-0.020
Token.
Token ID13
Feature activation+3.12e-01

Pos logit contributions
 rarely+0.028
lf+0.028
é+0.027
 hallway+0.026
 tv+0.026
Neg logit contributions
endant-0.016
als-0.015
 seeds-0.014
Sur-0.013
 da-0.012
Token James
Token ID3700
Feature activation+1.79e+00

Pos logit contributions
lf+0.009
 rarely+0.008
 hallway+0.008
é+0.008
 tv+0.008
Neg logit contributions
endant-0.005
als-0.005
Sur-0.005
 seeds-0.004
 half-0.004
Token was
Token ID373
Feature activation+2.43e+00

Pos logit contributions
lf+0.046
 tv+0.045
é+0.044
 rarely+0.044
 hallway+0.044
Neg logit contributions
als-0.031
endant-0.031
 seeds-0.028
Sur-0.028
 half-0.027
Token so
Token ID523
Feature activation+3.82e-01

Pos logit contributions
lf+0.058
é+0.054
 tv+0.054
 hallway+0.053
 rarely+0.053
Neg logit contributions
endant-0.051
Sur-0.044
 half-0.044
 enough-0.044
 seeds-0.044
Token disappointed
Token ID11679
Feature activation+1.45e-01

Pos logit contributions
lf+0.008
 hallway+0.007
 tv+0.007
é+0.007
 rarely+0.007
Neg logit contributions
endant-0.010
Sur-0.009
als-0.008
 seeds-0.008
 half-0.008
Token because
Token ID780
Feature activation+3.52e+00

Pos logit contributions
lf+0.004
 rarely+0.004
é+0.004
 hallway+0.004
 tv+0.004
Neg logit contributions
endant-0.002
als-0.002
 seeds-0.002
Sur-0.002
 half-0.002
Token he
Token ID339
Feature activation+2.23e+00

Pos logit contributions
icia+0.107
idget+0.105
é+0.104
oving+0.102
ussy+0.100
Neg logit contributions
endant-0.041
 half-0.040
 enough-0.038
 seeds-0.036
 eyes-0.036
Token could
Token ID714
Feature activation+2.43e+00

Pos logit contributions
 tv+0.059
é+0.057
 hallway+0.056
 Chess+0.056
 seeker+0.055
Neg logit contributions
endant-0.038
als-0.035
Sur-0.034
 seeds-0.031
s-0.029
Token not
Token ID407
Feature activation+9.85e-01

Pos logit contributions
é+0.056
 hallway+0.055
lf+0.055
 tv+0.055
icia+0.053
Neg logit contributions
endant-0.046
 half-0.042
 seeds-0.040
 respect-0.040
ossible-0.039
Token watch
Token ID2342
Feature activation+3.38e+00

Pos logit contributions
lf+0.023
é+0.023
 rarely+0.023
 hallway+0.023
 tv+0.023
Neg logit contributions
endant-0.019
als-0.017
 seeds-0.017
Sur-0.016
 half-0.016
Token wash
Token ID13502
Feature activation+1.69e+00

Pos logit contributions
lf+0.006
é+0.006
 rarely+0.006
icia+0.005
 hallway+0.005
Neg logit contributions
endant-0.004
 seeds-0.004
 half-0.004
als-0.004
 thumbs-0.004
Token it
Token ID340
Feature activation-8.15e-01

Pos logit contributions
 rarely+0.048
lf+0.047
 hallway+0.046
é+0.045
 seeker+0.044
Neg logit contributions
 seeds-0.028
als-0.027
endant-0.025
 half-0.025
 bread-0.023
Token.
Token ID13
Feature activation-7.31e-01

Pos logit contributions
endant+0.019
als+0.017
 seeds+0.017
Sur+0.016
 half+0.015
Neg logit contributions
 rarely-0.018
lf-0.018
é-0.017
 hallway-0.017
 tv-0.016
Token Day
Token ID3596
Feature activation-7.19e-01

Pos logit contributions
endant+0.021
als+0.019
 seeds+0.018
Sur+0.018
 da+0.017
Neg logit contributions
 rarely-0.013
é-0.012
 tv-0.012
lf-0.012
 hallway-0.012
Token after
Token ID706
Feature activation+2.80e+00

Pos logit contributions
endant+0.019
als+0.016
 seeds+0.015
Sur+0.014
 fiber+0.014
Neg logit contributions
 rarely-0.015
lf-0.014
é-0.014
 especially-0.013
 hallway-0.013
Token day
Token ID1110
Feature activation+2.19e-01

Pos logit contributions
 rarely+0.082
icia+0.079
idget+0.079
é+0.079
lf+0.078
Neg logit contributions
endant-0.038
 seeds-0.038
 half-0.034
Sur-0.033
 enough-0.032
Token,
Token ID11
Feature activation+1.44e+00

Pos logit contributions
 rarely+0.006
lf+0.006
é+0.006
 hallway+0.006
 tv+0.006
Neg logit contributions
endant-0.004
als-0.003
 seeds-0.003
Sur-0.003
 da-0.003
Token Joey
Token ID26154
Feature activation+1.20e+00

Pos logit contributions
 rarely+0.030
lf+0.030
é+0.028
 hallway+0.028
 tv+0.027
Neg logit contributions
endant-0.035
als-0.033
 seeds-0.032
Sur-0.031
 da-0.029
Token kept
Token ID4030
Feature activation+9.24e-01

Pos logit contributions
 tv+0.034
lf+0.034
 rarely+0.033
loe+0.033
é+0.033
Neg logit contributions
als-0.018
 seeds-0.017
endant-0.016
 da-0.014
 enough-0.014
Token washing
Token ID20518
Feature activation+2.84e+00

Pos logit contributions
 rarely+0.021
lf+0.021
é+0.021
 hallway+0.020
loe+0.019
Neg logit contributions
endant-0.019
 seeds-0.018
 half-0.017
 enough-0.017
Sur-0.017
Token the
Token ID262
Feature activation+1.23e+00

Pos logit contributions
 rarely+0.086
lf+0.082
é+0.081
 hallway+0.079
 seeker+0.079
Neg logit contributions
 seeds-0.040
als-0.038
 enough-0.036
 luck-0.035
 half-0.034
Token to
Token ID284
Feature activation+1.11e-02

Pos logit contributions
lf+0.013
 rarely+0.013
icia+0.012
 hallway+0.012
loe+0.012
Neg logit contributions
endant-0.005
 seeds-0.005
elt-0.005
 da-0.005
als-0.005
Token listen
Token ID6004
Feature activation-1.99e+00

Pos logit contributions
 rarely+0.000
lf+0.000
é+0.000
 hallway+0.000
 tv+0.000
Neg logit contributions
endant-0.000
als-0.000
 seeds-0.000
Sur-0.000
 half-0.000
Token,
Token ID11
Feature activation+1.30e-01

Pos logit contributions
endant+0.036
als+0.032
 seeds+0.030
Sur+0.029
 da+0.025
Neg logit contributions
lf-0.055
 rarely-0.055
é-0.053
 tv-0.052
 hallway-0.052
Token and
Token ID290
Feature activation-7.63e-01

Pos logit contributions
lf+0.003
 rarely+0.003
é+0.003
 hallway+0.003
 tv+0.003
Neg logit contributions
endant-0.003
als-0.003
 seeds-0.003
Sur-0.003
 da-0.002
Token sure
Token ID1654
Feature activation-2.06e+00

Pos logit contributions
endant+0.016
als+0.014
 seeds+0.014
Sur+0.013
 half+0.012
Neg logit contributions
 rarely-0.019
lf-0.019
é-0.018
 hallway-0.018
 tv-0.017
Token enough
Token ID1576
Feature activation-1.73e+00

Pos logit contributions
endant+0.042
als+0.038
 fiber+0.037
Sur+0.036
elt+0.035
Neg logit contributions
lf-0.054
 rarely-0.051
é-0.044
 Maxim-0.043
 drummer-0.043
Token,
Token ID11
Feature activation+1.49e+00

Pos logit contributions
endant+0.039
als+0.035
Sur+0.034
 seeds+0.032
elt+0.031
Neg logit contributions
lf-0.042
 rarely-0.040
 tv-0.037
 hallway-0.037
é-0.036
Token there
Token ID612
Feature activation-8.89e-02

Pos logit contributions
lf+0.038
 rarely+0.038
é+0.038
 hallway+0.037
loe+0.036
Neg logit contributions
endant-0.027
als-0.026
 seeds-0.026
Sur-0.024
 da-0.023
Token it
Token ID340
Feature activation-8.70e-01

Pos logit contributions
endant+0.002
als+0.002
 seeds+0.002
Sur+0.002
 half+0.002
Neg logit contributions
 rarely-0.002
lf-0.002
é-0.002
 hallway-0.002
 tv-0.002
Token was
Token ID373
Feature activation+1.63e-01

Pos logit contributions
endant+0.021
als+0.019
 seeds+0.019
Sur+0.018
 da+0.017
Neg logit contributions
 rarely-0.019
lf-0.018
é-0.018
 hallway-0.017
 tv-0.017
Token again
Token ID757
Feature activation-1.13e+00

Pos logit contributions
lf+0.005
 rarely+0.005
é+0.005
 hallway+0.005
 tv+0.005
Neg logit contributions
endant-0.002
als-0.002
 seeds-0.002
Sur-0.002
 half-0.002
Token the
Token ID262
Feature activation+9.81e-03

Pos logit contributions
lf+0.038
 rarely+0.037
é+0.036
 hallway+0.035
 tv+0.035
Neg logit contributions
endant-0.032
als-0.029
 seeds-0.029
Sur-0.027
 da-0.026
Token birds
Token ID10087
Feature activation+2.33e-01

Pos logit contributions
lf+0.000
 rarely+0.000
é+0.000
 hallway+0.000
 tv+0.000
Neg logit contributions
endant-0.000
als-0.000
 seeds-0.000
Sur-0.000
 da-0.000
Token were
Token ID547
Feature activation+6.36e-01

Pos logit contributions
 rarely+0.005
lf+0.005
é+0.005
 hallway+0.005
 tv+0.005
Neg logit contributions
endant-0.005
Sur-0.005
als-0.004
 seeds-0.004
 half-0.004
Token safe
Token ID3338
Feature activation-7.35e-01

Pos logit contributions
 rarely+0.014
lf+0.013
é+0.012
 hallway+0.012
 especially+0.012
Neg logit contributions
endant-0.016
als-0.015
Sur-0.014
 seeds-0.013
elt-0.013
Token and
Token ID290
Feature activation-2.99e-01

Pos logit contributions
endant+0.015
als+0.013
 seeds+0.013
Sur+0.013
 da+0.011
Neg logit contributions
 rarely-0.018
lf-0.018
é-0.017
 hallway-0.017
 tv-0.017
Token even
Token ID772
Feature activation-1.14e+00

Pos logit contributions
endant+0.007
als+0.006
Sur+0.006
 seeds+0.006
elt+0.005
Neg logit contributions
 rarely-0.007
lf-0.007
 hallway-0.006
é-0.006
 tv-0.006
Token the
Token ID262
Feature activation-3.86e-01

Pos logit contributions
endant+0.022
als+0.020
 seeds+0.019
Sur+0.018
 da+0.016
Neg logit contributions
 rarely-0.030
lf-0.029
é-0.028
 hallway-0.028
 tv-0.028
Token tigers
Token ID43114
Feature activation+9.65e-01

Pos logit contributions
endant+0.007
als+0.007
 seeds+0.007
Sur+0.006
 da+0.006
Neg logit contributions
lf-0.010
 rarely-0.010
é-0.010
 hallway-0.009
 tv-0.009
Token were
Token ID547
Feature activation+9.43e-01

Pos logit contributions
 rarely+0.024
lf+0.024
é+0.023
 hallway+0.022
 tv+0.022
Neg logit contributions
endant-0.020
als-0.018
 seeds-0.017
Sur-0.017
 da-0.015
Token safe
Token ID3338
Feature activation-5.38e-01

Pos logit contributions
 rarely+0.021
lf+0.020
é+0.019
 hallway+0.018
 tv+0.018
Neg logit contributions
endant-0.023
als-0.022
Sur-0.020
 seeds-0.020
elt-0.018
Token.
Token ID13
Feature activation+1.89e-01

Pos logit contributions
endant+0.010
als+0.009
Sur+0.008
 seeds+0.008
 da+0.007
Neg logit contributions
 rarely-0.014
lf-0.014
é-0.014
 hallway-0.014
 tv-0.013
Token was
Token ID373
Feature activation+1.54e+00

Pos logit contributions
 tv+0.002
 rarely+0.002
é+0.002
lf+0.002
idget+0.001
Neg logit contributions
endant-0.001
als-0.001
 seeds-0.001
 da-0.001
 bread-0.001
Token so
Token ID523
Feature activation-3.70e-03

Pos logit contributions
é+0.043
lf+0.043
 rarely+0.042
 tv+0.042
icia+0.041
Neg logit contributions
endant-0.025
 seeds-0.023
 da-0.020
 half-0.020
 bread-0.020
Token original
Token ID2656
Feature activation-1.10e+00

Pos logit contributions
endant+0.000
als+0.000
 seeds+0.000
Sur+0.000
 half+0.000
Neg logit contributions
 rarely-0.000
lf-0.000
é-0.000
 hallway-0.000
 tv-0.000
Token that
Token ID326
Feature activation+1.14e+00

Pos logit contributions
endant+0.019
als+0.017
 seeds+0.016
Sur+0.015
 da+0.014
Neg logit contributions
 rarely-0.031
lf-0.031
é-0.030
 hallway-0.029
 tv-0.029
Token it
Token ID340
Feature activation+7.36e-01

Pos logit contributions
lf+0.030
 rarely+0.029
é+0.029
 hallway+0.029
 tv+0.029
Neg logit contributions
als-0.020
endant-0.020
 seeds-0.019
 da-0.017
 half-0.017
Token had
Token ID550
Feature activation-1.00e+00

Pos logit contributions
 tv+0.019
lf+0.019
 hallway+0.019
loe+0.019
 rarely+0.018
Neg logit contributions
als-0.013
endant-0.012
 seeds-0.012
 da-0.010
Sur-0.010
Token colorful
Token ID20239
Feature activation-1.41e+00

Pos logit contributions
endant+0.021
als+0.018
ossible+0.018
Sur+0.018
of+0.017
Neg logit contributions
 rarely-0.024
 hallway-0.023
lf-0.022
 hardly-0.022
 pretending-0.021
Token stars
Token ID5788
Feature activation+1.04e+00

Pos logit contributions
endant+0.028
als+0.025
Sur+0.025
 seeds+0.024
of+0.022
Neg logit contributions
lf-0.036
 rarely-0.034
 hallway-0.034
 tv-0.034
é-0.033
Token and
Token ID290
Feature activation-2.35e-01

Pos logit contributions
lf+0.021
 rarely+0.020
é+0.019
 hallway+0.019
 tv+0.019
Neg logit contributions
endant-0.026
als-0.024
Sur-0.023
 seeds-0.023
 half-0.021
Token moons
Token ID31093
Feature activation+1.18e+00

Pos logit contributions
endant+0.004
als+0.004
 seeds+0.004
Sur+0.003
elt+0.003
Neg logit contributions
lf-0.006
 rarely-0.006
 hallway-0.006
é-0.006
 tv-0.006
Token on
Token ID319
Feature activation+1.50e+00

Pos logit contributions
lf+0.026
 rarely+0.026
é+0.024
 hallway+0.024
 tv+0.023
Neg logit contributions
endant-0.027
Sur-0.025
als-0.024
 seeds-0.023
 fiber-0.023
Token Maybe
Token ID6674
Feature activation-3.74e-01

Pos logit contributions
endant+0.020
als+0.019
 seeds+0.018
Sur+0.017
 half+0.016
Neg logit contributions
lf-0.024
 rarely-0.024
é-0.022
 hallway-0.022
 tv-0.022
Token it
Token ID340
Feature activation-1.16e+00

Pos logit contributions
endant+0.008
als+0.007
Sur+0.007
 seeds+0.006
elt+0.006
Neg logit contributions
lf-0.009
 rarely-0.009
 especially-0.008
 tv-0.008
é-0.008
Token's
Token ID338
Feature activation+8.07e-01

Pos logit contributions
endant+0.024
als+0.022
 seeds+0.021
Sur+0.021
 bread+0.019
Neg logit contributions
 rarely-0.028
lf-0.027
é-0.027
 hallway-0.026
 tv-0.025
Token hiding
Token ID11816
Feature activation+5.19e-01

Pos logit contributions
 rarely+0.020
lf+0.020
é+0.019
 hallway+0.018
 tv+0.018
Neg logit contributions
endant-0.017
als-0.016
Sur-0.014
 seeds-0.014
elt-0.013
Token somewhere
Token ID7382
Feature activation-1.52e+00

Pos logit contributions
 rarely+0.013
lf+0.012
é+0.012
 hallway+0.012
 tv+0.012
Neg logit contributions
endant-0.011
als-0.010
 seeds-0.010
Sur-0.009
 half-0.009
Token.
Token ID13
Feature activation-8.50e-01

Pos logit contributions
endant+0.029
als+0.027
Sur+0.026
 seeds+0.024
elt+0.024
Neg logit contributions
lf-0.038
 rarely-0.038
 especially-0.036
 tv-0.036
 hallway-0.035
Token Let
Token ID3914
Feature activation-1.21e+00

Pos logit contributions
endant+0.019
als+0.017
 seeds+0.017
Sur+0.016
 half+0.015
Neg logit contributions
 rarely-0.019
lf-0.019
é-0.018
 hallway-0.018
 tv-0.018
Token's
Token ID338
Feature activation-4.77e-01

Pos logit contributions
endant+0.022
als+0.017
 seeds+0.016
Sur+0.016
elt+0.015
Neg logit contributions
 rarely-0.034
lf-0.034
é-0.033
icia-0.032
 tv-0.032
Token play
Token ID711
Feature activation+1.45e+00

Pos logit contributions
endant+0.009
als+0.008
 seeds+0.008
 half+0.007
 bread+0.007
Neg logit contributions
lf-0.012
 rarely-0.012
é-0.012
 hallway-0.012
 tv-0.011
Token a
Token ID257
Feature activation+8.63e-01

Pos logit contributions
 rarely+0.039
é+0.038
lf+0.037
 hallway+0.036
 tv+0.035
Neg logit contributions
endant-0.026
als-0.025
 seeds-0.024
Sur-0.023
 half-0.022
Token game
Token ID983
Feature activation+3.34e-01

Pos logit contributions
 rarely+0.021
é+0.021
lf+0.020
 hallway+0.019
 tv+0.019
Neg logit contributions
endant-0.017
 seeds-0.016
als-0.015
Sur-0.015
 da-0.015
Tokenara
Token ID3301
Feature activation+6.07e-01

Pos logit contributions
endant+0.036
 seeds+0.034
als+0.034
 wings+0.033
Sur+0.032
Neg logit contributions
loe-0.024
lf-0.024
irl-0.024
é-0.024
he-0.023
Token takes
Token ID2753
Feature activation-9.93e-01

Pos logit contributions
 rarely+0.017
lf+0.017
é+0.016
 hallway+0.016
 tv+0.016
Neg logit contributions
endant-0.010
als-0.009
 seeds-0.009
Sur-0.009
 half-0.008
Token Tom
Token ID4186
Feature activation-1.63e+00

Pos logit contributions
endant+0.019
als+0.018
 seeds+0.017
Sur+0.016
 da+0.015
Neg logit contributions
 rarely-0.025
lf-0.025
é-0.024
 hallway-0.024
 tv-0.024
Token's
Token ID338
Feature activation-7.83e-01

Pos logit contributions
endant+0.036
als+0.032
Sur+0.032
 seeds+0.030
 fiber+0.029
Neg logit contributions
é-0.036
 hallway-0.036
 rarely-0.036
lf-0.035
 pretending-0.034
Token hand
Token ID1021
Feature activation-2.23e+00

Pos logit contributions
endant+0.017
als+0.016
Sur+0.015
 seeds+0.015
elt+0.013
Neg logit contributions
 rarely-0.017
 hallway-0.017
lf-0.017
é-0.017
 tv-0.017
Token.
Token ID13
Feature activation-3.57e-02

Pos logit contributions
endant+0.049
Sur+0.042
als+0.042
 seeds+0.041
 fiber+0.038
Neg logit contributions
lf-0.051
 rarely-0.050
é-0.050
 hallway-0.049
 tv-0.048
Token "
Token ID366
Feature activation-6.41e-01

Pos logit contributions
endant+0.001
als+0.001
 seeds+0.001
Sur+0.001
elt+0.001
Neg logit contributions
lf-0.001
 rarely-0.001
é-0.001
 hallway-0.001
 tv-0.001
TokenDon
Token ID3987
Feature activation+2.96e-01

Pos logit contributions
endant+0.015
als+0.014
 seeds+0.013
Sur+0.013
 da+0.012
Neg logit contributions
 rarely-0.014
lf-0.014
é-0.013
 hallway-0.013
 tv-0.013
Token't
Token ID470
Feature activation-1.42e+00

Pos logit contributions
 rarely+0.004
lf+0.004
 hallway+0.004
 tv+0.003
é+0.003
Neg logit contributions
endant-0.009
als-0.009
Sur-0.009
 seeds-0.009
 half-0.008
Token worry
Token ID5490
Feature activation-1.84e+00

Pos logit contributions
endant+0.039
als+0.034
 seeds+0.032
Sur+0.030
elt+0.030
Neg logit contributions
 rarely-0.031
lf-0.028
 especially-0.026
 hallway-0.026
 seeker-0.025
Token,
Token ID11
Feature activation+2.24e-01

Pos logit contributions
endant+0.041
als+0.036
 seeds+0.035
Sur+0.034
elt+0.032
Neg logit contributions
lf-0.044
 rarely-0.043
é-0.041
 tv-0.040
loe-0.038
Token and
Token ID290
Feature activation-3.29e-01

Pos logit contributions
endant+0.063
Sur+0.055
als+0.055
 seeds+0.052
 fiber+0.047
Neg logit contributions
lf-0.063
 rarely-0.061
 tv-0.058
 hallway-0.058
é-0.058
Token fresh
Token ID4713
Feature activation-1.78e+00

Pos logit contributions
endant+0.006
als+0.006
Sur+0.006
aw+0.006
elt+0.006
Neg logit contributions
 rarely-0.008
lf-0.008
 especially-0.007
 Chess-0.007
 hallway-0.007
Token.
Token ID13
Feature activation-1.33e-01

Pos logit contributions
endant+0.042
Sur+0.037
upp+0.034
ossible+0.034
of+0.034
Neg logit contributions
lf-0.035
 tv-0.034
 hallway-0.034
é-0.033
 seeker-0.033
Token Do
Token ID2141
Feature activation-6.19e-01

Pos logit contributions
endant+0.003
als+0.003
 seeds+0.003
Sur+0.003
 da+0.003
Neg logit contributions
 rarely-0.003
lf-0.003
é-0.003
 tv-0.003
 hallway-0.003
Token you
Token ID345
Feature activation-8.07e-01

Pos logit contributions
endant+0.013
als+0.012
elt+0.011
Sur+0.011
 seeds+0.011
Neg logit contributions
lf-0.016
 rarely-0.015
 especially-0.014
 Chess-0.013
 hardly-0.013
Token want
Token ID765
Feature activation-3.22e+00

Pos logit contributions
endant+0.017
als+0.015
 seeds+0.015
Sur+0.015
 half+0.014
Neg logit contributions
lf-0.020
 rarely-0.019
é-0.019
 hallway-0.018
 tv-0.018
Token to
Token ID284
Feature activation-3.64e-01

Pos logit contributions
endant+0.068
Sur+0.056
als+0.055
lets+0.052
aw+0.049
Neg logit contributions
 especially-0.077
lf-0.074
 Chess-0.074
 tv-0.073
 cartoons-0.069
Token try
Token ID1949
Feature activation-1.17e+00

Pos logit contributions
endant+0.009
als+0.008
Sur+0.008
 seeds+0.008
elt+0.007
Neg logit contributions
 rarely-0.008
lf-0.008
é-0.007
 tv-0.007
 hallway-0.007
Token it
Token ID340
Feature activation-2.26e-01

Pos logit contributions
endant+0.026
als+0.025
Sur+0.024
 seeds+0.023
elt+0.021
Neg logit contributions
lf-0.026
 rarely-0.025
é-0.024
 tv-0.024
 especially-0.024
Token?"
Token ID1701
Feature activation+7.72e-01

Pos logit contributions
endant+0.006
als+0.006
 seeds+0.005
Sur+0.005
 da+0.005
Neg logit contributions
lf-0.004
 rarely-0.004
é-0.004
 hallway-0.004
 tv-0.004
Token
Token ID198
Feature activation-6.39e-01

Pos logit contributions
lf+0.024
 rarely+0.024
 hallway+0.023
é+0.023
 tv+0.022
Neg logit contributions
endant-0.011
als-0.010
Sur-0.009
 seeds-0.009
elt-0.008
Token,
Token ID11
Feature activation+1.07e+00

Pos logit contributions
lf+0.002
 hallway+0.002
é+0.002
 especially+0.002
 rarely+0.002
Neg logit contributions
endant-0.003
Sur-0.003
als-0.002
 seeds-0.002
elt-0.002
Token too
Token ID1165
Feature activation-1.11e+00

Pos logit contributions
 rarely+0.028
lf+0.028
é+0.026
 hallway+0.026
 tv+0.026
Neg logit contributions
endant-0.020
als-0.018
 seeds-0.017
Sur-0.017
elt-0.015
Token.
Token ID13
Feature activation-6.72e-01

Pos logit contributions
als+0.027
endant+0.027
Sur+0.024
 seeds+0.023
elt+0.023
Neg logit contributions
lf-0.023
 rarely-0.023
 hallway-0.022
 tv-0.021
 seeker-0.021
Token Do
Token ID2141
Feature activation-1.20e+00

Pos logit contributions
endant+0.016
als+0.015
 seeds+0.014
Sur+0.014
 da+0.013
Neg logit contributions
 rarely-0.014
lf-0.014
é-0.014
 hallway-0.013
 tv-0.013
Token you
Token ID345
Feature activation-7.64e-01

Pos logit contributions
endant+0.029
als+0.026
elt+0.026
 seeds+0.024
Sur+0.023
Neg logit contributions
lf-0.029
 rarely-0.027
irl-0.025
 especially-0.024
 hardly-0.024
Token want
Token ID765
Feature activation-3.29e+00

Pos logit contributions
endant+0.015
 seeds+0.013
als+0.013
 half+0.013
ossible+0.013
Neg logit contributions
lf-0.020
é-0.019
 hallway-0.019
 rarely-0.019
 tv-0.019
Token to
Token ID284
Feature activation-4.48e-02

Pos logit contributions
endant+0.074
Sur+0.061
als+0.060
lets+0.060
ige+0.055
Neg logit contributions
 especially-0.078
 tv-0.077
 cartoons-0.072
 Chess-0.072
lf-0.072
Token help
Token ID1037
Feature activation+1.01e+00

Pos logit contributions
endant+0.001
als+0.001
Sur+0.001
 seeds+0.001
elt+0.001
Neg logit contributions
 rarely-0.001
lf-0.001
é-0.001
 tv-0.001
 hallway-0.001
Token me
Token ID502
Feature activation+2.34e-01

Pos logit contributions
lf+0.027
 rarely+0.027
é+0.026
 hallway+0.025
 tv+0.025
Neg logit contributions
endant-0.019
als-0.017
 seeds-0.016
Sur-0.016
 da-0.014
Token get
Token ID651
Feature activation-1.59e+00

Pos logit contributions
lf+0.004
 rarely+0.004
é+0.004
 hallway+0.004
 tv+0.004
Neg logit contributions
endant-0.006
als-0.006
 seeds-0.006
Sur-0.006
elt-0.005
Token ready
Token ID3492
Feature activation-1.67e-01

Pos logit contributions
endant+0.032
Sur+0.030
als+0.030
of+0.027
lets+0.026
Neg logit contributions
 especially-0.037
 tv-0.036
é-0.035
lf-0.035
irl-0.035
Token big
Token ID1263
Feature activation-1.48e+00

Pos logit contributions
endant+0.025
als+0.023
 seeds+0.022
Sur+0.021
elt+0.020
Neg logit contributions
lf-0.017
 rarely-0.017
 hallway-0.017
 tv-0.016
é-0.016
Token noise
Token ID7838
Feature activation+3.15e-01

Pos logit contributions
endant+0.035
als+0.033
 seeds+0.031
Sur+0.030
elt+0.029
Neg logit contributions
lf-0.033
 rarely-0.031
 hallway-0.030
é-0.030
 tv-0.030
Token!
Token ID0
Feature activation+1.56e-01

Pos logit contributions
lf+0.008
 rarely+0.008
é+0.008
 hallway+0.008
 tv+0.007
Neg logit contributions
endant-0.006
als-0.006
 seeds-0.005
Sur-0.005
 half-0.005
Token It
Token ID632
Feature activation+4.34e-02

Pos logit contributions
 rarely+0.004
lf+0.004
é+0.004
 hallway+0.003
 tv+0.003
Neg logit contributions
endant-0.003
als-0.003
 seeds-0.003
Sur-0.003
 half-0.003
Token was
Token ID373
Feature activation+4.98e-01

Pos logit contributions
 rarely+0.001
lf+0.001
é+0.001
 hallway+0.001
 tv+0.001
Neg logit contributions
endant-0.001
als-0.001
 seeds-0.001
Sur-0.001
 da-0.001
Token much
Token ID881
Feature activation-3.44e+00

Pos logit contributions
 rarely+0.014
lf+0.014
é+0.013
 hallway+0.013
 tv+0.013
Neg logit contributions
endant-0.009
als-0.008
 seeds-0.008
Sur-0.007
 half-0.007
Token too
Token ID1165
Feature activation-5.26e-01

Pos logit contributions
endant+0.084
als+0.081
Sur+0.070
 seeds+0.070
of+0.070
Neg logit contributions
 rarely-0.070
 hallway-0.069
lf-0.069
 tv-0.067
é-0.064
Token loud
Token ID7812
Feature activation-5.51e-01

Pos logit contributions
endant+0.012
als+0.011
 seeds+0.011
Sur+0.011
 da+0.010
Neg logit contributions
lf-0.012
 rarely-0.012
é-0.011
 hallway-0.011
 tv-0.011
Token,
Token ID11
Feature activation+2.25e+00

Pos logit contributions
endant+0.011
als+0.010
 seeds+0.010
Sur+0.010
 da+0.009
Neg logit contributions
lf-0.014
 rarely-0.013
é-0.013
 tv-0.013
 hallway-0.013
Token so
Token ID523
Feature activation+1.30e+00

Pos logit contributions
lf+0.059
é+0.058
 rarely+0.057
loe+0.055
icia+0.054
Neg logit contributions
endant-0.040
 seeds-0.039
als-0.038
Sur-0.036
 bread-0.034
Token the
Token ID262
Feature activation+1.17e+00

Pos logit contributions
é+0.036
lf+0.035
idget+0.035
 hallway+0.035
 rarely+0.034
Neg logit contributions
endant-0.020
 seeds-0.020
als-0.020
Sur-0.018
 nuts-0.018
Token frog
Token ID21264
Feature activation+2.79e-01

Pos logit contributions
 rarely+0.006
lf+0.006
 hallway+0.006
é+0.006
 tv+0.006
Neg logit contributions
endant-0.006
als-0.005
Sur-0.005
 seeds-0.005
elt-0.005
Token was
Token ID373
Feature activation+1.84e+00

Pos logit contributions
lf+0.005
 rarely+0.005
é+0.005
 tv+0.005
 hallway+0.005
Neg logit contributions
endant-0.007
als-0.007
 seeds-0.007
Sur-0.006
 da-0.006
Token green
Token ID4077
Feature activation-9.60e-01

Pos logit contributions
lf+0.044
 tv+0.042
 hallway+0.041
é+0.041
 rarely+0.041
Neg logit contributions
endant-0.039
 seeds-0.037
 da-0.033
als-0.033
 bread-0.033
Token and
Token ID290
Feature activation+2.20e+00

Pos logit contributions
endant+0.020
als+0.018
Sur+0.018
 seeds+0.017
 half+0.016
Neg logit contributions
 rarely-0.023
lf-0.023
é-0.022
 hallway-0.021
 tv-0.021
Token had
Token ID550
Feature activation-2.21e+00

Pos logit contributions
lf+0.054
 rarely+0.053
é+0.053
 tv+0.053
 hallway+0.053
Neg logit contributions
endant-0.042
 seeds-0.039
als-0.039
 da-0.035
Sur-0.035
Token big
Token ID1263
Feature activation-2.76e+00

Pos logit contributions
endant+0.055
als+0.048
of+0.045
use+0.044
forts+0.044
Neg logit contributions
 rarely-0.048
 especially-0.044
 hardly-0.040
 hallway-0.039
 barely-0.038
Token eyes
Token ID2951
Feature activation+7.28e-02

Pos logit contributions
endant+0.046
Sur+0.043
als+0.042
of+0.038
 half+0.035
Neg logit contributions
 rarely-0.075
lf-0.071
 hallway-0.070
 seeker-0.069
 especially-0.069
Token.
Token ID13
Feature activation-1.29e-01

Pos logit contributions
lf+0.002
 rarely+0.002
é+0.002
 hallway+0.002
 tv+0.002
Neg logit contributions
endant-0.001
Sur-0.001
 seeds-0.001
als-0.001
ossible-0.001
Token He
Token ID679
Feature activation+6.81e-01

Pos logit contributions
endant+0.003
als+0.002
 seeds+0.002
Sur+0.002
 da+0.002
Neg logit contributions
 rarely-0.003
lf-0.003
é-0.003
 hallway-0.003
 tv-0.003
Token liked
Token ID8288
Feature activation-9.10e-01

Pos logit contributions
lf+0.014
 tv+0.014
é+0.014
 hallway+0.014
 rarely+0.014
Neg logit contributions
endant-0.016
als-0.015
 seeds-0.015
Sur-0.014
 da-0.014
Token to
Token ID284
Feature activation+9.61e-01

Pos logit contributions
endant+0.012
als+0.010
Sur+0.009
 seeds+0.008
 half+0.007
Neg logit contributions
 rarely-0.030
lf-0.029
é-0.029
 hallway-0.028
 tv-0.028
Token have
Token ID423
Feature activation-2.39e+00

Pos logit contributions
endant+0.000
als+0.000
 seeds+0.000
Sur+0.000
 da+0.000
Neg logit contributions
 rarely-0.000
lf-0.000
é-0.000
 hallway-0.000
 tv-0.000
Token fun
Token ID1257
Feature activation+1.67e-01

Pos logit contributions
endant+0.059
als+0.052
ocular+0.047
elt+0.046
lets+0.044
Neg logit contributions
 especially-0.050
lf-0.049
 rarely-0.048
 tv-0.047
 pretending-0.046
Token.
Token ID13
Feature activation-3.37e-01

Pos logit contributions
 rarely+0.005
lf+0.005
é+0.004
icia+0.004
 hallway+0.004
Neg logit contributions
endant-0.003
als-0.003
 seeds-0.003
Sur-0.002
 da-0.002
Token Do
Token ID2141
Feature activation-4.91e-01

Pos logit contributions
endant+0.005
als+0.005
Sur+0.005
 seeds+0.005
elt+0.004
Neg logit contributions
lf-0.010
 rarely-0.009
 hallway-0.009
é-0.009
icia-0.009
Token you
Token ID345
Feature activation-6.05e-01

Pos logit contributions
endant+0.009
als+0.008
 seeds+0.008
elt+0.007
Sur+0.007
Neg logit contributions
 rarely-0.014
lf-0.014
 especially-0.012
 hardly-0.012
é-0.012
Token want
Token ID765
Feature activation-3.02e+00

Pos logit contributions
endant+0.011
als+0.010
 seeds+0.010
 half+0.009
of+0.009
Neg logit contributions
lf-0.016
 hallway-0.015
 rarely-0.015
é-0.015
 tv-0.015
Token to
Token ID284
Feature activation-3.35e-01

Pos logit contributions
endant+0.063
als+0.051
Sur+0.048
lets+0.044
aw+0.044
Neg logit contributions
 tv-0.076
 especially-0.076
lf-0.075
 Chess-0.073
 cartoons-0.070
Token play
Token ID711
Feature activation+2.63e+00

Pos logit contributions
endant+0.008
als+0.007
 seeds+0.007
Sur+0.007
elt+0.006
Neg logit contributions
 rarely-0.008
lf-0.008
é-0.007
 hallway-0.007
 tv-0.007
Token with
Token ID351
Feature activation-1.17e-01

Pos logit contributions
 rarely+0.067
é+0.063
 hallway+0.062
lf+0.061
icia+0.059
Neg logit contributions
endant-0.051
als-0.049
 seeds-0.048
Sur-0.046
 half-0.044
Token me
Token ID502
Feature activation+6.86e-01

Pos logit contributions
endant+0.002
als+0.001
Sur+0.001
 seeds+0.001
elt+0.001
Neg logit contributions
lf-0.004
 rarely-0.004
é-0.004
 tv-0.003
 hallway-0.003
Token?"
Token ID1701
Feature activation+1.15e+00

Pos logit contributions
 rarely+0.014
lf+0.014
é+0.013
 hallway+0.013
 tv+0.013
Neg logit contributions
endant-0.017
als-0.016
 seeds-0.015
Sur-0.014
 half-0.014