TOP ACTIVATIONS
MAX = 3.480

realised it wasn 't her fault after all . <|endoftext|> Once
that it wasn 't her fault . Finally , the family
. The cater pillar happily mun ched on the mushroom ,
to sail on the high seas . Lily was happy that
knew it was his own fault . He would have to
knew it was her own fault . The girl
" It 's not my fault . I can 't help
! It mun ched and mun ched until the cereal was
looked so happy ! It mun ched and mun ched until
it was Tim my 's fault but Tim my felt guilty
awn â s fault . The
said it was not his fault . She said the bird
felt like it was her fault , like she had done
! Little John mun ched away happily , welcoming
W OO F ! W OO F !" Ben
mouse watched as she happily mun ched away on the y
bad car was not his fault . She had to make
The bunny happily mun ched on his carrot and
said it was not their fault . She said the truck
in his hands . He mun ched on it happily .

MIN ACTIVATIONS
MIN = -3.079

and her mom sorted the av oc ados into piles according
and went to apologize to Rabb ie . Fox y said
You have a rabbit . Rabb its are for babies .
" I 'm sorry , Rabb ie . I didn 't
and try to grab the av oc ados . But the
It turned out that the av oc ados were bad and
make it whole again . Amen ." Then ,
the dolls and the nap kins inside . She realized that
so happy and brought the av oc ados home to share
set the plates and nap kins . They waited for the
and pick up their nap kins . They said that nap
. She used a m itt to take out the pan
came to the garden . Rabb its , birds and butterflies
, " Thank you . Rabb its are calm creatures ,
Thank you , God . Amen ." He opened
They get a m itt and a spat ula .
us safe and happy . Amen ." " A
, and all the other av oc ados were very happy
house and our family . Amen ." Mom smiled
to hop inside , a gru ff voice shouted " NO

INTERVAL 1.840 - 3.480
CONTAINS 0.049%

mom if she could at least polish it . Her mom
's mom said , " Oh no ! It looks like
pretending to sail the high seas and search for treasure .
chuckled and said , " Oh no , no , no
dog was still w agging its tail . He came to

INTERVAL 0.201 - 1.840
CONTAINS 30.492%

had caused a lot of trouble . <|endoftext|> Once there was
our heroes too . You are very brave and kind ."
he could not believe it ! Spot put the
?" " Yes , we did !" Lily and
It was so much fun ! <|endoftext|> Once upon a time

INTERVAL -1.439 - 0.201
CONTAINS 69.364%

and fresh air . As they were riding ,
was a little girl , who was very sad . She
dolls . She had many dolls and gave them names and
they saw a man acting funny . He was juggling and
But then , Tim my had an idea . He said

INTERVAL -3.079 - -1.439
CONTAINS 0.094%

home , they saw a gr umpy man . He was
is not good for pump kins ," Pam says .
said that sharing and being kind is better than fighting and
shell that was rare and very pretty . Every day she
collect or nam ents . Everywhere she went she searched for
Token realised
Token ID19169
Feature activation+5.67e-01

Pos logit contributions
 neighbourhood+0.002
iet+0.002
 world+0.002
 neighborhood+0.002
ling+0.002
Neg logit contributions
 seventy-0.002
 eighty-0.002
 ninety-0.002
 fourteen-0.002
 forty-0.002
Token it
Token ID340
Feature activation+1.84e-01

Pos logit contributions
 seventy+0.024
 eighty+0.024
 ninety+0.023
 fourteen+0.020
 forty+0.018
Neg logit contributions
 neighbourhood-0.025
 world-0.025
ini-0.025
 neighborhood-0.025
ag-0.025
Token wasn
Token ID2492
Feature activation+1.54e+00

Pos logit contributions
 eighty+0.008
 seventy+0.008
 ninety+0.008
 fourteen+0.007
 forty+0.006
Neg logit contributions
 world-0.008
 neighbourhood-0.008
ini-0.008
 neighborhood-0.008
ling-0.008
Token't
Token ID470
Feature activation-2.58e-01

Pos logit contributions
 eighty+0.077
 seventy+0.077
 ninety+0.075
 pret+0.069
 fourteen+0.065
Neg logit contributions
ag-0.053
 world-0.052
ini-0.051
 neighbourhood-0.050
iet-0.049
Token her
Token ID607
Feature activation+6.83e-01

Pos logit contributions
 world+0.011
ag+0.010
ling+0.010
 neighbourhood+0.010
 neighborhood+0.010
Neg logit contributions
 seventy-0.012
 eighty-0.012
 ninety-0.011
 fourteen-0.011
 pret-0.010
Token fault
Token ID8046
Feature activation+3.36e+00

Pos logit contributions
 seventy+0.037
 eighty+0.037
 ninety+0.036
 fourteen+0.033
 forty+0.031
Neg logit contributions
 world-0.024
 neighbourhood-0.023
 neighborhood-0.023
ini-0.022
ag-0.021
Token after
Token ID706
Feature activation+6.19e-01

Pos logit contributions
 eighty+0.168
 seventy+0.167
 ninety+0.161
 fourteen+0.145
 pret+0.136
Neg logit contributions
 world-0.126
ini-0.124
 neighbourhood-0.124
 neighborhood-0.122
ag-0.122
Token all
Token ID477
Feature activation+1.47e+00

Pos logit contributions
 eighty+0.034
 seventy+0.034
 ninety+0.033
 fourteen+0.029
 forty+0.028
Neg logit contributions
 world-0.021
 neighbourhood-0.020
ag-0.020
 neighborhood-0.020
 outdoors-0.019
Token.
Token ID13
Feature activation+4.23e-01

Pos logit contributions
 seventy+0.071
 eighty+0.070
 ninety+0.068
 enough+0.066
 fourteen+0.063
Neg logit contributions
ini-0.055
iet-0.054
ag-0.053
 neighbourhood-0.052
 neighborhood-0.051
Token<|endoftext|>
Token ID50256
Feature activation+1.21e-01

Pos logit contributions
 seventy+0.023
 eighty+0.023
 ninety+0.022
 fourteen+0.020
 Fl+0.019
Neg logit contributions
ag-0.014
ini-0.014
iet-0.014
ling-0.013
 neighbourhood-0.013
TokenOnce
Token ID7454
Feature activation+6.98e-01

Pos logit contributions
 seventy+0.005
 eighty+0.005
 ninety+0.005
 Fl+0.004
 fourteen+0.004
Neg logit contributions
ini-0.005
ag-0.005
iet-0.005
ling-0.005
 neighbourhood-0.005
Token that
Token ID326
Feature activation+1.02e-01

Pos logit contributions
 eighty+0.041
 seventy+0.041
 ninety+0.040
 fourteen+0.036
 forty+0.033
Neg logit contributions
 world-0.039
 neighbourhood-0.039
 neighborhood-0.038
ag-0.038
ini-0.038
Token it
Token ID340
Feature activation-3.48e-02

Pos logit contributions
 eighty+0.005
 seventy+0.005
 ninety+0.004
 fourteen+0.004
 forty+0.004
Neg logit contributions
 world-0.004
 neighbourhood-0.004
 neighborhood-0.004
ini-0.004
ag-0.004
Token wasn
Token ID2492
Feature activation+1.36e+00

Pos logit contributions
 neighbourhood+0.002
 world+0.002
ini+0.002
 neighborhood+0.002
ag+0.002
Neg logit contributions
 eighty-0.001
 seventy-0.001
 ninety-0.001
 fourteen-0.001
 forty-0.001
Token't
Token ID470
Feature activation-3.46e-01

Pos logit contributions
 eighty+0.068
 seventy+0.068
 ninety+0.066
 pret+0.059
 fourteen+0.057
Neg logit contributions
ag-0.047
 world-0.046
 neighbourhood-0.045
ini-0.045
ling-0.043
Token her
Token ID607
Feature activation+5.51e-01

Pos logit contributions
 world+0.014
ag+0.014
 neighbourhood+0.013
ling+0.013
 outdoors+0.013
Neg logit contributions
 seventy-0.016
 eighty-0.016
 ninety-0.015
 fourteen-0.014
 pret-0.013
Token fault
Token ID8046
Feature activation+3.07e+00

Pos logit contributions
 seventy+0.028
 eighty+0.028
 ninety+0.027
 fourteen+0.025
 forty+0.023
Neg logit contributions
 world-0.020
 neighbourhood-0.020
 neighborhood-0.020
ini-0.019
ag-0.019
Token.
Token ID13
Feature activation+2.29e-01

Pos logit contributions
 eighty+0.166
 seventy+0.166
 ninety+0.159
 fourteen+0.145
 pret+0.137
Neg logit contributions
 world-0.101
 neighbourhood-0.100
ini-0.100
ag-0.098
 neighborhood-0.098
Token Finally
Token ID9461
Feature activation+1.13e-01

Pos logit contributions
 seventy+0.013
 eighty+0.013
 ninety+0.012
 fourteen+0.011
 enough+0.011
Neg logit contributions
ag-0.007
ini-0.007
ling-0.007
iet-0.007
 neighbourhood-0.007
Token,
Token ID11
Feature activation-1.92e-01

Pos logit contributions
 eighty+0.006
 seventy+0.006
 ninety+0.006
 fourteen+0.005
 pret+0.005
Neg logit contributions
 neighbourhood-0.004
 world-0.004
iet-0.004
 neighborhood-0.004
ling-0.003
Token the
Token ID262
Feature activation-1.32e-01

Pos logit contributions
 world+0.007
 neighbourhood+0.007
 neighborhood+0.007
ini+0.007
iet+0.007
Neg logit contributions
 seventy-0.010
 eighty-0.010
 ninety-0.010
 fourteen-0.008
 pret-0.008
Token family
Token ID1641
Feature activation-5.09e-01

Pos logit contributions
ini+0.005
ag+0.005
 neighbourhood+0.004
iet+0.004
 neighborhood+0.004
Neg logit contributions
 eighty-0.007
 seventy-0.007
 ninety-0.007
 fourteen-0.006
 enough-0.006
Token.
Token ID13
Feature activation+2.16e-01

Pos logit contributions
 neighbourhood+0.013
ini+0.013
 world+0.013
ag+0.013
 neighborhood+0.013
Neg logit contributions
 eighty-0.026
 seventy-0.026
 ninety-0.025
 fourteen-0.024
 enough-0.022
Token The
Token ID383
Feature activation-2.46e-01

Pos logit contributions
 eighty+0.012
 seventy+0.012
 ninety+0.012
 fourteen+0.011
 pret+0.010
Neg logit contributions
 neighbourhood-0.007
 world-0.006
 neighborhood-0.006
ini-0.006
ag-0.006
Token cater
Token ID20825
Feature activation-6.07e-01

Pos logit contributions
ag+0.010
 neighbourhood+0.010
ini+0.010
iet+0.010
 neighborhood+0.010
Neg logit contributions
 seventy-0.011
 eighty-0.011
 ninety-0.011
 fourteen-0.010
 Fl-0.009
Tokenpillar
Token ID41643
Feature activation-3.39e-01

Pos logit contributions
 neighborhood+0.022
 neighbourhood+0.022
ag+0.022
 world+0.022
iet+0.021
Neg logit contributions
 seventy-0.029
 eighty-0.029
 ninety-0.027
 fourteen-0.026
 Fl-0.024
Token happily
Token ID18177
Feature activation+4.73e-01

Pos logit contributions
ling+0.013
 neighbourhood+0.013
 neighborhood+0.012
 world+0.012
iet+0.012
Neg logit contributions
 eighty-0.017
 seventy-0.016
 ninety-0.016
 fourteen-0.015
 forty-0.014
Token mun
Token ID29856
Feature activation+2.91e+00

Pos logit contributions
 eighty+0.021
 seventy+0.021
 ninety+0.021
 fourteen+0.020
 forty+0.018
Neg logit contributions
ling-0.019
 neighbourhood-0.017
ag-0.017
 neighborhood-0.017
iet-0.016
Tokenched
Token ID1740
Feature activation+1.46e+00

Pos logit contributions
 eighty+0.127
 seventy+0.126
 ninety+0.124
 pret+0.108
 Rob+0.106
Neg logit contributions
ag-0.128
ling-0.125
agging-0.118
iet-0.117
 world-0.111
Token on
Token ID319
Feature activation+8.69e-01

Pos logit contributions
 seventy+0.082
 eighty+0.082
 ninety+0.078
 fourteen+0.072
 forty+0.068
Neg logit contributions
ling-0.047
ag-0.047
 world-0.046
 neighbourhood-0.044
iet-0.042
Token the
Token ID262
Feature activation+5.61e-01

Pos logit contributions
 eighty+0.041
 seventy+0.040
 ninety+0.038
 fourteen+0.034
 pret+0.033
Neg logit contributions
 world-0.034
ag-0.034
ling-0.033
 neighbourhood-0.032
agging-0.031
Token mushroom
Token ID28520
Feature activation-1.31e-01

Pos logit contributions
 seventy+0.029
 eighty+0.029
 ninety+0.027
 fourteen+0.025
 Fl+0.024
Neg logit contributions
 neighbourhood-0.020
 neighborhood-0.019
ini-0.019
ag-0.019
iet-0.019
Token,
Token ID11
Feature activation+4.31e-01

Pos logit contributions
 world+0.005
 neighbourhood+0.005
ling+0.005
 neighborhood+0.005
ag+0.005
Neg logit contributions
 eighty-0.006
 seventy-0.006
 ninety-0.006
 fourteen-0.006
 pret-0.005
Token to
Token ID284
Feature activation+2.25e-01

Pos logit contributions
 world+0.001
 neighbourhood+0.001
ag+0.001
 neighborhood+0.001
ling+0.001
Neg logit contributions
 seventy-0.001
 eighty-0.001
 ninety-0.001
 fourteen-0.001
 forty-0.001
Token sail
Token ID14936
Feature activation+1.41e+00

Pos logit contributions
 seventy+0.012
 eighty+0.012
 ninety+0.011
 fourteen+0.010
 enough+0.010
Neg logit contributions
 world-0.008
ag-0.008
 neighbourhood-0.008
 neighborhood-0.008
ini-0.008
Token on
Token ID319
Feature activation-1.03e-01

Pos logit contributions
 eighty+0.075
 seventy+0.075
 ninety+0.072
 fourteen+0.066
 enough+0.063
Neg logit contributions
 neighbourhood-0.047
ini-0.047
 world-0.047
ag-0.046
 neighborhood-0.046
Token the
Token ID262
Feature activation+5.76e-01

Pos logit contributions
 world+0.005
 neighbourhood+0.004
 neighborhood+0.004
ag+0.004
ini+0.004
Neg logit contributions
 eighty-0.005
 seventy-0.005
 ninety-0.004
 fourteen-0.004
 forty-0.003
Token high
Token ID1029
Feature activation+4.99e-01

Pos logit contributions
 seventy+0.031
 eighty+0.031
 ninety+0.029
 fourteen+0.027
 enough+0.026
Neg logit contributions
 neighbourhood-0.019
 world-0.019
ini-0.019
 neighborhood-0.019
ag-0.019
Token seas
Token ID21547
Feature activation+2.88e+00

Pos logit contributions
 eighty+0.027
 seventy+0.027
 ninety+0.026
 fourteen+0.024
 enough+0.023
Neg logit contributions
ini-0.016
 neighbourhood-0.016
ag-0.016
 neighborhood-0.016
 world-0.016
Token.
Token ID13
Feature activation+7.64e-02

Pos logit contributions
 eighty+0.146
 seventy+0.146
 ninety+0.140
 fourteen+0.128
 enough+0.122
Neg logit contributions
 neighbourhood-0.103
ini-0.102
 neighborhood-0.101
 world-0.100
ag-0.099
Token Lily
Token ID20037
Feature activation+2.46e-01

Pos logit contributions
 seventy+0.003
 eighty+0.003
 ninety+0.003
 fourteen+0.003
 Fl+0.003
Neg logit contributions
ag-0.003
ini-0.003
ling-0.003
iet-0.003
 neighbourhood-0.003
Token was
Token ID373
Feature activation+3.34e-01

Pos logit contributions
 seventy+0.013
 eighty+0.013
 ninety+0.013
 fourteen+0.012
 um+0.011
Neg logit contributions
 neighbourhood-0.009
 neighborhood-0.009
ling-0.008
 world-0.008
ini-0.008
Token happy
Token ID3772
Feature activation+7.91e-01

Pos logit contributions
 seventy+0.016
 ninety+0.016
 eighty+0.016
 fourteen+0.015
 um+0.014
Neg logit contributions
 neighbourhood-0.013
 world-0.013
 neighborhood-0.013
 outdoors-0.012
ag-0.012
Token that
Token ID326
Feature activation+2.55e-01

Pos logit contributions
 eighty+0.044
 seventy+0.043
 ninety+0.042
 fourteen+0.037
 forty+0.035
Neg logit contributions
 world-0.029
 neighbourhood-0.028
 neighborhood-0.028
ling-0.025
 outdoors-0.025
Token knew
Token ID2993
Feature activation+1.78e-01

Pos logit contributions
 neighbourhood+0.007
 neighborhood+0.006
 world+0.006
ling+0.006
iet+0.006
Neg logit contributions
 eighty-0.008
 seventy-0.008
 ninety-0.008
 fourteen-0.007
 pret-0.006
Token it
Token ID340
Feature activation+5.17e-01

Pos logit contributions
 seventy+0.007
 eighty+0.007
 ninety+0.006
 fourteen+0.006
 forty+0.005
Neg logit contributions
 neighbourhood-0.009
 world-0.009
ini-0.009
ag-0.009
 neighborhood-0.009
Token was
Token ID373
Feature activation+4.04e-01

Pos logit contributions
 eighty+0.025
 seventy+0.025
 ninety+0.024
 fourteen+0.022
 forty+0.020
Neg logit contributions
 world-0.021
 neighbourhood-0.021
 neighborhood-0.020
ling-0.020
ini-0.020
Token his
Token ID465
Feature activation+5.05e-01

Pos logit contributions
 seventy+0.020
 eighty+0.020
 ninety+0.019
 fourteen+0.018
 pret+0.016
Neg logit contributions
 world-0.015
 neighbourhood-0.015
ag-0.015
 neighborhood-0.015
ini-0.015
Token own
Token ID898
Feature activation-2.02e-01

Pos logit contributions
 seventy+0.029
 eighty+0.028
 ninety+0.027
 fourteen+0.026
 forty+0.023
Neg logit contributions
 world-0.018
 neighbourhood-0.017
 neighborhood-0.017
ini-0.015
ag-0.015
Token fault
Token ID8046
Feature activation+2.88e+00

Pos logit contributions
 world+0.007
 neighbourhood+0.007
 neighborhood+0.007
ini+0.006
ag+0.006
Neg logit contributions
 eighty-0.011
 seventy-0.011
 ninety-0.011
 fourteen-0.010
 forty-0.010
Token.
Token ID13
Feature activation+3.58e-01

Pos logit contributions
 seventy+0.168
 eighty+0.168
 ninety+0.162
 fourteen+0.150
 pret+0.142
Neg logit contributions
 world-0.082
 neighbourhood-0.081
ini-0.081
ag-0.080
 neighborhood-0.078
Token He
Token ID679
Feature activation+4.47e-01

Pos logit contributions
 seventy+0.017
 eighty+0.017
 ninety+0.016
 fourteen+0.015
 enough+0.014
Neg logit contributions
ag-0.014
ini-0.014
 neighbourhood-0.014
 world-0.014
 neighborhood-0.014
Token would
Token ID561
Feature activation-1.89e-01

Pos logit contributions
 eighty+0.022
 seventy+0.022
 ninety+0.021
 fourteen+0.020
 forty+0.018
Neg logit contributions
 neighbourhood-0.017
 neighborhood-0.016
ling-0.016
iet-0.016
 world-0.016
Token have
Token ID423
Feature activation+2.76e-01

Pos logit contributions
 neighbourhood+0.008
 world+0.008
 neighborhood+0.008
ag+0.008
ini+0.008
Neg logit contributions
 eighty-0.008
 seventy-0.008
 ninety-0.007
 fourteen-0.007
 enough-0.006
Token to
Token ID284
Feature activation+5.96e-02

Pos logit contributions
 seventy+0.013
 eighty+0.013
 ninety+0.012
 fourteen+0.011
six+0.010
Neg logit contributions
 neighbourhood-0.012
 neighborhood-0.012
 world-0.011
ag-0.010
 town-0.010
Token knew
Token ID2993
Feature activation+1.47e-01

Pos logit contributions
 neighbourhood+0.028
 world+0.028
 neighborhood+0.028
ling+0.027
iet+0.027
Neg logit contributions
 seventy-0.034
 eighty-0.034
 ninety-0.033
 fourteen-0.030
 forty-0.028
Token it
Token ID340
Feature activation+8.13e-01

Pos logit contributions
 eighty+0.007
 seventy+0.007
 ninety+0.006
 fourteen+0.006
 forty+0.005
Neg logit contributions
 world-0.006
 neighbourhood-0.006
 neighborhood-0.006
ag-0.006
ini-0.006
Token was
Token ID373
Feature activation+6.65e-01

Pos logit contributions
 eighty+0.043
 seventy+0.043
 ninety+0.041
 fourteen+0.038
 forty+0.035
Neg logit contributions
 world-0.031
 neighbourhood-0.029
ling-0.029
 neighborhood-0.029
ag-0.027
Token her
Token ID607
Feature activation+1.43e-01

Pos logit contributions
 eighty+0.032
 seventy+0.032
 ninety+0.031
 fourteen+0.028
 pret+0.026
Neg logit contributions
 world-0.026
 neighbourhood-0.026
ag-0.025
 neighborhood-0.025
ini-0.025
Token own
Token ID898
Feature activation-1.63e-01

Pos logit contributions
 seventy+0.008
 eighty+0.008
 ninety+0.007
 fourteen+0.007
 forty+0.006
Neg logit contributions
 world-0.005
 neighbourhood-0.005
 neighborhood-0.005
ini-0.005
ag-0.004
Token fault
Token ID8046
Feature activation+2.87e+00

Pos logit contributions
 world+0.006
 neighbourhood+0.006
 neighborhood+0.005
ini+0.005
ag+0.005
Neg logit contributions
 seventy-0.009
 eighty-0.009
 ninety-0.008
 fourteen-0.008
 forty-0.007
Token.
Token ID13
Feature activation+1.97e-01

Pos logit contributions
 seventy+0.171
 eighty+0.171
 ninety+0.167
 fourteen+0.154
 forty+0.148
Neg logit contributions
 world-0.080
ag-0.076
 neighbourhood-0.076
ling-0.075
ini-0.075
Token
Token ID198
Feature activation-1.47e-01

Pos logit contributions
 eighty+0.011
 seventy+0.011
 ninety+0.010
 fourteen+0.010
 enough+0.009
Neg logit contributions
 neighbourhood-0.006
ini-0.006
 world-0.006
 neighborhood-0.006
ag-0.006
Token
Token ID198
Feature activation-1.26e-01

Pos logit contributions
ini+0.005
ling+0.005
ag+0.005
 world+0.005
 neighbourhood+0.005
Neg logit contributions
 eighty-0.008
 seventy-0.008
 ninety-0.008
 Fl-0.007
 fourteen-0.007
TokenThe
Token ID464
Feature activation-1.34e-01

Pos logit contributions
ini+0.004
ag+0.004
ling+0.004
 world+0.004
 neighbourhood+0.004
Neg logit contributions
 eighty-0.007
 seventy-0.007
 ninety-0.006
 fourteen-0.006
 Fl-0.005
Token girl
Token ID2576
Feature activation+2.24e-01

Pos logit contributions
ag+0.005
ini+0.005
 neighbourhood+0.005
 neighborhood+0.005
iet+0.005
Neg logit contributions
 eighty-0.007
 seventy-0.007
 ninety-0.007
 fourteen-0.006
 enough-0.006
Token "
Token ID366
Feature activation-1.99e-01

Pos logit contributions
 eighty+0.020
 seventy+0.020
 ninety+0.020
 fourteen+0.018
 enough+0.017
Neg logit contributions
 world-0.012
ling-0.011
 neighborhood-0.011
 neighbourhood-0.011
ag-0.011
TokenIt
Token ID1026
Feature activation+6.42e-01

Pos logit contributions
ini+0.007
 world+0.007
 neighbourhood+0.007
ag+0.006
 neighborhood+0.006
Neg logit contributions
 eighty-0.011
 seventy-0.011
 ninety-0.010
 fourteen-0.009
 enough-0.009
Token's
Token ID338
Feature activation+1.86e-01

Pos logit contributions
 eighty+0.030
 seventy+0.030
 ninety+0.029
 fourteen+0.026
 forty+0.024
Neg logit contributions
ini-0.026
 world-0.025
 neighbourhood-0.025
ag-0.025
ling-0.025
Token not
Token ID407
Feature activation-4.22e-01

Pos logit contributions
 eighty+0.009
 seventy+0.009
 ninety+0.008
 fourteen+0.007
 pret+0.007
Neg logit contributions
 world-0.008
ag-0.008
 neighbourhood-0.007
ini-0.007
 neighborhood-0.007
Token my
Token ID616
Feature activation+1.25e+00

Pos logit contributions
 world+0.014
ag+0.014
 outdoors+0.013
 nature+0.012
ling+0.012
Neg logit contributions
 seventy-0.024
 eighty-0.023
 ninety-0.023
 fourteen-0.022
six-0.020
Token fault
Token ID8046
Feature activation+2.86e+00

Pos logit contributions
 seventy+0.061
 eighty+0.061
 ninety+0.058
 fourteen+0.053
 forty+0.049
Neg logit contributions
 world-0.050
 neighbourhood-0.050
 neighborhood-0.049
ini-0.049
ag-0.047
Token.
Token ID13
Feature activation-9.33e-03

Pos logit contributions
 eighty+0.164
 seventy+0.162
 ninety+0.158
 fourteen+0.141
 forty+0.138
Neg logit contributions
 world-0.089
ling-0.086
ini-0.084
ag-0.082
agging-0.082
Token I
Token ID314
Feature activation+2.29e-02

Pos logit contributions
ling+0.000
iet+0.000
ag+0.000
 neighbourhood+0.000
 neighborhood+0.000
Neg logit contributions
 seventy-0.000
 eighty-0.000
 ninety-0.000
 Fl-0.000
 fourteen-0.000
Token can
Token ID460
Feature activation-2.49e-01

Pos logit contributions
 eighty+0.001
 seventy+0.001
 ninety+0.001
 fourteen+0.001
 enough+0.001
Neg logit contributions
ini-0.001
 neighbourhood-0.001
 world-0.001
ag-0.001
 neighborhood-0.001
Token't
Token ID470
Feature activation-1.02e+00

Pos logit contributions
 neighbourhood+0.012
ini+0.012
 world+0.012
 neighborhood+0.012
iet+0.012
Neg logit contributions
 seventy-0.010
 eighty-0.010
 ninety-0.009
 fourteen-0.008
 Fl-0.008
Token help
Token ID1037
Feature activation-4.85e-01

Pos logit contributions
 neighbourhood+0.046
ini+0.046
 neighborhood+0.045
iet+0.045
 world+0.044
Neg logit contributions
 seventy-0.040
 eighty-0.040
 ninety-0.037
 fourteen-0.034
 Fl-0.034
Token!
Token ID0
Feature activation+4.47e-01

Pos logit contributions
 world+0.002
 neighbourhood+0.002
 neighborhood+0.002
ling+0.002
 outdoors+0.002
Neg logit contributions
 eighty-0.004
 seventy-0.004
 ninety-0.004
 fourteen-0.003
 forty-0.003
Token It
Token ID632
Feature activation+4.69e-01

Pos logit contributions
 seventy+0.022
 eighty+0.022
 ninety+0.021
 fourteen+0.020
 enough+0.019
Neg logit contributions
ag-0.017
ini-0.016
ling-0.016
iet-0.016
agging-0.016
Token mun
Token ID29856
Feature activation+2.84e+00

Pos logit contributions
 eighty+0.024
 seventy+0.024
 ninety+0.024
 fourteen+0.023
 forty+0.020
Neg logit contributions
ling-0.016
 neighbourhood-0.016
 world-0.016
 neighborhood-0.016
ini-0.016
Tokenched
Token ID1740
Feature activation+1.39e+00

Pos logit contributions
 eighty+0.132
 ninety+0.130
 seventy+0.129
 Rob+0.116
 pret+0.115
Neg logit contributions
ling-0.118
ag-0.117
agging-0.109
iet-0.106
ilt-0.100
Token and
Token ID290
Feature activation+1.09e+00

Pos logit contributions
 eighty+0.075
 seventy+0.075
 ninety+0.072
 fourteen+0.065
 forty+0.061
Neg logit contributions
 world-0.046
ling-0.046
ag-0.046
 neighbourhood-0.045
ini-0.044
Token mun
Token ID29856
Feature activation+2.85e+00

Pos logit contributions
 eighty+0.051
 ninety+0.050
 seventy+0.048
 fault+0.043
 forty+0.043
Neg logit contributions
ling-0.038
 calm-0.034
 around-0.033
 neighbourhood-0.033
 garden-0.033
Tokenched
Token ID1740
Feature activation+1.16e+00

Pos logit contributions
 eighty+0.130
 seventy+0.128
 ninety+0.127
 Rob+0.111
 pret+0.110
Neg logit contributions
ag-0.120
ling-0.118
agging-0.111
iet-0.108
ilt-0.101
Token until
Token ID1566
Feature activation+5.03e-01

Pos logit contributions
 eighty+0.061
 seventy+0.061
 ninety+0.058
 fourteen+0.052
 forty+0.049
Neg logit contributions
ling-0.041
ag-0.041
 world-0.041
 neighbourhood-0.039
iet-0.038
Token the
Token ID262
Feature activation+8.23e-02

Pos logit contributions
 eighty+0.020
 seventy+0.020
 ninety+0.019
 fourteen+0.017
 enough+0.016
Neg logit contributions
 neighbourhood-0.023
 world-0.023
ini-0.023
 neighborhood-0.023
ag-0.023
Token cereal
Token ID33158
Feature activation-4.93e-02

Pos logit contributions
 eighty+0.004
 seventy+0.004
 ninety+0.004
 fourteen+0.003
 enough+0.003
Neg logit contributions
 neighbourhood-0.003
 world-0.003
ini-0.003
 neighborhood-0.003
ag-0.003
Token was
Token ID373
Feature activation+4.71e-01

Pos logit contributions
 world+0.002
ling+0.002
 neighbourhood+0.002
 neighborhood+0.002
 house+0.002
Neg logit contributions
 eighty-0.003
 seventy-0.003
 ninety-0.003
 fourteen-0.002
 forty-0.002
Token looked
Token ID3114
Feature activation-4.23e-01

Pos logit contributions
ling+0.020
 neighbourhood+0.020
 neighborhood+0.020
 world+0.019
iet+0.019
Neg logit contributions
 eighty-0.024
 seventy-0.024
 ninety-0.024
 fourteen-0.022
 forty-0.020
Token so
Token ID523
Feature activation-4.22e-01

Pos logit contributions
ini+0.014
 neighbourhood+0.014
 world+0.014
ag+0.014
 neighborhood+0.014
Neg logit contributions
 eighty-0.023
 seventy-0.023
 ninety-0.022
 fourteen-0.020
 enough-0.019
Token happy
Token ID3772
Feature activation-6.92e-02

Pos logit contributions
ini+0.016
 neighbourhood+0.015
iet+0.015
 neighborhood+0.015
 world+0.015
Neg logit contributions
 seventy-0.021
 eighty-0.021
 ninety-0.020
 enough-0.019
 fourteen-0.018
Token!
Token ID0
Feature activation+4.47e-01

Pos logit contributions
 world+0.002
 neighbourhood+0.002
 neighborhood+0.002
ling+0.002
 outdoors+0.002
Neg logit contributions
 eighty-0.004
 seventy-0.004
 ninety-0.004
 fourteen-0.003
 forty-0.003
Token It
Token ID632
Feature activation+4.69e-01

Pos logit contributions
 seventy+0.022
 eighty+0.022
 ninety+0.021
 fourteen+0.020
 enough+0.019
Neg logit contributions
ag-0.017
ini-0.016
ling-0.016
iet-0.016
agging-0.016
Token mun
Token ID29856
Feature activation+2.84e+00

Pos logit contributions
 eighty+0.024
 seventy+0.024
 ninety+0.024
 fourteen+0.023
 forty+0.020
Neg logit contributions
ling-0.016
 neighbourhood-0.016
 world-0.016
 neighborhood-0.016
ini-0.016
Tokenched
Token ID1740
Feature activation+1.39e+00

Pos logit contributions
 eighty+0.132
 ninety+0.130
 seventy+0.129
 Rob+0.116
 pret+0.115
Neg logit contributions
ling-0.118
ag-0.117
agging-0.109
iet-0.106
ilt-0.100
Token and
Token ID290
Feature activation+1.09e+00

Pos logit contributions
 eighty+0.075
 seventy+0.075
 ninety+0.072
 fourteen+0.065
 forty+0.061
Neg logit contributions
 world-0.046
ling-0.046
ag-0.046
 neighbourhood-0.045
ini-0.044
Token mun
Token ID29856
Feature activation+2.85e+00

Pos logit contributions
 eighty+0.051
 ninety+0.050
 seventy+0.048
 fault+0.043
 forty+0.043
Neg logit contributions
ling-0.038
 calm-0.034
 around-0.033
 neighbourhood-0.033
 garden-0.033
Tokenched
Token ID1740
Feature activation+1.16e+00

Pos logit contributions
 eighty+0.130
 seventy+0.128
 ninety+0.127
 Rob+0.111
 pret+0.110
Neg logit contributions
ag-0.120
ling-0.118
agging-0.111
iet-0.108
ilt-0.101
Token until
Token ID1566
Feature activation+5.03e-01

Pos logit contributions
 eighty+0.061
 seventy+0.061
 ninety+0.058
 fourteen+0.052
 forty+0.049
Neg logit contributions
ling-0.041
ag-0.041
 world-0.041
 neighbourhood-0.039
iet-0.038
Token it
Token ID340
Feature activation+8.56e-01

Pos logit contributions
 eighty+0.007
 seventy+0.007
 ninety+0.007
 fourteen+0.006
 pret+0.005
Neg logit contributions
ag-0.006
ini-0.006
ling-0.006
 neighbourhood-0.006
 world-0.005
Token was
Token ID373
Feature activation+2.70e-01

Pos logit contributions
 eighty+0.040
 seventy+0.040
 ninety+0.038
 fourteen+0.034
 forty+0.032
Neg logit contributions
ini-0.035
 world-0.035
 neighbourhood-0.035
 neighborhood-0.035
ling-0.035
Token Tim
Token ID5045
Feature activation+1.09e-01

Pos logit contributions
 eighty+0.011
 seventy+0.011
 ninety+0.010
 fourteen+0.009
 enough+0.008
Neg logit contributions
 neighbourhood-0.013
ini-0.013
 world-0.013
ag-0.013
 neighborhood-0.013
Tokenmy
Token ID1820
Feature activation-1.09e+00

Pos logit contributions
 seventy+0.005
 eighty+0.005
 ninety+0.005
 fourteen+0.005
 enough+0.004
Neg logit contributions
ini-0.004
 neighbourhood-0.004
 world-0.004
ag-0.004
 neighborhood-0.004
Token's
Token ID338
Feature activation+1.57e-01

Pos logit contributions
 world+0.041
 neighbourhood+0.041
ini+0.040
 neighborhood+0.040
ag+0.040
Neg logit contributions
 seventy-0.055
 eighty-0.055
 ninety-0.052
 fourteen-0.048
 forty-0.045
Token fault
Token ID8046
Feature activation+2.83e+00

Pos logit contributions
 eighty+0.009
 seventy+0.009
 ninety+0.008
 fourteen+0.008
 forty+0.007
Neg logit contributions
 world-0.006
 neighbourhood-0.006
 neighborhood-0.006
ini-0.006
ag-0.005
Token but
Token ID475
Feature activation+4.81e-01

Pos logit contributions
 eighty+0.148
 seventy+0.147
 ninety+0.142
 fourteen+0.127
 pret+0.123
Neg logit contributions
 world-0.101
ling-0.100
ini-0.099
agging-0.098
ag-0.096
Token Tim
Token ID5045
Feature activation+1.07e-01

Pos logit contributions
 eighty+0.023
 seventy+0.023
 ninety+0.022
 fourteen+0.020
 pret+0.018
Neg logit contributions
 neighbourhood-0.019
ini-0.019
 world-0.019
 neighborhood-0.019
ag-0.018
Tokenmy
Token ID1820
Feature activation-1.10e+00

Pos logit contributions
 eighty+0.005
 seventy+0.005
 ninety+0.005
 fourteen+0.005
 forty+0.004
Neg logit contributions
ag-0.004
ini-0.004
 neighbourhood-0.004
 neighborhood-0.004
 world-0.004
Token felt
Token ID2936
Feature activation+5.29e-01

Pos logit contributions
 neighbourhood+0.048
 neighborhood+0.048
ini+0.048
 world+0.047
ling+0.047
Neg logit contributions
 seventy-0.049
 eighty-0.049
 ninety-0.047
 fourteen-0.043
 forty-0.039
Token guilty
Token ID6717
Feature activation+1.14e+00

Pos logit contributions
 eighty+0.027
 seventy+0.027
 ninety+0.025
 fourteen+0.023
 enough+0.023
Neg logit contributions
ini-0.019
 neighbourhood-0.019
iet-0.019
 world-0.018
 neighborhood-0.018
Tokenawn
Token ID3832
Feature activation-2.47e-01

Pos logit contributions
 neighbourhood+0.006
 world+0.006
ini+0.006
 neighborhood+0.006
ag+0.006
Neg logit contributions
 eighty-0.008
 seventy-0.008
 ninety-0.008
 fourteen-0.007
 enough-0.007
Tokenâ
Token ID22940
Feature activation+2.35e-02

Pos logit contributions
ini+0.010
 neighbourhood+0.010
 world+0.009
 neighborhood+0.009
ag+0.009
Neg logit contributions
 seventy-0.012
 eighty-0.012
 ninety-0.011
 fourteen-0.010
 enough-0.010
Token
Token ID26391
Feature activation+1.07e-01

Pos logit contributions
 seventy+0.001
 eighty+0.001
 ninety+0.001
 fourteen+0.001
 enough+0.001
Neg logit contributions
iet-0.001
 neighborhood-0.001
 neighbourhood-0.001
ini-0.001
 world-0.001
Token
Token ID8151
Feature activation+7.42e-01

Pos logit contributions
 eighty+0.005
 seventy+0.005
 ninety+0.004
 fourteen+0.004
 Fl+0.004
Neg logit contributions
ag-0.004
ini-0.004
ling-0.004
iet-0.004
 world-0.004
Tokens
Token ID82
Feature activation-6.86e-02

Pos logit contributions
 eighty+0.032
 seventy+0.032
 ninety+0.031
 fourteen+0.028
 enough+0.026
Neg logit contributions
 neighbourhood-0.032
ini-0.032
 world-0.031
 neighborhood-0.031
ag-0.031
Token fault
Token ID8046
Feature activation+2.81e+00

Pos logit contributions
 world+0.003
 neighbourhood+0.003
 neighborhood+0.003
ini+0.003
ag+0.003
Neg logit contributions
 seventy-0.003
 eighty-0.003
 ninety-0.003
 fourteen-0.003
 forty-0.002
Token.
Token ID13
Feature activation+2.21e-01

Pos logit contributions
 eighty+0.157
 seventy+0.157
 ninety+0.152
 fourteen+0.139
 pret+0.135
Neg logit contributions
 world-0.087
ling-0.085
 neighbourhood-0.084
ag-0.083
ini-0.083
Token
Token ID220
Feature activation-6.91e-01

Pos logit contributions
 eighty+0.011
 seventy+0.011
 ninety+0.011
 fourteen+0.010
 enough+0.009
Neg logit contributions
 neighbourhood-0.008
 world-0.008
ini-0.008
 neighborhood-0.008
ag-0.008
Token
Token ID198
Feature activation+1.13e-03

Pos logit contributions
 neighbourhood+0.022
ini+0.022
 world+0.022
ag+0.021
 neighborhood+0.021
Neg logit contributions
 eighty-0.038
 seventy-0.038
 ninety-0.036
 fourteen-0.033
 enough-0.032
Token
Token ID198
Feature activation+3.11e-02

Pos logit contributions
 eighty+0.000
 seventy+0.000
 ninety+0.000
 fourteen+0.000
 enough+0.000
Neg logit contributions
 neighbourhood-0.000
 world-0.000
ini-0.000
 neighborhood-0.000
ag-0.000
TokenThe
Token ID464
Feature activation-1.87e-01

Pos logit contributions
 seventy+0.002
 eighty+0.002
 ninety+0.002
 fourteen+0.001
 enough+0.001
Neg logit contributions
 neighbourhood-0.001
 world-0.001
ini-0.001
 neighborhood-0.001
ag-0.001
Token said
Token ID531
Feature activation-5.84e-01

Pos logit contributions
 eighty+0.003
 seventy+0.003
 ninety+0.003
 fourteen+0.003
 forty+0.002
Neg logit contributions
 neighborhood-0.002
 neighbourhood-0.002
iet-0.002
ling-0.002
ag-0.002
Token it
Token ID340
Feature activation+2.94e-01

Pos logit contributions
ag+0.024
 neighbourhood+0.023
 world+0.023
ini+0.023
 neighborhood+0.023
Neg logit contributions
 eighty-0.027
 seventy-0.027
 ninety-0.025
 fourteen-0.023
 enough-0.021
Token was
Token ID373
Feature activation+1.30e-01

Pos logit contributions
 eighty+0.015
 seventy+0.015
 ninety+0.014
 fourteen+0.013
 forty+0.012
Neg logit contributions
 world-0.011
 neighborhood-0.011
ag-0.011
ling-0.011
 neighbourhood-0.011
Token not
Token ID407
Feature activation-2.09e-01

Pos logit contributions
 eighty+0.005
 seventy+0.005
 ninety+0.004
 fourteen+0.004
 enough+0.004
Neg logit contributions
ini-0.007
 neighbourhood-0.007
 world-0.006
 neighborhood-0.006
ag-0.006
Token his
Token ID465
Feature activation+8.45e-01

Pos logit contributions
ag+0.008
 world+0.008
ling+0.008
 outdoors+0.008
agging+0.007
Neg logit contributions
 seventy-0.010
 eighty-0.010
 ninety-0.010
 fourteen-0.009
 forty-0.008
Token fault
Token ID8046
Feature activation+2.81e+00

Pos logit contributions
 eighty+0.033
 seventy+0.033
 ninety+0.032
 fourteen+0.028
 forty+0.025
Neg logit contributions
 world-0.041
 neighbourhood-0.041
 neighborhood-0.040
ini-0.040
ag-0.040
Token.
Token ID13
Feature activation-2.66e-01

Pos logit contributions
 seventy+0.163
 eighty+0.162
 ninety+0.157
 fourteen+0.145
 enough+0.143
Neg logit contributions
 neighbourhood-0.081
ini-0.080
 neighborhood-0.078
iet-0.078
 world-0.077
Token She
Token ID1375
Feature activation-1.59e-01

Pos logit contributions
ini+0.009
ag+0.009
ling+0.008
iet+0.008
 world+0.008
Neg logit contributions
 seventy-0.015
 eighty-0.015
 ninety-0.014
 fourteen-0.013
 Fl-0.013
Token said
Token ID531
Feature activation-2.11e-01

Pos logit contributions
 neighborhood+0.005
ini+0.005
 neighbourhood+0.005
iet+0.005
ling+0.005
Neg logit contributions
 eighty-0.008
 seventy-0.008
 ninety-0.008
 fourteen-0.008
 sixty-0.007
Token the
Token ID262
Feature activation-3.22e-01

Pos logit contributions
 world+0.008
ag+0.008
 neighbourhood+0.008
ling+0.008
 neighborhood+0.008
Neg logit contributions
 seventy-0.009
 eighty-0.009
 ninety-0.009
 fourteen-0.008
 sixty-0.007
Token bird
Token ID6512
Feature activation+3.90e-02

Pos logit contributions
ini+0.012
ag+0.012
iet+0.011
 neighbourhood+0.011
ling+0.011
Neg logit contributions
 eighty-0.016
 seventy-0.016
 ninety-0.016
 fourteen-0.014
 enough-0.014
Token felt
Token ID2936
Feature activation+4.71e-01

Pos logit contributions
 neighbourhood+0.000
ini+0.000
 world+0.000
 neighborhood+0.000
ag+0.000
Neg logit contributions
 eighty-0.000
 seventy-0.000
 ninety-0.000
 fourteen-0.000
 enough-0.000
Token like
Token ID588
Feature activation+1.16e+00

Pos logit contributions
 eighty+0.024
 seventy+0.024
 ninety+0.023
 fourteen+0.021
 enough+0.021
Neg logit contributions
ini-0.017
 neighbourhood-0.016
 neighborhood-0.016
 world-0.016
iet-0.016
Token it
Token ID340
Feature activation+3.24e-01

Pos logit contributions
 seventy+0.055
 eighty+0.055
 ninety+0.052
 fourteen+0.048
 forty+0.044
Neg logit contributions
 world-0.047
 neighbourhood-0.045
 neighborhood-0.044
 outdoors-0.043
ag-0.042
Token was
Token ID373
Feature activation+5.88e-01

Pos logit contributions
 eighty+0.015
 seventy+0.015
 ninety+0.014
 fourteen+0.012
 forty+0.011
Neg logit contributions
 world-0.014
 neighbourhood-0.014
ini-0.014
 neighborhood-0.014
ag-0.014
Token her
Token ID607
Feature activation+1.90e-02

Pos logit contributions
 eighty+0.026
 seventy+0.026
 ninety+0.025
 fourteen+0.022
 enough+0.021
Neg logit contributions
 neighbourhood-0.025
 world-0.024
ini-0.024
ag-0.024
 neighborhood-0.024
Token fault
Token ID8046
Feature activation+2.81e+00

Pos logit contributions
 eighty+0.001
 seventy+0.001
 ninety+0.001
 fourteen+0.001
 enough+0.001
Neg logit contributions
ini-0.001
 neighbourhood-0.001
ag-0.001
 world-0.001
 neighborhood-0.001
Token,
Token ID11
Feature activation+5.33e-02

Pos logit contributions
 eighty+0.157
 seventy+0.157
 ninety+0.152
 fourteen+0.139
 pret+0.132
Neg logit contributions
 world-0.087
ini-0.086
 neighbourhood-0.085
ag-0.084
ling-0.083
Token like
Token ID588
Feature activation+5.46e-01

Pos logit contributions
 seventy+0.002
 eighty+0.002
 ninety+0.002
 fourteen+0.002
 forty+0.002
Neg logit contributions
 world-0.002
 neighbourhood-0.002
 neighborhood-0.002
ini-0.002
ag-0.002
Token she
Token ID673
Feature activation-5.63e-01

Pos logit contributions
 eighty+0.024
 seventy+0.024
 ninety+0.023
 fourteen+0.020
 forty+0.019
Neg logit contributions
 world-0.024
 neighbourhood-0.023
 neighborhood-0.022
ag-0.022
 outdoors-0.022
Token had
Token ID550
Feature activation+2.94e-03

Pos logit contributions
 world+0.024
 neighbourhood+0.024
 neighborhood+0.024
ini+0.024
ag+0.023
Neg logit contributions
 eighty-0.025
 seventy-0.025
 ninety-0.024
 fourteen-0.021
 forty-0.020
Token done
Token ID1760
Feature activation+5.54e-01

Pos logit contributions
 eighty+0.000
 seventy+0.000
 ninety+0.000
 fourteen+0.000
 forty+0.000
Neg logit contributions
 neighbourhood-0.000
 world-0.000
 neighborhood-0.000
ag-0.000
iet-0.000
Token!
Token ID0
Feature activation+4.45e-01

Pos logit contributions
 world+0.007
 neighbourhood+0.007
 neighborhood+0.007
ling+0.006
ini+0.006
Neg logit contributions
 eighty-0.015
 seventy-0.015
 ninety-0.014
 fourteen-0.013
 forty-0.012
Token
Token ID198
Feature activation+2.41e-01

Pos logit contributions
 seventy+0.023
 eighty+0.023
 ninety+0.022
 fourteen+0.020
 enough+0.019
Neg logit contributions
ini-0.015
ag-0.015
 neighbourhood-0.015
 world-0.015
iet-0.015
Token
Token ID198
Feature activation+2.70e-01

Pos logit contributions
 eighty+0.011
 seventy+0.011
 Frost+0.010
 ninety+0.010
 Fl+0.010
Neg logit contributions
rys-0.009
ag-0.009
ini-0.009
iet-0.009
 nation-0.009
TokenLittle
Token ID22253
Feature activation+2.70e-01

Pos logit contributions
 eighty+0.013
 seventy+0.013
 ninety+0.013
 fourteen+0.012
 Frost+0.011
Neg logit contributions
ag-0.010
 world-0.009
 neighbourhood-0.009
ini-0.009
agging-0.009
Token John
Token ID1757
Feature activation+2.22e-01

Pos logit contributions
 eighty+0.010
 seventy+0.010
 ninety+0.009
 fourteen+0.008
 enough+0.008
Neg logit contributions
ini-0.013
 neighbourhood-0.013
ag-0.013
 world-0.013
 neighborhood-0.013
Token mun
Token ID29856
Feature activation+2.78e+00

Pos logit contributions
 eighty+0.011
 seventy+0.011
 ninety+0.011
 fourteen+0.010
 um+0.009
Neg logit contributions
 neighbourhood-0.008
 neighborhood-0.008
ling-0.008
 world-0.008
ini-0.008
Tokenched
Token ID1740
Feature activation+1.54e+00

Pos logit contributions
 eighty+0.133
 seventy+0.130
 ninety+0.130
 pret+0.116
 Rob+0.115
Neg logit contributions
ling-0.112
ag-0.111
agging-0.101
iet-0.099
rys-0.092
Token away
Token ID1497
Feature activation+1.12e+00

Pos logit contributions
 seventy+0.085
 eighty+0.085
 ninety+0.081
 fourteen+0.073
 forty+0.069
Neg logit contributions
ling-0.052
ag-0.051
 world-0.050
 neighbourhood-0.049
ini-0.047
Token happily
Token ID18177
Feature activation+8.10e-01

Pos logit contributions
 seventy+0.058
 eighty+0.058
 ninety+0.055
 fourteen+0.048
 pret+0.047
Neg logit contributions
ling-0.042
 world-0.040
 neighbourhood-0.038
ag-0.038
iet-0.037
Token,
Token ID11
Feature activation+5.02e-01

Pos logit contributions
 eighty+0.040
 seventy+0.040
 ninety+0.038
 fourteen+0.035
 pret+0.032
Neg logit contributions
 neighbourhood-0.030
 world-0.030
ag-0.030
ini-0.030
 neighborhood-0.030
Token welcoming
Token ID22516
Feature activation-1.11e-02

Pos logit contributions
 eighty+0.023
 seventy+0.023
 ninety+0.022
 fourteen+0.020
 pret+0.019
Neg logit contributions
 neighbourhood-0.020
 neighborhood-0.020
 world-0.019
ling-0.019
ini-0.018
Token W
Token ID370
Feature activation-1.68e-01

Pos logit contributions
 eighty+0.025
 seventy+0.024
 ninety+0.024
 fourteen+0.021
 enough+0.021
Neg logit contributions
 world-0.020
 neighbourhood-0.019
ini-0.019
 neighborhood-0.019
umpy-0.019
TokenOO
Token ID6684
Feature activation+2.62e+00

Pos logit contributions
 neighbourhood+0.006
ini+0.006
 world+0.006
 neighborhood+0.006
ag+0.006
Neg logit contributions
 eighty-0.008
 seventy-0.008
 ninety-0.008
 fourteen-0.007
 enough-0.007
TokenF
Token ID37
Feature activation-6.25e-02

Pos logit contributions
 eighty+0.115
 seventy+0.114
 ninety+0.109
 fourteen+0.090
 pret+0.090
Neg logit contributions
 world-0.114
ini-0.111
ag-0.110
iet-0.110
 neighbourhood-0.110
Token!
Token ID0
Feature activation+6.50e-01

Pos logit contributions
ini+0.002
 neighbourhood+0.002
ag+0.002
 world+0.002
 neighborhood+0.002
Neg logit contributions
 eighty-0.003
 seventy-0.003
 ninety-0.003
 fourteen-0.003
 enough-0.003
Token W
Token ID370
Feature activation-1.52e-01

Pos logit contributions
 eighty+0.031
 seventy+0.030
 ninety+0.029
 fourteen+0.027
 enough+0.026
Neg logit contributions
 world-0.025
ini-0.024
 neighbourhood-0.024
 neighborhood-0.024
umpy-0.023
TokenOO
Token ID6684
Feature activation+2.74e+00

Pos logit contributions
 neighbourhood+0.006
ini+0.006
 world+0.006
 neighborhood+0.006
ag+0.005
Neg logit contributions
 eighty-0.007
 seventy-0.007
 ninety-0.007
 fourteen-0.007
 enough-0.006
TokenF
Token ID37
Feature activation-4.26e-02

Pos logit contributions
 eighty+0.122
 seventy+0.121
 ninety+0.117
 pret+0.097
 fourteen+0.096
Neg logit contributions
 world-0.118
ini-0.115
umpy-0.114
 neighbourhood-0.113
ag-0.113
Token!"
Token ID2474
Feature activation+4.06e-01

Pos logit contributions
ini+0.002
 neighbourhood+0.002
 world+0.002
ag+0.002
 neighborhood+0.002
Neg logit contributions
 eighty-0.002
 seventy-0.002
 ninety-0.002
 fourteen-0.002
 enough-0.002
Token
Token ID198
Feature activation+4.37e-02

Pos logit contributions
 seventy+0.013
 eighty+0.013
 ninety+0.012
 fourteen+0.010
 enough+0.009
Neg logit contributions
 neighbourhood-0.023
 world-0.022
ag-0.022
 neighborhood-0.022
ini-0.022
Token
Token ID198
Feature activation+4.49e-02

Pos logit contributions
 Fl+0.002
 seventy+0.002
 eighty+0.002
 Frost+0.002
 ninety+0.002
Neg logit contributions
ini-0.002
agging-0.002
 neighbourhood-0.002
iet-0.002
 neighborhood-0.002
TokenBen
Token ID11696
Feature activation+8.44e-01

Pos logit contributions
 eighty+0.002
 seventy+0.002
 ninety+0.002
 Fl+0.002
 fourteen+0.002
Neg logit contributions
ag-0.002
ini-0.002
 neighbourhood-0.002
 world-0.002
 neighborhood-0.002
Token mouse
Token ID10211
Feature activation-5.56e-01

Pos logit contributions
ag+0.013
ini+0.012
 neighbourhood+0.012
 neighborhood+0.012
agging+0.011
Neg logit contributions
 eighty-0.016
 seventy-0.016
 ninety-0.015
 fourteen-0.014
 enough-0.014
Token watched
Token ID7342
Feature activation-2.79e-01

Pos logit contributions
ling+0.021
 neighbourhood+0.020
 neighborhood+0.020
 world+0.019
ag+0.018
Neg logit contributions
 eighty-0.027
 seventy-0.027
 ninety-0.027
 fourteen-0.025
 kidding-0.023
Token as
Token ID355
Feature activation-1.25e-02

Pos logit contributions
ag+0.013
 world+0.013
ling+0.013
 neighbourhood+0.012
ini+0.012
Neg logit contributions
 eighty-0.011
 seventy-0.011
 ninety-0.011
 fourteen-0.009
 pret-0.009
Token she
Token ID673
Feature activation-4.44e-01

Pos logit contributions
 world+0.001
 neighbourhood+0.001
ini+0.001
 neighborhood+0.001
ag+0.001
Neg logit contributions
 seventy-0.001
 eighty-0.001
 ninety-0.000
 fourteen-0.000
 enough-0.000
Token happily
Token ID18177
Feature activation+3.01e-01

Pos logit contributions
 neighbourhood+0.019
 neighborhood+0.019
ling+0.018
iet+0.017
 world+0.017
Neg logit contributions
 eighty-0.018
 seventy-0.018
 ninety-0.017
 fourteen-0.016
 fault-0.016
Token mun
Token ID29856
Feature activation+2.72e+00

Pos logit contributions
 eighty+0.011
 seventy+0.011
 ninety+0.010
 fourteen+0.009
 Rob+0.009
Neg logit contributions
ling-0.014
ag-0.014
 neighbourhood-0.013
 neighborhood-0.013
 world-0.013
Tokenched
Token ID1740
Feature activation+1.27e+00

Pos logit contributions
 eighty+0.115
 seventy+0.114
 ninety+0.111
 fourteen+0.093
 pret+0.092
Neg logit contributions
ag-0.122
ling-0.121
iet-0.115
agging-0.113
 world-0.112
Token away
Token ID1497
Feature activation+8.71e-01

Pos logit contributions
 eighty+0.065
 seventy+0.065
 ninety+0.062
 fourteen+0.057
 enough+0.053
Neg logit contributions
 neighbourhood-0.045
 world-0.045
ini-0.045
ag-0.045
 neighborhood-0.044
Token on
Token ID319
Feature activation+3.47e-01

Pos logit contributions
 eighty+0.047
 seventy+0.047
 ninety+0.045
 fourteen+0.041
 pret+0.038
Neg logit contributions
 world-0.029
ling-0.029
ag-0.028
 neighbourhood-0.028
iet-0.027
Token the
Token ID262
Feature activation+7.06e-01

Pos logit contributions
 eighty+0.016
 seventy+0.016
 ninety+0.015
 fourteen+0.013
 pret+0.013
Neg logit contributions
 world-0.014
 neighbourhood-0.014
ag-0.013
ling-0.013
iet-0.013
Token y
Token ID331
Feature activation+2.47e-01

Pos logit contributions
 seventy+0.035
 eighty+0.034
 ninety+0.033
 fourteen+0.030
 Fl+0.029
Neg logit contributions
 neighbourhood-0.026
 neighborhood-0.026
ag-0.025
ini-0.025
 world-0.025
Token bad
Token ID2089
Feature activation-1.45e-01

Pos logit contributions
iet+0.018
ini+0.018
ag+0.018
ling+0.017
 neighbourhood+0.016
Neg logit contributions
 eighty-0.032
 seventy-0.032
 ninety-0.032
 enough-0.028
 fourteen-0.028
Token car
Token ID1097
Feature activation+2.32e-01

Pos logit contributions
ini+0.005
 neighbourhood+0.005
ag+0.005
iet+0.005
 neighborhood+0.005
Neg logit contributions
 eighty-0.008
 seventy-0.008
 ninety-0.008
 fourteen-0.007
 enough-0.007
Token was
Token ID373
Feature activation-9.94e-02

Pos logit contributions
 seventy+0.011
 eighty+0.011
 ninety+0.010
 fourteen+0.010
 forty+0.009
Neg logit contributions
 world-0.009
 neighbourhood-0.009
 neighborhood-0.009
ini-0.009
ag-0.009
Token not
Token ID407
Feature activation-3.11e-01

Pos logit contributions
ag+0.004
 world+0.004
 neighbourhood+0.004
 neighborhood+0.004
ling+0.004
Neg logit contributions
 eighty-0.004
 seventy-0.004
 ninety-0.004
 fourteen-0.004
 pret-0.004
Token his
Token ID465
Feature activation+7.77e-01

Pos logit contributions
ag+0.012
 world+0.012
 outdoors+0.011
 neighborhood+0.011
ling+0.011
Neg logit contributions
 seventy-0.015
 eighty-0.015
 ninety-0.014
 fourteen-0.014
 forty-0.012
Token fault
Token ID8046
Feature activation+2.72e+00

Pos logit contributions
 seventy+0.047
 eighty+0.047
 ninety+0.044
 fourteen+0.043
 Frost+0.039
Neg logit contributions
 world-0.024
 neighborhood-0.022
 neighbourhood-0.022
ag-0.020
 house-0.019
Token.
Token ID13
Feature activation-9.82e-02

Pos logit contributions
 eighty+0.171
 seventy+0.171
 ninety+0.166
 fourteen+0.154
 forty+0.146
Neg logit contributions
 world-0.064
 neighbourhood-0.064
ini-0.063
ag-0.063
 neighborhood-0.062
Token She
Token ID1375
Feature activation-5.80e-02

Pos logit contributions
ini+0.003
ag+0.003
 neighbourhood+0.003
ling+0.003
iet+0.003
Neg logit contributions
 seventy-0.005
 eighty-0.005
 ninety-0.005
 fourteen-0.005
 Fl-0.004
Token had
Token ID550
Feature activation-8.98e-03

Pos logit contributions
 neighborhood+0.002
 neighbourhood+0.002
iet+0.002
 world+0.002
ling+0.002
Neg logit contributions
 eighty-0.003
 seventy-0.003
 ninety-0.003
 fourteen-0.003
ward-0.002
Token to
Token ID284
Feature activation+1.55e-01

Pos logit contributions
 neighborhood+0.000
 neighbourhood+0.000
ag+0.000
 world+0.000
 town+0.000
Neg logit contributions
 seventy-0.000
 eighty-0.000
 ninety-0.000
 fourteen-0.000
 Frost-0.000
Token make
Token ID787
Feature activation-2.07e-01

Pos logit contributions
 eighty+0.007
 seventy+0.007
 ninety+0.007
 fourteen+0.006
 enough+0.006
Neg logit contributions
 neighbourhood-0.006
ini-0.006
 world-0.006
ag-0.006
 neighborhood-0.006
Token
Token ID198
Feature activation+6.54e-02

Pos logit contributions
 eighty+0.003
 seventy+0.003
 ninety+0.003
 fourteen+0.003
 enough+0.003
Neg logit contributions
 neighbourhood-0.002
 neighborhood-0.002
 world-0.002
ini-0.002
ag-0.002
Token
Token ID198
Feature activation+1.15e-01

Pos logit contributions
 eighty+0.003
 seventy+0.003
 ninety+0.003
 fourteen+0.003
 Fl+0.003
Neg logit contributions
ini-0.002
iet-0.002
 world-0.002
ag-0.002
 neighbourhood-0.002
TokenThe
Token ID464
Feature activation+4.07e-02

Pos logit contributions
 eighty+0.006
 seventy+0.006
 ninety+0.006
 fourteen+0.006
 enough+0.005
Neg logit contributions
ini-0.004
 neighbourhood-0.004
ag-0.004
 world-0.004
 neighborhood-0.004
Token bunny
Token ID44915
Feature activation-6.14e-02

Pos logit contributions
 eighty+0.002
 seventy+0.002
 ninety+0.002
 fourteen+0.002
 enough+0.001
Neg logit contributions
ini-0.002
 neighbourhood-0.002
ag-0.002
 world-0.002
 neighborhood-0.002
Token happily
Token ID18177
Feature activation+5.12e-01

Pos logit contributions
 neighbourhood+0.002
ling+0.002
 neighborhood+0.002
 world+0.002
ini+0.002
Neg logit contributions
 eighty-0.003
 seventy-0.003
 ninety-0.003
 fourteen-0.003
 um-0.002
Token mun
Token ID29856
Feature activation+2.72e+00

Pos logit contributions
 eighty+0.023
 seventy+0.023
 ninety+0.022
 fourteen+0.021
 pret+0.020
Neg logit contributions
ling-0.020
 neighbourhood-0.019
ag-0.019
 neighborhood-0.018
iet-0.018
Tokenched
Token ID1740
Feature activation+1.46e+00

Pos logit contributions
 eighty+0.121
 seventy+0.120
 ninety+0.118
 pret+0.106
 Rob+0.102
Neg logit contributions
ling-0.117
ag-0.117
agging-0.109
iet-0.106
 world-0.100
Token on
Token ID319
Feature activation+7.18e-01

Pos logit contributions
 seventy+0.079
 eighty+0.078
 ninety+0.075
 fourteen+0.068
 forty+0.064
Neg logit contributions
ling-0.051
ag-0.050
 world-0.049
 neighbourhood-0.047
 outdoors-0.045
Token his
Token ID465
Feature activation+1.17e+00

Pos logit contributions
 eighty+0.034
 seventy+0.034
 ninety+0.032
 pret+0.028
 fourteen+0.028
Neg logit contributions
ling-0.026
 world-0.026
ag-0.026
 neighbourhood-0.025
 nature-0.024
Token carrot
Token ID40562
Feature activation+1.45e+00

Pos logit contributions
 eighty+0.050
 seventy+0.050
 ninety+0.048
 fourteen+0.045
 enough+0.040
Neg logit contributions
 world-0.051
ag-0.051
ling-0.051
ini-0.050
 neighbourhood-0.050
Token and
Token ID290
Feature activation+4.44e-01

Pos logit contributions
 eighty+0.069
 seventy+0.069
 ninety+0.066
 fourteen+0.061
 pret+0.056
Neg logit contributions
 world-0.057
ling-0.057
ag-0.056
 neighbourhood-0.056
ini-0.055
Token said
Token ID531
Feature activation-5.44e-01

Pos logit contributions
 eighty+0.003
 seventy+0.003
 ninety+0.003
 fourteen+0.003
 forty+0.003
Neg logit contributions
 neighborhood-0.003
 neighbourhood-0.003
iet-0.003
ag-0.003
 world-0.003
Token it
Token ID340
Feature activation+3.32e-01

Pos logit contributions
ini+0.025
 neighbourhood+0.025
 world+0.024
 neighborhood+0.024
ag+0.024
Neg logit contributions
 seventy-0.023
 eighty-0.023
 ninety-0.022
 fourteen-0.020
 enough-0.019
Token was
Token ID373
Feature activation+1.37e-01

Pos logit contributions
 eighty+0.018
 seventy+0.017
 ninety+0.017
 fourteen+0.016
 forty+0.014
Neg logit contributions
 world-0.012
 neighborhood-0.011
 neighbourhood-0.011
ling-0.011
ag-0.011
Token not
Token ID407
Feature activation-2.10e-01

Pos logit contributions
 eighty+0.005
 seventy+0.005
 ninety+0.005
 fourteen+0.004
 enough+0.004
Neg logit contributions
ini-0.007
 neighbourhood-0.007
 world-0.007
 neighborhood-0.007
ag-0.007
Token their
Token ID511
Feature activation+5.74e-01

Pos logit contributions
ag+0.007
 world+0.007
 outdoors+0.006
ling+0.006
 neighborhood+0.006
Neg logit contributions
 seventy-0.012
 eighty-0.011
 ninety-0.011
 fourteen-0.011
six-0.010
Token fault
Token ID8046
Feature activation+2.72e+00

Pos logit contributions
 seventy+0.027
 eighty+0.027
 ninety+0.026
 fourteen+0.023
 forty+0.021
Neg logit contributions
 neighbourhood-0.024
 world-0.023
 neighborhood-0.023
ini-0.023
ag-0.023
Token.
Token ID13
Feature activation-3.01e-01

Pos logit contributions
 seventy+0.160
 eighty+0.160
 ninety+0.154
 fourteen+0.144
 enough+0.140
Neg logit contributions
 neighbourhood-0.075
ini-0.075
 neighborhood-0.073
 world-0.073
ag-0.072
Token She
Token ID1375
Feature activation-1.04e-01

Pos logit contributions
ini+0.012
ling+0.011
ag+0.011
iet+0.011
 world+0.010
Neg logit contributions
 seventy-0.015
 eighty-0.015
 ninety-0.015
 Fl-0.014
 fourteen-0.014
Token said
Token ID531
Feature activation-2.15e-01

Pos logit contributions
 neighborhood+0.003
 neighbourhood+0.003
ini+0.003
iet+0.003
ling+0.003
Neg logit contributions
 seventy-0.006
 eighty-0.006
 ninety-0.005
 fourteen-0.005
 sixty-0.005
Token the
Token ID262
Feature activation-3.56e-01

Pos logit contributions
 world+0.008
ag+0.008
 neighbourhood+0.008
 neighborhood+0.008
ini+0.008
Neg logit contributions
 seventy-0.010
 eighty-0.010
 ninety-0.010
 fourteen-0.009
 forty-0.008
Token truck
Token ID7779
Feature activation-3.50e-01

Pos logit contributions
ini+0.013
ag+0.013
 neighbourhood+0.013
iet+0.013
 world+0.012
Neg logit contributions
 eighty-0.018
 seventy-0.018
 ninety-0.017
 fourteen-0.016
 enough-0.015
Token in
Token ID287
Feature activation-3.07e-01

Pos logit contributions
 eighty+0.012
 seventy+0.012
 ninety+0.012
 fourteen+0.011
 pret+0.010
Neg logit contributions
 world-0.008
ini-0.008
ag-0.008
ling-0.008
 neighbourhood-0.008
Token his
Token ID465
Feature activation+5.27e-01

Pos logit contributions
 world+0.013
 neighbourhood+0.012
 neighborhood+0.012
unts+0.012
iet+0.012
Neg logit contributions
 eighty-0.014
 seventy-0.014
 ninety-0.013
 fourteen-0.012
 pret-0.011
Token hands
Token ID2832
Feature activation-6.25e-01

Pos logit contributions
 eighty+0.025
 seventy+0.025
 ninety+0.024
 fourteen+0.022
 pret+0.020
Neg logit contributions
 neighbourhood-0.021
 world-0.021
 neighborhood-0.021
ag-0.020
ini-0.020
Token.
Token ID13
Feature activation-5.92e-02

Pos logit contributions
ling+0.011
ag+0.011
 world+0.011
 neighbourhood+0.010
ini+0.010
Neg logit contributions
 eighty-0.044
 seventy-0.043
 ninety-0.042
 fourteen-0.039
 pret-0.038
Token He
Token ID679
Feature activation+3.04e-01

Pos logit contributions
 neighbourhood+0.002
 neighborhood+0.002
 world+0.002
ag+0.002
ini+0.001
Neg logit contributions
 eighty-0.003
 seventy-0.003
 ninety-0.003
 fourteen-0.003
 enough-0.003
Token mun
Token ID29856
Feature activation+2.71e+00

Pos logit contributions
 eighty+0.016
 seventy+0.015
 ninety+0.015
 fourteen+0.015
 fault+0.015
Neg logit contributions
ling-0.010
 neighbourhood-0.009
 neighborhood-0.009
iet-0.008
 world-0.008
Tokenched
Token ID1740
Feature activation+1.41e+00

Pos logit contributions
 eighty+0.132
 ninety+0.130
 seventy+0.129
 Rob+0.118
 pret+0.115
Neg logit contributions
ling-0.105
ag-0.105
agging-0.096
iet-0.093
ilt-0.086
Token on
Token ID319
Feature activation+8.45e-01

Pos logit contributions
 seventy+0.084
 eighty+0.083
 ninety+0.080
 fourteen+0.072
 sixty+0.070
Neg logit contributions
ling-0.046
ag-0.041
 world-0.038
 outdoors-0.036
 around-0.036
Token it
Token ID340
Feature activation+8.55e-01

Pos logit contributions
 eighty+0.043
 seventy+0.043
 ninety+0.041
 Mechan+0.038
 pret+0.037
Neg logit contributions
ling-0.028
ag-0.026
 world-0.026
 nature-0.025
 outdoors-0.024
Token happily
Token ID18177
Feature activation+6.82e-01

Pos logit contributions
 eighty+0.047
 seventy+0.047
 ninety+0.045
 fourteen+0.042
 pret+0.041
Neg logit contributions
 world-0.028
ling-0.028
ag-0.026
 outdoors-0.025
 neighbourhood-0.024
Token.
Token ID13
Feature activation-1.65e-02

Pos logit contributions
 eighty+0.038
 seventy+0.038
 ninety+0.036
 fourteen+0.034
 pret+0.032
Neg logit contributions
 world-0.021
ag-0.021
 neighbourhood-0.021
ling-0.020
 neighborhood-0.020
Token and
Token ID290
Feature activation+1.27e-01

Pos logit contributions
 neighbourhood+0.008
 neighborhood+0.008
ini+0.008
ling+0.008
iet+0.008
Neg logit contributions
 eighty-0.011
 seventy-0.011
 ninety-0.010
 fourteen-0.009
 Frost-0.009
Token her
Token ID607
Feature activation-2.08e-01

Pos logit contributions
 eighty+0.007
 seventy+0.007
 ninety+0.006
 fourteen+0.006
 forty+0.005
Neg logit contributions
 neighbourhood-0.004
 neighborhood-0.004
 world-0.004
ag-0.004
ini-0.004
Token mom
Token ID1995
Feature activation+7.55e-02

Pos logit contributions
iet+0.009
ag+0.009
ini+0.009
ling+0.009
 neighbourhood+0.009
Neg logit contributions
 seventy-0.009
 eighty-0.009
 ninety-0.008
 enough-0.007
 Fl-0.007
Token sorted
Token ID23243
Feature activation-1.55e-01

Pos logit contributions
 seventy+0.003
 eighty+0.003
 ninety+0.003
 fourteen+0.003
 enough+0.003
Neg logit contributions
 world-0.003
ag-0.003
ini-0.003
 neighbourhood-0.003
iet-0.003
Token the
Token ID262
Feature activation-8.52e-01

Pos logit contributions
 neighbourhood+0.007
 world+0.007
 neighborhood+0.006
ag+0.006
ini+0.006
Neg logit contributions
 eighty-0.007
 seventy-0.007
 ninety-0.007
 fourteen-0.006
 pret-0.006
Token av
Token ID1196
Feature activation-3.01e+00

Pos logit contributions
 neighbourhood+0.035
 world+0.034
ini+0.034
 neighborhood+0.034
ag+0.034
Neg logit contributions
 eighty-0.039
 seventy-0.039
 ninety-0.038
 fourteen-0.034
 enough-0.032
Tokenoc
Token ID420
Feature activation-9.39e-01

Pos logit contributions
ag+0.137
 neighbourhood+0.136
 neighborhood+0.134
ini+0.134
iet+0.133
Neg logit contributions
 seventy-0.123
 eighty-0.123
 ninety-0.117
 fourteen-0.101
 enough-0.100
Tokenados
Token ID22484
Feature activation-1.56e+00

Pos logit contributions
ini+0.040
 neighbourhood+0.040
 world+0.040
 neighborhood+0.040
ag+0.039
Neg logit contributions
 eighty-0.041
 seventy-0.041
 ninety-0.039
 fourteen-0.035
 enough-0.033
Token into
Token ID656
Feature activation-4.66e-01

Pos logit contributions
 neighbourhood+0.056
 world+0.056
 neighborhood+0.055
ini+0.055
ag+0.054
Neg logit contributions
 eighty-0.080
 seventy-0.080
 ninety-0.076
 fourteen-0.070
 forty-0.065
Token piles
Token ID30935
Feature activation-6.28e-01

Pos logit contributions
 neighbourhood+0.019
 world+0.019
 neighborhood+0.019
ini+0.018
iet+0.017
Neg logit contributions
 eighty-0.022
 seventy-0.022
 ninety-0.021
 fourteen-0.019
 pret-0.017
Token according
Token ID1864
Feature activation-4.38e-02

Pos logit contributions
 neighbourhood+0.017
ini+0.017
 world+0.017
ag+0.017
 neighborhood+0.017
Neg logit contributions
 eighty-0.037
 seventy-0.037
 ninety-0.036
 fourteen-0.033
 enough-0.032
Token and
Token ID290
Feature activation-2.28e-02

Pos logit contributions
 seventy+0.022
 eighty+0.022
 ninety+0.021
 fourteen+0.020
 forty+0.018
Neg logit contributions
 world-0.016
 neighbourhood-0.016
ag-0.015
ini-0.015
 neighborhood-0.015
Token went
Token ID1816
Feature activation+1.84e-01

Pos logit contributions
 neighbourhood+0.001
 neighborhood+0.001
 world+0.001
ling+0.001
ini+0.001
Neg logit contributions
 eighty-0.001
 seventy-0.001
 ninety-0.001
 fourteen-0.001
 forty-0.001
Token to
Token ID284
Feature activation-3.66e-01

Pos logit contributions
 eighty+0.006
 seventy+0.006
 ninety+0.006
 enough+0.006
 Fl+0.005
Neg logit contributions
ini-0.010
iet-0.010
 world-0.009
 neighbourhood-0.009
 neighborhood-0.009
Token apologize
Token ID16521
Feature activation-4.63e-01

Pos logit contributions
 neighbourhood+0.013
 world+0.013
 neighborhood+0.013
ag+0.013
 outdoors+0.012
Neg logit contributions
 eighty-0.019
 seventy-0.019
 ninety-0.018
 fourteen-0.017
 forty-0.015
Token to
Token ID284
Feature activation-1.21e-01

Pos logit contributions
 world+0.016
ag+0.016
 neighbourhood+0.015
ling+0.015
 neighborhood+0.015
Neg logit contributions
 seventy-0.026
 eighty-0.025
 ninety-0.024
 fourteen-0.023
 forty-0.021
Token Rabb
Token ID22502
Feature activation-2.91e+00

Pos logit contributions
 world+0.005
 neighbourhood+0.005
ag+0.005
 neighborhood+0.005
ling+0.005
Neg logit contributions
 seventy-0.005
 eighty-0.005
 ninety-0.005
 fourteen-0.004
 enough-0.004
Tokenie
Token ID494
Feature activation-6.50e-01

Pos logit contributions
 neighborhood+0.107
 neighbourhood+0.106
 hurricane+0.100
 world+0.100
ini+0.099
Neg logit contributions
 eighty-0.127
 seventy-0.123
 ninety-0.122
 enough-0.112
 fourteen-0.107
Token.
Token ID13
Feature activation+3.72e-02

Pos logit contributions
 neighbourhood+0.022
 neighborhood+0.022
ini+0.022
ag+0.021
iet+0.021
Neg logit contributions
 eighty-0.034
 seventy-0.034
 ninety-0.033
 enough-0.030
 Fl-0.030
Token Fox
Token ID5426
Feature activation-2.59e-01

Pos logit contributions
 eighty+0.002
 seventy+0.002
 ninety+0.002
 fourteen+0.002
 enough+0.002
Neg logit contributions
 neighbourhood-0.001
 world-0.001
 neighborhood-0.001
ini-0.001
ag-0.001
Tokeny
Token ID88
Feature activation+4.33e-01

Pos logit contributions
ling+0.008
ini+0.008
 neighbourhood+0.008
 world+0.007
 neighborhood+0.007
Neg logit contributions
 eighty-0.015
 seventy-0.015
 ninety-0.015
 fourteen-0.014
 forty-0.013
Token said
Token ID531
Feature activation+3.16e-01

Pos logit contributions
 eighty+0.022
 seventy+0.022
 ninety+0.021
 fourteen+0.020
 forty+0.019
Neg logit contributions
 neighbourhood-0.016
 neighborhood-0.016
ling-0.016
 world-0.015
iet-0.015
Token You
Token ID921
Feature activation-2.80e-01

Pos logit contributions
ling+0.009
 neighbourhood+0.009
ini+0.009
 neighborhood+0.009
 world+0.009
Neg logit contributions
 seventy-0.009
 eighty-0.009
 ninety-0.008
 Fl-0.007
 fourteen-0.007
Token have
Token ID423
Feature activation+2.78e-01

Pos logit contributions
ag+0.011
 world+0.011
 neighbourhood+0.011
ini+0.011
 neighborhood+0.011
Neg logit contributions
 seventy-0.014
 eighty-0.014
 ninety-0.013
 fourteen-0.012
 forty-0.011
Token a
Token ID257
Feature activation+4.04e-01

Pos logit contributions
 seventy+0.014
 eighty+0.014
 ninety+0.013
 fourteen+0.012
 forty+0.011
Neg logit contributions
 neighborhood-0.010
 world-0.010
 neighbourhood-0.010
ag-0.010
 language-0.009
Token rabbit
Token ID22746
Feature activation-6.29e-01

Pos logit contributions
 eighty+0.022
 seventy+0.022
 ninety+0.021
 fourteen+0.019
 enough+0.018
Neg logit contributions
 neighbourhood-0.013
ini-0.013
 world-0.013
ag-0.013
 neighborhood-0.013
Token.
Token ID13
Feature activation-2.28e-01

Pos logit contributions
 world+0.019
ag+0.019
 neighbourhood+0.018
 neighborhood+0.018
ini+0.018
Neg logit contributions
 eighty-0.036
 seventy-0.036
 ninety-0.035
 fourteen-0.032
 forty-0.030
Token Rabb
Token ID22502
Feature activation-2.90e+00

Pos logit contributions
 neighbourhood+0.009
ini+0.009
 world+0.009
 neighborhood+0.009
ag+0.009
Neg logit contributions
 seventy-0.011
 eighty-0.011
 ninety-0.010
 fourteen-0.009
 Fl-0.009
Tokenits
Token ID896
Feature activation+4.30e-01

Pos logit contributions
 neighbourhood+0.090
 neighborhood+0.089
 world+0.081
 outdoors+0.078
ini+0.077
Neg logit contributions
 eighty-0.152
 ninety-0.148
 seventy-0.148
 fourteen-0.143
 enough-0.133
Token are
Token ID389
Feature activation-1.90e-02

Pos logit contributions
 seventy+0.022
 eighty+0.022
 ninety+0.021
 fourteen+0.018
 forty+0.018
Neg logit contributions
ag-0.017
 world-0.016
iet-0.016
ling-0.016
umpy-0.014
Token for
Token ID329
Feature activation-8.61e-02

Pos logit contributions
ag+0.001
 world+0.001
ling+0.001
iet+0.001
 neighbourhood+0.001
Neg logit contributions
 seventy-0.001
 eighty-0.001
 ninety-0.001
 fourteen-0.001
 forty-0.001
Token babies
Token ID11903
Feature activation+1.60e-01

Pos logit contributions
 world+0.003
ag+0.003
 neighbourhood+0.003
 neighborhood+0.003
 nature+0.003
Neg logit contributions
 eighty-0.005
 seventy-0.005
 ninety-0.004
 fourteen-0.004
 um-0.004
Token.
Token ID13
Feature activation-1.76e-01

Pos logit contributions
 eighty+0.008
 seventy+0.008
 ninety+0.008
 fourteen+0.007
 enough+0.007
Neg logit contributions
 neighbourhood-0.005
ini-0.005
 world-0.005
ag-0.005
 neighborhood-0.005
Token "
Token ID366
Feature activation-1.04e-02

Pos logit contributions
 eighty+0.040
 seventy+0.040
 ninety+0.039
 fourteen+0.036
 enough+0.034
Neg logit contributions
 world-0.026
 neighbourhood-0.025
 neighborhood-0.025
ag-0.024
ling-0.024
TokenI
Token ID40
Feature activation+6.28e-01

Pos logit contributions
ag+0.000
 neighbourhood+0.000
ini+0.000
 neighborhood+0.000
 world+0.000
Neg logit contributions
 seventy-0.001
 eighty-0.001
 ninety-0.001
 fourteen-0.000
 enough-0.000
Token'm
Token ID1101
Feature activation+6.95e-01

Pos logit contributions
 seventy+0.029
 eighty+0.029
 ninety+0.028
 fourteen+0.026
 forty+0.024
Neg logit contributions
 world-0.026
 neighbourhood-0.026
ini-0.026
 neighborhood-0.025
ag-0.025
Token sorry
Token ID7926
Feature activation-7.80e-01

Pos logit contributions
 seventy+0.042
 eighty+0.042
 ninety+0.041
 fourteen+0.038
 pret+0.036
Neg logit contributions
ag-0.018
 world-0.017
 neighbourhood-0.017
ini-0.017
 neighborhood-0.017
Token,
Token ID11
Feature activation-1.40e-01

Pos logit contributions
 world+0.032
ling+0.031
 neighbourhood+0.030
ini+0.030
 neighborhood+0.030
Neg logit contributions
 eighty-0.036
 seventy-0.036
 ninety-0.035
 fourteen-0.032
 forty-0.029
Token Rabb
Token ID22502
Feature activation-2.75e+00

Pos logit contributions
 world+0.006
ini+0.005
 neighbourhood+0.005
ling+0.005
 neighborhood+0.005
Neg logit contributions
 eighty-0.006
 seventy-0.006
 ninety-0.006
 fourteen-0.006
 pret-0.005
Tokenie
Token ID494
Feature activation-6.26e-01

Pos logit contributions
 neighbourhood+0.085
 neighborhood+0.085
 world+0.078
ini+0.075
 outdoors+0.074
Neg logit contributions
 eighty-0.147
 seventy-0.144
 ninety-0.142
 fourteen-0.130
 enough-0.124
Token.
Token ID13
Feature activation+6.20e-02

Pos logit contributions
 neighbourhood+0.023
ag+0.023
 neighborhood+0.023
iet+0.022
ini+0.022
Neg logit contributions
 seventy-0.031
 eighty-0.031
 ninety-0.029
 enough-0.027
 fourteen-0.027
Token I
Token ID314
Feature activation-1.22e-02

Pos logit contributions
 seventy+0.002
 eighty+0.002
 ninety+0.002
 fourteen+0.002
 Fl+0.002
Neg logit contributions
iet-0.003
ag-0.003
 neighbourhood-0.003
 neighborhood-0.003
ling-0.003
Token didn
Token ID1422
Feature activation-1.28e-01

Pos logit contributions
 neighbourhood+0.001
ini+0.001
 world+0.001
ag+0.001
 neighborhood+0.001
Neg logit contributions
 eighty-0.000
 seventy-0.000
 ninety-0.000
 fourteen-0.000
 enough-0.000
Token't
Token ID470
Feature activation-4.48e-01

Pos logit contributions
ag+0.005
 world+0.005
ini+0.005
 neighbourhood+0.005
 neighborhood+0.005
Neg logit contributions
 seventy-0.006
 eighty-0.006
 ninety-0.006
 fourteen-0.005
 enough-0.005
Token and
Token ID290
Feature activation+5.56e-01

Pos logit contributions
ag+0.015
 world+0.014
ling+0.014
 neighbourhood+0.014
iet+0.014
Neg logit contributions
 seventy-0.019
 eighty-0.019
 ninety-0.018
 fourteen-0.016
 pret-0.016
Token try
Token ID1949
Feature activation-1.12e-01

Pos logit contributions
 eighty+0.026
 seventy+0.026
 ninety+0.025
 fourteen+0.023
 enough+0.022
Neg logit contributions
 neighbourhood-0.021
ag-0.021
 world-0.021
 neighborhood-0.021
iet-0.020
Token to
Token ID284
Feature activation-5.22e-01

Pos logit contributions
 world+0.004
ag+0.004
 neighbourhood+0.004
ling+0.004
 neighborhood+0.004
Neg logit contributions
 eighty-0.006
 seventy-0.006
 ninety-0.005
 fourteen-0.005
 um-0.004
Token grab
Token ID5552
Feature activation-6.38e-01

Pos logit contributions
 neighbourhood+0.021
ini+0.021
 world+0.021
ag+0.021
 neighborhood+0.021
Neg logit contributions
 eighty-0.024
 seventy-0.024
 ninety-0.023
 fourteen-0.021
 enough-0.020
Token the
Token ID262
Feature activation-5.80e-01

Pos logit contributions
ag+0.023
 world+0.023
ling+0.022
iet+0.022
 neighbourhood+0.021
Neg logit contributions
 eighty-0.032
 seventy-0.032
 ninety-0.030
 fourteen-0.026
 Mechan-0.025
Token av
Token ID1196
Feature activation-2.74e+00

Pos logit contributions
 neighbourhood+0.023
ini+0.023
ag+0.023
 neighborhood+0.023
 world+0.022
Neg logit contributions
 eighty-0.027
 seventy-0.027
 ninety-0.026
 fourteen-0.024
 enough-0.023
Tokenoc
Token ID420
Feature activation-4.75e-01

Pos logit contributions
 neighbourhood+0.121
ini+0.120
 world+0.119
ag+0.119
 neighborhood+0.119
Neg logit contributions
 eighty-0.116
 seventy-0.116
 ninety-0.110
 fourteen-0.099
 enough-0.093
Tokenados
Token ID22484
Feature activation-1.12e+00

Pos logit contributions
 neighbourhood+0.021
ini+0.021
 world+0.021
ag+0.021
 neighborhood+0.021
Neg logit contributions
 eighty-0.020
 seventy-0.020
 ninety-0.019
 fourteen-0.017
 enough-0.016
Token.
Token ID13
Feature activation-2.79e-01

Pos logit contributions
 world+0.031
ag+0.031
ling+0.030
 neighbourhood+0.030
 neighborhood+0.029
Neg logit contributions
 eighty-0.065
 seventy-0.065
 ninety-0.062
 fourteen-0.057
 pret-0.055
Token But
Token ID887
Feature activation-1.26e-01

Pos logit contributions
 neighbourhood+0.007
 world+0.007
 neighborhood+0.007
ag+0.007
ini+0.006
Neg logit contributions
 eighty-0.017
 seventy-0.017
 ninety-0.016
 fourteen-0.015
 enough-0.014
Token the
Token ID262
Feature activation-4.64e-01

Pos logit contributions
 nature+0.003
ling+0.003
 hurricane+0.003
 town+0.003
 nation+0.003
Neg logit contributions
 eighty-0.007
 seventy-0.007
 ninety-0.007
 Mechan-0.006
seven-0.006
Token It
Token ID632
Feature activation+6.24e-01

Pos logit contributions
 eighty+0.012
 seventy+0.012
 ninety+0.011
 fourteen+0.010
 enough+0.010
Neg logit contributions
 neighbourhood-0.007
 world-0.007
 neighborhood-0.007
ini-0.007
ag-0.007
Token turned
Token ID2900
Feature activation+9.28e-01

Pos logit contributions
 eighty+0.031
 seventy+0.031
 ninety+0.030
 fourteen+0.027
 forty+0.026
Neg logit contributions
ling-0.025
 world-0.023
ini-0.023
 neighbourhood-0.023
iet-0.023
Token out
Token ID503
Feature activation+7.26e-01

Pos logit contributions
 eighty+0.047
 seventy+0.047
 ninety+0.045
 fourteen+0.041
 forty+0.038
Neg logit contributions
ling-0.033
ag-0.033
 world-0.033
 neighbourhood-0.033
ini-0.033
Token that
Token ID326
Feature activation-5.16e-01

Pos logit contributions
 eighty+0.032
 seventy+0.031
 ninety+0.030
 fourteen+0.027
 enough+0.025
Neg logit contributions
 neighbourhood-0.031
 world-0.031
ini-0.031
ag-0.031
 neighborhood-0.031
Token the
Token ID262
Feature activation-7.40e-01

Pos logit contributions
 world+0.022
 neighbourhood+0.020
 neighborhood+0.020
ling+0.019
ini+0.019
Neg logit contributions
 seventy-0.025
 eighty-0.025
 ninety-0.024
 fourteen-0.021
 forty-0.020
Token av
Token ID1196
Feature activation-2.70e+00

Pos logit contributions
ag+0.024
ini+0.024
 neighbourhood+0.023
iet+0.023
 neighborhood+0.023
Neg logit contributions
 eighty-0.040
 seventy-0.040
 ninety-0.038
 fourteen-0.035
 enough-0.034
Tokenoc
Token ID420
Feature activation-2.82e-01

Pos logit contributions
 neighbourhood+0.117
ini+0.117
 world+0.116
ag+0.115
 neighborhood+0.115
Neg logit contributions
 eighty-0.117
 seventy-0.117
 ninety-0.111
 fourteen-0.100
 enough-0.094
Tokenados
Token ID22484
Feature activation-9.27e-01

Pos logit contributions
ini+0.012
 neighbourhood+0.012
 world+0.011
 neighborhood+0.011
ag+0.011
Neg logit contributions
 eighty-0.013
 seventy-0.013
 ninety-0.012
 fourteen-0.011
 Fl-0.010
Token were
Token ID547
Feature activation+8.29e-01

Pos logit contributions
 world+0.041
 neighbourhood+0.041
 neighborhood+0.040
ling+0.040
ini+0.040
Neg logit contributions
 eighty-0.041
 seventy-0.040
 ninety-0.039
 fourteen-0.034
 forty-0.033
Token bad
Token ID2089
Feature activation+2.42e-01

Pos logit contributions
 eighty+0.040
 seventy+0.040
 ninety+0.039
 fourteen+0.034
 pret+0.033
Neg logit contributions
 world-0.032
ling-0.032
 neighbourhood-0.031
ag-0.031
 neighborhood-0.031
Token and
Token ID290
Feature activation+1.54e-01

Pos logit contributions
 eighty+0.014
 seventy+0.014
 ninety+0.013
 fourteen+0.012
 forty+0.011
Neg logit contributions
 world-0.008
ling-0.008
 neighbourhood-0.008
 neighborhood-0.008
ag-0.007
Token make
Token ID787
Feature activation-5.43e-01

Pos logit contributions
 neighbourhood+0.031
 neighborhood+0.031
 world+0.031
ag+0.030
ini+0.030
Neg logit contributions
 seventy-0.038
 eighty-0.038
 ninety-0.037
 fourteen-0.033
 forty-0.030
Token it
Token ID340
Feature activation-2.03e-01

Pos logit contributions
ag+0.023
 world+0.023
 neighbourhood+0.022
ini+0.022
 neighborhood+0.022
Neg logit contributions
 eighty-0.024
 seventy-0.024
 ninety-0.023
 fourteen-0.020
 pret-0.018
Token whole
Token ID2187
Feature activation-1.34e-01

Pos logit contributions
 world+0.006
ag+0.006
ling+0.006
ini+0.006
 house+0.006
Neg logit contributions
 seventy-0.011
 eighty-0.011
 ninety-0.011
 fourteen-0.010
 Rob-0.010
Token again
Token ID757
Feature activation-2.69e-01

Pos logit contributions
 world+0.006
 neighborhood+0.005
 neighbourhood+0.005
ling+0.005
umpy+0.005
Neg logit contributions
 seventy-0.007
 eighty-0.007
 ninety-0.007
 Nib-0.006
 fourteen-0.006
Token.
Token ID13
Feature activation-4.36e-01

Pos logit contributions
ini+0.011
 neighbourhood+0.011
ag+0.011
 world+0.011
 neighborhood+0.011
Neg logit contributions
 eighty-0.012
 seventy-0.012
 ninety-0.012
 fourteen-0.011
 enough-0.010
Token Amen
Token ID34717
Feature activation-2.67e+00

Pos logit contributions
 neighbourhood+0.017
ini+0.017
 world+0.017
 neighborhood+0.017
ag+0.017
Neg logit contributions
 eighty-0.020
 seventy-0.020
 ninety-0.019
 fourteen-0.017
 enough-0.017
Token."
Token ID526
Feature activation+1.44e-02

Pos logit contributions
ini+0.110
 neighbourhood+0.109
 world+0.108
 neighborhood+0.107
ag+0.107
Neg logit contributions
 eighty-0.121
 seventy-0.121
 ninety-0.116
 fourteen-0.105
 enough-0.099
Token
Token ID198
Feature activation-4.02e-01

Pos logit contributions
 eighty+0.001
 seventy+0.001
 ninety+0.001
 fourteen+0.001
 enough+0.001
Neg logit contributions
 neighbourhood-0.000
 world-0.000
ini-0.000
 neighborhood-0.000
ag-0.000
Token
Token ID198
Feature activation-3.52e-01

Pos logit contributions
ini+0.020
ling+0.019
agging+0.019
 world+0.019
 neighbourhood+0.019
Neg logit contributions
 eighty-0.014
 seventy-0.014
 Fl-0.014
 Frost-0.013
 ninety-0.013
TokenThen
Token ID6423
Feature activation+6.59e-02

Pos logit contributions
ini+0.016
ag+0.016
 world+0.016
ling+0.015
agging+0.015
Neg logit contributions
 eighty-0.015
 seventy-0.015
 ninety-0.013
 Fl-0.013
 fourteen-0.012
Token,
Token ID11
Feature activation-3.73e-01

Pos logit contributions
 seventy+0.003
 eighty+0.003
 ninety+0.003
 fourteen+0.003
 forty+0.003
Neg logit contributions
 neighbourhood-0.002
 neighborhood-0.002
 world-0.002
 nation-0.002
iet-0.002
Token the
Token ID262
Feature activation-2.07e-01

Pos logit contributions
ling+0.025
iet+0.024
 world+0.024
 neighbourhood+0.024
ini+0.024
Neg logit contributions
 eighty-0.039
 seventy-0.039
 ninety-0.037
 mun-0.031
 pret-0.031
Token dolls
Token ID36062
Feature activation-5.87e-01

Pos logit contributions
ag+0.008
ini+0.008
 neighbourhood+0.008
 neighborhood+0.008
iet+0.008
Neg logit contributions
 seventy-0.010
 eighty-0.010
 ninety-0.010
 fourteen-0.009
 enough-0.008
Token and
Token ID290
Feature activation-2.53e-01

Pos logit contributions
ini+0.025
 neighbourhood+0.025
 world+0.025
ag+0.025
 neighborhood+0.024
Neg logit contributions
 seventy-0.026
 eighty-0.026
 ninety-0.025
 fourteen-0.022
 enough-0.021
Token the
Token ID262
Feature activation-2.54e-01

Pos logit contributions
 neighbourhood+0.009
 neighborhood+0.009
 world+0.009
iet+0.008
ling+0.008
Neg logit contributions
 eighty-0.013
 seventy-0.013
 ninety-0.012
 fourteen-0.011
 Frost-0.010
Token nap
Token ID25422
Feature activation+2.10e-01

Pos logit contributions
ini+0.010
 neighbourhood+0.010
ag+0.010
 world+0.010
 neighborhood+0.010
Neg logit contributions
 seventy-0.012
 eighty-0.012
 ninety-0.011
 fourteen-0.010
 enough-0.010
Tokenkins
Token ID5331
Feature activation-2.65e+00

Pos logit contributions
 seventy+0.012
 eighty+0.012
 ninety+0.011
 fourteen+0.010
 enough+0.010
Neg logit contributions
 neighbourhood-0.006
ini-0.006
 neighborhood-0.006
 world-0.006
ag-0.006
Token inside
Token ID2641
Feature activation-1.99e-01

Pos logit contributions
 neighbourhood+0.096
 world+0.096
ini+0.095
ag+0.095
 neighborhood+0.095
Neg logit contributions
 eighty-0.132
 seventy-0.132
 ninety-0.126
 fourteen-0.115
 enough-0.108
Token.
Token ID13
Feature activation-1.73e-01

Pos logit contributions
ini+0.006
 neighbourhood+0.006
ag+0.006
iet+0.006
 neighborhood+0.006
Neg logit contributions
 seventy-0.011
 eighty-0.011
 ninety-0.010
 fourteen-0.010
 enough-0.009
Token She
Token ID1375
Feature activation-8.33e-02

Pos logit contributions
ag+0.006
ini+0.006
ling+0.006
agging+0.006
iet+0.006
Neg logit contributions
 seventy-0.009
 eighty-0.009
 ninety-0.008
 Fl-0.008
 fourteen-0.008
Token realized
Token ID6939
Feature activation+3.23e-01

Pos logit contributions
 neighborhood+0.003
iet+0.003
 neighbourhood+0.003
ini+0.003
ling+0.003
Neg logit contributions
 eighty-0.004
 seventy-0.004
 ninety-0.004
 fourteen-0.004
 enough-0.003
Token that
Token ID326
Feature activation-2.69e-01

Pos logit contributions
 eighty+0.016
 seventy+0.015
 ninety+0.015
 fourteen+0.013
 pret+0.012
Neg logit contributions
 neighbourhood-0.012
 world-0.012
 neighborhood-0.012
ini-0.012
iet-0.012
Token so
Token ID523
Feature activation+6.68e-02

Pos logit contributions
 ninety+0.008
 seventy+0.008
 eighty+0.008
 fourteen+0.007
 um+0.007
Neg logit contributions
 neighbourhood-0.007
 neighborhood-0.007
 world-0.007
ag-0.006
 outdoors-0.006
Token happy
Token ID3772
Feature activation+2.92e-01

Pos logit contributions
 seventy+0.004
 eighty+0.004
 ninety+0.004
 fourteen+0.003
 enough+0.003
Neg logit contributions
 neighbourhood-0.002
ini-0.002
 world-0.002
 neighborhood-0.002
ag-0.002
Token and
Token ID290
Feature activation-7.86e-02

Pos logit contributions
 eighty+0.015
 seventy+0.015
 ninety+0.014
 fourteen+0.013
 forty+0.012
Neg logit contributions
 world-0.011
 neighbourhood-0.011
 neighborhood-0.011
ini-0.011
ag-0.011
Token brought
Token ID3181
Feature activation+2.61e-01

Pos logit contributions
 neighbourhood+0.003
ini+0.003
 world+0.003
 neighborhood+0.003
ag+0.003
Neg logit contributions
 eighty-0.004
 seventy-0.004
 ninety-0.004
 fourteen-0.003
 enough-0.003
Token the
Token ID262
Feature activation-9.60e-01

Pos logit contributions
 seventy+0.010
 eighty+0.010
 ninety+0.009
 fourteen+0.008
 enough+0.008
Neg logit contributions
ini-0.013
 neighbourhood-0.013
 world-0.013
 neighborhood-0.013
ag-0.013
Token av
Token ID1196
Feature activation-2.63e+00

Pos logit contributions
iet+0.038
ag+0.038
ini+0.037
 neighbourhood+0.036
 neighborhood+0.035
Neg logit contributions
 eighty-0.046
 seventy-0.046
 ninety-0.044
 fourteen-0.039
 enough-0.038
Tokenoc
Token ID420
Feature activation-3.31e-01

Pos logit contributions
 neighbourhood+0.129
 neighborhood+0.128
ag+0.128
iet+0.127
 world+0.127
Neg logit contributions
 eighty-0.100
 seventy-0.099
 ninety-0.095
 fourteen-0.079
 enough-0.077
Tokenados
Token ID22484
Feature activation-1.21e+00

Pos logit contributions
iet+0.016
 neighbourhood+0.015
ag+0.015
 world+0.015
 neighborhood+0.015
Neg logit contributions
 seventy-0.013
 eighty-0.013
 ninety-0.013
 fourteen-0.011
 enough-0.010
Token home
Token ID1363
Feature activation-8.64e-02

Pos logit contributions
 world+0.044
 neighbourhood+0.043
ag+0.043
 neighborhood+0.042
ini+0.042
Neg logit contributions
 seventy-0.060
 eighty-0.060
 ninety-0.058
 fourteen-0.054
 pret-0.051
Token to
Token ID284
Feature activation-3.35e-01

Pos logit contributions
 neighbourhood+0.002
ini+0.002
 world+0.002
ag+0.002
 neighborhood+0.002
Neg logit contributions
 eighty-0.005
 seventy-0.005
 ninety-0.005
 fourteen-0.005
 pret-0.004
Token share
Token ID2648
Feature activation+5.79e-02

Pos logit contributions
ini+0.016
 neighbourhood+0.015
ag+0.015
iet+0.015
 neighborhood+0.015
Neg logit contributions
 seventy-0.014
 eighty-0.014
 ninety-0.013
 fourteen-0.012
 enough-0.011
Token set
Token ID900
Feature activation-3.50e-01

Pos logit contributions
 neighbourhood+0.023
 neighborhood+0.023
 world+0.023
ag+0.022
ini+0.022
Neg logit contributions
 eighty-0.024
 seventy-0.024
 ninety-0.023
 fourteen-0.020
 forty-0.019
Token the
Token ID262
Feature activation-8.28e-01

Pos logit contributions
ini+0.016
 neighbourhood+0.016
 neighborhood+0.016
 world+0.016
ag+0.015
Neg logit contributions
 seventy-0.015
 eighty-0.015
 ninety-0.014
 fourteen-0.013
 enough-0.012
Token plates
Token ID13375
Feature activation-8.34e-01

Pos logit contributions
ini+0.036
ag+0.035
 neighbourhood+0.035
iet+0.034
 neighborhood+0.034
Neg logit contributions
 eighty-0.036
 seventy-0.036
 ninety-0.035
 fourteen-0.031
 enough-0.029
Token and
Token ID290
Feature activation+2.58e-02

Pos logit contributions
 world+0.025
 neighbourhood+0.025
ag+0.025
ini+0.025
 neighborhood+0.025
Neg logit contributions
 eighty-0.046
 seventy-0.046
 ninety-0.045
 fourteen-0.041
 forty-0.039
Token nap
Token ID25422
Feature activation+1.14e-01

Pos logit contributions
 eighty+0.001
 seventy+0.001
 ninety+0.001
 fourteen+0.001
 enough+0.001
Neg logit contributions
 neighbourhood-0.001
 neighborhood-0.001
 world-0.001
 nation-0.001
 town-0.001
Tokenkins
Token ID5331
Feature activation-2.61e+00

Pos logit contributions
 seventy+0.007
 eighty+0.007
 ninety+0.006
 fourteen+0.006
 enough+0.006
Neg logit contributions
ini-0.003
 neighbourhood-0.003
 neighborhood-0.003
ag-0.003
 world-0.003
Token.
Token ID13
Feature activation-4.88e-01

Pos logit contributions
 world+0.066
ag+0.065
 neighbourhood+0.065
 neighborhood+0.064
ling+0.063
Neg logit contributions
 seventy-0.159
 eighty-0.158
 ninety-0.154
 fourteen-0.143
 forty-0.136
Token They
Token ID1119
Feature activation+3.22e-01

Pos logit contributions
ini+0.016
ag+0.016
ling+0.016
iet+0.015
 world+0.015
Neg logit contributions
 eighty-0.027
 seventy-0.027
 ninety-0.026
 fourteen-0.024
 Fl-0.023
Token waited
Token ID13488
Feature activation-4.92e-01

Pos logit contributions
 seventy+0.016
 eighty+0.016
 ninety+0.015
 fourteen+0.014
iously+0.014
Neg logit contributions
 neighborhood-0.011
iet-0.011
 neighbourhood-0.011
 nation-0.011
ling-0.010
Token for
Token ID329
Feature activation-2.14e-01

Pos logit contributions
 world+0.022
ag+0.022
ling+0.022
agging+0.022
iet+0.021
Neg logit contributions
 eighty-0.020
 seventy-0.020
 ninety-0.020
 fourteen-0.017
 Rob-0.016
Token the
Token ID262
Feature activation-3.13e-01

Pos logit contributions
 world+0.009
 neighbourhood+0.009
ag+0.009
 neighborhood+0.009
iet+0.009
Neg logit contributions
 eighty-0.010
 seventy-0.010
 ninety-0.009
 fourteen-0.008
 pret-0.008
Token and
Token ID290
Feature activation-3.40e-01

Pos logit contributions
ini+0.014
 neighbourhood+0.014
 world+0.014
 neighborhood+0.014
ag+0.014
Neg logit contributions
 seventy-0.015
 eighty-0.015
 ninety-0.015
 fourteen-0.013
 enough-0.013
Token pick
Token ID2298
Feature activation-2.05e-01

Pos logit contributions
ini+0.015
 neighbourhood+0.014
 world+0.014
 neighborhood+0.014
ag+0.014
Neg logit contributions
 seventy-0.015
 eighty-0.015
 ninety-0.014
 fourteen-0.013
 enough-0.013
Token up
Token ID510
Feature activation-2.11e-01

Pos logit contributions
ling+0.007
ag+0.007
 world+0.007
ini+0.006
 neighbourhood+0.006
Neg logit contributions
 eighty-0.011
 seventy-0.011
 ninety-0.011
 fourteen-0.010
 forty-0.009
Token their
Token ID511
Feature activation-2.19e-01

Pos logit contributions
ag+0.009
 neighbourhood+0.009
 world+0.009
 neighborhood+0.009
ini+0.009
Neg logit contributions
 seventy-0.009
 eighty-0.009
 ninety-0.009
 fourteen-0.008
 forty-0.007
Token nap
Token ID25422
Feature activation+1.22e-01

Pos logit contributions
ini+0.009
 neighbourhood+0.009
 world+0.009
ag+0.009
 neighborhood+0.009
Neg logit contributions
 eighty-0.010
 seventy-0.010
 ninety-0.009
 fourteen-0.008
 enough-0.008
Tokenkins
Token ID5331
Feature activation-2.61e+00

Pos logit contributions
 eighty+0.007
 seventy+0.007
 ninety+0.007
 fourteen+0.006
 enough+0.006
Neg logit contributions
ini-0.004
 neighbourhood-0.004
 world-0.004
 neighborhood-0.003
ag-0.003
Token.
Token ID13
Feature activation-1.72e-01

Pos logit contributions
 world+0.055
 neighbourhood+0.054
ling+0.053
 neighborhood+0.052
ag+0.052
Neg logit contributions
 eighty-0.168
 seventy-0.168
 ninety-0.164
 fourteen-0.152
 forty-0.147
Token They
Token ID1119
Feature activation+3.69e-01

Pos logit contributions
ini+0.005
 neighbourhood+0.005
ag+0.005
 world+0.005
 neighborhood+0.005
Neg logit contributions
 seventy-0.010
 eighty-0.010
 ninety-0.009
 fourteen-0.009
 Fl-0.008
Token said
Token ID531
Feature activation-2.68e-01

Pos logit contributions
 eighty+0.016
 seventy+0.016
 ninety+0.015
 fourteen+0.014
 enough+0.013
Neg logit contributions
ini-0.016
 neighbourhood-0.016
 world-0.016
ag-0.015
 neighborhood-0.015
Token that
Token ID326
Feature activation-1.70e-01

Pos logit contributions
ag+0.009
 world+0.009
 nation+0.008
ling+0.008
 neighbourhood+0.008
Neg logit contributions
 seventy-0.013
 eighty-0.013
 ninety-0.013
 fourteen-0.011
 sixty-0.011
Token nap
Token ID25422
Feature activation+5.73e-01

Pos logit contributions
 world+0.006
 neighborhood+0.006
 neighbourhood+0.006
 nation+0.006
 nature+0.006
Neg logit contributions
 seventy-0.008
 eighty-0.008
 ninety-0.008
 fourteen-0.007
iously-0.007
Token.
Token ID13
Feature activation-1.72e-01

Pos logit contributions
ini+0.024
iet+0.023
 neighbourhood+0.023
 neighborhood+0.023
ag+0.023
Neg logit contributions
 seventy-0.034
 eighty-0.033
 ninety-0.032
 enough-0.031
 fourteen-0.030
Token She
Token ID1375
Feature activation-7.99e-02

Pos logit contributions
ini+0.006
ag+0.006
ling+0.006
 world+0.005
 neighbourhood+0.005
Neg logit contributions
 seventy-0.009
 eighty-0.009
 ninety-0.009
 fourteen-0.008
 Fl-0.008
Token used
Token ID973
Feature activation-2.43e-01

Pos logit contributions
 neighbourhood+0.003
 neighborhood+0.003
iet+0.002
 world+0.002
 house+0.002
Neg logit contributions
 eighty-0.004
 seventy-0.004
 ninety-0.004
 fourteen-0.004
 forty-0.004
Token a
Token ID257
Feature activation+1.76e-02

Pos logit contributions
ag+0.009
 world+0.008
 neighbourhood+0.008
 outdoors+0.008
 lens+0.008
Neg logit contributions
 eighty-0.012
 seventy-0.012
 ninety-0.011
 fourteen-0.010
 pret-0.010
Token m
Token ID285
Feature activation-1.07e+00

Pos logit contributions
 seventy+0.001
 eighty+0.001
 ninety+0.001
 fourteen+0.001
 enough+0.001
Neg logit contributions
ini-0.001
 neighbourhood-0.001
ag-0.001
 world-0.001
 neighborhood-0.001
Tokenitt
Token ID715
Feature activation-2.60e+00

Pos logit contributions
 neighbourhood+0.031
ini+0.030
 world+0.030
ag+0.030
 neighborhood+0.030
Neg logit contributions
 eighty-0.062
 seventy-0.062
 ninety-0.060
 fourteen-0.055
 enough-0.053
Token to
Token ID284
Feature activation-1.55e-01

Pos logit contributions
ling+0.073
ag+0.073
ini+0.071
 world+0.069
iet+0.069
Neg logit contributions
 eighty-0.152
 seventy-0.152
 ninety-0.145
 fourteen-0.132
 Fl-0.130
Token take
Token ID1011
Feature activation-2.84e-01

Pos logit contributions
ag+0.006
ini+0.006
 neighbourhood+0.006
 world+0.006
iet+0.006
Neg logit contributions
 eighty-0.007
 seventy-0.007
 ninety-0.007
 fourteen-0.006
 enough-0.006
Token out
Token ID503
Feature activation+6.06e-02

Pos logit contributions
 world+0.012
ag+0.012
 outdoors+0.012
 neighbourhood+0.011
 town+0.011
Neg logit contributions
 eighty-0.013
 seventy-0.012
 ninety-0.012
 fourteen-0.010
 puck-0.009
Token the
Token ID262
Feature activation-7.28e-01

Pos logit contributions
 eighty+0.003
 seventy+0.003
 ninety+0.002
 fourteen+0.002
 pret+0.002
Neg logit contributions
 world-0.003
ag-0.003
 neighbourhood-0.003
iet-0.003
ling-0.003
Token pan
Token ID3425
Feature activation+2.37e-02

Pos logit contributions
 world+0.030
 neighbourhood+0.029
 neighborhood+0.028
ini+0.028
ag+0.027
Neg logit contributions
 eighty-0.034
 seventy-0.034
 ninety-0.033
 fourteen-0.029
 forty-0.028
Token came
Token ID1625
Feature activation+4.86e-01

Pos logit contributions
ini+0.041
 neighbourhood+0.040
 world+0.040
ag+0.040
 neighborhood+0.040
Neg logit contributions
 eighty-0.036
 seventy-0.036
 ninety-0.034
 fourteen-0.031
 enough-0.029
Token to
Token ID284
Feature activation-1.65e-01

Pos logit contributions
 seventy+0.019
 eighty+0.019
 ninety+0.018
 enough+0.017
 fourteen+0.017
Neg logit contributions
ini-0.022
ag-0.022
iet-0.021
 neighbourhood-0.021
 neighborhood-0.021
Token the
Token ID262
Feature activation+3.12e-02

Pos logit contributions
 world+0.006
 neighbourhood+0.006
 neighborhood+0.006
ag+0.006
ini+0.006
Neg logit contributions
 eighty-0.008
 seventy-0.008
 ninety-0.008
 fourteen-0.007
 enough-0.006
Token garden
Token ID11376
Feature activation-3.37e-01

Pos logit contributions
 seventy+0.002
 eighty+0.002
 ten+0.002
 fourteen+0.002
 ninety+0.002
Neg logit contributions
ag-0.001
agons-0.001
agging-0.001
ini-0.001
ling-0.001
Token.
Token ID13
Feature activation+1.73e-01

Pos logit contributions
 world+0.013
 neighbourhood+0.013
 neighborhood+0.013
ini+0.013
ag+0.012
Neg logit contributions
 eighty-0.017
 seventy-0.016
 ninety-0.016
 fourteen-0.014
 pret-0.013
Token Rabb
Token ID22502
Feature activation-2.60e+00

Pos logit contributions
 eighty+0.010
 seventy+0.010
 ninety+0.010
 fourteen+0.009
 enough+0.009
Neg logit contributions
 neighbourhood-0.005
 world-0.005
 neighborhood-0.005
ag-0.005
ini-0.004
Tokenits
Token ID896
Feature activation+8.23e-01

Pos logit contributions
 neighbourhood+0.071
ini+0.071
 world+0.070
 neighborhood+0.070
ag+0.068
Neg logit contributions
 eighty-0.154
 seventy-0.153
 ninety-0.148
 fourteen-0.138
 enough-0.132
Token,
Token ID11
Feature activation+1.39e-01

Pos logit contributions
 eighty+0.047
 seventy+0.047
 ninety+0.046
 fourteen+0.040
 forty+0.040
Neg logit contributions
ling-0.028
ag-0.026
iet-0.026
 world-0.025
 outdoors-0.024
Token birds
Token ID10087
Feature activation-5.60e-01

Pos logit contributions
 seventy+0.006
 eighty+0.006
 ninety+0.006
 fourteen+0.005
 enough+0.005
Neg logit contributions
 world-0.006
 neighbourhood-0.006
 neighborhood-0.006
ling-0.005
ini-0.005
Token and
Token ID290
Feature activation+4.21e-01

Pos logit contributions
 neighbourhood+0.023
ini+0.023
 world+0.023
ag+0.023
 neighborhood+0.023
Neg logit contributions
 eighty-0.025
 seventy-0.025
 ninety-0.024
 fourteen-0.022
 enough-0.020
Token butterflies
Token ID44571
Feature activation-2.52e-01

Pos logit contributions
 eighty+0.024
 seventy+0.024
 ninety+0.023
 fourteen+0.021
 enough+0.020
Neg logit contributions
 neighbourhood-0.013
 world-0.013
ini-0.013
 neighborhood-0.013
ag-0.012
Token,
Token ID11
Feature activation+7.39e-01

Pos logit contributions
 eighty+0.019
 seventy+0.019
 ninety+0.018
 fourteen+0.016
 enough+0.015
Neg logit contributions
ini-0.016
 world-0.016
 neighbourhood-0.016
ag-0.016
 neighborhood-0.016
Token "
Token ID366
Feature activation+1.33e-01

Pos logit contributions
 eighty+0.036
 seventy+0.036
 ninety+0.035
 fourteen+0.032
 pret+0.030
Neg logit contributions
 neighbourhood-0.027
 world-0.027
 neighborhood-0.026
ini-0.026
ling-0.026
TokenThank
Token ID10449
Feature activation+1.54e-01

Pos logit contributions
 eighty+0.007
 seventy+0.007
 ninety+0.006
 fourteen+0.006
 enough+0.005
Neg logit contributions
 neighbourhood-0.005
ag-0.005
ini-0.005
 world-0.005
 neighborhood-0.005
Token you
Token ID345
Feature activation+1.23e-01

Pos logit contributions
 seventy+0.006
 eighty+0.006
 ninety+0.006
 Fl+0.005
 fourteen+0.005
Neg logit contributions
 neighborhood-0.008
 neighbourhood-0.008
ini-0.008
iet-0.007
ag-0.007
Token.
Token ID13
Feature activation+3.93e-01

Pos logit contributions
 seventy+0.005
 eighty+0.005
 ninety+0.005
 fourteen+0.004
 pret+0.004
Neg logit contributions
 world-0.006
ini-0.006
 neighbourhood-0.006
ag-0.006
iet-0.005
Token Rabb
Token ID22502
Feature activation-2.59e+00

Pos logit contributions
 eighty+0.018
 seventy+0.017
 ninety+0.017
 fourteen+0.016
 enough+0.015
Neg logit contributions
ini-0.016
 world-0.016
 neighbourhood-0.016
 neighborhood-0.016
ag-0.015
Tokenits
Token ID896
Feature activation+8.28e-01

Pos logit contributions
 neighbourhood+0.075
 neighborhood+0.075
 world+0.071
ini+0.068
ag+0.065
Neg logit contributions
 eighty-0.145
 seventy-0.143
 ninety-0.141
 fourteen-0.132
 enough-0.125
Token are
Token ID389
Feature activation+5.81e-01

Pos logit contributions
 seventy+0.039
 eighty+0.039
 ninety+0.037
 fourteen+0.033
 forty+0.031
Neg logit contributions
 world-0.034
ag-0.034
ling-0.033
iet-0.033
 neighbourhood-0.033
Token calm
Token ID9480
Feature activation+3.51e-01

Pos logit contributions
 seventy+0.027
 eighty+0.027
 ninety+0.026
 fourteen+0.024
 pret+0.023
Neg logit contributions
 world-0.023
 neighbourhood-0.023
ag-0.022
ini-0.022
 neighborhood-0.022
Token creatures
Token ID8109
Feature activation-3.11e-01

Pos logit contributions
 eighty+0.016
 seventy+0.016
 ninety+0.015
 fourteen+0.014
 enough+0.013
Neg logit contributions
 neighbourhood-0.014
ini-0.014
 world-0.014
ag-0.014
 neighborhood-0.014
Token,
Token ID11
Feature activation+2.45e-01

Pos logit contributions
 world+0.013
 neighbourhood+0.013
ini+0.012
ag+0.012
 neighborhood+0.012
Neg logit contributions
 seventy-0.014
 eighty-0.014
 ninety-0.014
 fourteen-0.012
 forty-0.012
Token Thank
Token ID6952
Feature activation+2.58e-01

Pos logit contributions
ini+0.011
 neighbourhood+0.011
ag+0.011
 world+0.011
 neighborhood+0.011
Neg logit contributions
 seventy-0.017
 eighty-0.017
 ninety-0.016
 fourteen-0.015
 enough-0.014
Token you
Token ID345
Feature activation-1.62e-01

Pos logit contributions
 seventy+0.010
 eighty+0.010
 ninety+0.010
 fourteen+0.008
 enough+0.008
Neg logit contributions
ini-0.013
 neighbourhood-0.012
 neighborhood-0.012
 world-0.012
iet-0.012
Token,
Token ID11
Feature activation-1.94e-01

Pos logit contributions
 world+0.006
ag+0.006
iet+0.006
agging+0.006
 neighborhood+0.006
Neg logit contributions
 seventy-0.008
 eighty-0.008
 ninety-0.007
 pret-0.007
 fourteen-0.006
Token God
Token ID1793
Feature activation-5.01e-01

Pos logit contributions
 world+0.007
 neighbourhood+0.007
iet+0.007
 neighborhood+0.007
 nation+0.007
Neg logit contributions
 eighty-0.008
 seventy-0.008
 ninety-0.008
 fourteen-0.008
ward-0.007
Token.
Token ID13
Feature activation-2.17e-01

Pos logit contributions
 world+0.014
ling+0.013
 neighborhood+0.012
umpy+0.012
ag+0.011
Neg logit contributions
 seventy-0.029
 eighty-0.029
 ninety-0.029
 fourteen-0.026
 puck-0.026
Token Amen
Token ID34717
Feature activation-2.57e+00

Pos logit contributions
 neighbourhood+0.007
ini+0.007
 world+0.006
ag+0.006
 neighborhood+0.006
Neg logit contributions
 eighty-0.012
 seventy-0.012
 ninety-0.012
 fourteen-0.011
 enough-0.010
Token."
Token ID526
Feature activation+7.83e-02

Pos logit contributions
ini+0.117
 neighbourhood+0.117
 neighborhood+0.115
iet+0.114
 world+0.114
Neg logit contributions
 eighty-0.106
 seventy-0.106
 ninety-0.101
 fourteen-0.090
 enough-0.085
Token
Token ID198
Feature activation-2.25e-01

Pos logit contributions
 seventy+0.004
 eighty+0.004
 ninety+0.004
 fourteen+0.004
 enough+0.004
Neg logit contributions
 neighbourhood-0.002
 world-0.002
 neighborhood-0.002
ag-0.002
ini-0.002
Token
Token ID198
Feature activation-1.74e-01

Pos logit contributions
ini+0.010
iet+0.010
ling+0.010
 neighbourhood+0.010
ag+0.010
Neg logit contributions
 eighty-0.010
 seventy-0.009
 ninety-0.009
 Frost-0.009
 Fl-0.008
TokenHe
Token ID1544
Feature activation+6.87e-01

Pos logit contributions
ini+0.007
ag+0.007
 world+0.007
ling+0.007
iet+0.007
Neg logit contributions
 eighty-0.009
 seventy-0.009
 ninety-0.008
 Frost-0.007
 Fl-0.007
Token opened
Token ID4721
Feature activation+6.18e-02

Pos logit contributions
 eighty+0.031
 ninety+0.030
 seventy+0.030
 fourteen+0.030
ward+0.027
Neg logit contributions
iet-0.026
 neighborhood-0.025
 neighbourhood-0.025
ling-0.024
 world-0.024
Token
Token ID198
Feature activation-5.12e-01

Pos logit contributions
ini+0.027
ag+0.026
iet+0.025
ling+0.025
 neighbourhood+0.025
Neg logit contributions
 eighty-0.026
 seventy-0.026
 ninety-0.024
 Frost-0.023
 Fl-0.023
TokenThey
Token ID2990
Feature activation+7.04e-01

Pos logit contributions
ag+0.023
ini+0.023
 world+0.022
 neighbourhood+0.021
agging+0.021
Neg logit contributions
 eighty-0.022
 seventy-0.022
 ninety-0.020
 fourteen-0.019
 Frost-0.019
Token get
Token ID651
Feature activation-7.38e-01

Pos logit contributions
 eighty+0.033
 seventy+0.033
 ninety+0.031
 fourteen+0.028
 enough+0.027
Neg logit contributions
 neighbourhood-0.027
ag-0.027
iet-0.027
 world-0.027
 neighborhood-0.027
Token a
Token ID257
Feature activation-3.09e-01

Pos logit contributions
 neighbourhood+0.029
 world+0.028
ag+0.028
ini+0.028
 neighborhood+0.028
Neg logit contributions
 eighty-0.035
 seventy-0.035
 ninety-0.034
 fourteen-0.031
 forty-0.028
Token m
Token ID285
Feature activation-9.87e-01

Pos logit contributions
 world+0.014
 neighbourhood+0.013
ini+0.013
ag+0.013
 neighborhood+0.013
Neg logit contributions
 eighty-0.013
 seventy-0.013
 ninety-0.012
 fourteen-0.011
 enough-0.010
Tokenitt
Token ID715
Feature activation-2.56e+00

Pos logit contributions
 neighbourhood+0.035
 world+0.034
 neighborhood+0.034
iet+0.032
ag+0.031
Neg logit contributions
 seventy-0.051
 eighty-0.051
 ninety-0.049
 fourteen-0.047
 enough-0.043
Token and
Token ID290
Feature activation-2.08e-01

Pos logit contributions
ini+0.086
ag+0.085
 neighbourhood+0.085
 world+0.085
 neighborhood+0.083
Neg logit contributions
 eighty-0.134
 seventy-0.134
 ninety-0.129
 fourteen-0.118
 enough-0.112
Token a
Token ID257
Feature activation-1.20e-01

Pos logit contributions
 neighbourhood+0.008
 world+0.008
 neighborhood+0.008
iet+0.008
ag+0.008
Neg logit contributions
 eighty-0.009
 seventy-0.009
 ninety-0.008
 fourteen-0.007
 forty-0.007
Token spat
Token ID15246
Feature activation-1.34e-01

Pos logit contributions
 world+0.005
 neighbourhood+0.005
ini+0.005
 neighborhood+0.005
ag+0.005
Neg logit contributions
 eighty-0.005
 seventy-0.005
 ninety-0.005
 fourteen-0.004
 enough-0.004
Tokenula
Token ID4712
Feature activation-5.86e-01

Pos logit contributions
ag+0.005
 world+0.005
ling+0.005
iet+0.005
agging+0.004
Neg logit contributions
 eighty-0.007
 seventy-0.007
 ninety-0.007
 fourteen-0.006
 forty-0.006
Token.
Token ID13
Feature activation-4.13e-01

Pos logit contributions
ling+0.014
 world+0.013
ini+0.013
ag+0.013
 house+0.012
Neg logit contributions
 eighty-0.038
 seventy-0.037
 ninety-0.036
 forty-0.030
 sixty-0.030
Token us
Token ID514
Feature activation-8.48e-01

Pos logit contributions
ini+0.020
 neighbourhood+0.019
 neighborhood+0.019
ag+0.019
umpy+0.019
Neg logit contributions
 seventy-0.028
 eighty-0.028
 ninety-0.026
 enough-0.024
 fourteen-0.024
Token safe
Token ID3338
Feature activation-3.27e-01

Pos logit contributions
 world+0.028
ag+0.028
 neighbourhood+0.027
 neighborhood+0.027
ini+0.027
Neg logit contributions
 seventy-0.046
 eighty-0.045
 ninety-0.044
 fourteen-0.041
 forty-0.038
Token and
Token ID290
Feature activation-2.71e-02

Pos logit contributions
 world+0.013
 neighbourhood+0.012
 neighborhood+0.012
ini+0.012
ag+0.012
Neg logit contributions
 seventy-0.016
 eighty-0.016
 ninety-0.015
 fourteen-0.014
 forty-0.013
Token happy
Token ID3772
Feature activation-3.34e-01

Pos logit contributions
ini+0.001
iet+0.001
 neighbourhood+0.001
ag+0.001
 neighborhood+0.001
Neg logit contributions
 eighty-0.001
 seventy-0.001
 ninety-0.001
 enough-0.001
 Fl-0.001
Token.
Token ID13
Feature activation-2.94e-01

Pos logit contributions
 world+0.013
 neighbourhood+0.012
 neighborhood+0.012
ag+0.012
ini+0.012
Neg logit contributions
 eighty-0.017
 seventy-0.017
 ninety-0.016
 fourteen-0.015
 forty-0.014
Token Amen
Token ID34717
Feature activation-2.56e+00

Pos logit contributions
ini+0.011
ag+0.011
 neighbourhood+0.011
ling+0.011
 world+0.011
Neg logit contributions
 seventy-0.015
 eighty-0.015
 ninety-0.014
 fourteen-0.013
 Fl-0.012
Token."
Token ID526
Feature activation+1.88e-01

Pos logit contributions
ini+0.122
 neighbourhood+0.119
iet+0.117
 neighborhood+0.117
ag+0.116
Neg logit contributions
 eighty-0.103
 seventy-0.103
 ninety-0.097
 fourteen-0.086
 enough-0.082
Token
Token ID198
Feature activation-1.70e-01

Pos logit contributions
 eighty+0.010
 seventy+0.010
 ninety+0.010
 fourteen+0.009
 enough+0.009
Neg logit contributions
 neighbourhood-0.006
 world-0.006
ini-0.006
 neighborhood-0.006
ag-0.006
Token
Token ID198
Feature activation-1.23e-01

Pos logit contributions
ini+0.008
ag+0.007
 neighbourhood+0.007
iet+0.007
 neighborhood+0.007
Neg logit contributions
 eighty-0.008
 seventy-0.007
 ninety-0.007
 Fl-0.007
 Frost-0.007
Token"
Token ID1
Feature activation+2.47e-01

Pos logit contributions
ini+0.005
ag+0.005
 neighbourhood+0.005
 world+0.005
 neighborhood+0.005
Neg logit contributions
 eighty-0.006
 seventy-0.006
 ninety-0.005
 fourteen-0.005
 Fl-0.005
TokenA
Token ID32
Feature activation+9.53e-01

Pos logit contributions
 eighty+0.013
 seventy+0.013
 ninety+0.012
 fourteen+0.011
 enough+0.011
Neg logit contributions
 neighbourhood-0.008
 world-0.008
 neighborhood-0.008
ini-0.008
ag-0.008
Token,
Token ID11
Feature activation+1.48e-02

Pos logit contributions
 neighbourhood+0.009
ini+0.009
 world+0.009
ag+0.009
 neighborhood+0.009
Neg logit contributions
 eighty-0.015
 seventy-0.015
 ninety-0.015
 fourteen-0.014
 enough-0.013
Token and
Token ID290
Feature activation-7.30e-01

Pos logit contributions
 seventy+0.001
 eighty+0.001
 ninety+0.001
 fourteen+0.001
 forty+0.001
Neg logit contributions
 world-0.001
 neighbourhood-0.001
 neighborhood-0.001
ini-0.001
ag-0.001
Token all
Token ID477
Feature activation+1.30e-01

Pos logit contributions
 neighbourhood+0.030
 world+0.029
 neighborhood+0.029
ini+0.028
ling+0.028
Neg logit contributions
 eighty-0.032
 seventy-0.032
 ninety-0.031
 fourteen-0.028
 forty-0.026
Token the
Token ID262
Feature activation-9.90e-01

Pos logit contributions
 eighty+0.005
 seventy+0.005
 ninety+0.005
 fourteen+0.005
 enough+0.005
Neg logit contributions
ini-0.006
 neighbourhood-0.006
 neighborhood-0.006
iet-0.006
ag-0.005
Token other
Token ID584
Feature activation-5.80e-01

Pos logit contributions
ag+0.033
ini+0.033
iet+0.032
 neighbourhood+0.032
 neighborhood+0.031
Neg logit contributions
 eighty-0.052
 seventy-0.052
 ninety-0.050
 fourteen-0.046
 enough-0.044
Token av
Token ID1196
Feature activation-2.56e+00

Pos logit contributions
 neighbourhood+0.018
 world+0.018
ini+0.018
 neighborhood+0.018
ag+0.018
Neg logit contributions
 eighty-0.032
 seventy-0.032
 ninety-0.031
 fourteen-0.029
 enough-0.027
Tokenoc
Token ID420
Feature activation-1.69e-01

Pos logit contributions
 neighbourhood+0.111
ini+0.111
 world+0.110
ag+0.109
 neighborhood+0.109
Neg logit contributions
 eighty-0.111
 seventy-0.111
 ninety-0.105
 fourteen-0.095
 enough-0.089
Tokenados
Token ID22484
Feature activation-1.01e+00

Pos logit contributions
ini+0.007
 neighbourhood+0.007
 world+0.007
 neighborhood+0.007
ag+0.007
Neg logit contributions
 eighty-0.008
 seventy-0.008
 ninety-0.007
 fourteen-0.007
 Fl-0.006
Token were
Token ID547
Feature activation+1.16e+00

Pos logit contributions
 neighbourhood+0.041
 neighborhood+0.041
 world+0.041
ag+0.040
ling+0.040
Neg logit contributions
 seventy-0.047
 eighty-0.047
 ninety-0.045
 fourteen-0.040
 forty-0.038
Token very
Token ID845
Feature activation-2.86e-01

Pos logit contributions
 seventy+0.054
 eighty+0.054
 ninety+0.052
 fourteen+0.047
 pret+0.044
Neg logit contributions
 neighbourhood-0.047
 world-0.047
ini-0.047
 neighborhood-0.046
ag-0.046
Token happy
Token ID3772
Feature activation+4.45e-01

Pos logit contributions
iet+0.011
ag+0.011
 neighbourhood+0.011
 neighborhood+0.011
 world+0.010
Neg logit contributions
 eighty-0.014
 seventy-0.014
 ninety-0.013
 enough-0.012
 fourteen-0.012
Token house
Token ID2156
Feature activation-1.06e-01

Pos logit contributions
 seventy+0.049
 eighty+0.049
 ninety+0.047
 fourteen+0.043
 enough+0.040
Neg logit contributions
 world-0.036
 neighbourhood-0.036
 neighborhood-0.036
ini-0.035
ag-0.034
Token and
Token ID290
Feature activation-2.66e-02

Pos logit contributions
 world+0.004
 neighborhood+0.003
 neighbourhood+0.003
ini+0.003
ag+0.003
Neg logit contributions
 seventy-0.006
 eighty-0.006
 ninety-0.006
 fourteen-0.005
 forty-0.005
Token our
Token ID674
Feature activation+3.80e-01

Pos logit contributions
 neighbourhood+0.001
 neighborhood+0.001
 world+0.001
ag+0.001
ini+0.001
Neg logit contributions
 eighty-0.001
 seventy-0.001
 ninety-0.001
 fourteen-0.001
 forty-0.001
Token family
Token ID1641
Feature activation-4.93e-01

Pos logit contributions
 seventy+0.020
 eighty+0.020
 ninety+0.019
 fourteen+0.018
 enough+0.017
Neg logit contributions
 neighbourhood-0.013
 world-0.013
ini-0.013
 neighborhood-0.013
ag-0.013
Token.
Token ID13
Feature activation-8.04e-02

Pos logit contributions
 world+0.016
 neighbourhood+0.015
 neighborhood+0.015
ag+0.014
ling+0.014
Neg logit contributions
 eighty-0.028
 seventy-0.028
 ninety-0.027
 fourteen-0.025
 forty-0.024
Token Amen
Token ID34717
Feature activation-2.54e+00

Pos logit contributions
ag+0.003
ini+0.003
 neighbourhood+0.003
iet+0.003
ling+0.003
Neg logit contributions
 seventy-0.004
 eighty-0.004
 ninety-0.004
 fourteen-0.004
 Fl-0.003
Token."
Token ID526
Feature activation+3.04e-01

Pos logit contributions
ini+0.121
 neighbourhood+0.120
iet+0.118
ling+0.116
 neighborhood+0.116
Neg logit contributions
 eighty-0.098
 seventy-0.098
 ninety-0.093
 fourteen-0.084
 enough-0.079
Token
Token ID198
Feature activation-1.04e-01

Pos logit contributions
 eighty+0.017
 seventy+0.017
 ninety+0.016
 fourteen+0.015
 enough+0.014
Neg logit contributions
 neighbourhood-0.010
 world-0.010
ini-0.009
 neighborhood-0.009
ag-0.009
Token
Token ID198
Feature activation-4.95e-02

Pos logit contributions
ini+0.004
 neighbourhood+0.004
 world+0.004
ag+0.004
 neighborhood+0.004
Neg logit contributions
 eighty-0.005
 seventy-0.005
 ninety-0.005
 fourteen-0.005
 Fl-0.004
TokenMom
Token ID29252
Feature activation-1.31e-01

Pos logit contributions
ini+0.001
ag+0.001
 neighbourhood+0.001
 world+0.001
 neighborhood+0.001
Neg logit contributions
 eighty-0.003
 seventy-0.003
 ninety-0.003
 fourteen-0.003
 Fl-0.002
Token smiled
Token ID13541
Feature activation-2.37e-01

Pos logit contributions
 neighborhood+0.005
iet+0.005
 neighbourhood+0.005
ini+0.005
 world+0.005
Neg logit contributions
 eighty-0.006
 seventy-0.006
 ninety-0.006
 fourteen-0.006
 enough-0.005
Token to
Token ID284
Feature activation-9.29e-02

Pos logit contributions
 seventy+0.028
 eighty+0.028
 ninety+0.026
 fourteen+0.024
 forty+0.022
Neg logit contributions
 neighbourhood-0.026
 world-0.026
 neighborhood-0.026
ini-0.026
ag-0.025
Token hop
Token ID1725
Feature activation+4.55e-01

Pos logit contributions
ini+0.004
 neighbourhood+0.004
 world+0.004
ag+0.004
 neighborhood+0.004
Neg logit contributions
 eighty-0.004
 seventy-0.004
 ninety-0.004
 fourteen-0.004
 enough-0.004
Token inside
Token ID2641
Feature activation+1.84e-01

Pos logit contributions
 eighty+0.024
 seventy+0.024
 ninety+0.023
 fourteen+0.021
 enough+0.021
Neg logit contributions
ini-0.015
 neighbourhood-0.015
 neighborhood-0.015
 world-0.015
ag-0.015
Token,
Token ID11
Feature activation+1.03e-02

Pos logit contributions
 eighty+0.009
 seventy+0.009
 ninety+0.008
 fourteen+0.008
 pret+0.007
Neg logit contributions
 world-0.007
 neighbourhood-0.007
 neighborhood-0.007
ini-0.007
ag-0.007
Token a
Token ID257
Feature activation+1.52e-02

Pos logit contributions
 seventy+0.000
 eighty+0.000
 ninety+0.000
 fourteen+0.000
 pret+0.000
Neg logit contributions
 world-0.000
 neighbourhood-0.000
 neighborhood-0.000
 hurricane-0.000
 outdoors-0.000
Token gru
Token ID22848
Feature activation-2.52e+00

Pos logit contributions
 seventy+0.001
 eighty+0.001
 ninety+0.001
 Fl+0.001
 enough+0.001
Neg logit contributions
ag-0.001
ini-0.001
agging-0.001
ags-0.001
iet-0.000
Tokenff
Token ID487
Feature activation-3.70e-02

Pos logit contributions
 neighbourhood+0.081
ini+0.081
ag+0.080
 world+0.080
 neighborhood+0.079
Neg logit contributions
 seventy-0.135
 eighty-0.134
 ninety-0.130
 fourteen-0.119
 enough-0.116
Token voice
Token ID3809
Feature activation-7.68e-01

Pos logit contributions
 neighbourhood+0.001
ag+0.001
 neighborhood+0.001
ags+0.001
agging+0.001
Neg logit contributions
 seventy-0.002
 eighty-0.002
 ninety-0.002
 enough-0.002
 Frost-0.002
Token shouted
Token ID17293
Feature activation-4.78e-01

Pos logit contributions
 neighbourhood+0.038
ini+0.037
 neighborhood+0.037
 world+0.037
iet+0.035
Neg logit contributions
 seventy-0.029
 eighty-0.029
 ninety-0.027
 fourteen-0.025
 enough-0.025
Token "
Token ID366
Feature activation+5.73e-01

Pos logit contributions
ini+0.021
 neighbourhood+0.020
 neighborhood+0.020
 world+0.020
ag+0.020
Neg logit contributions
 seventy-0.021
 eighty-0.021
 ninety-0.020
 fourteen-0.018
 enough-0.018
TokenNO
Token ID15285
Feature activation+8.42e-01

Pos logit contributions
 eighty+0.030
 seventy+0.030
 ninety+0.028
 fourteen+0.026
 pret+0.025
Neg logit contributions
ini-0.021
 world-0.021
 neighbourhood-0.020
ag-0.019
ling-0.019
Token mom
Token ID1995
Feature activation+6.37e-01

Pos logit contributions
 eighty+0.003
 seventy+0.003
 ninety+0.003
 fourteen+0.003
 enough+0.003
Neg logit contributions
 neighbourhood-0.002
 world-0.002
ini-0.002
 neighborhood-0.002
ag-0.002
Token if
Token ID611
Feature activation-2.87e-01

Pos logit contributions
 eighty+0.021
 seventy+0.021
 ninety+0.019
 fourteen+0.017
 enough+0.015
Neg logit contributions
 neighbourhood-0.034
ini-0.034
 world-0.034
ag-0.034
 neighborhood-0.034
Token she
Token ID673
Feature activation-2.22e-01

Pos logit contributions
 neighbourhood+0.011
ini+0.011
 world+0.011
 neighborhood+0.011
ag+0.011
Neg logit contributions
 eighty-0.014
 seventy-0.014
 ninety-0.013
 fourteen-0.012
 enough-0.011
Token could
Token ID714
Feature activation+3.75e-01

Pos logit contributions
 neighbourhood+0.007
 neighborhood+0.007
ini+0.007
 world+0.007
ag+0.007
Neg logit contributions
 eighty-0.012
 seventy-0.012
 ninety-0.011
 fourteen-0.011
 forty-0.010
Token at
Token ID379
Feature activation+2.40e-01

Pos logit contributions
 seventy+0.015
 eighty+0.015
 ninety+0.014
 fourteen+0.012
 enough+0.012
Neg logit contributions
iet-0.017
ini-0.017
 neighbourhood-0.017
 neighborhood-0.017
 world-0.017
Token least
Token ID1551
Feature activation+1.85e+00

Pos logit contributions
 seventy+0.011
 eighty+0.011
 ninety+0.010
 fourteen+0.010
 um+0.008
Neg logit contributions
 world-0.010
 neighbourhood-0.010
 neighborhood-0.010
ini-0.009
 town-0.009
Token polish
Token ID25245
Feature activation+9.30e-01

Pos logit contributions
 eighty+0.082
 seventy+0.082
 ninety+0.078
 enough+0.072
 fourteen+0.069
Neg logit contributions
ini-0.082
iet-0.077
 neighbourhood-0.076
 world-0.076
 neighborhood-0.075
Token it
Token ID340
Feature activation+2.76e-01

Pos logit contributions
 seventy+0.041
 eighty+0.041
 ninety+0.039
 fourteen+0.035
 forty+0.032
Neg logit contributions
 world-0.040
ini-0.039
 neighbourhood-0.039
ag-0.039
ling-0.038
Token.
Token ID13
Feature activation-2.98e-02

Pos logit contributions
 eighty+0.018
 seventy+0.018
 ninety+0.018
 fourteen+0.017
 Frost+0.016
Neg logit contributions
 world-0.006
ag-0.006
ling-0.006
 neighbourhood-0.005
 neighborhood-0.005
Token Her
Token ID2332
Feature activation-3.81e-01

Pos logit contributions
ini+0.001
iet+0.001
ag+0.001
ling+0.001
 world+0.001
Neg logit contributions
 seventy-0.001
 eighty-0.001
 ninety-0.001
 fourteen-0.001
 Fl-0.001
Token mom
Token ID1995
Feature activation+3.84e-01

Pos logit contributions
ag+0.016
ini+0.016
iet+0.016
 neighbourhood+0.016
ling+0.016
Neg logit contributions
 seventy-0.017
 eighty-0.017
 ninety-0.016
 enough-0.014
 Fl-0.014
Token's
Token ID338
Feature activation+7.21e-02

Pos logit contributions
 neighbourhood+0.006
 neighborhood+0.005
ling+0.005
iet+0.005
ini+0.005
Neg logit contributions
 eighty-0.007
 seventy-0.007
 ninety-0.007
 fourteen-0.006
 forty-0.006
Token mom
Token ID1995
Feature activation+2.78e-01

Pos logit contributions
 eighty+0.004
 seventy+0.004
 ninety+0.003
 fourteen+0.003
 enough+0.003
Neg logit contributions
ini-0.003
 neighbourhood-0.003
ag-0.003
 world-0.003
 neighborhood-0.003
Token said
Token ID531
Feature activation+1.51e-01

Pos logit contributions
 eighty+0.013
 seventy+0.013
 ninety+0.012
 fourteen+0.011
 forty+0.010
Neg logit contributions
 neighbourhood-0.011
ini-0.011
 neighborhood-0.011
ling-0.011
 world-0.011
Token,
Token ID11
Feature activation+5.20e-01

Pos logit contributions
 eighty+0.008
 seventy+0.008
 ninety+0.008
 fourteen+0.007
 pret+0.007
Neg logit contributions
 world-0.005
ag-0.005
ling-0.005
 neighbourhood-0.005
 outdoors-0.004
Token "
Token ID366
Feature activation-7.80e-02

Pos logit contributions
 eighty+0.027
 seventy+0.027
 ninety+0.026
 fourteen+0.024
 enough+0.022
Neg logit contributions
 world-0.018
 neighbourhood-0.017
 neighborhood-0.017
ini-0.017
ag-0.016
TokenOh
Token ID5812
Feature activation+1.97e+00

Pos logit contributions
ag+0.003
 neighbourhood+0.003
ling+0.003
agging+0.003
 neighborhood+0.003
Neg logit contributions
 seventy-0.004
 eighty-0.004
 ninety-0.004
 fourteen-0.004
 enough-0.003
Token no
Token ID645
Feature activation+6.30e-01

Pos logit contributions
 eighty+0.091
 seventy+0.090
 ninety+0.088
 Frost+0.071
 fourteen+0.070
Neg logit contributions
 world-0.080
iet-0.080
ini-0.078
 neighbourhood-0.078
 neighborhood-0.078
Token!
Token ID0
Feature activation+2.50e-01

Pos logit contributions
 eighty+0.038
 seventy+0.037
 ninety+0.036
 Frost+0.030
 Nib+0.029
Neg logit contributions
 world-0.020
 neighbourhood-0.018
 neighborhood-0.018
ag-0.018
 outdoors-0.018
Token It
Token ID632
Feature activation+6.00e-01

Pos logit contributions
 eighty+0.012
 seventy+0.011
 ninety+0.011
 fourteen+0.010
 enough+0.009
Neg logit contributions
ini-0.010
 neighbourhood-0.010
 world-0.010
 neighborhood-0.010
ag-0.010
Token looks
Token ID3073
Feature activation-2.47e-01

Pos logit contributions
 eighty+0.032
 seventy+0.032
 ninety+0.031
 fourteen+0.027
 forty+0.026
Neg logit contributions
 world-0.022
ini-0.022
 neighbourhood-0.021
ling-0.021
 neighborhood-0.021
Token like
Token ID588
Feature activation+7.13e-01

Pos logit contributions
 world+0.007
ag+0.007
ling+0.007
 outdoors+0.007
 neighbourhood+0.007
Neg logit contributions
 seventy-0.015
 eighty-0.015
 ninety-0.015
 fourteen-0.013
 sixty-0.012
Token pretending
Token ID23662
Feature activation-9.88e-02

Pos logit contributions
 seventy+0.008
 eighty+0.007
 ninety+0.007
 fourteen+0.007
 enough+0.006
Neg logit contributions
 world-0.007
 neighbourhood-0.007
 neighborhood-0.007
ling-0.006
ini-0.006
Token to
Token ID284
Feature activation+2.02e-01

Pos logit contributions
 world+0.004
 neighbourhood+0.004
ag+0.004
 neighborhood+0.004
ling+0.004
Neg logit contributions
 seventy-0.005
 eighty-0.005
 ninety-0.005
 fourteen-0.004
 forty-0.004
Token sail
Token ID14936
Feature activation+1.28e+00

Pos logit contributions
 seventy+0.010
 eighty+0.010
 ninety+0.009
 fourteen+0.009
 enough+0.008
Neg logit contributions
 world-0.008
ag-0.007
 nation-0.007
 neighbourhood-0.007
 neighborhood-0.007
Token the
Token ID262
Feature activation+1.83e-02

Pos logit contributions
 eighty+0.065
 seventy+0.065
 ninety+0.062
 fourteen+0.057
 enough+0.054
Neg logit contributions
 neighbourhood-0.046
ini-0.046
 world-0.045
ag-0.045
 neighborhood-0.045
Token high
Token ID1029
Feature activation+1.72e-01

Pos logit contributions
 eighty+0.001
 seventy+0.001
 ninety+0.001
 fourteen+0.001
 enough+0.001
Neg logit contributions
ini-0.001
 neighbourhood-0.001
ag-0.001
 neighborhood-0.001
iet-0.001
Token seas
Token ID21547
Feature activation+2.39e+00

Pos logit contributions
 eighty+0.009
 seventy+0.009
 ninety+0.009
 fourteen+0.008
 enough+0.008
Neg logit contributions
ini-0.006
 neighbourhood-0.006
ag-0.006
 neighborhood-0.006
iet-0.006
Token and
Token ID290
Feature activation+2.45e-01

Pos logit contributions
 eighty+0.115
 seventy+0.115
 ninety+0.110
 fourteen+0.101
 enough+0.096
Neg logit contributions
 neighbourhood-0.092
ini-0.091
 neighborhood-0.090
 world-0.089
ag-0.088
Token search
Token ID2989
Feature activation-8.08e-01

Pos logit contributions
 seventy+0.012
 eighty+0.012
 ninety+0.011
 fourteen+0.010
 enough+0.010
Neg logit contributions
 world-0.009
 neighbourhood-0.009
 neighborhood-0.009
ini-0.009
ag-0.009
Token for
Token ID329
Feature activation-4.65e-01

Pos logit contributions
ini+0.031
 neighbourhood+0.031
 neighborhood+0.030
iet+0.030
 world+0.030
Neg logit contributions
 eighty-0.039
 seventy-0.039
 ninety-0.037
 fourteen-0.034
 enough-0.033
Token treasure
Token ID14068
Feature activation+6.38e-01

Pos logit contributions
 world+0.019
 neighbourhood+0.019
ag+0.019
 neighborhood+0.019
ini+0.018
Neg logit contributions
 seventy-0.022
 eighty-0.022
 ninety-0.020
 fourteen-0.019
 forty-0.017
Token.
Token ID13
Feature activation+1.79e-01

Pos logit contributions
 eighty+0.037
 seventy+0.037
 ninety+0.036
 fourteen+0.033
 enough+0.032
Neg logit contributions
 neighbourhood-0.018
ini-0.018
 world-0.018
ag-0.018
 neighborhood-0.018
Token chuckled
Token ID39889
Feature activation-8.53e-02

Pos logit contributions
 eighty+0.016
 seventy+0.015
 ninety+0.015
 fourteen+0.014
 pret+0.013
Neg logit contributions
ini-0.015
 neighbourhood-0.015
ling-0.015
iet-0.015
 neighborhood-0.015
Token and
Token ID290
Feature activation+7.58e-01

Pos logit contributions
ini+0.004
 neighbourhood+0.004
 neighborhood+0.004
 world+0.004
ag+0.004
Neg logit contributions
 seventy-0.004
 eighty-0.004
 ninety-0.003
 enough-0.003
 fourteen-0.003
Token said
Token ID531
Feature activation-2.55e-01

Pos logit contributions
 eighty+0.039
 seventy+0.038
 ninety+0.038
 fourteen+0.035
 fault+0.032
Neg logit contributions
ling-0.023
 neighbourhood-0.023
iet-0.022
 neighborhood-0.022
ini-0.022
Token,
Token ID11
Feature activation+6.25e-01

Pos logit contributions
ling+0.007
ag+0.007
ini+0.007
agging+0.007
 outdoors+0.006
Neg logit contributions
 eighty-0.014
 seventy-0.014
 ninety-0.014
 fourteen-0.013
 least-0.012
Token "
Token ID366
Feature activation-3.29e-02

Pos logit contributions
 eighty+0.030
 seventy+0.030
 ninety+0.029
 fourteen+0.026
 enough+0.025
Neg logit contributions
 neighbourhood-0.023
 world-0.023
 neighborhood-0.023
ini-0.023
ag-0.022
TokenOh
Token ID5812
Feature activation+2.01e+00

Pos logit contributions
ini+0.001
 world+0.001
 neighbourhood+0.001
 neighborhood+0.001
iet+0.001
Neg logit contributions
 seventy-0.002
 eighty-0.002
 ninety-0.002
 pret-0.002
 enough-0.002
Token no
Token ID645
Feature activation+7.69e-01

Pos logit contributions
 seventy+0.121
 eighty+0.119
 ninety+0.116
 puck+0.108
six+0.107
Neg logit contributions
ay-0.035
lee-0.035
 hurricane-0.033
 kind-0.033
 nation-0.032
Token,
Token ID11
Feature activation+1.44e-01

Pos logit contributions
 eighty+0.047
 seventy+0.045
 ninety+0.044
 Frost+0.037
 um+0.036
Neg logit contributions
 world-0.021
 neighbourhood-0.020
agging-0.020
 neighborhood-0.020
ag-0.020
Token no
Token ID645
Feature activation+7.13e-01

Pos logit contributions
 seventy+0.007
 eighty+0.007
 ninety+0.006
 pret+0.006
 sixty+0.006
Neg logit contributions
 world-0.005
ini-0.005
 hurricane-0.005
 nature-0.005
ay-0.004
Token,
Token ID11
Feature activation+5.84e-02

Pos logit contributions
 eighty+0.037
 seventy+0.036
 ninety+0.035
 fourteen+0.030
 Frost+0.029
Neg logit contributions
 world-0.027
 neighbourhood-0.025
ini-0.025
ling-0.025
 neighborhood-0.024
Token no
Token ID645
Feature activation+7.67e-01

Pos logit contributions
 seventy+0.003
 eighty+0.003
 ninety+0.003
 pret+0.002
 sixty+0.002
Neg logit contributions
 world-0.002
ini-0.002
 hurricane-0.002
 nature-0.002
iet-0.002
Token dog
Token ID3290
Feature activation-2.86e-01

Pos logit contributions
ini+0.024
ag+0.024
iet+0.023
anne+0.022
 neighbourhood+0.022
Neg logit contributions
 seventy-0.023
 eighty-0.023
 ninety-0.022
 Fl-0.020
 enough-0.019
Token was
Token ID373
Feature activation-8.45e-02

Pos logit contributions
ling+0.011
 neighborhood+0.011
 neighbourhood+0.011
iet+0.011
ag+0.011
Neg logit contributions
 eighty-0.014
 seventy-0.013
 ninety-0.013
 fourteen-0.012
 enough-0.011
Token still
Token ID991
Feature activation+5.20e-01

Pos logit contributions
ini+0.003
 neighbourhood+0.003
 world+0.003
 neighborhood+0.003
iet+0.003
Neg logit contributions
 seventy-0.004
 eighty-0.004
 ninety-0.004
 enough-0.003
 fourteen-0.003
Token w
Token ID266
Feature activation-3.41e-01

Pos logit contributions
 seventy+0.024
 eighty+0.024
 ninety+0.022
 enough+0.021
 Fl+0.021
Neg logit contributions
ini-0.021
 neighbourhood-0.020
iet-0.020
ag-0.020
 neighborhood-0.020
Tokenagging
Token ID16406
Feature activation-6.65e-01

Pos logit contributions
ini+0.007
 world+0.007
 neighbourhood+0.007
 neighborhood+0.007
 hurricane+0.006
Neg logit contributions
 seventy-0.022
 eighty-0.021
 fourteen-0.021
 ninety-0.021
 enough-0.020
Token its
Token ID663
Feature activation+2.21e+00

Pos logit contributions
ag+0.032
 world+0.032
 neighbourhood+0.032
ini+0.031
iet+0.031
Neg logit contributions
 seventy-0.026
 eighty-0.026
 ninety-0.025
 fourteen-0.022
 enough-0.020
Token tail
Token ID7894
Feature activation+1.30e-02

Pos logit contributions
 eighty+0.101
 seventy+0.101
 ninety+0.096
 fourteen+0.087
 enough+0.082
Neg logit contributions
 neighbourhood-0.090
ini-0.090
 world-0.090
ag-0.089
 neighborhood-0.089
Token.
Token ID13
Feature activation-1.67e-01

Pos logit contributions
 eighty+0.001
 seventy+0.001
 ninety+0.001
 fourteen+0.001
 pret+0.001
Neg logit contributions
 world-0.000
 neighbourhood-0.000
ag-0.000
 neighborhood-0.000
ling-0.000
Token He
Token ID679
Feature activation+3.33e-01

Pos logit contributions
ini+0.007
 neighbourhood+0.007
ag+0.007
 world+0.007
iet+0.006
Neg logit contributions
 eighty-0.008
 seventy-0.008
 ninety-0.007
 fourteen-0.007
 Fl-0.007
Token came
Token ID1625
Feature activation+1.26e-01

Pos logit contributions
 eighty+0.017
 seventy+0.017
 ninety+0.016
 fourteen+0.015
 enough+0.014
Neg logit contributions
 neighborhood-0.012
ling-0.012
 neighbourhood-0.012
iet-0.012
ag-0.011
Token to
Token ID284
Feature activation-1.37e-01

Pos logit contributions
 seventy+0.005
 eighty+0.005
 ninety+0.005
 fourteen+0.004
 enough+0.004
Neg logit contributions
ini-0.006
 neighbourhood-0.005
 world-0.005
 neighborhood-0.005
ag-0.005
Token had
Token ID550
Feature activation+1.69e-01

Pos logit contributions
 world+0.012
 neighbourhood+0.012
 neighborhood+0.012
ling+0.012
ini+0.012
Neg logit contributions
 seventy-0.014
 eighty-0.014
 ninety-0.013
 fourteen-0.012
 forty-0.011
Token caused
Token ID4073
Feature activation+7.52e-01

Pos logit contributions
 seventy+0.008
 eighty+0.008
 ninety+0.007
 fourteen+0.006
 pret+0.006
Neg logit contributions
 world-0.007
ag-0.007
 neighbourhood-0.007
 neighborhood-0.007
unts-0.006
Token a
Token ID257
Feature activation+7.48e-01

Pos logit contributions
 eighty+0.036
 seventy+0.036
 ninety+0.033
 fourteen+0.030
 pret+0.027
Neg logit contributions
 world-0.033
 neighbourhood-0.032
 neighborhood-0.031
ag-0.031
 outdoors-0.029
Token lot
Token ID1256
Feature activation+8.13e-01

Pos logit contributions
 seventy+0.040
 eighty+0.040
 ninety+0.038
 fourteen+0.035
 enough+0.033
Neg logit contributions
 world-0.026
 neighbourhood-0.026
ini-0.025
 neighborhood-0.025
ag-0.024
Token of
Token ID286
Feature activation+1.90e-01

Pos logit contributions
 eighty+0.056
 seventy+0.056
 ninety+0.054
 fourteen+0.050
 pret+0.048
Neg logit contributions
ag-0.015
 world-0.015
ini-0.015
 neighbourhood-0.015
 neighborhood-0.014
Token trouble
Token ID5876
Feature activation+4.57e-01

Pos logit contributions
 eighty+0.010
 seventy+0.010
 ninety+0.010
 fourteen+0.009
 forty+0.008
Neg logit contributions
 neighbourhood-0.007
 world-0.007
ag-0.006
 neighborhood-0.006
ini-0.006
Token.
Token ID13
Feature activation+3.27e-01

Pos logit contributions
 seventy+0.027
 eighty+0.027
 ninety+0.026
 fourteen+0.024
 forty+0.023
Neg logit contributions
 world-0.013
 neighbourhood-0.013
ag-0.013
 neighborhood-0.012
ling-0.012
Token<|endoftext|>
Token ID50256
Feature activation+9.77e-03

Pos logit contributions
 eighty+0.018
 seventy+0.018
 ninety+0.018
 fourteen+0.016
 enough+0.016
Neg logit contributions
 neighbourhood-0.010
 world-0.010
 neighborhood-0.010
ini-0.010
ag-0.009
TokenOnce
Token ID7454
Feature activation+6.94e-01

Pos logit contributions
 seventy+0.000
 eighty+0.000
 ninety+0.000
 Fl+0.000
 fourteen+0.000
Neg logit contributions
ini-0.000
ag-0.000
iet-0.000
 neighbourhood-0.000
ling-0.000
Token there
Token ID612
Feature activation+1.08e+00

Pos logit contributions
 eighty+0.028
 seventy+0.027
 ninety+0.026
 fourteen+0.023
 enough+0.023
Neg logit contributions
ini-0.032
 neighbourhood-0.031
 neighborhood-0.031
iet-0.031
ag-0.031
Token was
Token ID373
Feature activation-4.75e-01

Pos logit contributions
 seventy+0.062
 eighty+0.062
 ninety+0.060
 fourteen+0.056
six+0.052
Neg logit contributions
 world-0.037
 neighbourhood-0.036
 neighborhood-0.035
ini-0.032
ling-0.031
Token our
Token ID674
Feature activation+6.68e-01

Pos logit contributions
 seventy+0.033
 eighty+0.032
 ninety+0.031
 fourteen+0.028
 enough+0.027
Neg logit contributions
 neighbourhood-0.029
 world-0.029
ini-0.028
ag-0.028
 neighborhood-0.028
Token heroes
Token ID10281
Feature activation-2.93e-01

Pos logit contributions
 eighty+0.034
 seventy+0.034
 ninety+0.032
 fourteen+0.029
 enough+0.028
Neg logit contributions
ini-0.024
ag-0.024
 neighbourhood-0.024
iet-0.023
 world-0.023
Token too
Token ID1165
Feature activation-1.59e-01

Pos logit contributions
 world+0.009
 neighbourhood+0.009
ini+0.008
 neighborhood+0.008
ag+0.008
Neg logit contributions
 seventy-0.017
 eighty-0.017
 ninety-0.016
 fourteen-0.015
 forty-0.014
Token.
Token ID13
Feature activation-2.05e-01

Pos logit contributions
ini+0.006
ag+0.005
 neighbourhood+0.005
 neighborhood+0.005
iet+0.005
Neg logit contributions
 eighty-0.008
 seventy-0.008
 ninety-0.008
 fourteen-0.007
 enough-0.007
Token You
Token ID921
Feature activation-9.73e-02

Pos logit contributions
 neighbourhood+0.008
ini+0.008
 world+0.008
 neighborhood+0.008
ag+0.008
Neg logit contributions
 eighty-0.010
 seventy-0.010
 ninety-0.010
 fourteen-0.009
 enough-0.008
Token are
Token ID389
Feature activation+6.17e-01

Pos logit contributions
ag+0.004
 world+0.004
 neighbourhood+0.004
 neighborhood+0.004
iet+0.003
Neg logit contributions
 seventy-0.005
 eighty-0.005
 ninety-0.005
 fourteen-0.004
 forty-0.004
Token very
Token ID845
Feature activation-3.63e-01

Pos logit contributions
 seventy+0.028
 eighty+0.028
 ninety+0.026
 fourteen+0.024
 enough+0.022
Neg logit contributions
 neighbourhood-0.026
 world-0.026
ag-0.025
ini-0.025
 neighborhood-0.025
Token brave
Token ID14802
Feature activation-2.32e-01

Pos logit contributions
 neighbourhood+0.013
 neighborhood+0.013
iet+0.012
ini+0.012
 world+0.012
Neg logit contributions
 eighty-0.019
 seventy-0.019
 ninety-0.018
 Frost-0.016
 Fl-0.016
Token and
Token ID290
Feature activation+3.29e-01

Pos logit contributions
 world+0.009
 neighbourhood+0.008
 neighborhood+0.008
ini+0.008
ag+0.008
Neg logit contributions
 eighty-0.012
 seventy-0.012
 ninety-0.011
 fourteen-0.010
 forty-0.010
Token kind
Token ID1611
Feature activation-1.49e+00

Pos logit contributions
 eighty+0.019
 seventy+0.019
 ninety+0.019
 fourteen+0.017
 enough+0.016
Neg logit contributions
 neighbourhood-0.009
ini-0.009
 world-0.009
ag-0.009
 neighborhood-0.009
Token."
Token ID526
Feature activation+2.12e-01

Pos logit contributions
 world+0.044
 neighbourhood+0.042
 neighborhood+0.041
ag+0.041
ini+0.040
Neg logit contributions
 eighty-0.087
 seventy-0.087
 ninety-0.084
 fourteen-0.078
 forty-0.073
Token he
Token ID339
Feature activation-4.85e-01

Pos logit contributions
 world+0.002
 neighborhood+0.002
 neighbourhood+0.002
 outdoors+0.001
ag+0.001
Neg logit contributions
 seventy-0.002
 eighty-0.002
 ninety-0.002
 fourteen-0.002
 forty-0.002
Token could
Token ID714
Feature activation-1.65e-03

Pos logit contributions
 neighbourhood+0.017
 neighborhood+0.017
 world+0.017
ini+0.016
ling+0.016
Neg logit contributions
 seventy-0.024
 eighty-0.024
 ninety-0.024
 fourteen-0.023
 forty-0.021
Token not
Token ID407
Feature activation-4.31e-01

Pos logit contributions
 neighbourhood+0.000
ini+0.000
 world+0.000
 neighborhood+0.000
ag+0.000
Neg logit contributions
 eighty-0.000
 seventy-0.000
 ninety-0.000
 fourteen-0.000
 enough-0.000
Token believe
Token ID1975
Feature activation+2.99e-01

Pos logit contributions
ini+0.023
 neighbourhood+0.023
iet+0.023
 neighborhood+0.023
 world+0.023
Neg logit contributions
 seventy-0.014
 eighty-0.014
 ninety-0.014
 fourteen-0.012
 enough-0.011
Token it
Token ID340
Feature activation+6.32e-01

Pos logit contributions
 eighty+0.009
 seventy+0.009
 ninety+0.008
 enough+0.007
 fourteen+0.007
Neg logit contributions
 neighbourhood-0.018
ini-0.017
 neighborhood-0.017
 world-0.017
ag-0.017
Token!
Token ID0
Feature activation+2.03e-01

Pos logit contributions
 eighty+0.042
 seventy+0.042
 ninety+0.040
 fourteen+0.038
 forty+0.035
Neg logit contributions
 world-0.015
ling-0.014
 neighbourhood-0.013
ag-0.013
 neighborhood-0.013
Token
Token ID198
Feature activation-1.29e-01

Pos logit contributions
 eighty+0.011
 seventy+0.011
 ninety+0.010
 fourteen+0.010
 enough+0.009
Neg logit contributions
 neighbourhood-0.007
ini-0.007
 world-0.007
 neighborhood-0.007
ag-0.007
Token
Token ID198
Feature activation-8.54e-02

Pos logit contributions
ini+0.004
 neighbourhood+0.004
 world+0.004
 neighborhood+0.004
iet+0.004
Neg logit contributions
 eighty-0.007
 seventy-0.007
 ninety-0.007
 fourteen-0.006
 Fl-0.006
TokenSpot
Token ID32565
Feature activation-3.31e-02

Pos logit contributions
 neighbourhood+0.003
ini+0.003
 world+0.003
ag+0.003
 neighborhood+0.003
Neg logit contributions
 eighty-0.004
 seventy-0.004
 ninety-0.004
 fourteen-0.004
 enough-0.004
Token put
Token ID1234
Feature activation+9.73e-02

Pos logit contributions
 neighborhood+0.001
 neighbourhood+0.001
ling+0.001
 world+0.001
iet+0.001
Neg logit contributions
 seventy-0.002
 eighty-0.002
 ninety-0.002
 fourteen-0.002
 fault-0.002
Token the
Token ID262
Feature activation-9.52e-01

Pos logit contributions
 seventy+0.005
 eighty+0.005
 ninety+0.005
 fourteen+0.005
iously+0.004
Neg logit contributions
ag-0.003
ling-0.003
 world-0.003
 neighbourhood-0.003
iet-0.003
Token?"
Token ID1701
Feature activation+2.81e-01

Pos logit contributions
 seventy+0.061
 eighty+0.060
 ninety+0.058
 fourteen+0.053
 forty+0.049
Neg logit contributions
 world-0.042
 neighborhood-0.041
 neighbourhood-0.041
ag-0.039
iet-0.039
Token
Token ID198
Feature activation-1.01e-01

Pos logit contributions
 seventy+0.009
 eighty+0.009
 Fl+0.008
 Frost+0.008
 ninety+0.008
Neg logit contributions
ini-0.017
ag-0.016
iet-0.016
ilt-0.016
ling-0.015
Token
Token ID198
Feature activation-3.38e-02

Pos logit contributions
ini+0.006
ling+0.005
 hurricane+0.005
 neighbourhood+0.005
 nation+0.005
Neg logit contributions
 seventy-0.003
 Fl-0.003
 eighty-0.003
 Frost-0.003
 ninety-0.003
Token"
Token ID1
Feature activation+3.56e-01

Pos logit contributions
ini+0.002
ag+0.002
 world+0.002
 hurricane+0.002
 neighbourhood+0.002
Neg logit contributions
 eighty-0.001
 seventy-0.001
 ninety-0.001
 Fl-0.001
 fourteen-0.001
TokenYes
Token ID5297
Feature activation+1.67e-01

Pos logit contributions
 eighty+0.018
 seventy+0.018
 ninety+0.017
 fourteen+0.016
 enough+0.015
Neg logit contributions
ini-0.013
 neighbourhood-0.012
 world-0.012
 neighborhood-0.012
ag-0.012
Token,
Token ID11
Feature activation+4.48e-01

Pos logit contributions
 eighty+0.009
 seventy+0.009
 ninety+0.009
 pret+0.008
 fourteen+0.007
Neg logit contributions
iet-0.005
ling-0.005
ag-0.005
 world-0.005
ini-0.005
Token we
Token ID356
Feature activation+3.94e-01

Pos logit contributions
 eighty+0.022
 seventy+0.022
 ninety+0.021
 fourteen+0.019
 forty+0.018
Neg logit contributions
 world-0.016
ini-0.016
iet-0.016
ag-0.016
 neighbourhood-0.016
Token did
Token ID750
Feature activation-4.35e-01

Pos logit contributions
 seventy+0.017
 eighty+0.017
 ninety+0.016
 fourteen+0.014
 forty+0.013
Neg logit contributions
 neighbourhood-0.017
ini-0.017
 neighborhood-0.017
ag-0.017
iet-0.017
Token!"
Token ID2474
Feature activation+1.73e-01

Pos logit contributions
ag+0.019
 world+0.018
 neighbourhood+0.018
 neighborhood+0.017
ling+0.017
Neg logit contributions
 seventy-0.019
 eighty-0.019
 ninety-0.018
 fourteen-0.016
 forty-0.015
Token Lily
Token ID20037
Feature activation+5.76e-01

Pos logit contributions
 eighty+0.007
 seventy+0.007
 ninety+0.006
 fourteen+0.006
 enough+0.005
Neg logit contributions
ini-0.009
 neighbourhood-0.009
 world-0.008
ag-0.008
 neighborhood-0.008
Token and
Token ID290
Feature activation+4.91e-01

Pos logit contributions
 eighty+0.030
 seventy+0.030
 ninety+0.029
 fourteen+0.027
 forty+0.025
Neg logit contributions
 world-0.019
 neighbourhood-0.019
 neighborhood-0.019
ag-0.019
iet-0.018
Token It
Token ID632
Feature activation+5.42e-01

Pos logit contributions
 seventy+0.013
 eighty+0.013
 ninety+0.012
 fourteen+0.011
 Fl+0.011
Neg logit contributions
ini-0.010
 neighbourhood-0.010
 world-0.010
iet-0.010
ag-0.010
Token was
Token ID373
Feature activation+3.17e-01

Pos logit contributions
 seventy+0.027
 eighty+0.027
 ninety+0.026
 fourteen+0.023
 forty+0.021
Neg logit contributions
ling-0.022
 world-0.022
ini-0.021
 neighbourhood-0.021
iet-0.021
Token so
Token ID523
Feature activation+4.95e-02

Pos logit contributions
 seventy+0.017
 eighty+0.016
 ninety+0.016
 pret+0.014
 um+0.014
Neg logit contributions
 world-0.013
 nature-0.012
 outdoors-0.011
 neighbourhood-0.011
 neighborhood-0.011
Token much
Token ID881
Feature activation+4.13e-01

Pos logit contributions
 eighty+0.003
 seventy+0.003
 ninety+0.002
 fourteen+0.002
 enough+0.002
Neg logit contributions
 neighbourhood-0.002
ini-0.002
 neighborhood-0.002
ag-0.002
iet-0.002
Token fun
Token ID1257
Feature activation-5.20e-01

Pos logit contributions
 eighty+0.018
 seventy+0.018
 ninety+0.017
 fourteen+0.016
 enough+0.016
Neg logit contributions
ini-0.017
iet-0.017
 neighbourhood-0.017
ag-0.017
 neighborhood-0.017
Token!
Token ID0
Feature activation+2.71e-01

Pos logit contributions
 world+0.020
 neighbourhood+0.019
ini+0.018
 neighborhood+0.018
ag+0.018
Neg logit contributions
 seventy-0.027
 eighty-0.027
 ninety-0.026
 fourteen-0.023
 forty-0.022
Token<|endoftext|>
Token ID50256
Feature activation+1.65e-02

Pos logit contributions
 seventy+0.012
 eighty+0.012
 ninety+0.011
 Fl+0.011
 fourteen+0.010
Neg logit contributions
iet-0.011
ini-0.011
 neighbourhood-0.011
 world-0.011
ag-0.011
TokenOnce
Token ID7454
Feature activation+5.90e-01

Pos logit contributions
 seventy+0.001
 eighty+0.001
 ninety+0.001
 Fl+0.001
 fourteen+0.001
Neg logit contributions
ini-0.001
iet-0.001
ag-0.001
 neighbourhood-0.001
ling-0.001
Token upon
Token ID2402
Feature activation+9.38e-01

Pos logit contributions
 eighty+0.021
 seventy+0.021
 ninety+0.020
 Fl+0.019
 enough+0.018
Neg logit contributions
ini-0.029
 neighbourhood-0.028
 neighborhood-0.028
agons-0.027
 world-0.027
Token a
Token ID257
Feature activation+5.45e-01

Pos logit contributions
 seventy+0.029
 eighty+0.029
 ninety+0.028
 fourteen+0.024
 forty+0.021
Neg logit contributions
 world-0.053
ag-0.052
 neighbourhood-0.052
 neighborhood-0.051
iet-0.051
Token time
Token ID640
Feature activation+1.02e-01

Pos logit contributions
 eighty+0.031
 seventy+0.031
 ninety+0.029
 fourteen+0.027
 enough+0.026
Neg logit contributions
ini-0.017
 neighbourhood-0.016
 world-0.015
 neighborhood-0.015
ag-0.015
Token and
Token ID290
Feature activation+3.29e-01

Pos logit contributions
 eighty+0.020
 seventy+0.020
 ninety+0.019
 fourteen+0.017
 enough+0.017
Neg logit contributions
ini-0.015
 neighbourhood-0.015
ag-0.015
 world-0.015
 neighborhood-0.015
Token fresh
Token ID4713
Feature activation-1.22e-01

Pos logit contributions
 seventy+0.016
 eighty+0.016
 ninety+0.015
 fourteen+0.014
 um+0.013
Neg logit contributions
 world-0.013
 neighbourhood-0.013
 neighborhood-0.013
ling-0.012
ini-0.012
Token air
Token ID1633
Feature activation-5.40e-01

Pos logit contributions
ini+0.004
 neighbourhood+0.004
iet+0.004
ag+0.004
 neighborhood+0.004
Neg logit contributions
 seventy-0.007
 eighty-0.007
 ninety-0.006
 fourteen-0.006
 enough-0.006
Token.
Token ID13
Feature activation+1.45e-01

Pos logit contributions
ini+0.017
 neighbourhood+0.017
 neighborhood+0.017
iet+0.017
 world+0.016
Neg logit contributions
 eighty-0.029
 seventy-0.029
 ninety-0.029
 enough-0.026
 fourteen-0.026
Token
Token ID198
Feature activation+1.57e-01

Pos logit contributions
 eighty+0.007
 seventy+0.007
 ninety+0.006
 fourteen+0.006
 enough+0.006
Neg logit contributions
ini-0.006
 neighbourhood-0.006
 world-0.006
ag-0.006
 neighborhood-0.006
Token
Token ID198
Feature activation+1.93e-01

Pos logit contributions
 eighty+0.007
 seventy+0.007
 ninety+0.007
 fourteen+0.006
 Fl+0.006
Neg logit contributions
ini-0.006
 neighbourhood-0.006
 world-0.006
ag-0.006
 neighborhood-0.006
TokenAs
Token ID1722
Feature activation-5.28e-02

Pos logit contributions
 eighty+0.008
 seventy+0.008
 ninety+0.007
 fourteen+0.006
 Fl+0.006
Neg logit contributions
ini-0.009
ag-0.009
 neighbourhood-0.009
 world-0.009
 neighborhood-0.009
Token they
Token ID484
Feature activation-3.05e-01

Pos logit contributions
 neighbourhood+0.002
 world+0.002
 neighborhood+0.002
iet+0.002
ini+0.002
Neg logit contributions
 seventy-0.002
 eighty-0.002
 ninety-0.002
 fourteen-0.002
 forty-0.002
Token were
Token ID547
Feature activation+2.50e-01

Pos logit contributions
 neighbourhood+0.011
ling+0.011
 neighborhood+0.011
ag+0.010
iet+0.010
Neg logit contributions
 seventy-0.015
 eighty-0.015
 ninety-0.014
 fault-0.014
 fourteen-0.014
Token riding
Token ID10311
Feature activation+5.45e-01

Pos logit contributions
 seventy+0.013
 eighty+0.013
 ninety+0.013
 fourteen+0.012
 pret+0.011
Neg logit contributions
ag-0.008
agging-0.008
 neighbourhood-0.008
 neighborhood-0.008
ling-0.007
Token,
Token ID11
Feature activation-1.48e-01

Pos logit contributions
 seventy+0.028
 eighty+0.027
 ninety+0.026
 fourteen+0.024
 forty+0.022
Neg logit contributions
 neighbourhood-0.020
ag-0.020
 world-0.020
 neighborhood-0.020
ling-0.020
Token was
Token ID373
Feature activation-5.19e-01

Pos logit contributions
 eighty+0.055
 seventy+0.055
 ninety+0.053
 fourteen+0.050
six+0.045
Neg logit contributions
 world-0.035
 neighbourhood-0.034
 neighborhood-0.033
ini-0.032
ling-0.030
Token a
Token ID257
Feature activation+8.33e-02

Pos logit contributions
 world+0.028
 neighbourhood+0.028
 neighborhood+0.028
 outdoors+0.028
ag+0.028
Neg logit contributions
 seventy-0.017
 eighty-0.017
 ninety-0.016
 fourteen-0.014
 forty-0.012
Token little
Token ID1310
Feature activation+1.72e-01

Pos logit contributions
 eighty+0.003
 seventy+0.003
 ninety+0.003
 Fl+0.003
 Frost+0.003
Neg logit contributions
ag-0.004
agging-0.004
ags-0.004
 languages-0.004
ini-0.004
Token girl
Token ID2576
Feature activation-5.11e-02

Pos logit contributions
 eighty+0.008
 seventy+0.008
 ninety+0.007
 Fl+0.007
 Frost+0.007
Neg logit contributions
ag-0.007
 neighbourhood-0.007
iet-0.007
agging-0.007
ini-0.007
Token,
Token ID11
Feature activation+1.98e-01

Pos logit contributions
agons+0.003
agging+0.003
ags+0.003
ag+0.003
reci+0.003
Neg logit contributions
 Fl-0.002
 no-0.002
 seventy-0.002
 eighty-0.001
 Pe-0.001
Token who
Token ID508
Feature activation-4.27e-01

Pos logit contributions
 eighty+0.009
 seventy+0.009
 ninety+0.008
 fourteen+0.007
 Fl+0.007
Neg logit contributions
ini-0.009
 neighbourhood-0.009
ag-0.009
 world-0.009
 neighborhood-0.008
Token was
Token ID373
Feature activation-3.29e-01

Pos logit contributions
agging+0.019
ini+0.019
iet+0.019
ag+0.019
agons+0.019
Neg logit contributions
 eighty-0.017
 seventy-0.016
 Fl-0.015
 ninety-0.015
 pret-0.014
Token very
Token ID845
Feature activation-7.75e-01

Pos logit contributions
 neighbourhood+0.013
ini+0.013
 world+0.013
 neighborhood+0.013
ag+0.013
Neg logit contributions
 eighty-0.016
 seventy-0.016
 ninety-0.015
 fourteen-0.014
 enough-0.013
Token sad
Token ID6507
Feature activation+5.12e-01

Pos logit contributions
iet+0.031
ag+0.029
 neighbourhood+0.029
 lens+0.029
 neighborhood+0.029
Neg logit contributions
 eighty-0.036
 seventy-0.036
 ninety-0.034
 Fl-0.032
 Ell-0.031
Token.
Token ID13
Feature activation+2.05e-01

Pos logit contributions
 eighty+0.032
 seventy+0.032
 ninety+0.031
 fourteen+0.028
 forty+0.027
Neg logit contributions
 world-0.015
 neighbourhood-0.014
 neighborhood-0.014
ag-0.013
ling-0.013
Token She
Token ID1375
Feature activation+5.13e-02

Pos logit contributions
 seventy+0.010
 eighty+0.010
 ninety+0.010
 Fl+0.009
 Ell+0.009
Neg logit contributions
ling-0.008
ini-0.008
iet-0.008
agons-0.007
ag-0.007
Token dolls
Token ID36062
Feature activation-3.34e-01

Pos logit contributions
ini+0.006
 neighbourhood+0.005
ag+0.005
 world+0.005
 neighborhood+0.005
Neg logit contributions
 eighty-0.006
 seventy-0.006
 ninety-0.006
 fourteen-0.005
 Fl-0.005
Token.
Token ID13
Feature activation-1.61e-01

Pos logit contributions
ini+0.012
 neighbourhood+0.011
iet+0.011
ag+0.011
 neighborhood+0.011
Neg logit contributions
 eighty-0.017
 seventy-0.017
 ninety-0.017
 enough-0.015
 fourteen-0.015
Token She
Token ID1375
Feature activation-1.64e-01

Pos logit contributions
 world+0.005
 neighbourhood+0.005
 neighborhood+0.005
ag+0.005
iet+0.005
Neg logit contributions
 eighty-0.009
 seventy-0.009
 ninety-0.008
 fourteen-0.008
 enough-0.007
Token had
Token ID550
Feature activation-3.75e-01

Pos logit contributions
 neighborhood+0.005
 neighbourhood+0.005
 world+0.005
iet+0.005
ag+0.005
Neg logit contributions
 seventy-0.009
 eighty-0.009
 ninety-0.009
 fourteen-0.009
 enough-0.008
Token many
Token ID867
Feature activation+4.24e-01

Pos logit contributions
 neighbourhood+0.021
 neighborhood+0.021
 world+0.020
ag+0.020
iet+0.020
Neg logit contributions
 seventy-0.012
 eighty-0.012
 ninety-0.011
 fourteen-0.010
 forty-0.009
Token dolls
Token ID36062
Feature activation-2.23e-01

Pos logit contributions
 seventy+0.018
 eighty+0.017
 ninety+0.017
 enough+0.015
 pret+0.015
Neg logit contributions
ini-0.019
iet-0.018
 neighbourhood-0.018
 world-0.018
 neighborhood-0.018
Token and
Token ID290
Feature activation+1.11e-02

Pos logit contributions
ini+0.010
 neighbourhood+0.010
 world+0.010
ag+0.009
 neighborhood+0.009
Neg logit contributions
 eighty-0.010
 seventy-0.010
 ninety-0.009
 fourteen-0.008
 enough-0.008
Token gave
Token ID2921
Feature activation-3.32e-01

Pos logit contributions
 seventy+0.001
 eighty+0.001
 ninety+0.000
 fourteen+0.000
 forty+0.000
Neg logit contributions
 neighbourhood-0.000
ag-0.000
 world-0.000
 neighborhood-0.000
ini-0.000
Token them
Token ID606
Feature activation-3.46e-02

Pos logit contributions
ag+0.011
 world+0.011
 neighbourhood+0.011
 neighborhood+0.010
ini+0.010
Neg logit contributions
 eighty-0.018
 seventy-0.018
 ninety-0.017
 fourteen-0.016
 enough-0.015
Token names
Token ID3891
Feature activation-4.00e-01

Pos logit contributions
ag+0.001
 world+0.001
 neighbourhood+0.001
 neighborhood+0.001
iet+0.001
Neg logit contributions
 seventy-0.002
 eighty-0.002
 ninety-0.001
 fourteen-0.001
 enough-0.001
Token and
Token ID290
Feature activation-3.80e-01

Pos logit contributions
ag+0.010
 world+0.010
 neighbourhood+0.010
ini+0.010
 neighborhood+0.010
Neg logit contributions
 eighty-0.024
 seventy-0.024
 ninety-0.023
 fourteen-0.022
 enough-0.021
Token they
Token ID484
Feature activation-7.64e-01

Pos logit contributions
 neighbourhood+0.012
 neighborhood+0.012
 world+0.012
iet+0.011
 town+0.011
Neg logit contributions
 seventy-0.016
 eighty-0.016
 ninety-0.015
 fourteen-0.013
 forty-0.012
Token saw
Token ID2497
Feature activation-8.47e-01

Pos logit contributions
 neighbourhood+0.026
 neighborhood+0.026
iet+0.024
 world+0.024
ling+0.023
Neg logit contributions
 seventy-0.040
 eighty-0.040
 ninety-0.038
 fourteen-0.036
 pret-0.034
Token a
Token ID257
Feature activation+2.21e-01

Pos logit contributions
 neighbourhood+0.041
 neighborhood+0.041
 world+0.040
ag+0.038
iet+0.038
Neg logit contributions
 seventy-0.034
 eighty-0.033
 ninety-0.031
 fourteen-0.027
 pret-0.025
Token man
Token ID582
Feature activation-5.03e-01

Pos logit contributions
 eighty+0.012
 seventy+0.012
 ninety+0.012
 fourteen+0.010
 Fl+0.010
Neg logit contributions
ag-0.007
ini-0.007
iet-0.007
agging-0.007
 neighbourhood-0.007
Token acting
Token ID7205
Feature activation-1.12e-01

Pos logit contributions
iet+0.025
ini+0.024
agons+0.024
 neighbourhood+0.023
ag+0.023
Neg logit contributions
 seventy-0.020
 eighty-0.020
 ninety-0.019
 enough-0.017
 fifty-0.016
Token funny
Token ID8258
Feature activation-4.81e-02

Pos logit contributions
ini+0.004
 neighbourhood+0.004
 neighborhood+0.004
 world+0.004
ag+0.004
Neg logit contributions
 eighty-0.006
 seventy-0.006
 ninety-0.006
 fourteen-0.005
 enough-0.005
Token.
Token ID13
Feature activation+1.05e-01

Pos logit contributions
ini+0.002
 neighbourhood+0.002
 neighborhood+0.002
ag+0.002
 world+0.002
Neg logit contributions
 eighty-0.002
 seventy-0.002
 ninety-0.002
 fourteen-0.002
 enough-0.002
Token He
Token ID679
Feature activation+3.17e-01

Pos logit contributions
 eighty+0.005
 seventy+0.005
 ninety+0.004
 fourteen+0.004
 enough+0.004
Neg logit contributions
ini-0.005
 neighbourhood-0.005
 world-0.004
ag-0.004
 neighborhood-0.004
Token was
Token ID373
Feature activation+2.02e-01

Pos logit contributions
 seventy+0.015
 eighty+0.015
 ninety+0.015
 fourteen+0.014
 enough+0.013
Neg logit contributions
 neighbourhood-0.012
 neighborhood-0.012
 world-0.012
ling-0.011
iet-0.011
Token juggling
Token ID48097
Feature activation+7.76e-01

Pos logit contributions
 eighty+0.009
 seventy+0.009
 ninety+0.009
 fourteen+0.008
 enough+0.008
Neg logit contributions
 neighbourhood-0.008
 world-0.008
ini-0.008
ag-0.008
 neighborhood-0.008
Token and
Token ID290
Feature activation+8.04e-01

Pos logit contributions
 seventy+0.040
 eighty+0.039
 ninety+0.037
 fourteen+0.033
 forty+0.031
Neg logit contributions
ag-0.030
 world-0.029
ling-0.028
iet-0.027
 neighbourhood-0.027
TokenBut
Token ID1537
Feature activation+4.37e-01

Pos logit contributions
 eighty+0.005
 seventy+0.005
 fourteen+0.004
 ninety+0.004
 oh+0.004
Neg logit contributions
ag-0.006
 world-0.006
 neighbourhood-0.006
ini-0.006
agging-0.006
Token then
Token ID788
Feature activation-1.85e-01

Pos logit contributions
 seventy+0.014
 eighty+0.014
 ninety+0.013
 fourteen+0.011
 pret+0.010
Neg logit contributions
 neighbourhood-0.023
 neighborhood-0.023
ini-0.023
 world-0.023
ag-0.023
Token,
Token ID11
Feature activation-8.13e-02

Pos logit contributions
 neighbourhood+0.007
 neighborhood+0.007
 world+0.007
ini+0.007
iet+0.007
Neg logit contributions
 seventy-0.009
 eighty-0.009
 ninety-0.009
 fourteen-0.008
 forty-0.007
Token Tim
Token ID5045
Feature activation-1.65e-01

Pos logit contributions
 neighbourhood+0.004
 neighborhood+0.004
 world+0.004
ini+0.004
 outdoors+0.004
Neg logit contributions
 seventy-0.003
 eighty-0.003
 ninety-0.003
 fourteen-0.002
 sixty-0.002
Tokenmy
Token ID1820
Feature activation-1.23e+00

Pos logit contributions
 neighbourhood+0.007
ini+0.007
ag+0.007
 world+0.007
 neighborhood+0.007
Neg logit contributions
 eighty-0.007
 seventy-0.007
 ninety-0.007
 fourteen-0.006
 enough-0.006
Token had
Token ID550
Feature activation-1.94e-01

Pos logit contributions
 neighbourhood+0.052
ini+0.052
 world+0.051
 neighborhood+0.051
ag+0.051
Neg logit contributions
 seventy-0.055
 eighty-0.055
 ninety-0.052
 fourteen-0.047
 enough-0.044
Token an
Token ID281
Feature activation+8.29e-01

Pos logit contributions
 neighborhood+0.010
 neighbourhood+0.009
 world+0.009
 town+0.009
 nation+0.008
Neg logit contributions
 seventy-0.008
 eighty-0.008
 ninety-0.007
 fourteen-0.006
 sixty-0.006
Token idea
Token ID2126
Feature activation-1.18e+00

Pos logit contributions
 eighty+0.048
 seventy+0.048
 ninety+0.046
 fourteen+0.043
 enough+0.042
Neg logit contributions
 neighbourhood-0.024
ini-0.023
 world-0.023
 neighborhood-0.023
ag-0.022
Token.
Token ID13
Feature activation+2.80e-01

Pos logit contributions
 neighbourhood+0.041
ini+0.041
 neighborhood+0.040
iet+0.039
 world+0.039
Neg logit contributions
 seventy-0.061
 eighty-0.060
 ninety-0.058
 enough-0.055
 fourteen-0.054
Token He
Token ID679
Feature activation+3.09e-01

Pos logit contributions
 seventy+0.011
 eighty+0.011
 ninety+0.011
 enough+0.009
 Fl+0.009
Neg logit contributions
iet-0.013
ag-0.013
ling-0.013
ini-0.013
agging-0.013
Token said
Token ID531
Feature activation+9.65e-02

Pos logit contributions
 seventy+0.015
 eighty+0.015
 ninety+0.014
 fourteen+0.013
 forty+0.012
Neg logit contributions
 neighborhood-0.012
 neighbourhood-0.011
ling-0.011
ini-0.011
iet-0.011
Token home
Token ID1363
Feature activation-2.67e-01

Pos logit contributions
ini+0.003
 neighbourhood+0.003
ag+0.003
 neighborhood+0.003
 world+0.003
Neg logit contributions
 eighty-0.002
 seventy-0.002
 ninety-0.002
 fourteen-0.002
 enough-0.002
Token,
Token ID11
Feature activation-1.85e-01

Pos logit contributions
 neighbourhood+0.010
 world+0.010
 neighborhood+0.010
ling+0.010
ini+0.010
Neg logit contributions
 eighty-0.014
 seventy-0.013
 ninety-0.013
 fourteen-0.012
 forty-0.011
Token they
Token ID484
Feature activation-5.97e-01

Pos logit contributions
 neighbourhood+0.007
 world+0.007
 neighborhood+0.007
iet+0.007
ini+0.007
Neg logit contributions
 eighty-0.009
 seventy-0.009
 ninety-0.008
 fourteen-0.007
 forty-0.007
Token saw
Token ID2497
Feature activation-9.83e-01

Pos logit contributions
 neighbourhood+0.023
 neighborhood+0.023
iet+0.022
 world+0.022
ag+0.021
Neg logit contributions
 seventy-0.029
 eighty-0.029
 ninety-0.028
 fourteen-0.026
 forty-0.024
Token a
Token ID257
Feature activation+3.07e-01

Pos logit contributions
 neighbourhood+0.049
 world+0.049
 neighborhood+0.048
ag+0.047
ini+0.047
Neg logit contributions
 seventy-0.039
 eighty-0.039
 ninety-0.036
 fourteen-0.029
 sixty-0.027
Token gr
Token ID1036
Feature activation-1.48e+00

Pos logit contributions
 eighty+0.018
 seventy+0.018
 ninety+0.017
 fourteen+0.016
 enough+0.015
Neg logit contributions
ag-0.009
ini-0.009
 neighbourhood-0.009
iet-0.009
 neighborhood-0.008
Tokenumpy
Token ID32152
Feature activation-3.79e-01

Pos logit contributions
 neighbourhood+0.030
 neighborhood+0.029
 outdoors+0.027
 world+0.024
 languages+0.023
Neg logit contributions
 eighty-0.086
 seventy-0.085
 ninety-0.085
 fourteen-0.082
 enough-0.077
Token man
Token ID582
Feature activation-3.86e-01

Pos logit contributions
 world+0.012
 neighbourhood+0.012
ini+0.012
 neighborhood+0.011
ag+0.011
Neg logit contributions
 eighty-0.021
 seventy-0.021
 ninety-0.020
 fourteen-0.019
 enough-0.018
Token.
Token ID13
Feature activation+6.45e-02

Pos logit contributions
iet+0.018
ini+0.018
agons+0.017
 neighbourhood+0.017
ag+0.016
Neg logit contributions
 seventy-0.016
 eighty-0.016
 ninety-0.015
 enough-0.014
 Fl-0.013
Token He
Token ID679
Feature activation+3.83e-01

Pos logit contributions
 seventy+0.003
 eighty+0.003
 ninety+0.003
 fourteen+0.002
 Fl+0.002
Neg logit contributions
ini-0.003
 neighbourhood-0.003
ling-0.003
ag-0.003
 world-0.003
Token was
Token ID373
Feature activation+5.64e-02

Pos logit contributions
 eighty+0.017
 seventy+0.017
 ninety+0.017
 fourteen+0.015
 enough+0.014
Neg logit contributions
 neighbourhood-0.016
 neighborhood-0.016
 world-0.016
ini-0.016
ag-0.015
Token is
Token ID318
Feature activation-2.96e-01

Pos logit contributions
 eighty+0.009
 seventy+0.009
 ninety+0.008
 fourteen+0.008
 enough+0.007
Neg logit contributions
ini-0.009
 neighbourhood-0.009
ag-0.009
 world-0.009
 neighborhood-0.009
Token not
Token ID407
Feature activation-2.96e-01

Pos logit contributions
 world+0.012
 neighbourhood+0.012
 neighborhood+0.012
ag+0.012
ini+0.012
Neg logit contributions
 eighty-0.014
 seventy-0.014
 ninety-0.013
 fourteen-0.012
 forty-0.011
Token good
Token ID922
Feature activation+7.00e-01

Pos logit contributions
 world+0.011
ag+0.011
ling+0.011
 neighbourhood+0.010
 neighborhood+0.010
Neg logit contributions
 seventy-0.015
 eighty-0.015
 ninety-0.014
 fourteen-0.013
 forty-0.012
Token for
Token ID329
Feature activation+6.89e-02

Pos logit contributions
 eighty+0.040
 seventy+0.039
 ninety+0.038
 forty+0.033
 fourteen+0.032
Neg logit contributions
 world-0.023
ag-0.022
ling-0.021
 neighbourhood-0.021
 neighborhood-0.021
Token pump
Token ID8901
Feature activation+4.93e-01

Pos logit contributions
 eighty+0.004
 seventy+0.004
 ninety+0.004
 fourteen+0.003
 pret+0.003
Neg logit contributions
 world-0.002
 neighbourhood-0.002
ag-0.002
 neighborhood-0.002
ini-0.002
Tokenkins
Token ID5331
Feature activation-1.44e+00

Pos logit contributions
 eighty+0.027
 seventy+0.027
 ninety+0.026
 fourteen+0.024
 enough+0.024
Neg logit contributions
 neighbourhood-0.015
ini-0.015
 neighborhood-0.015
ag-0.015
 world-0.015
Token,"
Token ID553
Feature activation+9.15e-02

Pos logit contributions
 world+0.046
 neighbourhood+0.045
ag+0.045
ini+0.044
 neighborhood+0.044
Neg logit contributions
 seventy-0.079
 eighty-0.079
 ninety-0.076
 fourteen-0.070
 forty-0.066
Token Pam
Token ID24497
Feature activation+8.73e-01

Pos logit contributions
 eighty+0.004
 seventy+0.004
 ninety+0.003
 fourteen+0.003
 Fl+0.003
Neg logit contributions
ini-0.004
iet-0.004
ag-0.004
agging-0.004
 neighbourhood-0.004
Token says
Token ID1139
Feature activation+3.52e-01

Pos logit contributions
 eighty+0.042
 seventy+0.042
 ninety+0.040
 fourteen+0.036
 enough+0.034
Neg logit contributions
 neighbourhood-0.034
ag-0.033
 world-0.033
 neighborhood-0.033
ini-0.033
Token.
Token ID13
Feature activation-7.01e-02

Pos logit contributions
 seventy+0.018
 eighty+0.017
 ninety+0.017
 enough+0.016
 Frost+0.015
Neg logit contributions
ini-0.014
 neighbourhood-0.014
iet-0.014
 neighborhood-0.014
 world-0.014
Token
Token ID198
Feature activation-2.45e-01

Pos logit contributions
ini+0.003
iet+0.003
ag+0.003
ling+0.003
 neighbourhood+0.003
Neg logit contributions
 seventy-0.003
 eighty-0.003
 ninety-0.003
 Fl-0.003
 fourteen-0.003
Token said
Token ID531
Feature activation-2.88e-01

Pos logit contributions
 seventy+0.015
 eighty+0.015
 ninety+0.014
 fourteen+0.013
 forty+0.012
Neg logit contributions
 neighbourhood-0.014
 neighborhood-0.014
ag-0.013
 world-0.013
ini-0.013
Token that
Token ID326
Feature activation-1.16e-01

Pos logit contributions
 neighbourhood+0.012
 world+0.012
ag+0.012
ini+0.012
 neighborhood+0.012
Neg logit contributions
 seventy-0.013
 eighty-0.013
 ninety-0.012
 fourteen-0.011
 forty-0.010
Token sharing
Token ID7373
Feature activation-8.12e-02

Pos logit contributions
 world+0.005
 neighbourhood+0.005
 neighborhood+0.005
ag+0.005
ini+0.005
Neg logit contributions
 seventy-0.005
 eighty-0.005
 ninety-0.005
 fourteen-0.005
 forty-0.004
Token and
Token ID290
Feature activation+2.27e-01

Pos logit contributions
 world+0.003
ling+0.003
ag+0.003
 neighbourhood+0.003
 neighborhood+0.003
Neg logit contributions
 seventy-0.004
 eighty-0.004
 ninety-0.004
 fourteen-0.003
 forty-0.003
Token being
Token ID852
Feature activation-8.06e-01

Pos logit contributions
 eighty+0.010
 seventy+0.010
 ninety+0.010
 fourteen+0.009
 forty+0.008
Neg logit contributions
 world-0.010
 neighbourhood-0.009
 neighborhood-0.009
ag-0.009
ini-0.009
Token kind
Token ID1611
Feature activation-1.49e+00

Pos logit contributions
 neighbourhood+0.024
ag+0.023
 world+0.023
ini+0.023
 neighborhood+0.023
Neg logit contributions
 seventy-0.046
 eighty-0.046
 ninety-0.045
 fourteen-0.041
 enough-0.039
Token is
Token ID318
Feature activation-1.93e-02

Pos logit contributions
 world+0.053
 neighborhood+0.051
 neighbourhood+0.051
ag+0.051
ling+0.048
Neg logit contributions
 eighty-0.081
 seventy-0.081
 ninety-0.077
 fourteen-0.069
 forty-0.067
Token better
Token ID1365
Feature activation+8.22e-02

Pos logit contributions
 neighbourhood+0.001
 world+0.001
ini+0.001
ag+0.001
 neighborhood+0.001
Neg logit contributions
 seventy-0.001
 eighty-0.001
 ninety-0.001
 fourteen-0.001
 enough-0.001
Token than
Token ID621
Feature activation+7.82e-02

Pos logit contributions
 eighty+0.004
 seventy+0.004
 ninety+0.004
 fourteen+0.003
 enough+0.003
Neg logit contributions
ini-0.003
 neighbourhood-0.003
 world-0.003
ag-0.003
 neighborhood-0.003
Token fighting
Token ID4330
Feature activation-1.47e-01

Pos logit contributions
 eighty+0.004
 seventy+0.004
 ninety+0.003
 fourteen+0.003
 forty+0.003
Neg logit contributions
ag-0.003
 world-0.003
 neighbourhood-0.003
 neighborhood-0.003
agging-0.003
Token and
Token ID290
Feature activation-8.93e-02

Pos logit contributions
 world+0.004
ag+0.004
 neighbourhood+0.004
 neighborhood+0.004
ini+0.004
Neg logit contributions
 seventy-0.009
 eighty-0.009
 ninety-0.008
 fourteen-0.008
 forty-0.007
Token shell
Token ID7582
Feature activation+6.18e-01

Pos logit contributions
 world+0.007
 neighbourhood+0.007
 neighborhood+0.007
ini+0.007
ag+0.007
Neg logit contributions
 eighty-0.011
 seventy-0.011
 ninety-0.011
 fourteen-0.010
 enough-0.009
Token that
Token ID326
Feature activation-7.14e-02

Pos logit contributions
 eighty+0.023
 seventy+0.023
 ninety+0.021
 enough+0.019
 fourteen+0.018
Neg logit contributions
 neighbourhood-0.030
ini-0.030
iet-0.030
ag-0.030
 neighborhood-0.029
Token was
Token ID373
Feature activation-2.57e-01

Pos logit contributions
 world+0.002
 neighbourhood+0.002
ini+0.002
 neighborhood+0.002
ag+0.002
Neg logit contributions
 seventy-0.004
 eighty-0.004
 ninety-0.004
 fourteen-0.003
 enough-0.003
Token rare
Token ID4071
Feature activation-8.05e-02

Pos logit contributions
ag+0.008
 world+0.008
ini+0.008
 neighbourhood+0.008
iet+0.008
Neg logit contributions
 seventy-0.014
 eighty-0.014
 ninety-0.013
 fourteen-0.013
 pret-0.012
Token and
Token ID290
Feature activation-1.09e-01

Pos logit contributions
 neighbourhood+0.002
ini+0.002
 world+0.002
 neighborhood+0.002
ag+0.002
Neg logit contributions
 eighty-0.005
 seventy-0.005
 ninety-0.004
 fourteen-0.004
 enough-0.004
Token very
Token ID845
Feature activation-1.49e+00

Pos logit contributions
 neighbourhood+0.004
ini+0.004
ag+0.004
 world+0.004
 neighborhood+0.004
Neg logit contributions
 eighty-0.006
 seventy-0.006
 ninety-0.006
 fourteen-0.005
 enough-0.005
Token pretty
Token ID2495
Feature activation-7.78e-01

Pos logit contributions
 neighbourhood+0.054
 neighborhood+0.053
iet+0.049
 world+0.049
ini+0.049
Neg logit contributions
 eighty-0.079
 seventy-0.077
 ninety-0.076
 Fl-0.069
 enough-0.066
Token.
Token ID13
Feature activation+4.80e-02

Pos logit contributions
ag+0.024
ini+0.024
 neighbourhood+0.024
iet+0.023
 neighborhood+0.023
Neg logit contributions
 seventy-0.043
 eighty-0.043
 ninety-0.041
 enough-0.038
 fourteen-0.038
Token Every
Token ID3887
Feature activation-8.08e-01

Pos logit contributions
 eighty+0.003
 seventy+0.003
 ninety+0.003
 fourteen+0.002
 enough+0.002
Neg logit contributions
 neighbourhood-0.002
ini-0.002
 world-0.002
ag-0.002
 neighborhood-0.002
Token day
Token ID1110
Feature activation-3.99e-01

Pos logit contributions
ini+0.022
 neighbourhood+0.021
ag+0.021
 neighborhood+0.021
 world+0.021
Neg logit contributions
 eighty-0.048
 seventy-0.048
 ninety-0.047
 fourteen-0.043
 enough-0.042
Token she
Token ID673
Feature activation-8.60e-01

Pos logit contributions
 world+0.017
 neighbourhood+0.017
ini+0.016
 neighborhood+0.016
ag+0.016
Neg logit contributions
 seventy-0.018
 eighty-0.018
 ninety-0.017
 fourteen-0.015
 pret-0.014
Token collect
Token ID2824
Feature activation+2.13e-01

Pos logit contributions
 seventy+0.022
 eighty+0.022
 ninety+0.021
 fourteen+0.019
 pret+0.019
Neg logit contributions
ini-0.018
 neighbourhood-0.018
ling-0.017
 neighborhood-0.017
 world-0.017
Token or
Token ID393
Feature activation-4.81e-01

Pos logit contributions
 eighty+0.010
 seventy+0.010
 ninety+0.010
 enough+0.009
 Fl+0.008
Neg logit contributions
ini-0.008
 neighbourhood-0.008
 neighborhood-0.008
iet-0.008
 world-0.008
Tokennam
Token ID7402
Feature activation-1.29e+00

Pos logit contributions
ini+0.025
iet+0.024
agging+0.024
 neighbourhood+0.024
ag+0.023
Neg logit contributions
 eighty-0.018
 seventy-0.017
 ninety-0.017
 enough-0.014
 Fl-0.014
Tokenents
Token ID658
Feature activation+1.39e+00

Pos logit contributions
 neighbourhood+0.073
iet+0.072
 world+0.072
 neighborhood+0.072
ag+0.072
Neg logit contributions
 eighty-0.039
 seventy-0.039
 ninety-0.037
 fourteen-0.029
 enough-0.028
Token.
Token ID13
Feature activation+8.57e-02

Pos logit contributions
 seventy+0.086
 eighty+0.085
 ninety+0.082
 fourteen+0.077
 forty+0.072
Neg logit contributions
ag-0.035
 world-0.035
 neighbourhood-0.034
ini-0.034
 neighborhood-0.033
Token Everywhere
Token ID48201
Feature activation-1.51e+00

Pos logit contributions
 seventy+0.004
 eighty+0.004
 ninety+0.004
 Fl+0.004
 forty+0.004
Neg logit contributions
ling-0.003
ini-0.003
iet-0.003
agging-0.003
ag-0.003
Token she
Token ID673
Feature activation-7.57e-01

Pos logit contributions
 neighbourhood+0.056
 world+0.056
 neighborhood+0.055
ini+0.055
ag+0.055
Neg logit contributions
 seventy-0.075
 eighty-0.075
 ninety-0.072
 fourteen-0.065
 enough-0.061
Token went
Token ID1816
Feature activation+3.66e-01

Pos logit contributions
ag+0.033
agging+0.032
 neighbourhood+0.032
iet+0.032
ini+0.032
Neg logit contributions
 eighty-0.033
 seventy-0.033
 ninety-0.031
 pret-0.027
 enough-0.026
Token she
Token ID673
Feature activation-6.61e-01

Pos logit contributions
 eighty+0.014
 seventy+0.014
 ninety+0.014
 fourteen+0.012
 enough+0.012
Neg logit contributions
ini-0.017
iet-0.017
 neighbourhood-0.017
ag-0.017
 neighborhood-0.017
Token searched
Token ID16499
Feature activation-2.41e-01

Pos logit contributions
 neighbourhood+0.027
 neighborhood+0.027
 world+0.027
ini+0.027
ag+0.026
Neg logit contributions
 seventy-0.031
 eighty-0.031
 ninety-0.029
 fourteen-0.028
 enough-0.025
Token for
Token ID329
Feature activation-1.74e-01

Pos logit contributions
ini+0.011
 neighbourhood+0.011
 neighborhood+0.010
ag+0.010
iet+0.010
Neg logit contributions
 eighty-0.010
 seventy-0.010
 ninety-0.010
 fourteen-0.009
 enough-0.008