TOP ACTIVATIONS
MAX = 0.059

says . She hears the noise and comes to the attic
Mom hears the noise and comes to the garage
very much . She hugged it every night and played with
. Lily had a bath tub in her bathroom . She
every night . She hugs it and squee zes it tight
Mom hears the noise . She comes to see
Mom hears the noise and comes to see what
Mom hears the noise and comes to see what
Mom heard the noise and came to see what
Mom hears the noise and comes to the room
Mom hears the noise and comes to see what
. His name was Joe , and he was wearing red
shop . She gave Po ppy some money and said "
Mom heard the noise and came to see what
safe . His parents made him feel so special and loved
very excited . She gave Ben a great big hug and
mess . Mom hears the noise and comes to see .
to end . Sarah gave Jack one last squeeze and they
Mom hears the noise . She comes to see
Mom hears the noise and comes to the room

MIN ACTIVATIONS
MIN = -0.079

and saw a magical fair yt ale creature called a n
it first ? â M ummy said
tall hills and passed through vibrant forests . After several days
lot of different animals and lush green trees .
came across rolling hills with lush green grass . The sun
Finally Lucy â s mom said ,
Don â t worry , Jack .
, why don â t you take a break
you want ? â Alex wanted the
! Why don â t you come with me
my friend ? â The chicken said yes and
, if that â s what you want to
felt bad and said ," I 'm sorry , let me
Don â t worry , Mat ty
I do ? â Her dad replied ,
, please ? â Her mom
. They promised to play and eat popcorn together every day
he just couldn â t reach it . Then
Don â t worry , I â
â Sam â s eyes lit up and

INTERVAL 0.025 - 0.059
CONTAINS 1.829%

balloon !" Tom said . " I want to touch it
pond would always offer her a fun surprise . <|endoftext|> Once
. Lily wanted to help Max feel better .
goes straight . It does not bump or crash . It
get it if she wanted it . Molly was so excited

INTERVAL -0.010 - 0.025
CONTAINS 78.042%

he had a surprise for her . He gave her a
if you need extra help ." And that
was a girl called Lucy . She loved to pick things
wanted it very badly . Mom said they could buy it
visit the sm elly nation , but one day a brave

INTERVAL -0.044 - -0.010
CONTAINS 20.100%

" Not yet , we have to cut it first ,"
she was done , she said , " Thank you ,
told T ane that it was a sign of hope and
them and makes them mad . You could have been hurt
gentle tap , and spoke softly to him . He said

INTERVAL -0.079 - -0.044
CONTAINS 0.030%

animals like lions , ze br as , and gir aff
. You are rude . I do not want to play
end . <|endoftext|> Mom my had a plan . She wanted
to hurt your feelings . I like your stick . It
. This worked until a gust of wind blew the tw
Token says
Token ID1139
Feature activation-3.38e-03

Pos logit contributions
 someone+0.012
Maybe+0.012
 one+0.011
 dad+0.011
 somewhere+0.011
Neg logit contributions
ocol-0.016
 rib-0.013
 microsc-0.012
 gymn-0.012
rou-0.012
Token.
Token ID13
Feature activation+2.37e-02

Pos logit contributions
ocol+0.007
 rib+0.006
 microsc+0.006
 gymn+0.006
rou+0.006
Neg logit contributions
Maybe-0.002
 someone-0.002
 one-0.002
 dad-0.002
Mom-0.002
Token She
Token ID1375
Feature activation-3.70e-03

Pos logit contributions
 somewhere+0.024
Maybe+0.024
 someone+0.024
ths+0.024
 one+0.024
Neg logit contributions
ocol-0.028
 rib-0.026
 gymn-0.024
 microsc-0.024
 tant-0.022
Token hears
Token ID23180
Feature activation+2.34e-02

Pos logit contributions
ocol+0.007
 rib+0.006
 microsc+0.005
 gymn+0.005
rou+0.005
Neg logit contributions
Maybe-0.003
 someone-0.003
 one-0.003
 dad-0.003
Mom-0.003
Token the
Token ID262
Feature activation-8.80e-03

Pos logit contributions
Maybe+0.024
How+0.022
Nothing+0.020
Mom+0.020
 next+0.020
Neg logit contributions
ocol-0.039
 rib-0.032
 microsc-0.029
 gymn-0.028
 tant-0.027
Token noise
Token ID7838
Feature activation+5.93e-02

Pos logit contributions
ocol+0.018
 rib+0.015
 microsc+0.015
 gymn+0.014
rou+0.014
Neg logit contributions
Maybe-0.005
 someone-0.005
 one-0.005
 dad-0.005
 next-0.005
Token and
Token ID290
Feature activation+2.17e-02

Pos logit contributions
cil+0.091
Someone+0.080
 Mysteries+0.079
 gentlemen+0.077
 Rays+0.077
Neg logit contributions
ocol-0.054
 gymn-0.044
 rib-0.042
oke-0.041
 tant-0.041
Token comes
Token ID2058
Feature activation+5.71e-03

Pos logit contributions
Mom+0.021
Maybe+0.020
Dad+0.019
 someone+0.019
 somewhere+0.019
Neg logit contributions
ocol-0.035
 rib-0.032
 gymn-0.029
 microsc-0.028
rou-0.027
Token to
Token ID284
Feature activation-1.38e-02

Pos logit contributions
Maybe+0.007
 someone+0.006
How+0.006
 one+0.006
Mom+0.006
Neg logit contributions
ocol-0.008
 rib-0.007
 microsc-0.006
 gymn-0.006
rou-0.006
Token the
Token ID262
Feature activation-2.46e-02

Pos logit contributions
ocol+0.030
 rib+0.025
 microsc+0.024
rou+0.024
pill+0.024
Neg logit contributions
 someone-0.008
Maybe-0.008
 dad-0.008
 one-0.008
 next-0.007
Token attic
Token ID46113
Feature activation-7.45e-03

Pos logit contributions
ocol+0.052
 rib+0.041
 microsc+0.040
pill+0.040
rou+0.040
Neg logit contributions
 someone-0.018
 next-0.018
 dad-0.018
Maybe-0.017
 one-0.017
Token
Token ID198
Feature activation+1.96e-02

Pos logit contributions
Maybe+0.006
 one+0.006
 next+0.006
 dad+0.006
 someone+0.006
Neg logit contributions
ocol-0.014
 rib-0.012
 microsc-0.012
 gymn-0.011
 topp-0.011
Token
Token ID198
Feature activation+1.89e-02

Pos logit contributions
 one+0.016
Maybe+0.016
 someone+0.015
How+0.015
 dad+0.015
Neg logit contributions
ocol-0.031
 microsc-0.026
 rib-0.026
rou-0.025
 gymn-0.025
TokenMom
Token ID29252
Feature activation+1.27e-02

Pos logit contributions
 one+0.012
 someone+0.012
 dad+0.012
Maybe+0.012
 look+0.012
Neg logit contributions
ocol-0.034
 rib-0.028
 microsc-0.028
 gol-0.027
rou-0.027
Token hears
Token ID23180
Feature activation+2.01e-02

Pos logit contributions
 someone+0.011
Maybe+0.011
 one+0.010
 dad+0.010
 somewhere+0.010
Neg logit contributions
ocol-0.022
 rib-0.019
rou-0.018
 microsc-0.018
 gymn-0.018
Token the
Token ID262
Feature activation+4.01e-03

Pos logit contributions
Maybe+0.019
How+0.017
Mom+0.016
Nothing+0.016
 next+0.016
Neg logit contributions
ocol-0.034
 rib-0.029
 microsc-0.027
 gymn-0.025
 topp-0.024
Token noise
Token ID7838
Feature activation+5.90e-02

Pos logit contributions
Maybe+0.003
 someone+0.003
 one+0.003
How+0.003
Mom+0.003
Neg logit contributions
ocol-0.008
 rib-0.006
 microsc-0.006
 gymn-0.006
 topp-0.006
Token and
Token ID290
Feature activation+2.52e-02

Pos logit contributions
cil+0.089
Someone+0.082
Maybe+0.080
 Mysteries+0.079
Mother+0.079
Neg logit contributions
ocol-0.052
 gymn-0.043
oke-0.042
 rib-0.040
opot-0.036
Token comes
Token ID2058
Feature activation-7.45e-04

Pos logit contributions
Mom+0.025
Maybe+0.024
 someone+0.023
cil+0.023
 somewhere+0.023
Neg logit contributions
ocol-0.039
 rib-0.036
 gymn-0.031
 microsc-0.031
pill-0.029
Token to
Token ID284
Feature activation-1.57e-02

Pos logit contributions
ocol+0.001
 rib+0.001
 microsc+0.001
 gymn+0.001
rou+0.001
Neg logit contributions
Maybe-0.001
 someone-0.001
 one-0.001
 dad-0.001
Mom-0.001
Token the
Token ID262
Feature activation+2.02e-04

Pos logit contributions
ocol+0.036
 rib+0.030
rou+0.029
 microsc+0.029
 topp+0.029
Neg logit contributions
 someone-0.008
Maybe-0.007
 dad-0.007
 one-0.007
 next-0.007
Token garage
Token ID15591
Feature activation+1.05e-02

Pos logit contributions
Maybe+0.000
 someone+0.000
 one+0.000
 dad+0.000
Mom+0.000
Neg logit contributions
ocol-0.000
 rib-0.000
 microsc-0.000
 gymn-0.000
rou-0.000
Token very
Token ID845
Feature activation+3.21e-02

Pos logit contributions
ocol+0.009
 rib+0.007
 microsc+0.007
 gymn+0.006
rou+0.006
Neg logit contributions
Maybe-0.006
 someone-0.006
 one-0.006
 dad-0.006
 next-0.006
Token much
Token ID881
Feature activation-1.26e-02

Pos logit contributions
How+0.030
Maybe+0.030
What+0.029
Where+0.027
 someone+0.025
Neg logit contributions
ocol-0.059
hy-0.046
 microsc-0.046
 rib-0.045
rou-0.044
Token.
Token ID13
Feature activation-2.42e-02

Pos logit contributions
ocol+0.028
 rib+0.024
 microsc+0.023
rou+0.022
 gymn+0.022
Neg logit contributions
 one-0.006
 someone-0.006
Maybe-0.006
 dad-0.006
 next-0.006
Token She
Token ID1375
Feature activation+5.60e-03

Pos logit contributions
ocol+0.051
 rib+0.043
rou+0.041
 microsc+0.040
 topp+0.040
Neg logit contributions
 one-0.016
 someone-0.015
Maybe-0.014
 dad-0.014
 everything-0.013
Token hugged
Token ID41130
Feature activation+3.19e-03

Pos logit contributions
 someone+0.005
Maybe+0.005
 dad+0.005
Mom+0.005
 one+0.005
Neg logit contributions
ocol-0.009
 rib-0.008
 microsc-0.007
 gymn-0.007
rou-0.007
Token it
Token ID340
Feature activation+5.86e-02

Pos logit contributions
Maybe+0.003
 someone+0.002
How+0.002
Mom+0.002
 next+0.002
Neg logit contributions
ocol-0.006
 rib-0.005
 microsc-0.005
 gymn-0.005
 topp-0.004
Token every
Token ID790
Feature activation+1.91e-02

Pos logit contributions
Nothing+0.076
Where+0.074
What+0.073
How+0.072
Maybe+0.072
Neg logit contributions
 rib-0.065
 anx-0.058
 stuffed-0.055
rou-0.054
 tre-0.054
Token night
Token ID1755
Feature activation-5.82e-03

Pos logit contributions
Maybe+0.017
How+0.015
 someone+0.015
They+0.015
Nothing+0.015
Neg logit contributions
ocol-0.033
 rib-0.028
 gymn-0.026
 topp-0.026
 gol-0.025
Token and
Token ID290
Feature activation-1.07e-02

Pos logit contributions
ocol+0.011
 rib+0.009
 microsc+0.009
 gymn+0.009
rou+0.009
Neg logit contributions
Maybe-0.004
 someone-0.004
 one-0.004
 dad-0.004
 next-0.004
Token played
Token ID2826
Feature activation-1.41e-02

Pos logit contributions
ocol+0.021
 rib+0.018
 microsc+0.017
 gymn+0.016
rou+0.016
Neg logit contributions
Maybe-0.008
 someone-0.007
 one-0.007
 dad-0.007
 next-0.007
Token with
Token ID351
Feature activation-2.73e-03

Pos logit contributions
ocol+0.031
 rib+0.026
 microsc+0.025
rou+0.025
ifying+0.024
Neg logit contributions
 someone-0.008
 one-0.008
 next-0.007
Maybe-0.007
 dad-0.007
Token.
Token ID13
Feature activation-3.33e-03

Pos logit contributions
Maybe+0.001
 someone+0.001
 dad+0.001
 one+0.001
How+0.001
Neg logit contributions
ocol-0.004
 rib-0.003
 microsc-0.003
rou-0.003
 gymn-0.003
Token Lily
Token ID20037
Feature activation+4.58e-03

Pos logit contributions
ocol+0.006
 rib+0.005
 microsc+0.005
 gymn+0.005
rou+0.005
Neg logit contributions
Maybe-0.003
 someone-0.002
 dad-0.002
 one-0.002
 next-0.002
Token had
Token ID550
Feature activation-1.47e-02

Pos logit contributions
 someone+0.004
Maybe+0.004
 one+0.004
 dad+0.004
How+0.004
Neg logit contributions
ocol-0.007
 rib-0.006
 microsc-0.006
 gymn-0.006
rou-0.006
Token a
Token ID257
Feature activation-1.26e-02

Pos logit contributions
ocol+0.038
 rib+0.032
 microsc+0.031
rou+0.031
 gymn+0.031
Neg logit contributions
 someone-0.003
 one-0.003
Maybe-0.003
 dad-0.003
Mom-0.002
Token bath
Token ID7837
Feature activation+2.14e-02

Pos logit contributions
ocol+0.025
 rib+0.020
 microsc+0.020
rou+0.019
 topp+0.019
Neg logit contributions
 dad-0.009
 one-0.009
 someone-0.009
Maybe-0.009
 next-0.008
Tokentub
Token ID37995
Feature activation+5.80e-02

Pos logit contributions
Maybe+0.023
What+0.022
How+0.022
 someone+0.022
Mom+0.021
Neg logit contributions
ocol-0.030
 rib-0.026
 gymn-0.024
rou-0.024
 gol-0.023
Token in
Token ID287
Feature activation+1.24e-02

Pos logit contributions
Maybe+0.071
Someone+0.068
What+0.067
Mother+0.066
Nothing+0.065
Neg logit contributions
ocol-0.075
 rib-0.065
hy-0.062
 tre-0.061
rou-0.060
Token her
Token ID607
Feature activation+1.34e-03

Pos logit contributions
Maybe+0.013
How+0.011
 dad+0.011
Nothing+0.011
ths+0.011
Neg logit contributions
ocol-0.020
 rib-0.017
 microsc-0.016
 gymn-0.015
 topp-0.014
Token bathroom
Token ID12436
Feature activation+2.96e-02

Pos logit contributions
Maybe+0.001
 someone+0.001
 one+0.001
How+0.001
What+0.001
Neg logit contributions
ocol-0.003
 rib-0.002
 microsc-0.002
 gymn-0.002
rou-0.002
Token.
Token ID13
Feature activation-7.28e-03

Pos logit contributions
Maybe+0.030
What+0.029
Someone+0.027
How+0.027
Nothing+0.027
Neg logit contributions
ocol-0.046
 rib-0.041
rou-0.038
 gymn-0.038
 topp-0.038
Token She
Token ID1375
Feature activation-2.63e-03

Pos logit contributions
ocol+0.014
 rib+0.012
 microsc+0.011
rou+0.011
 gymn+0.011
Neg logit contributions
Maybe-0.006
 someone-0.005
 one-0.005
 dad-0.005
Mom-0.005
Token every
Token ID790
Feature activation+3.29e-02

Pos logit contributions
Maybe+0.009
How+0.008
Mom+0.007
 someone+0.007
 next+0.007
Neg logit contributions
ocol-0.018
 rib-0.015
 microsc-0.014
 gymn-0.014
rou-0.013
Token night
Token ID1755
Feature activation-4.53e-03

Pos logit contributions
Maybe+0.035
Nothing+0.032
What+0.031
How+0.031
Mom+0.030
Neg logit contributions
ocol-0.051
 rib-0.043
 gymn-0.040
ifying-0.038
 topp-0.038
Token.
Token ID13
Feature activation-1.44e-02

Pos logit contributions
ocol+0.009
 rib+0.008
 microsc+0.008
 gymn+0.008
rou+0.007
Neg logit contributions
Maybe-0.003
 someone-0.003
 one-0.002
 dad-0.002
Mom-0.002
Token She
Token ID1375
Feature activation+6.42e-03

Pos logit contributions
ocol+0.029
 rib+0.024
 microsc+0.023
rou+0.023
 topp+0.023
Neg logit contributions
 one-0.010
 someone-0.010
Maybe-0.010
 dad-0.009
Mom-0.009
Token hugs
Token ID40657
Feature activation+6.49e-03

Pos logit contributions
 someone+0.007
Maybe+0.007
Mom+0.007
 dad+0.006
 one+0.006
Neg logit contributions
ocol-0.010
 rib-0.009
 microsc-0.008
 gymn-0.008
rou-0.007
Token it
Token ID340
Feature activation+5.74e-02

Pos logit contributions
Maybe+0.005
How+0.004
 someone+0.004
Mom+0.004
 next+0.004
Neg logit contributions
ocol-0.012
 rib-0.011
 gymn-0.010
 microsc-0.010
rou-0.009
Token and
Token ID290
Feature activation+6.91e-03

Pos logit contributions
Nothing+0.079
Where+0.077
What+0.077
Maybe+0.076
How+0.075
Neg logit contributions
 rib-0.060
 stuffed-0.054
 tre-0.049
 gymn-0.048
rou-0.047
Token squee
Token ID13170
Feature activation+2.80e-03

Pos logit contributions
Maybe+0.006
Mom+0.006
 someone+0.006
 next+0.006
How+0.006
Neg logit contributions
ocol-0.012
 rib-0.010
 microsc-0.009
 gymn-0.009
 topp-0.009
Tokenzes
Token ID12271
Feature activation-3.56e-03

Pos logit contributions
Maybe+0.003
 someone+0.003
 one+0.003
 next+0.002
 dad+0.002
Neg logit contributions
ocol-0.005
 rib-0.004
 microsc-0.004
 gymn-0.004
 topp-0.004
Token it
Token ID340
Feature activation+4.09e-02

Pos logit contributions
ocol+0.008
 rib+0.007
 microsc+0.006
 gymn+0.006
rou+0.006
Neg logit contributions
Maybe-0.002
 someone-0.002
 dad-0.002
 one-0.002
 next-0.001
Token tight
Token ID5381
Feature activation-1.51e-02

Pos logit contributions
Dad+0.046
Someone+0.045
Maybe+0.044
Nothing+0.044
What+0.044
Neg logit contributions
 rib-0.054
ocol-0.049
 gymn-0.047
 topp-0.043
hy-0.043
Token
Token ID198
Feature activation+1.38e-02

Pos logit contributions
Maybe+0.009
 one+0.009
 next+0.009
 someone+0.008
 somewhere+0.008
Neg logit contributions
ocol-0.019
 rib-0.016
 microsc-0.016
 gymn-0.015
 topp-0.015
Token
Token ID198
Feature activation+1.02e-02

Pos logit contributions
 one+0.011
Maybe+0.011
 someone+0.010
 dad+0.010
How+0.010
Neg logit contributions
ocol-0.023
 rib-0.019
 microsc-0.019
 gymn-0.019
rou-0.018
TokenMom
Token ID29252
Feature activation+1.35e-02

Pos logit contributions
 someone+0.007
Maybe+0.007
 dad+0.007
 one+0.007
 somewhere+0.006
Neg logit contributions
ocol-0.019
 rib-0.016
 microsc-0.015
 gymn-0.015
ifying-0.015
Token hears
Token ID23180
Feature activation+1.21e-02

Pos logit contributions
 someone+0.012
Maybe+0.012
 one+0.011
 dad+0.011
 somewhere+0.011
Neg logit contributions
ocol-0.023
 rib-0.020
rou-0.019
 microsc-0.018
 gymn-0.018
Token the
Token ID262
Feature activation-9.01e-03

Pos logit contributions
Maybe+0.010
How+0.009
Mom+0.009
 someone+0.009
 next+0.009
Neg logit contributions
ocol-0.022
 rib-0.019
 microsc-0.017
 gymn-0.017
 topp-0.017
Token noise
Token ID7838
Feature activation+5.68e-02

Pos logit contributions
ocol+0.019
 rib+0.016
 microsc+0.015
 gymn+0.015
rou+0.015
Neg logit contributions
 someone-0.005
Maybe-0.005
 one-0.005
 dad-0.005
 next-0.005
Token.
Token ID13
Feature activation+1.51e-02

Pos logit contributions
cil+0.084
Someone+0.078
Maybe+0.078
Mother+0.076
 gentlemen+0.075
Neg logit contributions
ocol-0.054
 gymn-0.044
oke-0.042
 rib-0.041
opot-0.038
Token She
Token ID1375
Feature activation+8.55e-03

Pos logit contributions
Maybe+0.015
 somewhere+0.015
How+0.015
 one+0.015
 someone+0.015
Neg logit contributions
ocol-0.021
 rib-0.019
 microsc-0.018
 gymn-0.017
pill-0.016
Token comes
Token ID2058
Feature activation+2.25e-03

Pos logit contributions
 someone+0.007
Maybe+0.007
 dad+0.007
Mom+0.007
 one+0.007
Neg logit contributions
ocol-0.016
 rib-0.013
 gymn-0.012
 microsc-0.012
rou-0.012
Token to
Token ID284
Feature activation-3.65e-03

Pos logit contributions
Maybe+0.003
 someone+0.003
How+0.003
 one+0.003
Mom+0.003
Neg logit contributions
ocol-0.003
 rib-0.002
 microsc-0.002
 gymn-0.002
rou-0.002
Token see
Token ID766
Feature activation+2.81e-02

Pos logit contributions
ocol+0.007
 rib+0.006
 microsc+0.006
 gymn+0.006
rou+0.006
Neg logit contributions
Maybe-0.002
 someone-0.002
 one-0.002
 dad-0.002
 next-0.002
Token
Token ID198
Feature activation+1.95e-02

Pos logit contributions
Maybe+0.005
 one+0.005
 next+0.005
 someone+0.005
 dad+0.005
Neg logit contributions
ocol-0.012
 rib-0.011
 microsc-0.010
 gymn-0.010
 gol-0.010
Token
Token ID198
Feature activation+1.69e-02

Pos logit contributions
Maybe+0.017
 one+0.017
How+0.016
 someone+0.016
 looked+0.015
Neg logit contributions
ocol-0.031
 microsc-0.025
 rib-0.025
 gymn-0.024
rou-0.024
TokenMom
Token ID29252
Feature activation+1.00e-02

Pos logit contributions
 somewhere+0.010
Maybe+0.010
 one+0.010
 looked+0.010
 someone+0.010
Neg logit contributions
ocol-0.031
 rib-0.026
 microsc-0.025
 gymn-0.025
rou-0.025
Token hears
Token ID23180
Feature activation+1.97e-02

Pos logit contributions
 someone+0.009
Maybe+0.008
 one+0.008
 dad+0.008
Mom+0.007
Neg logit contributions
ocol-0.018
 rib-0.015
rou-0.014
 microsc-0.014
 gymn-0.014
Token the
Token ID262
Feature activation-8.61e-03

Pos logit contributions
Maybe+0.020
How+0.017
Mom+0.017
Nothing+0.017
 next+0.016
Neg logit contributions
ocol-0.034
 rib-0.028
 microsc-0.026
 gymn-0.025
pill-0.024
Token noise
Token ID7838
Feature activation+5.68e-02

Pos logit contributions
ocol+0.018
 rib+0.016
 microsc+0.015
 gymn+0.015
rou+0.015
Neg logit contributions
Maybe-0.005
 someone-0.005
 one-0.005
 dad-0.005
 next-0.005
Token and
Token ID290
Feature activation+2.41e-02

Pos logit contributions
cil+0.084
Someone+0.076
Maybe+0.075
Mother+0.075
 gentlemen+0.075
Neg logit contributions
ocol-0.055
 gymn-0.045
oke-0.043
 rib-0.042
opot-0.038
Token comes
Token ID2058
Feature activation-7.17e-03

Pos logit contributions
Mom+0.023
Maybe+0.023
 someone+0.022
 somewhere+0.021
cil+0.021
Neg logit contributions
ocol-0.039
 rib-0.035
 gymn-0.031
 microsc-0.030
 spec-0.028
Token to
Token ID284
Feature activation-1.32e-02

Pos logit contributions
ocol+0.011
 rib+0.009
 microsc+0.009
 gymn+0.008
rou+0.008
Neg logit contributions
Maybe-0.008
 someone-0.008
 one-0.008
 dad-0.008
 next-0.007
Token see
Token ID766
Feature activation+2.47e-02

Pos logit contributions
ocol+0.029
 rib+0.025
 microsc+0.024
rou+0.024
 topp+0.024
Neg logit contributions
 someone-0.007
Maybe-0.007
 dad-0.006
 one-0.006
 next-0.006
Token what
Token ID644
Feature activation+9.73e-03

Pos logit contributions
Maybe+0.017
ths+0.017
Days+0.016
Someone+0.016
cil+0.016
Neg logit contributions
ocol-0.046
 rib-0.039
 microsc-0.038
rou-0.034
 gymn-0.034
Token
Token ID198
Feature activation+1.99e-02

Pos logit contributions
Maybe+0.005
 one+0.005
 next+0.005
 dad+0.005
 someone+0.005
Neg logit contributions
ocol-0.010
 rib-0.009
 microsc-0.008
 gymn-0.008
 gol-0.008
Token
Token ID198
Feature activation+2.08e-02

Pos logit contributions
 one+0.017
Maybe+0.017
 dad+0.016
 someone+0.016
How+0.016
Neg logit contributions
ocol-0.031
 microsc-0.025
 rib-0.025
 gymn-0.024
rou-0.024
TokenMom
Token ID29252
Feature activation+1.28e-02

Pos logit contributions
 dad+0.018
 look+0.018
 one+0.018
 somewhere+0.018
 someone+0.017
Neg logit contributions
ocol-0.032
 microsc-0.026
 rib-0.026
 gol-0.025
oc-0.025
Token hears
Token ID23180
Feature activation+1.37e-02

Pos logit contributions
 someone+0.011
Maybe+0.011
 one+0.011
 dad+0.010
Mom+0.010
Neg logit contributions
ocol-0.022
 rib-0.019
rou-0.018
 microsc-0.018
 gymn-0.017
Token the
Token ID262
Feature activation-2.62e-02

Pos logit contributions
Maybe+0.012
How+0.011
Mom+0.010
Nothing+0.010
 next+0.010
Neg logit contributions
ocol-0.025
 rib-0.020
 microsc-0.019
 gymn-0.018
 topp-0.018
Token noise
Token ID7838
Feature activation+5.66e-02

Pos logit contributions
ocol+0.065
 rib+0.052
ifying+0.052
pill+0.052
 microsc+0.052
Neg logit contributions
 next-0.011
 dad-0.011
 someone-0.010
 one-0.010
 mom-0.009
Token and
Token ID290
Feature activation+2.60e-02

Pos logit contributions
cil+0.087
Maybe+0.080
Someone+0.079
How+0.079
 gentlemen+0.077
Neg logit contributions
ocol-0.050
oke-0.041
 gymn-0.039
 rib-0.038
opot-0.034
Token comes
Token ID2058
Feature activation-5.79e-03

Pos logit contributions
Mom+0.026
Maybe+0.025
 someone+0.024
cil+0.024
 somewhere+0.024
Neg logit contributions
ocol-0.040
 rib-0.037
 microsc-0.032
 gymn-0.032
pill-0.030
Token to
Token ID284
Feature activation-1.70e-02

Pos logit contributions
ocol+0.009
 rib+0.007
 microsc+0.007
 gymn+0.007
rou+0.007
Neg logit contributions
Maybe-0.007
 someone-0.006
 one-0.006
 dad-0.006
 next-0.006
Token see
Token ID766
Feature activation+1.56e-02

Pos logit contributions
ocol+0.038
 rib+0.032
rou+0.032
 microsc+0.031
pill+0.031
Neg logit contributions
 someone-0.009
Maybe-0.008
 dad-0.008
 one-0.008
 next-0.007
Token what
Token ID644
Feature activation+1.22e-03

Pos logit contributions
Maybe+0.009
cil+0.008
How+0.008
Mom+0.008
 dad+0.008
Neg logit contributions
ocol-0.031
 rib-0.027
 microsc-0.026
rou-0.024
 gymn-0.023
Token
Token ID198
Feature activation+2.03e-02

Pos logit contributions
ocol+0.000
 rib+0.000
 microsc+0.000
 gymn+0.000
 topp+0.000
Neg logit contributions
Maybe-0.000
 one-0.000
 someone-0.000
 dad-0.000
 next-0.000
Token
Token ID198
Feature activation+1.74e-02

Pos logit contributions
 one+0.018
Maybe+0.017
How+0.017
 someone+0.017
 dad+0.017
Neg logit contributions
ocol-0.031
 microsc-0.025
 rib-0.025
rou-0.025
 gymn-0.024
TokenMom
Token ID29252
Feature activation+8.55e-03

Pos logit contributions
 one+0.013
 someone+0.013
 dad+0.012
 looked+0.012
 somewhere+0.012
Neg logit contributions
ocol-0.029
 rib-0.024
 microsc-0.024
rou-0.024
oc-0.024
Token heard
Token ID2982
Feature activation+2.00e-02

Pos logit contributions
 someone+0.007
Maybe+0.007
 one+0.007
 dad+0.006
 somewhere+0.006
Neg logit contributions
ocol-0.015
 rib-0.013
rou-0.012
 microsc-0.012
 gymn-0.012
Token the
Token ID262
Feature activation+6.65e-03

Pos logit contributions
Maybe+0.020
How+0.018
cil+0.017
Nothing+0.017
 next+0.017
Neg logit contributions
ocol-0.034
 rib-0.028
 microsc-0.026
 gymn-0.024
 topp-0.023
Token noise
Token ID7838
Feature activation+5.61e-02

Pos logit contributions
Maybe+0.005
 someone+0.005
How+0.005
 one+0.005
Mom+0.005
Neg logit contributions
ocol-0.012
 rib-0.010
 microsc-0.010
 gymn-0.010
 topp-0.009
Token and
Token ID290
Feature activation+2.81e-02

Pos logit contributions
cil+0.084
Maybe+0.076
How+0.076
 gentlemen+0.075
Someone+0.074
Neg logit contributions
ocol-0.054
 gymn-0.042
oke-0.042
 rib-0.042
hy-0.037
Token came
Token ID1625
Feature activation+2.15e-03

Pos logit contributions
cil+0.026
Maybe+0.026
ble+0.026
 somewhere+0.025
 one+0.025
Neg logit contributions
ocol-0.041
 rib-0.038
 gymn-0.032
 microsc-0.032
oke-0.031
Token to
Token ID284
Feature activation-1.56e-02

Pos logit contributions
Maybe+0.003
 someone+0.003
 one+0.002
 dad+0.002
Mom+0.002
Neg logit contributions
ocol-0.003
 rib-0.002
 microsc-0.002
 gymn-0.002
rou-0.002
Token see
Token ID766
Feature activation+2.51e-02

Pos logit contributions
ocol+0.035
 rib+0.030
rou+0.029
 microsc+0.029
 topp+0.029
Neg logit contributions
 someone-0.007
Maybe-0.007
 dad-0.007
 one-0.007
 next-0.006
Token what
Token ID644
Feature activation+4.13e-03

Pos logit contributions
cil+0.016
Maybe+0.016
Someone+0.015
ths+0.015
ble+0.015
Neg logit contributions
ocol-0.047
 rib-0.040
 microsc-0.038
 gymn-0.035
rou-0.034
Token
Token ID198
Feature activation+1.25e-02

Pos logit contributions
Maybe+0.004
 one+0.004
 next+0.004
 someone+0.004
 somewhere+0.004
Neg logit contributions
ocol-0.010
 rib-0.009
 microsc-0.009
 gymn-0.008
 gol-0.008
Token
Token ID198
Feature activation+9.38e-03

Pos logit contributions
 one+0.010
Maybe+0.009
 someone+0.009
 dad+0.009
How+0.009
Neg logit contributions
ocol-0.022
 rib-0.018
 microsc-0.018
 gymn-0.018
rou-0.017
TokenMom
Token ID29252
Feature activation+1.44e-02

Pos logit contributions
Maybe+0.007
 one+0.006
 someone+0.006
 dad+0.006
 somewhere+0.006
Neg logit contributions
ocol-0.017
 rib-0.014
 microsc-0.014
 gymn-0.014
 gol-0.014
Token hears
Token ID23180
Feature activation+1.69e-02

Pos logit contributions
 someone+0.013
Maybe+0.012
 one+0.012
 dad+0.012
 somewhere+0.011
Neg logit contributions
ocol-0.024
 rib-0.021
rou-0.020
 microsc-0.020
 gymn-0.020
Token the
Token ID262
Feature activation-1.18e-02

Pos logit contributions
Maybe+0.016
How+0.014
 next+0.014
Mom+0.014
Nothing+0.014
Neg logit contributions
ocol-0.029
 rib-0.025
 microsc-0.023
 gymn-0.021
 topp-0.021
Token noise
Token ID7838
Feature activation+5.61e-02

Pos logit contributions
ocol+0.026
 rib+0.022
 microsc+0.021
rou+0.021
 gymn+0.021
Neg logit contributions
 someone-0.006
Maybe-0.006
 dad-0.006
 next-0.006
 one-0.006
Token and
Token ID290
Feature activation+2.26e-02

Pos logit contributions
cil+0.086
Someone+0.079
How+0.078
Maybe+0.078
 gentlemen+0.077
Neg logit contributions
ocol-0.049
 gymn-0.042
 rib-0.041
oke-0.039
hy-0.035
Token comes
Token ID2058
Feature activation-2.99e-03

Pos logit contributions
Mom+0.022
Maybe+0.021
cil+0.021
 someone+0.021
 somewhere+0.021
Neg logit contributions
ocol-0.035
 rib-0.033
 gymn-0.028
 microsc-0.028
rou-0.027
Token to
Token ID284
Feature activation-1.34e-02

Pos logit contributions
ocol+0.005
 rib+0.004
 microsc+0.003
 gymn+0.003
rou+0.003
Neg logit contributions
Maybe-0.003
 someone-0.003
 one-0.003
 dad-0.003
Mom-0.003
Token the
Token ID262
Feature activation-2.66e-02

Pos logit contributions
ocol+0.030
 rib+0.025
 microsc+0.024
rou+0.024
pill+0.024
Neg logit contributions
 someone-0.007
Maybe-0.007
 dad-0.007
 one-0.007
 next-0.006
Token room
Token ID2119
Feature activation+5.00e-03

Pos logit contributions
ocol+0.060
 rib+0.048
pill+0.046
rou+0.045
 gol+0.045
Neg logit contributions
 next-0.019
 someone-0.019
 dad-0.018
Maybe-0.017
Mom-0.017
Token
Token ID198
Feature activation+1.75e-02

Pos logit contributions
Maybe+0.009
 someone+0.009
 one+0.009
 next+0.009
 dad+0.008
Neg logit contributions
ocol-0.019
 rib-0.017
 microsc-0.016
 gymn-0.015
 topp-0.015
Token
Token ID198
Feature activation+7.28e-03

Pos logit contributions
 one+0.014
Maybe+0.014
 someone+0.013
 dad+0.013
How+0.013
Neg logit contributions
ocol-0.028
 rib-0.024
 microsc-0.023
 gymn-0.023
pri-0.022
TokenMom
Token ID29252
Feature activation+1.83e-02

Pos logit contributions
Maybe+0.004
 someone+0.004
 one+0.004
 dad+0.004
 somewhere+0.004
Neg logit contributions
ocol-0.014
 rib-0.012
 microsc-0.011
 gymn-0.011
ifying-0.011
Token hears
Token ID23180
Feature activation+1.35e-02

Pos logit contributions
 someone+0.018
Maybe+0.017
 somewhere+0.016
 one+0.016
 dad+0.016
Neg logit contributions
ocol-0.030
 rib-0.026
rou-0.025
 microsc-0.024
 gymn-0.024
Token the
Token ID262
Feature activation+1.79e-03

Pos logit contributions
Maybe+0.011
Mom+0.010
How+0.010
 next+0.010
 someone+0.010
Neg logit contributions
ocol-0.025
 rib-0.021
 microsc-0.019
 gymn-0.019
 topp-0.018
Token noise
Token ID7838
Feature activation+5.61e-02

Pos logit contributions
Maybe+0.001
 someone+0.001
 one+0.001
How+0.001
Mom+0.001
Neg logit contributions
ocol-0.003
 rib-0.003
 microsc-0.003
 gymn-0.003
 topp-0.003
Token and
Token ID290
Feature activation+2.85e-02

Pos logit contributions
cil+0.079
Someone+0.074
Maybe+0.074
How+0.073
Mother+0.073
Neg logit contributions
ocol-0.058
 gymn-0.048
 rib-0.047
oke-0.042
hy-0.042
Token comes
Token ID2058
Feature activation-1.54e-03

Pos logit contributions
Mom+0.031
ble+0.030
 somewhere+0.029
cil+0.029
Maybe+0.029
Neg logit contributions
ocol-0.041
 rib-0.038
 gymn-0.032
 microsc-0.032
 spec-0.031
Token to
Token ID284
Feature activation-4.65e-03

Pos logit contributions
ocol+0.002
 rib+0.002
 microsc+0.002
 gymn+0.002
rou+0.002
Neg logit contributions
Maybe-0.002
 someone-0.002
 one-0.002
 dad-0.002
Mom-0.002
Token see
Token ID766
Feature activation+3.16e-02

Pos logit contributions
ocol+0.010
 rib+0.009
 microsc+0.008
 gymn+0.008
rou+0.008
Neg logit contributions
Maybe-0.002
 someone-0.002
 one-0.002
 dad-0.002
 next-0.002
Token what
Token ID644
Feature activation+1.18e-02

Pos logit contributions
 forth+0.020
ths+0.020
cil+0.019
Days+0.019
Years+0.018
Neg logit contributions
ocol-0.059
 rib-0.050
 microsc-0.046
 gymn-0.044
ifying-0.042
Token.
Token ID13
Feature activation+6.49e-03

Pos logit contributions
ocol+0.017
 rib+0.014
 microsc+0.014
 gymn+0.013
rou+0.013
Neg logit contributions
Maybe-0.005
 someone-0.005
 one-0.005
 dad-0.005
 next-0.004
Token His
Token ID2399
Feature activation+1.57e-03

Pos logit contributions
Maybe+0.008
 next+0.007
 someone+0.007
 somewhere+0.007
How+0.007
Neg logit contributions
ocol-0.009
 rib-0.007
 microsc-0.007
 topp-0.007
 gymn-0.007
Token name
Token ID1438
Feature activation+2.16e-02

Pos logit contributions
Maybe+0.001
 someone+0.001
How+0.001
 one+0.001
Mom+0.001
Neg logit contributions
ocol-0.003
 rib-0.003
 microsc-0.002
 gymn-0.002
rou-0.002
Token was
Token ID373
Feature activation-1.76e-02

Pos logit contributions
How+0.024
Maybe+0.024
 dad+0.023
 gentlemen+0.023
Mom+0.023
Neg logit contributions
ocol-0.028
 rib-0.024
 gymn-0.022
hy-0.022
 topp-0.021
Token Joe
Token ID5689
Feature activation-1.10e-02

Pos logit contributions
ocol+0.032
 rib+0.026
 microsc+0.024
rou+0.024
 gymn+0.024
Neg logit contributions
 someone-0.017
 dad-0.016
 one-0.016
Maybe-0.015
 everything-0.015
Token,
Token ID11
Feature activation+5.60e-02

Pos logit contributions
ocol+0.024
 rib+0.020
 microsc+0.020
 gymn+0.019
rou+0.019
Neg logit contributions
 one-0.006
 someone-0.006
Maybe-0.006
 dad-0.006
 next-0.005
Token and
Token ID290
Feature activation+1.47e-02

Pos logit contributions
 halftime+0.060
Days+0.058
Parent+0.056
Mom+0.055
ths+0.054
Neg logit contributions
 rib-0.065
ocol-0.064
rou-0.063
 gymn-0.061
 such-0.059
Token he
Token ID339
Feature activation-4.27e-04

Pos logit contributions
Maybe+0.012
Mom+0.011
How+0.011
 somewhere+0.010
 someone+0.010
Neg logit contributions
ocol-0.025
 microsc-0.022
 rib-0.022
 gymn-0.020
rou-0.020
Token was
Token ID373
Feature activation-9.93e-03

Pos logit contributions
ocol+0.001
 rib+0.001
 microsc+0.001
 gymn+0.001
rou+0.001
Neg logit contributions
Maybe-0.000
 someone-0.000
 one-0.000
 dad-0.000
Mom-0.000
Token wearing
Token ID5762
Feature activation-3.28e-03

Pos logit contributions
ocol+0.021
 rib+0.018
 microsc+0.017
 topp+0.016
 gymn+0.016
Neg logit contributions
 someone-0.006
 one-0.006
Maybe-0.006
 dad-0.006
 next-0.006
Token red
Token ID2266
Feature activation-1.26e-02

Pos logit contributions
ocol+0.007
 rib+0.006
 microsc+0.006
 gymn+0.006
 topp+0.006
Neg logit contributions
Maybe-0.001
 someone-0.001
 one-0.001
 dad-0.001
 next-0.001
Token shop
Token ID6128
Feature activation-6.70e-03

Pos logit contributions
ocol+0.026
 rib+0.022
 microsc+0.021
rou+0.020
 gymn+0.020
Neg logit contributions
 someone-0.009
Maybe-0.009
 one-0.009
 dad-0.009
 next-0.009
Token.
Token ID13
Feature activation+9.77e-03

Pos logit contributions
ocol+0.014
 rib+0.012
 microsc+0.011
 gymn+0.011
rou+0.011
Neg logit contributions
Maybe-0.004
 someone-0.004
 one-0.004
 dad-0.004
 next-0.004
Token She
Token ID1375
Feature activation+8.27e-03

Pos logit contributions
Maybe+0.008
 someone+0.008
 dad+0.008
 next+0.008
Mom+0.008
Neg logit contributions
ocol-0.016
 rib-0.014
 microsc-0.014
 gymn-0.013
pill-0.013
Token gave
Token ID2921
Feature activation+1.18e-02

Pos logit contributions
 someone+0.007
Maybe+0.007
 one+0.007
How+0.007
What+0.007
Neg logit contributions
ocol-0.014
 rib-0.012
rou-0.011
 gymn-0.011
 microsc-0.011
Token Po
Token ID7695
Feature activation+1.58e-02

Pos logit contributions
Maybe+0.012
How+0.011
 dad+0.011
Mom+0.011
 someone+0.010
Neg logit contributions
ocol-0.020
 rib-0.017
 microsc-0.015
 gymn-0.015
pill-0.014
Tokenppy
Token ID14097
Feature activation+5.58e-02

Pos logit contributions
Maybe+0.019
 dad+0.018
 someone+0.018
Mom+0.018
How+0.018
Neg logit contributions
ocol-0.023
 rib-0.019
rou-0.018
 microsc-0.017
pill-0.016
Token some
Token ID617
Feature activation-2.34e-02

Pos logit contributions
Maybe+0.048
How+0.045
Nothing+0.045
 knock+0.044
 forth+0.044
Neg logit contributions
 rib-0.085
ocol-0.083
 microsc-0.075
ifying-0.071
rou-0.070
Token money
Token ID1637
Feature activation+8.07e-03

Pos logit contributions
ocol+0.044
 microsc+0.037
rou+0.036
 rib+0.036
 gymn+0.036
Neg logit contributions
 one-0.018
 someone-0.018
 dad-0.017
 something-0.017
 next-0.017
Token and
Token ID290
Feature activation+1.50e-02

Pos logit contributions
Maybe+0.008
 someone+0.008
Mom+0.008
How+0.007
What+0.007
Neg logit contributions
ocol-0.013
 rib-0.011
 microsc-0.010
rou-0.010
 topp-0.010
Token said
Token ID531
Feature activation-1.91e-02

Pos logit contributions
Maybe+0.011
Mom+0.010
How+0.010
What+0.010
Nothing+0.010
Neg logit contributions
ocol-0.027
 rib-0.025
 microsc-0.023
 gymn-0.022
 topp-0.022
Token "
Token ID366
Feature activation+1.28e-02

Pos logit contributions
ocol+0.041
 rib+0.034
 microsc+0.034
rou+0.034
 gol+0.033
Neg logit contributions
 someone-0.011
 one-0.010
 dad-0.010
 everything-0.009
 they-0.009
Token
Token ID198
Feature activation+1.94e-02

Pos logit contributions
Maybe+0.004
 one+0.004
 next+0.004
 dad+0.004
 someone+0.004
Neg logit contributions
ocol-0.011
 rib-0.009
 microsc-0.009
 gymn-0.009
 gol-0.009
Token
Token ID198
Feature activation+1.49e-02

Pos logit contributions
 one+0.017
Maybe+0.017
How+0.016
 looked+0.015
 someone+0.015
Neg logit contributions
ocol-0.031
 microsc-0.025
 rib-0.025
 gymn-0.024
rou-0.024
TokenMom
Token ID29252
Feature activation+1.66e-02

Pos logit contributions
 one+0.011
Maybe+0.011
 looked+0.010
 someone+0.010
 dad+0.010
Neg logit contributions
ocol-0.026
 rib-0.021
 microsc-0.021
 gol-0.021
ifying-0.020
Token heard
Token ID2982
Feature activation+1.92e-02

Pos logit contributions
 someone+0.015
 one+0.014
Maybe+0.014
 dad+0.013
 somewhere+0.013
Neg logit contributions
ocol-0.027
 rib-0.024
rou-0.023
 microsc-0.022
 gymn-0.021
Token the
Token ID262
Feature activation-6.22e-03

Pos logit contributions
Maybe+0.019
How+0.016
Mom+0.016
 next+0.016
cil+0.016
Neg logit contributions
ocol-0.033
 rib-0.028
 microsc-0.026
 gymn-0.024
pill-0.023
Token noise
Token ID7838
Feature activation+5.57e-02

Pos logit contributions
ocol+0.013
 rib+0.011
 microsc+0.010
 gymn+0.010
rou+0.010
Neg logit contributions
Maybe-0.004
 someone-0.004
 one-0.004
 dad-0.004
 next-0.003
Token and
Token ID290
Feature activation+3.09e-02

Pos logit contributions
cil+0.082
 gentlemen+0.076
Mother+0.075
Someone+0.075
How+0.075
Neg logit contributions
ocol-0.053
 gymn-0.043
 rib-0.043
oke-0.040
hy-0.038
Token came
Token ID1625
Feature activation+4.42e-03

Pos logit contributions
cil+0.031
ble+0.031
Maybe+0.030
 somewhere+0.029
 one+0.029
Neg logit contributions
ocol-0.042
 rib-0.041
 microsc-0.034
 gymn-0.033
rou-0.032
Token to
Token ID284
Feature activation-1.36e-02

Pos logit contributions
Maybe+0.005
 someone+0.005
 one+0.005
 dad+0.005
How+0.005
Neg logit contributions
ocol-0.006
 rib-0.005
 microsc-0.005
 gymn-0.005
rou-0.005
Token see
Token ID766
Feature activation+3.00e-02

Pos logit contributions
ocol+0.031
 rib+0.026
 microsc+0.025
rou+0.025
 topp+0.025
Neg logit contributions
Maybe-0.006
 someone-0.006
 dad-0.006
 one-0.006
 next-0.005
Token what
Token ID644
Feature activation+7.54e-03

Pos logit contributions
ths+0.023
Days+0.022
Maybe+0.022
Years+0.021
Someone+0.021
Neg logit contributions
ocol-0.055
 rib-0.045
 microsc-0.045
rou-0.040
 gymn-0.039
Token safe
Token ID3338
Feature activation-4.35e-03

Pos logit contributions
Maybe+0.016
Mom+0.015
 someone+0.013
 dad+0.013
Dad+0.013
Neg logit contributions
ocol-0.034
 rib-0.029
rou-0.028
 gymn-0.027
 microsc-0.027
Token.
Token ID13
Feature activation-4.33e-03

Pos logit contributions
ocol+0.009
 rib+0.007
 microsc+0.007
 gymn+0.007
rou+0.007
Neg logit contributions
Maybe-0.003
 someone-0.003
 one-0.003
 dad-0.003
Mom-0.003
Token His
Token ID2399
Feature activation+1.63e-02

Pos logit contributions
ocol+0.007
 rib+0.006
 microsc+0.006
 gymn+0.006
rou+0.006
Neg logit contributions
Maybe-0.004
 someone-0.004
 one-0.004
 dad-0.004
 next-0.004
Token parents
Token ID3397
Feature activation+2.39e-03

Pos logit contributions
Maybe+0.013
How+0.012
What+0.011
Nothing+0.010
Mom+0.010
Neg logit contributions
ocol-0.031
 rib-0.027
 microsc-0.027
 gymn-0.026
 former-0.025
Token made
Token ID925
Feature activation-9.19e-03

Pos logit contributions
Maybe+0.002
 someone+0.002
 one+0.002
 dad+0.002
How+0.002
Neg logit contributions
ocol-0.004
 rib-0.004
 microsc-0.003
rou-0.003
 gymn-0.003
Token him
Token ID683
Feature activation+5.57e-02

Pos logit contributions
ocol+0.017
 rib+0.014
 microsc+0.014
 gymn+0.013
rou+0.013
Neg logit contributions
 someone-0.007
Maybe-0.007
 one-0.007
 dad-0.007
 next-0.007
Token feel
Token ID1254
Feature activation+4.36e-03

Pos logit contributions
Mom+0.049
Where+0.048
How+0.047
Maybe+0.047
What+0.047
Neg logit contributions
ocol-0.086
 rib-0.082
rou-0.079
hy-0.076
 gymn-0.074
Token so
Token ID523
Feature activation-5.33e-03

Pos logit contributions
Maybe+0.003
 dad+0.003
 someone+0.003
Mom+0.003
 one+0.003
Neg logit contributions
ocol-0.009
 rib-0.007
 microsc-0.007
rou-0.007
 gymn-0.007
Token special
Token ID2041
Feature activation-1.04e-02

Pos logit contributions
ocol+0.011
 rib+0.009
 microsc+0.009
 gymn+0.008
rou+0.008
Neg logit contributions
Maybe-0.004
 someone-0.003
 one-0.003
 dad-0.003
 next-0.003
Token and
Token ID290
Feature activation-1.86e-02

Pos logit contributions
ocol+0.021
 rib+0.018
 microsc+0.017
 gymn+0.017
rou+0.017
Neg logit contributions
Maybe-0.006
 someone-0.006
 one-0.006
 dad-0.006
 next-0.006
Token loved
Token ID6151
Feature activation-1.73e-02

Pos logit contributions
ocol+0.040
 rib+0.033
 microsc+0.032
 gymn+0.031
pill+0.031
Neg logit contributions
 someone-0.012
Maybe-0.011
 one-0.011
 dad-0.011
 next-0.011
Token very
Token ID845
Feature activation-2.23e-03

Pos logit contributions
Maybe+0.005
 someone+0.004
Mom+0.004
 one+0.004
How+0.004
Neg logit contributions
ocol-0.010
 rib-0.009
 gymn-0.008
rou-0.008
 microsc-0.008
Token excited
Token ID6568
Feature activation-4.39e-03

Pos logit contributions
ocol+0.004
 rib+0.004
 microsc+0.003
 gymn+0.003
rou+0.003
Neg logit contributions
Maybe-0.002
 someone-0.002
 one-0.002
 dad-0.002
Mom-0.002
Token.
Token ID13
Feature activation-1.65e-03

Pos logit contributions
ocol+0.008
 rib+0.007
 microsc+0.007
 gymn+0.006
rou+0.006
Neg logit contributions
Maybe-0.003
 someone-0.003
 one-0.003
 dad-0.003
Mom-0.003
Token She
Token ID1375
Feature activation-1.22e-02

Pos logit contributions
ocol+0.003
 rib+0.003
 microsc+0.002
 gymn+0.002
rou+0.002
Neg logit contributions
Maybe-0.001
 someone-0.001
 one-0.001
 dad-0.001
Mom-0.001
Token gave
Token ID2921
Feature activation+8.04e-03

Pos logit contributions
ocol+0.025
 rib+0.020
 microsc+0.020
 gymn+0.019
rou+0.019
Neg logit contributions
Maybe-0.008
 someone-0.008
 dad-0.008
 one-0.008
 next-0.008
Token Ben
Token ID3932
Feature activation+5.56e-02

Pos logit contributions
Maybe+0.008
 someone+0.007
Mom+0.007
How+0.007
Nothing+0.007
Neg logit contributions
ocol-0.014
 rib-0.011
 microsc-0.011
 gymn-0.010
 topp-0.010
Token a
Token ID257
Feature activation-1.07e-02

Pos logit contributions
Maybe+0.051
Nothing+0.047
ths+0.046
How+0.046
Where+0.046
Neg logit contributions
 rib-0.083
ocol-0.082
 microsc-0.075
 such-0.070
rou-0.070
Token great
Token ID1049
Feature activation+7.16e-03

Pos logit contributions
ocol+0.022
 rib+0.019
 microsc+0.018
 gymn+0.018
rou+0.018
Neg logit contributions
Maybe-0.006
 someone-0.006
 one-0.006
 dad-0.006
 next-0.006
Token big
Token ID1263
Feature activation-2.57e-05

Pos logit contributions
Maybe+0.007
 someone+0.007
How+0.006
Mom+0.006
 dad+0.006
Neg logit contributions
ocol-0.012
 rib-0.010
 gymn-0.009
 microsc-0.009
 topp-0.009
Token hug
Token ID16225
Feature activation-3.04e-02

Pos logit contributions
ocol+0.000
 rib+0.000
 microsc+0.000
 gymn+0.000
 topp+0.000
Neg logit contributions
Maybe-0.000
 someone-0.000
 dad-0.000
Mom-0.000
How-0.000
Token and
Token ID290
Feature activation+2.64e-03

Pos logit contributions
ocol+0.069
 microsc+0.058
 rib+0.057
 gymn+0.055
pill+0.054
Neg logit contributions
 one-0.017
 next-0.015
 someone-0.015
 dad-0.015
Maybe-0.014
Token mess
Token ID2085
Feature activation+1.07e-03

Pos logit contributions
Maybe+0.005
 someone+0.004
How+0.004
 one+0.004
 dad+0.004
Neg logit contributions
ocol-0.008
 rib-0.006
 microsc-0.006
 gymn-0.006
 topp-0.006
Token.
Token ID13
Feature activation+1.38e-02

Pos logit contributions
Maybe+0.001
 someone+0.001
 dad+0.001
 one+0.001
Mom+0.001
Neg logit contributions
ocol-0.002
 rib-0.002
 microsc-0.002
 gymn-0.002
rou-0.002
Token Mom
Token ID11254
Feature activation+1.43e-02

Pos logit contributions
 next+0.012
Maybe+0.012
 one+0.012
 somewhere+0.011
How+0.011
Neg logit contributions
ocol-0.022
 microsc-0.019
 rib-0.018
 gymn-0.017
 gol-0.017
Token hears
Token ID23180
Feature activation+2.12e-02

Pos logit contributions
 someone+0.014
Maybe+0.013
 one+0.013
 dad+0.013
Mom+0.012
Neg logit contributions
ocol-0.024
 rib-0.020
rou-0.019
 gymn-0.019
 topp-0.019
Token the
Token ID262
Feature activation-6.76e-03

Pos logit contributions
Maybe+0.021
How+0.018
Mom+0.018
Nothing+0.017
 next+0.017
Neg logit contributions
ocol-0.036
 rib-0.031
 microsc-0.028
 gymn-0.027
rou-0.026
Token noise
Token ID7838
Feature activation+5.56e-02

Pos logit contributions
ocol+0.014
 rib+0.012
 microsc+0.011
 gymn+0.011
rou+0.011
Neg logit contributions
Maybe-0.004
 someone-0.004
 one-0.004
 dad-0.004
 next-0.004
Token and
Token ID290
Feature activation+2.01e-02

Pos logit contributions
cil+0.082
Someone+0.076
Maybe+0.075
 gentlemen+0.074
How+0.073
Neg logit contributions
ocol-0.052
oke-0.041
 gymn-0.040
 rib-0.039
rou-0.036
Token comes
Token ID2058
Feature activation-4.52e-03

Pos logit contributions
Mom+0.018
Maybe+0.018
 someone+0.017
cil+0.017
 one+0.017
Neg logit contributions
ocol-0.034
 rib-0.030
 gymn-0.027
 microsc-0.026
rou-0.025
Token to
Token ID284
Feature activation-1.70e-02

Pos logit contributions
ocol+0.007
 rib+0.006
 microsc+0.006
 gymn+0.005
rou+0.005
Neg logit contributions
Maybe-0.005
 someone-0.005
 one-0.005
 dad-0.005
Mom-0.004
Token see
Token ID766
Feature activation+1.66e-02

Pos logit contributions
ocol+0.038
 rib+0.032
rou+0.031
 microsc+0.031
pill+0.031
Neg logit contributions
 someone-0.009
 dad-0.009
Maybe-0.009
 one-0.008
 next-0.008
Token.
Token ID13
Feature activation+1.36e-02

Pos logit contributions
Maybe+0.011
cil+0.010
Someone+0.010
Mom+0.010
How+0.010
Neg logit contributions
ocol-0.032
 rib-0.027
 microsc-0.026
rou-0.025
 gymn-0.024
Token to
Token ID284
Feature activation-5.63e-03

Pos logit contributions
ocol+0.035
 rib+0.028
 microsc+0.027
rou+0.026
pill+0.026
Neg logit contributions
 someone-0.016
 one-0.016
Maybe-0.015
 dad-0.015
 next-0.015
Token end
Token ID886
Feature activation-5.24e-03

Pos logit contributions
ocol+0.011
 rib+0.010
 microsc+0.009
 gymn+0.009
rou+0.009
Neg logit contributions
Maybe-0.004
 someone-0.004
 one-0.003
 dad-0.003
 next-0.003
Token.
Token ID13
Feature activation+7.96e-03

Pos logit contributions
ocol+0.011
 rib+0.009
 microsc+0.009
 gymn+0.008
rou+0.008
Neg logit contributions
Maybe-0.003
 someone-0.003
 one-0.003
 dad-0.003
 next-0.003
Token Sarah
Token ID10490
Feature activation-1.72e-03

Pos logit contributions
Maybe+0.007
 someone+0.006
 next+0.006
 one+0.006
 dad+0.006
Neg logit contributions
ocol-0.013
 microsc-0.011
 rib-0.011
 gymn-0.011
rou-0.011
Token gave
Token ID2921
Feature activation+9.41e-04

Pos logit contributions
ocol+0.003
 rib+0.003
 microsc+0.003
 gymn+0.002
rou+0.002
Neg logit contributions
Maybe-0.001
 someone-0.001
 one-0.001
 dad-0.001
Mom-0.001
Token Jack
Token ID3619
Feature activation+5.55e-02

Pos logit contributions
Maybe+0.001
 someone+0.001
How+0.001
Mom+0.001
 next+0.001
Neg logit contributions
ocol-0.002
 rib-0.001
 microsc-0.001
 gymn-0.001
rou-0.001
Token one
Token ID530
Feature activation+1.78e-02

Pos logit contributions
How+0.045
Where+0.044
 knock+0.044
What+0.044
Maybe+0.044
Neg logit contributions
 rib-0.084
ocol-0.081
rou-0.077
 such-0.075
 tre-0.074
Token last
Token ID938
Feature activation+2.18e-02

Pos logit contributions
Maybe+0.015
How+0.014
 someone+0.014
Mom+0.014
What+0.014
Neg logit contributions
ocol-0.030
 rib-0.027
rou-0.025
 topp-0.024
 gymn-0.024
Token squeeze
Token ID21229
Feature activation-8.26e-03

Pos logit contributions
How+0.022
Maybe+0.022
Mom+0.022
What+0.021
 someone+0.021
Neg logit contributions
ocol-0.034
 rib-0.030
rou-0.026
 gymn-0.026
hy-0.026
Token and
Token ID290
Feature activation+5.23e-03

Pos logit contributions
ocol+0.017
 rib+0.014
 microsc+0.014
 gymn+0.013
rou+0.013
Neg logit contributions
Maybe-0.005
 someone-0.005
 one-0.005
 dad-0.005
 next-0.005
Token they
Token ID484
Feature activation-8.20e-03

Pos logit contributions
Maybe+0.004
 someone+0.003
Mom+0.003
How+0.003
 one+0.003
Neg logit contributions
ocol-0.010
 rib-0.009
 microsc-0.008
 gymn-0.008
rou-0.008
Token
Token ID198
Feature activation+1.92e-02

Pos logit contributions
Maybe+0.006
 one+0.006
 next+0.006
 someone+0.006
 dad+0.006
Neg logit contributions
ocol-0.016
 rib-0.014
 microsc-0.013
 gymn-0.013
 topp-0.013
Token
Token ID198
Feature activation+1.83e-02

Pos logit contributions
Maybe+0.016
 one+0.016
How+0.015
 dad+0.015
 someone+0.015
Neg logit contributions
ocol-0.031
 rib-0.026
 microsc-0.025
 gymn-0.025
rou-0.024
TokenMom
Token ID29252
Feature activation+1.51e-02

Pos logit contributions
Maybe+0.012
 look+0.011
 someone+0.011
 one+0.011
 dad+0.011
Neg logit contributions
ocol-0.033
 rib-0.027
ifying-0.027
 microsc-0.026
 gymn-0.026
Token hears
Token ID23180
Feature activation+1.03e-02

Pos logit contributions
 someone+0.014
Maybe+0.013
 one+0.013
 dad+0.012
 somewhere+0.012
Neg logit contributions
ocol-0.026
 rib-0.022
rou-0.021
 microsc-0.021
 gymn-0.020
Token the
Token ID262
Feature activation-1.04e-02

Pos logit contributions
Maybe+0.008
Mom+0.007
How+0.007
 someone+0.007
 next+0.007
Neg logit contributions
ocol-0.019
 rib-0.016
 microsc-0.015
 gymn-0.015
 topp-0.015
Token noise
Token ID7838
Feature activation+5.55e-02

Pos logit contributions
ocol+0.022
 rib+0.019
 microsc+0.018
rou+0.018
 gymn+0.018
Neg logit contributions
 someone-0.006
Maybe-0.006
 dad-0.006
 one-0.006
 next-0.006
Token.
Token ID13
Feature activation+1.89e-02

Pos logit contributions
cil+0.078
Maybe+0.073
How+0.072
Someone+0.072
Mother+0.072
Neg logit contributions
ocol-0.057
 gymn-0.047
 rib-0.046
oke-0.043
hy-0.041
Token She
Token ID1375
Feature activation+4.59e-03

Pos logit contributions
 somewhere+0.019
 one+0.019
 someone+0.019
Maybe+0.019
How+0.019
Neg logit contributions
ocol-0.024
 rib-0.022
 microsc-0.021
 gymn-0.020
pill-0.019
Token comes
Token ID2058
Feature activation+2.57e-03

Pos logit contributions
Maybe+0.004
 someone+0.004
 one+0.003
 dad+0.003
Mom+0.003
Neg logit contributions
ocol-0.009
 rib-0.007
 microsc-0.007
 gymn-0.007
rou-0.007
Token to
Token ID284
Feature activation-7.41e-03

Pos logit contributions
Maybe+0.003
 someone+0.003
How+0.003
 one+0.003
Mom+0.003
Neg logit contributions
ocol-0.004
 rib-0.003
 microsc-0.003
 gymn-0.003
rou-0.003
Token see
Token ID766
Feature activation+2.93e-02

Pos logit contributions
ocol+0.016
 rib+0.013
 microsc+0.013
rou+0.012
 gymn+0.012
Neg logit contributions
Maybe-0.004
 someone-0.004
 one-0.004
 dad-0.004
 next-0.004
Token
Token ID198
Feature activation+2.60e-02

Pos logit contributions
 one+0.006
 dad+0.006
 someone+0.006
 next+0.006
Maybe+0.006
Neg logit contributions
ocol-0.014
 rib-0.012
 microsc-0.012
 gymn-0.012
 topp-0.011
Token
Token ID198
Feature activation+2.66e-02

Pos logit contributions
 one+0.025
 forth+0.025
Maybe+0.024
How+0.023
 looked+0.023
Neg logit contributions
ocol-0.038
 microsc-0.031
 rib-0.030
 gymn-0.029
rou-0.029
TokenMom
Token ID29252
Feature activation+1.09e-02

Pos logit contributions
 look+0.021
 one+0.021
 someone+0.020
 somewhere+0.020
 dad+0.020
Neg logit contributions
ocol-0.043
 microsc-0.036
 rib-0.034
 gymn-0.033
ifying-0.033
Token hears
Token ID23180
Feature activation+2.05e-02

Pos logit contributions
 someone+0.010
Maybe+0.009
 one+0.009
 dad+0.009
Mom+0.009
Neg logit contributions
ocol-0.019
 rib-0.016
rou-0.015
 microsc-0.015
 gymn-0.015
Token the
Token ID262
Feature activation-5.64e-03

Pos logit contributions
Maybe+0.019
How+0.017
Mom+0.017
Nothing+0.016
 next+0.016
Neg logit contributions
ocol-0.036
 rib-0.030
 microsc-0.028
 gymn-0.026
 topp-0.025
Token noise
Token ID7838
Feature activation+5.55e-02

Pos logit contributions
ocol+0.012
 rib+0.010
 microsc+0.009
 gymn+0.009
rou+0.009
Neg logit contributions
Maybe-0.004
 someone-0.003
 one-0.003
 dad-0.003
 next-0.003
Token and
Token ID290
Feature activation+2.04e-02

Pos logit contributions
cil+0.081
Someone+0.074
Maybe+0.073
 gentlemen+0.072
Mother+0.072
Neg logit contributions
ocol-0.055
 gymn-0.044
 rib-0.043
oke-0.042
rou-0.037
Token comes
Token ID2058
Feature activation-6.07e-03

Pos logit contributions
Mom+0.019
Maybe+0.019
 someone+0.018
Dad+0.017
 somewhere+0.017
Neg logit contributions
ocol-0.033
 rib-0.030
 gymn-0.027
 microsc-0.026
pill-0.025
Token to
Token ID284
Feature activation-1.23e-02

Pos logit contributions
ocol+0.010
 rib+0.008
 microsc+0.007
 gymn+0.007
rou+0.007
Neg logit contributions
Maybe-0.007
 someone-0.006
 one-0.006
 dad-0.006
 next-0.006
Token the
Token ID262
Feature activation-1.83e-02

Pos logit contributions
ocol+0.027
 rib+0.023
 microsc+0.022
rou+0.022
 topp+0.022
Neg logit contributions
Maybe-0.006
 someone-0.006
 one-0.006
 dad-0.006
 next-0.006
Token room
Token ID2119
Feature activation+7.39e-03

Pos logit contributions
ocol+0.037
 rib+0.030
 microsc+0.028
 topp+0.028
opot+0.028
Neg logit contributions
 next-0.014
 someone-0.014
Maybe-0.014
 one-0.014
 dad-0.014
Token and
Token ID290
Feature activation+5.92e-03

Pos logit contributions
ocol+0.012
 rib+0.010
 microsc+0.010
 gymn+0.010
rou+0.010
Neg logit contributions
Maybe-0.004
 someone-0.004
 one-0.004
 dad-0.004
 next-0.004
Token saw
Token ID2497
Feature activation+1.92e-03

Pos logit contributions
Maybe+0.004
Mom+0.004
 someone+0.004
How+0.004
 dad+0.004
Neg logit contributions
ocol-0.011
 rib-0.009
 microsc-0.009
 gymn-0.008
rou-0.008
Token a
Token ID257
Feature activation+2.26e-03

Pos logit contributions
Maybe+0.001
Mom+0.001
How+0.001
 someone+0.001
 dad+0.001
Neg logit contributions
ocol-0.004
 rib-0.004
 microsc-0.003
rou-0.003
 gymn-0.003
Token magical
Token ID10883
Feature activation-7.90e-03

Pos logit contributions
Maybe+0.002
 someone+0.002
 one+0.002
 dad+0.002
 next+0.002
Neg logit contributions
ocol-0.004
 rib-0.004
 microsc-0.003
 gymn-0.003
rou-0.003
Token fair
Token ID3148
Feature activation+1.72e-03

Pos logit contributions
ocol+0.015
 rib+0.013
 microsc+0.012
 gymn+0.012
rou+0.012
Neg logit contributions
Maybe-0.006
 someone-0.006
 one-0.006
 dad-0.006
 next-0.005
Tokenyt
Token ID20760
Feature activation-6.26e-02

Pos logit contributions
Maybe+0.001
 someone+0.001
Mom+0.001
How+0.001
 one+0.001
Neg logit contributions
ocol-0.003
 rib-0.003
 microsc-0.003
 gymn-0.003
rou-0.003
Tokenale
Token ID1000
Feature activation-1.62e-02

Pos logit contributions
 microsc+0.112
ocol+0.112
pill+0.110
 topp+0.110
 gymn+0.109
Neg logit contributions
 one-0.042
 end-0.037
 back-0.036
 next-0.036
 where-0.036
Token creature
Token ID7185
Feature activation-5.86e-03

Pos logit contributions
ocol+0.032
 rib+0.027
 microsc+0.026
 gymn+0.025
rou+0.025
Neg logit contributions
 someone-0.011
 one-0.011
 next-0.011
Maybe-0.011
 dad-0.010
Token called
Token ID1444
Feature activation-1.81e-02

Pos logit contributions
ocol+0.012
 rib+0.010
 microsc+0.009
 gymn+0.009
rou+0.009
Neg logit contributions
Maybe-0.004
 someone-0.004
 one-0.004
 dad-0.004
 next-0.004
Token a
Token ID257
Feature activation-1.11e-02

Pos logit contributions
ocol+0.037
 rib+0.031
 microsc+0.029
 gymn+0.029
 topp+0.029
Neg logit contributions
 someone-0.012
 one-0.012
 dad-0.012
Maybe-0.011
 next-0.011
Token n
Token ID299
Feature activation-1.85e-02

Pos logit contributions
ocol+0.021
 rib+0.018
 microsc+0.017
rou+0.016
 gymn+0.016
Neg logit contributions
 someone-0.009
Maybe-0.008
 one-0.008
 dad-0.008
 next-0.008
Token it
Token ID340
Feature activation+1.67e-02

Pos logit contributions
ocol+0.002
 rib+0.002
 microsc+0.002
 gymn+0.002
rou+0.002
Neg logit contributions
Maybe-0.001
 someone-0.001
 dad-0.001
Mom-0.001
 one-0.001
Token first
Token ID717
Feature activation+8.73e-03

Pos logit contributions
Maybe+0.013
Mom+0.012
 dad+0.012
 someone+0.011
What+0.011
Neg logit contributions
ocol-0.030
 rib-0.026
 gymn-0.025
 microsc-0.024
rou-0.024
Token?
Token ID30
Feature activation-1.26e-02

Pos logit contributions
Maybe+0.006
 someone+0.006
Mom+0.005
 dad+0.005
What+0.005
Neg logit contributions
ocol-0.017
 rib-0.014
 gymn-0.014
 microsc-0.013
rou-0.013
Tokenâ
Token ID22940
Feature activation-1.29e-02

Pos logit contributions
ocol+0.022
 rib+0.018
 microsc+0.018
rou+0.017
 gymn+0.017
Neg logit contributions
Maybe-0.012
 someone-0.011
 one-0.011
 dad-0.011
Mom-0.011
Token
Token ID26391
Feature activation+3.21e-03

Pos logit contributions
ocol+0.017
 rib+0.013
 microsc+0.012
 gymn+0.011
 topp+0.011
Neg logit contributions
Maybe-0.018
 someone-0.018
 one-0.018
 next-0.018
 dad-0.017
Token
Token ID8151
Feature activation-6.20e-02

Pos logit contributions
Maybe+0.003
 someone+0.003
 one+0.002
 next+0.002
 dad+0.002
Neg logit contributions
ocol-0.006
 rib-0.005
 microsc-0.005
 gymn-0.005
rou-0.005
Token M
Token ID337
Feature activation-2.40e-02

Pos logit contributions
ocol+0.138
 rib+0.118
 microsc+0.108
 former+0.106
 topp+0.105
Neg logit contributions
 one-0.039
Maybe-0.038
 dad-0.037
 they-0.037
 someone-0.037
Tokenummy
Token ID13513
Feature activation+1.60e-02

Pos logit contributions
ocol+0.038
 rib+0.031
 microsc+0.029
 topp+0.028
ifying+0.028
Neg logit contributions
 dad-0.026
 next-0.026
 someone-0.025
 one-0.025
Maybe-0.025
Token said
Token ID531
Feature activation-2.32e-02

Pos logit contributions
Maybe+0.016
 someone+0.015
How+0.015
Mom+0.015
What+0.015
Neg logit contributions
ocol-0.026
 rib-0.022
 gymn-0.020
 microsc-0.020
 topp-0.020
Token
Token ID6184
Feature activation-1.24e-02

Pos logit contributions
ocol+0.043
 rib+0.035
 microsc+0.035
rou+0.034
pill+0.033
Neg logit contributions
 someone-0.020
 one-0.020
 dad-0.019
Maybe-0.019
 next-0.019
Token
Token ID95
Feature activation-1.73e-02

Pos logit contributions
ocol+0.019
 rib+0.016
 microsc+0.015
rou+0.014
 topp+0.014
Neg logit contributions
Maybe-0.014
 someone-0.013
 one-0.013
 dad-0.013
 next-0.013
Token tall
Token ID7331
Feature activation-3.53e-02

Pos logit contributions
ocol+0.033
 rib+0.027
 microsc+0.027
rou+0.026
 gymn+0.026
Neg logit contributions
 one-0.009
 someone-0.008
Maybe-0.008
 next-0.008
 everything-0.008
Token hills
Token ID18639
Feature activation-3.04e-02

Pos logit contributions
ocol+0.078
rou+0.068
ifying+0.065
 microsc+0.064
 gol+0.064
Neg logit contributions
 one-0.019
 next-0.016
 everything-0.015
 someone-0.015
 looked-0.013
Token and
Token ID290
Feature activation+9.78e-04

Pos logit contributions
ocol+0.074
 rib+0.061
 microsc+0.061
 topp+0.058
rou+0.058
Neg logit contributions
 one-0.014
 next-0.012
 fast-0.010
 someone-0.010
 everything-0.010
Token passed
Token ID3804
Feature activation-2.27e-02

Pos logit contributions
Maybe+0.001
 someone+0.001
 dad+0.001
Mom+0.001
 one+0.001
Neg logit contributions
ocol-0.002
 rib-0.002
 microsc-0.002
 gymn-0.002
rou-0.001
Token through
Token ID832
Feature activation+2.61e-03

Pos logit contributions
ocol+0.046
 rib+0.037
rou+0.035
 microsc+0.035
 gymn+0.035
Neg logit contributions
 someone-0.018
 one-0.018
 everything-0.017
Maybe-0.017
 next-0.017
Token vibrant
Token ID21266
Feature activation-6.09e-02

Pos logit contributions
Maybe+0.002
Mom+0.002
 dad+0.002
 someone+0.002
How+0.002
Neg logit contributions
ocol-0.005
 rib-0.004
 microsc-0.004
 gymn-0.004
 topp-0.004
Token forests
Token ID17039
Feature activation-4.35e-02

Pos logit contributions
ocol+0.137
liv+0.126
oke+0.125
 gol+0.125
rou+0.125
Neg logit contributions
 one-0.029
 everything-0.028
 things-0.023
 water-0.021
 something-0.020
Token.
Token ID13
Feature activation+5.00e-03

Pos logit contributions
ocol+0.111
 rib+0.093
 microsc+0.091
 gol+0.089
 gymn+0.089
Neg logit contributions
 one-0.015
 next-0.014
 someone-0.010
 fast-0.009
 everything-0.009
Token After
Token ID2293
Feature activation+2.35e-02

Pos logit contributions
Maybe+0.005
Mom+0.005
 dad+0.005
 someone+0.005
 next+0.005
Neg logit contributions
ocol-0.008
 rib-0.006
 microsc-0.006
 gymn-0.006
 topp-0.006
Token several
Token ID1811
Feature activation+1.45e-02

Pos logit contributions
Maybe+0.018
Mom+0.018
How+0.016
Parent+0.015
 dad+0.015
Neg logit contributions
ocol-0.044
 rib-0.039
 microsc-0.037
 gymn-0.036
pill-0.034
Token days
Token ID1528
Feature activation-2.65e-02

Pos logit contributions
Maybe+0.014
Mom+0.013
 dad+0.013
How+0.013
 someone+0.013
Neg logit contributions
ocol-0.026
 rib-0.021
 microsc-0.019
 gymn-0.019
 topp-0.019
Token lot
Token ID1256
Feature activation-1.17e-02

Pos logit contributions
Maybe+0.006
 someone+0.005
 one+0.005
 next+0.005
How+0.005
Neg logit contributions
ocol-0.013
 rib-0.011
 microsc-0.010
 gymn-0.010
rou-0.010
Token of
Token ID286
Feature activation-1.72e-02

Pos logit contributions
ocol+0.017
 rib+0.014
 microsc+0.013
 gymn+0.013
 topp+0.012
Neg logit contributions
Maybe-0.014
 someone-0.014
 one-0.014
 next-0.014
 dad-0.013
Token different
Token ID1180
Feature activation-2.73e-02

Pos logit contributions
ocol+0.034
 rib+0.028
 microsc+0.027
rou+0.027
 gymn+0.027
Neg logit contributions
 someone-0.012
 one-0.012
Maybe-0.012
 dad-0.012
 next-0.011
Token animals
Token ID4695
Feature activation-3.05e-03

Pos logit contributions
ocol+0.056
 rib+0.045
rou+0.044
 microsc+0.043
 gol+0.043
Neg logit contributions
 one-0.020
 someone-0.019
 everything-0.018
 something-0.017
 next-0.017
Token and
Token ID290
Feature activation-1.14e-02

Pos logit contributions
ocol+0.006
 rib+0.005
 microsc+0.005
 gymn+0.005
rou+0.005
Neg logit contributions
Maybe-0.002
 someone-0.002
 one-0.002
 dad-0.002
Mom-0.002
Token lush
Token ID37408
Feature activation-6.09e-02

Pos logit contributions
ocol+0.023
 rib+0.019
 microsc+0.019
rou+0.018
 gymn+0.018
Neg logit contributions
 someone-0.007
Maybe-0.007
 one-0.007
 dad-0.007
 next-0.007
Token green
Token ID4077
Feature activation-4.16e-02

Pos logit contributions
ocol+0.136
rou+0.126
ifying+0.124
pri+0.123
bek+0.122
Neg logit contributions
 one-0.025
 water-0.022
 everything-0.020
 next-0.017
 fast-0.017
Token trees
Token ID7150
Feature activation-2.27e-02

Pos logit contributions
ocol+0.092
rou+0.079
 gol+0.077
pri+0.077
 gymn+0.076
Neg logit contributions
 one-0.021
 someone-0.017
 next-0.016
 everything-0.015
 water-0.015
Token.
Token ID13
Feature activation+5.42e-03

Pos logit contributions
ocol+0.051
 rib+0.043
 microsc+0.042
 gymn+0.041
rou+0.041
Neg logit contributions
 one-0.011
 next-0.010
 someone-0.009
Maybe-0.009
 they-0.008
Token
Token ID198
Feature activation+4.35e-03

Pos logit contributions
Maybe+0.005
 next+0.004
 someone+0.004
 one+0.004
 dad+0.004
Neg logit contributions
ocol-0.009
 rib-0.008
 microsc-0.008
 gymn-0.007
pill-0.007
Token
Token ID198
Feature activation-4.20e-03

Pos logit contributions
Maybe+0.003
 someone+0.003
 one+0.003
 dad+0.003
 next+0.003
Neg logit contributions
ocol-0.008
 rib-0.007
 microsc-0.007
 gymn-0.007
rou-0.007
Token came
Token ID1625
Feature activation+7.61e-04

Pos logit contributions
ocol+0.004
 rib+0.004
 microsc+0.003
 gymn+0.003
rou+0.003
Neg logit contributions
Maybe-0.002
 someone-0.002
 one-0.002
 dad-0.002
Mom-0.002
Token across
Token ID1973
Feature activation-3.05e-03

Pos logit contributions
Maybe+0.001
 someone+0.001
 dad+0.001
Mom+0.001
 one+0.001
Neg logit contributions
ocol-0.001
 rib-0.001
 microsc-0.001
rou-0.001
 gymn-0.001
Token rolling
Token ID10708
Feature activation-1.40e-02

Pos logit contributions
ocol+0.007
 rib+0.006
 microsc+0.006
 gymn+0.006
rou+0.006
Neg logit contributions
Maybe-0.001
 someone-0.001
 dad-0.001
 one-0.001
Mom-0.001
Token hills
Token ID18639
Feature activation-2.65e-02

Pos logit contributions
ocol+0.029
 rib+0.024
 microsc+0.023
rou+0.023
 gymn+0.022
Neg logit contributions
 one-0.009
 someone-0.009
Maybe-0.009
 next-0.009
 dad-0.008
Token with
Token ID351
Feature activation-4.17e-03

Pos logit contributions
ocol+0.063
 rib+0.052
 microsc+0.051
 topp+0.049
 gymn+0.049
Neg logit contributions
 one-0.013
 next-0.012
Maybe-0.010
 someone-0.010
 everything-0.009
Token lush
Token ID37408
Feature activation-6.07e-02

Pos logit contributions
ocol+0.008
 rib+0.007
 microsc+0.007
 gymn+0.006
rou+0.006
Neg logit contributions
Maybe-0.003
 someone-0.003
 one-0.003
 dad-0.003
 next-0.003
Token green
Token ID4077
Feature activation-4.43e-02

Pos logit contributions
ocol+0.134
rou+0.122
liv+0.120
ifying+0.120
pri+0.119
Neg logit contributions
 one-0.026
 water-0.025
 everything-0.024
 sun-0.021
 fast-0.021
Token grass
Token ID8701
Feature activation-1.42e-02

Pos logit contributions
ocol+0.099
rou+0.087
pri+0.084
liv+0.084
hy+0.084
Neg logit contributions
 one-0.022
 water-0.019
 everything-0.017
 someone-0.015
 next-0.015
Token.
Token ID13
Feature activation+6.13e-03

Pos logit contributions
ocol+0.031
 rib+0.026
 microsc+0.025
 gymn+0.025
rou+0.024
Neg logit contributions
 one-0.007
Maybe-0.007
 someone-0.007
 next-0.007
 dad-0.007
Token The
Token ID383
Feature activation+1.23e-02

Pos logit contributions
Maybe+0.006
 someone+0.006
 dad+0.006
 next+0.006
Mom+0.006
Neg logit contributions
ocol-0.010
 rib-0.009
 microsc-0.008
 gymn-0.008
pill-0.008
Token sun
Token ID4252
Feature activation-4.95e-04

Pos logit contributions
Maybe+0.011
 someone+0.010
Nothing+0.010
Mom+0.010
How+0.009
Neg logit contributions
ocol-0.021
 rib-0.019
 gol-0.018
 gymn-0.017
 topp-0.016
Token
Token ID198
Feature activation+1.05e-02

Pos logit contributions
Maybe+0.014
 dad+0.013
 someone+0.013
 one+0.013
 next+0.013
Neg logit contributions
ocol-0.030
 microsc-0.024
 rib-0.024
rou-0.024
 gymn-0.024
TokenFinally
Token ID11158
Feature activation+1.87e-02

Pos logit contributions
 someone+0.008
 dad+0.008
 one+0.008
Maybe+0.008
 somewhere+0.008
Neg logit contributions
ocol-0.019
 rib-0.016
 microsc-0.015
 gymn-0.015
rou-0.015
Token Lucy
Token ID22162
Feature activation-1.20e-02

Pos logit contributions
Maybe+0.020
How+0.018
Mom+0.018
What+0.018
Nothing+0.016
Neg logit contributions
ocol-0.031
 rib-0.025
 microsc-0.023
 gymn-0.023
 topp-0.022
Tokenâ
Token ID22940
Feature activation-1.34e-02

Pos logit contributions
ocol+0.025
 rib+0.021
 microsc+0.020
 gymn+0.019
 topp+0.019
Neg logit contributions
Maybe-0.008
 someone-0.007
 one-0.007
 dad-0.007
 next-0.007
Token
Token ID26391
Feature activation+6.02e-03

Pos logit contributions
ocol+0.018
 rib+0.014
 microsc+0.013
 gymn+0.012
rou+0.012
Neg logit contributions
Maybe-0.018
 one-0.018
 someone-0.018
 next-0.018
Mom-0.018
Token
Token ID8151
Feature activation-6.04e-02

Pos logit contributions
Maybe+0.006
 someone+0.006
 one+0.006
 next+0.005
 dad+0.005
Neg logit contributions
ocol-0.010
 rib-0.008
 microsc-0.008
rou-0.008
 gymn-0.008
Tokens
Token ID82
Feature activation+6.08e-03

Pos logit contributions
ocol+0.124
 rib+0.104
 microsc+0.096
 former+0.093
 topp+0.092
Neg logit contributions
 one-0.048
Maybe-0.048
 everything-0.047
 they-0.047
 there-0.045
Token mom
Token ID1995
Feature activation-5.23e-04

Pos logit contributions
Maybe+0.004
How+0.003
 someone+0.003
What+0.003
Mom+0.003
Neg logit contributions
ocol-0.012
 rib-0.011
 microsc-0.010
rou-0.010
 gymn-0.010
Token said
Token ID531
Feature activation-1.57e-02

Pos logit contributions
ocol+0.001
 rib+0.001
 microsc+0.001
rou+0.001
 gymn+0.001
Neg logit contributions
Maybe-0.000
 someone-0.000
 one-0.000
 dad-0.000
Mom-0.000
Token,
Token ID11
Feature activation+7.01e-03

Pos logit contributions
ocol+0.033
 rib+0.027
 microsc+0.027
rou+0.026
 topp+0.026
Neg logit contributions
 someone-0.010
 one-0.010
 dad-0.009
Maybe-0.009
 next-0.009
Token
Token ID6184
Feature activation-9.82e-03

Pos logit contributions
Maybe+0.007
 someone+0.007
Mom+0.007
 one+0.007
How+0.007
Neg logit contributions
ocol-0.011
 rib-0.009
 microsc-0.008
 gymn-0.008
rou-0.008
Token
Token ID129
Feature activation-2.53e-02

Pos logit contributions
Maybe+0.012
 someone+0.011
 one+0.011
 next+0.011
 dad+0.011
Neg logit contributions
ocol-0.022
 rib-0.018
 gymn-0.018
rou-0.018
 microsc-0.017
Token
Token ID241
Feature activation-9.06e-03

Pos logit contributions
ocol+0.039
 rib+0.033
 microsc+0.030
 topp+0.030
rou+0.028
Neg logit contributions
 someone-0.028
 one-0.028
 next-0.027
 dad-0.027
Mom-0.027
TokenDon
Token ID3987
Feature activation-2.30e-02

Pos logit contributions
ocol+0.018
 rib+0.015
 microsc+0.014
 gymn+0.014
rou+0.014
Neg logit contributions
Maybe-0.006
 someone-0.006
 one-0.006
 dad-0.006
 next-0.006
Tokenâ
Token ID22940
Feature activation-8.96e-03

Pos logit contributions
ocol+0.047
 rib+0.039
 microsc+0.039
 topp+0.038
rou+0.036
Neg logit contributions
 someone-0.017
Maybe-0.017
 next-0.017
Mom-0.016
 one-0.016
Token
Token ID26391
Feature activation+4.71e-03

Pos logit contributions
ocol+0.011
 rib+0.008
 microsc+0.008
 gymn+0.007
rou+0.007
Neg logit contributions
Maybe-0.013
 someone-0.013
 one-0.013
 next-0.012
 dad-0.012
Token
Token ID8151
Feature activation-6.01e-02

Pos logit contributions
Maybe+0.004
 someone+0.004
 one+0.004
 dad+0.004
 next+0.004
Neg logit contributions
ocol-0.008
 rib-0.007
 microsc-0.006
 gymn-0.006
rou-0.006
Tokent
Token ID83
Feature activation+3.52e-03

Pos logit contributions
ocol+0.121
 rib+0.105
 microsc+0.096
 former+0.092
 topp+0.091
Neg logit contributions
 one-0.046
Maybe-0.045
 someone-0.044
 they-0.043
 next-0.043
Token worry
Token ID5490
Feature activation-5.92e-03

Pos logit contributions
Maybe+0.003
 someone+0.003
 dad+0.003
How+0.003
Mom+0.003
Neg logit contributions
ocol-0.006
 rib-0.005
 microsc-0.005
 gymn-0.005
rou-0.005
Token,
Token ID11
Feature activation-9.85e-03

Pos logit contributions
ocol+0.012
 rib+0.010
 microsc+0.009
 gymn+0.009
rou+0.009
Neg logit contributions
Maybe-0.004
 someone-0.004
 one-0.004
 dad-0.004
 next-0.004
Token Jack
Token ID3619
Feature activation-1.05e-02

Pos logit contributions
ocol+0.020
 rib+0.017
 microsc+0.016
 gymn+0.016
rou+0.016
Neg logit contributions
Maybe-0.007
 someone-0.007
 one-0.006
 dad-0.006
 next-0.006
Token.
Token ID13
Feature activation+3.74e-03

Pos logit contributions
ocol+0.020
 rib+0.017
 microsc+0.016
 gymn+0.016
rou+0.016
Neg logit contributions
Maybe-0.008
 someone-0.008
 one-0.008
 dad-0.007
 next-0.007
Token,
Token ID11
Feature activation+3.99e-04

Pos logit contributions
 dad+0.001
 someone+0.001
 one+0.001
Maybe+0.001
 everything+0.001
Neg logit contributions
ocol-0.002
 rib-0.001
 microsc-0.001
 gymn-0.001
 topp-0.001
Token why
Token ID1521
Feature activation-1.86e-02

Pos logit contributions
Maybe+0.000
 someone+0.000
 one+0.000
 dad+0.000
Mom+0.000
Neg logit contributions
ocol-0.001
 rib-0.001
 microsc-0.001
 gymn-0.001
rou-0.001
Token don
Token ID836
Feature activation-2.26e-03

Pos logit contributions
ocol+0.033
 rib+0.027
 microsc+0.026
rou+0.025
 gymn+0.025
Neg logit contributions
 someone-0.017
 one-0.017
Maybe-0.016
 dad-0.016
 everything-0.016
Tokenâ
Token ID22940
Feature activation-8.45e-03

Pos logit contributions
ocol+0.004
 rib+0.003
 microsc+0.003
 gymn+0.003
rou+0.003
Neg logit contributions
Maybe-0.002
 someone-0.002
 dad-0.002
 one-0.002
Mom-0.002
Token
Token ID26391
Feature activation-4.71e-03

Pos logit contributions
ocol+0.012
 rib+0.009
 microsc+0.008
 gymn+0.008
rou+0.008
Neg logit contributions
Maybe-0.011
 someone-0.011
 one-0.011
 next-0.011
 dad-0.011
Token
Token ID8151
Feature activation-6.01e-02

Pos logit contributions
ocol+0.009
 rib+0.008
 microsc+0.007
 gymn+0.007
rou+0.007
Neg logit contributions
Maybe-0.004
 someone-0.004
 one-0.003
 dad-0.003
 next-0.003
Tokent
Token ID83
Feature activation-2.98e-03

Pos logit contributions
ocol+0.120
 rib+0.100
 microsc+0.096
 former+0.089
 topp+0.088
Neg logit contributions
Maybe-0.050
 one-0.049
 they-0.047
 someone-0.047
 everything-0.045
Token you
Token ID345
Feature activation-4.87e-03

Pos logit contributions
ocol+0.006
 rib+0.005
 microsc+0.004
 gymn+0.004
rou+0.004
Neg logit contributions
Maybe-0.002
 someone-0.002
 one-0.002
 dad-0.002
Mom-0.002
Token take
Token ID1011
Feature activation+1.44e-02

Pos logit contributions
ocol+0.008
 rib+0.006
 microsc+0.006
 gymn+0.006
rou+0.006
Neg logit contributions
Maybe-0.005
 someone-0.005
 one-0.005
 dad-0.005
 next-0.005
Token a
Token ID257
Feature activation+2.77e-02

Pos logit contributions
Maybe+0.009
Mom+0.008
How+0.008
 dad+0.008
 next+0.007
Neg logit contributions
ocol-0.029
 rib-0.026
 microsc-0.024
 gymn-0.024
 topp-0.023
Token break
Token ID2270
Feature activation+4.20e-03

Pos logit contributions
Maybe+0.031
How+0.029
What+0.028
Mom+0.028
 someone+0.028
Neg logit contributions
ocol-0.042
 rib-0.037
 gymn-0.034
 microsc-0.033
 tant-0.032
Token you
Token ID345
Feature activation-1.08e-02

Pos logit contributions
Maybe+0.004
 someone+0.003
 dad+0.003
Mom+0.003
 one+0.003
Neg logit contributions
ocol-0.008
 rib-0.007
 microsc-0.007
 gymn-0.006
 topp-0.006
Token want
Token ID765
Feature activation+5.84e-03

Pos logit contributions
ocol+0.021
 rib+0.017
 microsc+0.016
 gymn+0.016
rou+0.016
Neg logit contributions
Maybe-0.008
 someone-0.008
 one-0.008
 dad-0.008
 next-0.008
Token?
Token ID30
Feature activation-1.84e-03

Pos logit contributions
Maybe+0.006
Mom+0.005
 someone+0.005
 dad+0.005
How+0.005
Neg logit contributions
ocol-0.010
 rib-0.008
 microsc-0.008
 gymn-0.008
 topp-0.007
Tokenâ
Token ID22940
Feature activation-8.84e-03

Pos logit contributions
ocol+0.003
 rib+0.003
 microsc+0.002
 gymn+0.002
rou+0.002
Neg logit contributions
Maybe-0.002
 someone-0.002
 one-0.002
 dad-0.002
 next-0.002
Token
Token ID26391
Feature activation+1.77e-02

Pos logit contributions
ocol+0.012
 rib+0.009
 microsc+0.009
 gymn+0.008
rou+0.008
Neg logit contributions
Maybe-0.012
 someone-0.012
 one-0.011
 next-0.011
 dad-0.011
Token
Token ID8151
Feature activation-5.99e-02

Pos logit contributions
Maybe+0.015
 someone+0.015
 one+0.014
 next+0.014
What+0.014
Neg logit contributions
ocol-0.029
 rib-0.024
 microsc-0.024
 gymn-0.024
opot-0.024
Token
Token ID198
Feature activation-8.02e-03

Pos logit contributions
ocol+0.126
 rib+0.108
 former+0.098
 microsc+0.098
 topp+0.096
Neg logit contributions
 one-0.043
 someone-0.042
 dad-0.041
 they-0.041
Maybe-0.040
Token
Token ID198
Feature activation-1.03e-02

Pos logit contributions
ocol+0.017
 rib+0.014
 microsc+0.014
 gymn+0.013
rou+0.013
Neg logit contributions
Maybe-0.005
 someone-0.004
 one-0.004
 dad-0.004
Mom-0.004
TokenAlex
Token ID15309
Feature activation-4.10e-03

Pos logit contributions
ocol+0.020
 rib+0.016
 microsc+0.015
rou+0.015
 gymn+0.015
Neg logit contributions
Maybe-0.008
 someone-0.008
 one-0.008
Mom-0.008
 next-0.008
Token wanted
Token ID2227
Feature activation-8.74e-04

Pos logit contributions
ocol+0.007
 rib+0.006
 microsc+0.006
 gymn+0.006
rou+0.006
Neg logit contributions
Maybe-0.004
 someone-0.003
 one-0.003
 dad-0.003
 next-0.003
Token the
Token ID262
Feature activation+4.59e-03

Pos logit contributions
ocol+0.001
 rib+0.001
 microsc+0.001
 gymn+0.001
 topp+0.001
Neg logit contributions
Maybe-0.001
 someone-0.001
Mom-0.001
 dad-0.001
 one-0.001
Token!
Token ID0
Feature activation-3.73e-03

Pos logit contributions
Maybe+0.001
 someone+0.001
Mom+0.001
 dad+0.001
How+0.001
Neg logit contributions
ocol-0.003
 rib-0.002
 microsc-0.002
 gymn-0.002
rou-0.002
Token Why
Token ID4162
Feature activation-1.81e-02

Pos logit contributions
ocol+0.007
 rib+0.005
 microsc+0.005
 gymn+0.005
rou+0.005
Neg logit contributions
Maybe-0.003
 someone-0.003
 one-0.003
 dad-0.003
 next-0.003
Token don
Token ID836
Feature activation+4.92e-04

Pos logit contributions
ocol+0.032
 rib+0.025
 microsc+0.024
rou+0.023
 gol+0.023
Neg logit contributions
 someone-0.019
 one-0.018
Maybe-0.018
 dad-0.018
 everything-0.017
Tokenâ
Token ID22940
Feature activation-5.96e-03

Pos logit contributions
Maybe+0.000
 someone+0.000
 dad+0.000
 one+0.000
Mom+0.000
Neg logit contributions
ocol-0.001
 rib-0.001
 microsc-0.001
 gymn-0.001
rou-0.001
Token
Token ID26391
Feature activation-2.75e-03

Pos logit contributions
ocol+0.008
 rib+0.006
 microsc+0.006
 gymn+0.006
rou+0.006
Neg logit contributions
Maybe-0.008
 someone-0.008
 one-0.007
 dad-0.007
 next-0.007
Token
Token ID8151
Feature activation-5.96e-02

Pos logit contributions
ocol+0.005
 rib+0.004
 microsc+0.004
 gymn+0.004
rou+0.004
Neg logit contributions
Maybe-0.002
 someone-0.002
 one-0.002
 dad-0.002
 next-0.002
Tokent
Token ID83
Feature activation-1.90e-03

Pos logit contributions
ocol+0.123
 rib+0.105
 microsc+0.098
 former+0.093
 topp+0.092
Neg logit contributions
Maybe-0.045
 one-0.044
 someone-0.042
 they-0.042
 there-0.041
Token you
Token ID345
Feature activation-9.91e-04

Pos logit contributions
ocol+0.004
 rib+0.003
 microsc+0.003
 gymn+0.003
rou+0.003
Neg logit contributions
Maybe-0.001
 someone-0.001
 one-0.001
 dad-0.001
Mom-0.001
Token come
Token ID1282
Feature activation+4.90e-03

Pos logit contributions
ocol+0.002
 rib+0.001
 microsc+0.001
 gymn+0.001
rou+0.001
Neg logit contributions
Maybe-0.001
 someone-0.001
 one-0.001
 dad-0.001
Mom-0.001
Token with
Token ID351
Feature activation+5.51e-03

Pos logit contributions
Maybe+0.004
 someone+0.004
 dad+0.004
Mom+0.004
 one+0.004
Neg logit contributions
ocol-0.009
 rib-0.008
 gymn-0.007
 microsc-0.007
rou-0.007
Token me
Token ID502
Feature activation-1.25e-03

Pos logit contributions
Maybe+0.005
 someone+0.004
How+0.004
Mom+0.004
What+0.004
Neg logit contributions
ocol-0.010
 rib-0.008
 microsc-0.008
 gymn-0.008
rou-0.007
Token my
Token ID616
Feature activation-1.14e-02

Pos logit contributions
Maybe+0.009
 someone+0.008
How+0.008
 dad+0.007
Mom+0.007
Neg logit contributions
ocol-0.018
 rib-0.015
rou-0.014
 gymn-0.014
 microsc-0.014
Token friend
Token ID1545
Feature activation-2.52e-03

Pos logit contributions
ocol+0.021
 rib+0.017
 microsc+0.016
 gymn+0.016
 topp+0.016
Neg logit contributions
 one-0.010
 someone-0.010
Maybe-0.010
 dad-0.010
 next-0.009
Token?
Token ID30
Feature activation+7.61e-04

Pos logit contributions
ocol+0.005
 rib+0.004
 microsc+0.004
 gymn+0.004
rou+0.004
Neg logit contributions
Maybe-0.002
 someone-0.002
 one-0.002
 dad-0.002
Mom-0.002
Tokenâ
Token ID22940
Feature activation-1.63e-02

Pos logit contributions
Maybe+0.001
 someone+0.001
 dad+0.001
 one+0.001
 next+0.001
Neg logit contributions
ocol-0.001
 rib-0.001
 microsc-0.001
 gymn-0.001
rou-0.001
Token
Token ID26391
Feature activation+1.04e-02

Pos logit contributions
ocol+0.022
 rib+0.017
 microsc+0.016
 gymn+0.015
 topp+0.014
Neg logit contributions
Maybe-0.022
 next-0.022
 someone-0.022
 one-0.022
Mom-0.021
Token
Token ID8151
Feature activation-5.93e-02

Pos logit contributions
Maybe+0.009
 someone+0.009
 one+0.009
 next+0.009
Mom+0.009
Neg logit contributions
ocol-0.017
 rib-0.014
 microsc-0.014
 gymn-0.014
rou-0.014
Token The
Token ID383
Feature activation+5.09e-03

Pos logit contributions
ocol+0.130
 rib+0.112
 topp+0.102
 microsc+0.102
 former+0.101
Neg logit contributions
 one-0.038
 someone-0.037
 dad-0.037
 they-0.035
 everything-0.035
Token chicken
Token ID9015
Feature activation+1.36e-02

Pos logit contributions
Maybe+0.005
 someone+0.005
Mom+0.005
 dad+0.004
How+0.004
Neg logit contributions
ocol-0.009
 rib-0.007
 gymn-0.007
 microsc-0.007
 topp-0.006
Token said
Token ID531
Feature activation-2.45e-02

Pos logit contributions
Maybe+0.013
 someone+0.012
Mom+0.012
How+0.012
What+0.012
Neg logit contributions
ocol-0.023
 rib-0.019
 microsc-0.018
 gymn-0.018
rou-0.017
Token yes
Token ID3763
Feature activation-1.85e-02

Pos logit contributions
ocol+0.049
 microsc+0.040
 rib+0.040
rou+0.038
 topp+0.038
Neg logit contributions
 someone-0.018
 one-0.018
 dad-0.017
 next-0.016
Maybe-0.016
Token and
Token ID290
Feature activation-7.08e-03

Pos logit contributions
ocol+0.040
 rib+0.033
 microsc+0.032
pill+0.031
 topp+0.031
Neg logit contributions
 one-0.012
 someone-0.011
Maybe-0.011
 next-0.011
 dad-0.011
Token,
Token ID11
Feature activation-1.39e-03

Pos logit contributions
Maybe+0.013
How+0.012
 someone+0.012
Mom+0.012
What+0.012
Neg logit contributions
ocol-0.024
 rib-0.021
 gymn-0.019
rou-0.018
 microsc-0.018
Token if
Token ID611
Feature activation-8.81e-03

Pos logit contributions
ocol+0.003
 rib+0.002
 microsc+0.002
 gymn+0.002
rou+0.002
Neg logit contributions
Maybe-0.001
 someone-0.001
 one-0.001
 dad-0.001
Mom-0.001
Token that
Token ID326
Feature activation+8.03e-03

Pos logit contributions
ocol+0.018
 rib+0.015
 microsc+0.014
rou+0.014
 gymn+0.014
Neg logit contributions
 someone-0.006
Maybe-0.006
 one-0.006
 dad-0.006
 next-0.006
Tokenâ
Token ID22940
Feature activation-5.25e-03

Pos logit contributions
Maybe+0.007
Mom+0.007
 dad+0.007
 someone+0.007
What+0.007
Neg logit contributions
ocol-0.014
 rib-0.012
 gymn-0.011
 microsc-0.011
 topp-0.011
Token
Token ID26391
Feature activation+6.80e-03

Pos logit contributions
ocol+0.006
 rib+0.005
 microsc+0.004
 gymn+0.004
rou+0.004
Neg logit contributions
Maybe-0.008
 someone-0.008
 one-0.008
 dad-0.008
 next-0.007
Token
Token ID8151
Feature activation-5.93e-02

Pos logit contributions
Maybe+0.006
 someone+0.006
 one+0.006
 dad+0.006
 next+0.006
Neg logit contributions
ocol-0.012
 rib-0.010
 gymn-0.009
 microsc-0.009
rou-0.009
Tokens
Token ID82
Feature activation+1.36e-02

Pos logit contributions
ocol+0.115
 rib+0.100
 former+0.091
 microsc+0.091
 topp+0.086
Neg logit contributions
 they-0.048
 one-0.048
Maybe-0.047
 someone-0.047
 there-0.046
Token what
Token ID644
Feature activation-1.54e-02

Pos logit contributions
Maybe+0.013
Mom+0.012
How+0.011
What+0.011
 someone+0.011
Neg logit contributions
ocol-0.023
 rib-0.020
 gymn-0.018
rou-0.018
 microsc-0.018
Token you
Token ID345
Feature activation-1.12e-02

Pos logit contributions
ocol+0.029
 rib+0.023
 microsc+0.022
rou+0.022
 gymn+0.021
Neg logit contributions
 someone-0.013
 one-0.013
Maybe-0.013
 dad-0.013
 next-0.012
Token want
Token ID765
Feature activation-1.62e-03

Pos logit contributions
ocol+0.020
 rib+0.016
 microsc+0.015
 gymn+0.015
rou+0.015
Neg logit contributions
Maybe-0.010
 someone-0.010
 one-0.010
 dad-0.010
 next-0.010
Token to
Token ID284
Feature activation-1.32e-02

Pos logit contributions
ocol+0.003
 rib+0.003
 microsc+0.002
 gymn+0.002
rou+0.002
Neg logit contributions
Maybe-0.001
 someone-0.001
 dad-0.001
 one-0.001
Mom-0.001
Token felt
Token ID2936
Feature activation+1.10e-02

Pos logit contributions
ocol+0.003
 rib+0.003
 microsc+0.003
 gymn+0.003
rou+0.003
Neg logit contributions
Maybe-0.001
 someone-0.001
 one-0.001
 dad-0.001
Mom-0.001
Token bad
Token ID2089
Feature activation+5.12e-03

Pos logit contributions
Maybe+0.008
 dad+0.008
Mom+0.008
 one+0.008
How+0.007
Neg logit contributions
ocol-0.020
 rib-0.016
 microsc-0.016
 gymn-0.016
hy-0.015
Token and
Token ID290
Feature activation-1.46e-02

Pos logit contributions
Maybe+0.004
Mom+0.004
 someone+0.004
 dad+0.004
How+0.004
Neg logit contributions
ocol-0.009
 rib-0.008
 microsc-0.007
 gymn-0.007
rou-0.007
Token said
Token ID531
Feature activation-1.05e-02

Pos logit contributions
ocol+0.030
 rib+0.025
 microsc+0.024
 gymn+0.023
rou+0.023
Neg logit contributions
 someone-0.010
Maybe-0.010
 one-0.010
 dad-0.009
 next-0.009
Token,"
Token ID553
Feature activation-4.68e-04

Pos logit contributions
ocol+0.022
 rib+0.018
 microsc+0.018
rou+0.017
 gymn+0.017
Neg logit contributions
 someone-0.007
Maybe-0.006
 one-0.006
 dad-0.006
 next-0.006
TokenI
Token ID40
Feature activation-5.92e-02

Pos logit contributions
ocol+0.001
 rib+0.001
 microsc+0.001
 gymn+0.001
 topp+0.001
Neg logit contributions
 someone-0.000
Maybe-0.000
 dad-0.000
 one-0.000
Mom-0.000
Token'm
Token ID1101
Feature activation-1.01e-02

Pos logit contributions
ocol+0.134
 microsc+0.108
 rib+0.106
ifying+0.103
pill+0.100
Neg logit contributions
Maybe-0.050
 turned-0.045
 not-0.044
 looked-0.044
 one-0.043
Token sorry
Token ID7926
Feature activation+1.00e-02

Pos logit contributions
ocol+0.022
 rib+0.019
 microsc+0.018
 gymn+0.018
 topp+0.018
Neg logit contributions
 someone-0.005
 one-0.005
Maybe-0.005
 dad-0.005
 next-0.004
Token,
Token ID11
Feature activation+1.49e-03

Pos logit contributions
Maybe+0.011
 someone+0.010
How+0.010
 dad+0.009
What+0.009
Neg logit contributions
ocol-0.015
 rib-0.013
 gymn-0.012
rou-0.012
 microsc-0.012
Token let
Token ID1309
Feature activation-5.25e-03

Pos logit contributions
Maybe+0.001
 someone+0.001
 one+0.001
 dad+0.001
Mom+0.001
Neg logit contributions
ocol-0.003
 rib-0.002
 microsc-0.002
 gymn-0.002
rou-0.002
Token me
Token ID502
Feature activation-2.21e-03

Pos logit contributions
ocol+0.010
 rib+0.008
 microsc+0.008
 gymn+0.008
rou+0.008
Neg logit contributions
Maybe-0.004
 someone-0.004
 one-0.004
 dad-0.004
 next-0.004
Token
Token ID129
Feature activation-2.57e-02

Pos logit contributions
Maybe+0.012
 someone+0.012
 one+0.012
 next+0.011
What+0.011
Neg logit contributions
ocol-0.019
 rib-0.016
 microsc-0.016
rou-0.016
 gymn-0.015
Token
Token ID241
Feature activation-8.92e-03

Pos logit contributions
ocol+0.040
 rib+0.034
 topp+0.031
 microsc+0.031
ifying+0.030
Neg logit contributions
 someone-0.028
 next-0.028
 one-0.028
 dad-0.027
Mom-0.027
TokenDon
Token ID3987
Feature activation-2.70e-02

Pos logit contributions
ocol+0.018
 rib+0.015
 microsc+0.014
 gymn+0.014
rou+0.014
Neg logit contributions
Maybe-0.006
 someone-0.006
 one-0.006
 dad-0.006
 next-0.006
Tokenâ
Token ID22940
Feature activation-1.12e-02

Pos logit contributions
ocol+0.058
 rib+0.048
 microsc+0.048
 topp+0.046
rou+0.045
Neg logit contributions
 next-0.019
 someone-0.019
Maybe-0.019
Mom-0.018
 one-0.017
Token
Token ID26391
Feature activation+3.50e-03

Pos logit contributions
ocol+0.015
 rib+0.011
 microsc+0.010
rou+0.010
 gymn+0.010
Neg logit contributions
Maybe-0.016
 someone-0.015
 one-0.015
 next-0.015
Mom-0.015
Token
Token ID8151
Feature activation-5.90e-02

Pos logit contributions
Maybe+0.003
 someone+0.003
 one+0.003
 dad+0.003
 next+0.003
Neg logit contributions
ocol-0.006
 rib-0.005
 microsc-0.005
 gymn-0.005
rou-0.005
Tokent
Token ID83
Feature activation+1.06e-02

Pos logit contributions
ocol+0.122
 rib+0.106
 microsc+0.095
 former+0.095
 topp+0.092
Neg logit contributions
 someone-0.044
 they-0.043
 one-0.043
Mom-0.043
Maybe-0.043
Token worry
Token ID5490
Feature activation-5.50e-03

Pos logit contributions
Maybe+0.010
How+0.009
Mom+0.009
 dad+0.009
What+0.009
Neg logit contributions
ocol-0.018
 rib-0.015
 gymn-0.014
 microsc-0.014
hy-0.014
Token,
Token ID11
Feature activation-5.46e-03

Pos logit contributions
ocol+0.011
 rib+0.009
 microsc+0.009
 gymn+0.009
rou+0.009
Neg logit contributions
Maybe-0.004
 someone-0.004
 one-0.004
 dad-0.004
 next-0.003
Token Mat
Token ID6550
Feature activation-3.38e-02

Pos logit contributions
ocol+0.011
 rib+0.009
 microsc+0.009
 gymn+0.009
rou+0.009
Neg logit contributions
Maybe-0.004
 someone-0.004
 one-0.003
 dad-0.003
 next-0.003
Tokenty
Token ID774
Feature activation-2.86e-03

Pos logit contributions
ocol+0.062
 rib+0.050
 microsc+0.048
 topp+0.048
 former+0.047
Neg logit contributions
 next-0.032
 someone-0.031
 one-0.030
 dad-0.030
Maybe-0.029
Token I
Token ID314
Feature activation-2.30e-02

Pos logit contributions
ocol+0.029
 rib+0.025
 microsc+0.024
rou+0.023
 topp+0.023
Neg logit contributions
 someone-0.007
Maybe-0.006
 one-0.006
 dad-0.006
 next-0.006
Token do
Token ID466
Feature activation+3.14e-03

Pos logit contributions
ocol+0.043
 rib+0.035
 microsc+0.033
 topp+0.033
pill+0.032
Neg logit contributions
 someone-0.021
 one-0.021
Maybe-0.020
 next-0.020
 dad-0.020
Token?
Token ID30
Feature activation+2.39e-03

Pos logit contributions
Maybe+0.002
 someone+0.002
 dad+0.002
Mom+0.002
 one+0.002
Neg logit contributions
ocol-0.006
 rib-0.005
 gymn-0.005
 microsc-0.005
rou-0.005
Tokenâ
Token ID22940
Feature activation-6.29e-03

Pos logit contributions
Maybe+0.002
 someone+0.002
 dad+0.002
 one+0.002
 next+0.002
Neg logit contributions
ocol-0.004
 rib-0.003
 microsc-0.003
 gymn-0.003
rou-0.003
Token
Token ID26391
Feature activation+1.56e-02

Pos logit contributions
ocol+0.008
 rib+0.006
 microsc+0.005
 gymn+0.005
rou+0.005
Neg logit contributions
Maybe-0.009
 someone-0.009
 one-0.009
 dad-0.009
 next-0.009
Token
Token ID8151
Feature activation-5.90e-02

Pos logit contributions
Maybe+0.014
 someone+0.014
 one+0.014
 next+0.013
What+0.013
Neg logit contributions
ocol-0.026
 rib-0.021
 microsc-0.021
rou-0.021
 gymn-0.021
Token Her
Token ID2332
Feature activation-7.34e-03

Pos logit contributions
ocol+0.120
 rib+0.104
 topp+0.093
 former+0.093
 microsc+0.091
Neg logit contributions
 one-0.047
 someone-0.046
 dad-0.046
 they-0.045
 there-0.044
Token dad
Token ID9955
Feature activation+8.72e-03

Pos logit contributions
ocol+0.014
 rib+0.012
 microsc+0.011
 gymn+0.011
rou+0.011
Neg logit contributions
Maybe-0.005
 someone-0.005
 one-0.005
 dad-0.005
 next-0.005
Token replied
Token ID8712
Feature activation-1.61e-02

Pos logit contributions
Maybe+0.008
 someone+0.008
Mom+0.008
How+0.007
 dad+0.007
Neg logit contributions
ocol-0.015
 rib-0.012
 microsc-0.012
rou-0.011
 gymn-0.011
Token,
Token ID11
Feature activation-1.95e-03

Pos logit contributions
ocol+0.033
 rib+0.028
 microsc+0.028
rou+0.027
 gymn+0.027
Neg logit contributions
 someone-0.010
 one-0.010
Maybe-0.009
 dad-0.009
 next-0.009
Token
Token ID6184
Feature activation-8.71e-03

Pos logit contributions
ocol+0.003
 rib+0.003
 microsc+0.003
 gymn+0.002
rou+0.002
Neg logit contributions
Maybe-0.002
 someone-0.002
 one-0.002
 dad-0.002
Mom-0.002
Token,
Token ID11
Feature activation-3.83e-04

Pos logit contributions
ocol+0.021
 rib+0.018
 microsc+0.017
 gymn+0.017
rou+0.017
Neg logit contributions
Maybe-0.007
 someone-0.007
 one-0.007
 dad-0.006
 next-0.006
Token please
Token ID3387
Feature activation+1.58e-02

Pos logit contributions
ocol+0.001
 rib+0.001
 microsc+0.001
 gymn+0.001
rou+0.001
Neg logit contributions
Maybe-0.000
 someone-0.000
 one-0.000
 dad-0.000
Mom-0.000
Token?
Token ID30
Feature activation-6.64e-03

Pos logit contributions
Maybe+0.013
 someone+0.011
Mom+0.011
What+0.011
How+0.011
Neg logit contributions
ocol-0.030
 rib-0.025
 gymn-0.023
 microsc-0.023
 topp-0.022
Tokenâ
Token ID22940
Feature activation-8.61e-03

Pos logit contributions
ocol+0.011
 rib+0.009
 microsc+0.009
 gymn+0.008
rou+0.008
Neg logit contributions
Maybe-0.007
 someone-0.007
 one-0.006
 dad-0.006
 next-0.006
Token
Token ID26391
Feature activation+1.13e-02

Pos logit contributions
ocol+0.011
 rib+0.008
 microsc+0.008
 gymn+0.007
rou+0.007
Neg logit contributions
Maybe-0.012
 someone-0.012
 one-0.012
 dad-0.012
 next-0.012
Token
Token ID8151
Feature activation-5.88e-02

Pos logit contributions
Maybe+0.011
 someone+0.011
 one+0.010
 next+0.010
Mom+0.010
Neg logit contributions
ocol-0.018
 rib-0.015
 microsc-0.015
 gymn-0.015
rou-0.014
Token
Token ID220
Feature activation-7.42e-03

Pos logit contributions
ocol+0.129
 rib+0.112
 microsc+0.101
 former+0.101
 topp+0.100
Neg logit contributions
 one-0.039
 dad-0.038
 someone-0.037
 mom-0.036
Mom-0.035
Token
Token ID198
Feature activation-8.55e-03

Pos logit contributions
ocol+0.015
 rib+0.013
 microsc+0.013
 gymn+0.012
rou+0.012
Neg logit contributions
Maybe-0.004
 someone-0.004
 one-0.004
 dad-0.004
 next-0.004
Token
Token ID198
Feature activation-1.09e-02

Pos logit contributions
ocol+0.018
 rib+0.016
 microsc+0.015
 gymn+0.015
rou+0.015
Neg logit contributions
Maybe-0.005
 someone-0.005
 one-0.004
 dad-0.004
Mom-0.004
TokenHer
Token ID9360
Feature activation-9.71e-03

Pos logit contributions
ocol+0.022
 rib+0.019
 microsc+0.018
 topp+0.017
rou+0.017
Neg logit contributions
Maybe-0.008
 someone-0.007
Mom-0.007
 one-0.007
 dad-0.007
Token mom
Token ID1995
Feature activation+4.89e-03

Pos logit contributions
ocol+0.025
 rib+0.022
 microsc+0.021
 gymn+0.021
rou+0.021
Neg logit contributions
 dad-0.001
 someone-0.001
Maybe-0.001
 one-0.001
 next-0.001
Token.
Token ID13
Feature activation+6.71e-03

Pos logit contributions
ocol+0.004
 rib+0.003
 microsc+0.003
 gymn+0.003
rou+0.003
Neg logit contributions
Maybe-0.001
 someone-0.001
 dad-0.001
 one-0.001
Mom-0.001
Token They
Token ID1119
Feature activation-1.72e-02

Pos logit contributions
Maybe+0.005
 someone+0.005
 one+0.005
 dad+0.005
 next+0.005
Neg logit contributions
ocol-0.012
 rib-0.010
 microsc-0.010
 gymn-0.009
pill-0.009
Token promised
Token ID8072
Feature activation-1.22e-02

Pos logit contributions
ocol+0.034
 rib+0.027
 microsc+0.026
 gymn+0.025
 topp+0.025
Neg logit contributions
Maybe-0.014
 one-0.014
 dad-0.014
 someone-0.013
 next-0.013
Token to
Token ID284
Feature activation-2.25e-02

Pos logit contributions
ocol+0.021
 rib+0.016
 microsc+0.015
 topp+0.015
rou+0.015
Neg logit contributions
 someone-0.013
 one-0.013
Maybe-0.013
 dad-0.013
 next-0.013
Token play
Token ID711
Feature activation-9.04e-03

Pos logit contributions
ocol+0.046
 rib+0.038
 microsc+0.037
pill+0.037
rou+0.035
Neg logit contributions
 one-0.016
 dad-0.015
Mom-0.015
 someone-0.015
 next-0.015
Token and
Token ID290
Feature activation-5.87e-02

Pos logit contributions
ocol+0.019
 rib+0.016
 microsc+0.015
rou+0.015
 gymn+0.015
Neg logit contributions
Maybe-0.005
 someone-0.005
 one-0.005
 dad-0.005
 next-0.005
Token eat
Token ID4483
Feature activation-1.10e-02

Pos logit contributions
ocol+0.159
pill+0.127
rou+0.126
aman+0.124
ifying+0.123
Neg logit contributions
 they-0.033
 not-0.029
 one-0.028
 look-0.025
 mom-0.025
Token popcorn
Token ID38914
Feature activation-7.63e-04

Pos logit contributions
ocol+0.023
 rib+0.019
 microsc+0.019
rou+0.018
 gymn+0.018
Neg logit contributions
Maybe-0.007
 someone-0.007
 one-0.007
 dad-0.006
 next-0.006
Token together
Token ID1978
Feature activation-4.43e-03

Pos logit contributions
ocol+0.001
 rib+0.001
 microsc+0.001
 gymn+0.001
rou+0.001
Neg logit contributions
Maybe-0.001
 someone-0.001
 dad-0.001
 one-0.001
Mom-0.001
Token every
Token ID790
Feature activation+2.50e-02

Pos logit contributions
ocol+0.009
 rib+0.008
 microsc+0.007
 gymn+0.007
rou+0.007
Neg logit contributions
Maybe-0.003
 someone-0.003
 one-0.003
 dad-0.003
Mom-0.002
Token day
Token ID1110
Feature activation-3.72e-03

Pos logit contributions
Maybe+0.026
Nothing+0.024
 someone+0.023
What+0.023
Someone+0.023
Neg logit contributions
ocol-0.037
 topp-0.031
 gymn-0.031
 rib-0.031
pri-0.029
Token he
Token ID339
Feature activation+1.29e-02

Pos logit contributions
ocol+0.029
 rib+0.024
 microsc+0.023
 gymn+0.022
rou+0.022
Neg logit contributions
Maybe-0.013
 one-0.013
 someone-0.013
 dad-0.012
 something-0.012
Token just
Token ID655
Feature activation-9.08e-03

Pos logit contributions
Maybe+0.011
 someone+0.010
Mom+0.010
How+0.010
What+0.010
Neg logit contributions
ocol-0.023
 rib-0.019
 microsc-0.018
 gymn-0.018
rou-0.018
Token couldn
Token ID3521
Feature activation-2.05e-02

Pos logit contributions
ocol+0.017
 rib+0.014
 microsc+0.014
 gymn+0.013
rou+0.013
Neg logit contributions
Maybe-0.007
 someone-0.007
 one-0.007
 dad-0.007
 next-0.007
Tokenâ
Token ID22940
Feature activation-1.63e-02

Pos logit contributions
ocol+0.039
 rib+0.032
 microsc+0.031
rou+0.030
 topp+0.030
Neg logit contributions
 someone-0.017
 one-0.017
Maybe-0.016
 next-0.016
 dad-0.015
Token
Token ID26391
Feature activation-2.55e-03

Pos logit contributions
ocol+0.022
 rib+0.018
 microsc+0.017
rou+0.015
 topp+0.015
Neg logit contributions
Maybe-0.021
 one-0.021
 someone-0.021
 next-0.021
Mom-0.020
Token
Token ID8151
Feature activation-5.86e-02

Pos logit contributions
ocol+0.005
 rib+0.004
 microsc+0.004
 gymn+0.004
rou+0.004
Neg logit contributions
Maybe-0.002
 someone-0.002
 one-0.002
 dad-0.002
 next-0.002
Tokent
Token ID83
Feature activation+1.78e-02

Pos logit contributions
ocol+0.118
 rib+0.098
 microsc+0.095
 topp+0.090
 gymn+0.087
Neg logit contributions
 one-0.047
Maybe-0.046
 someone-0.045
 they-0.044
 next-0.044
Token reach
Token ID3151
Feature activation+1.54e-02

Pos logit contributions
Maybe+0.015
Mom+0.014
How+0.014
 dad+0.014
What+0.014
Neg logit contributions
ocol-0.032
 rib-0.026
 gymn-0.025
hy-0.024
 microsc-0.024
Token it
Token ID340
Feature activation+1.92e-02

Pos logit contributions
Maybe+0.012
Mom+0.011
How+0.011
 dad+0.011
 someone+0.010
Neg logit contributions
ocol-0.029
 rib-0.025
 microsc-0.023
 gymn-0.023
rou-0.022
Token.
Token ID13
Feature activation-7.44e-03

Pos logit contributions
Maybe+0.016
Mom+0.015
 dad+0.015
How+0.014
What+0.014
Neg logit contributions
ocol-0.034
 rib-0.029
 gymn-0.027
 microsc-0.027
rou-0.026
Token Then
Token ID3244
Feature activation+4.33e-03

Pos logit contributions
ocol+0.014
 rib+0.012
 microsc+0.011
 gymn+0.011
rou+0.011
Neg logit contributions
Maybe-0.006
 someone-0.006
 one-0.006
 dad-0.006
 next-0.006
Token
Token ID129
Feature activation-2.27e-02

Pos logit contributions
Maybe+0.023
 one+0.023
 someone+0.023
What+0.022
Mom+0.022
Neg logit contributions
ocol-0.033
 rib-0.027
opot-0.027
 microsc-0.027
oc-0.027
Token
Token ID241
Feature activation-1.87e-02

Pos logit contributions
ocol+0.035
 rib+0.029
 microsc+0.027
 topp+0.027
ifying+0.026
Neg logit contributions
 one-0.025
 someone-0.025
 next-0.025
 dad-0.024
Maybe-0.024
TokenDon
Token ID3987
Feature activation-2.79e-02

Pos logit contributions
ocol+0.037
 rib+0.031
 microsc+0.030
 gymn+0.029
 topp+0.028
Neg logit contributions
Maybe-0.015
 someone-0.014
 one-0.013
 dad-0.013
Mom-0.013
Tokenâ
Token ID22940
Feature activation-8.52e-03

Pos logit contributions
ocol+0.060
 rib+0.050
 microsc+0.050
 topp+0.047
rou+0.047
Neg logit contributions
 next-0.020
 someone-0.019
Maybe-0.019
 one-0.018
Mom-0.018
Token
Token ID26391
Feature activation+8.32e-03

Pos logit contributions
ocol+0.012
 rib+0.010
 microsc+0.009
 gymn+0.008
rou+0.008
Neg logit contributions
Maybe-0.011
 someone-0.011
 one-0.010
 next-0.010
 dad-0.010
Token
Token ID8151
Feature activation-5.85e-02

Pos logit contributions
Maybe+0.007
 one+0.007
 someone+0.007
Mom+0.007
 next+0.007
Neg logit contributions
ocol-0.014
 rib-0.012
 microsc-0.011
 gymn-0.011
rou-0.011
Tokent
Token ID83
Feature activation+1.29e-02

Pos logit contributions
ocol+0.121
 rib+0.108
 former+0.102
 microsc+0.097
 gymn+0.094
Neg logit contributions
 someone-0.041
 one-0.040
Mom-0.040
 they-0.040
 there-0.039
Token worry
Token ID5490
Feature activation-1.55e-02

Pos logit contributions
Maybe+0.012
How+0.012
Mom+0.011
What+0.011
 dad+0.011
Neg logit contributions
ocol-0.022
 rib-0.017
 microsc-0.017
 gymn-0.017
hy-0.016
Token,
Token ID11
Feature activation-1.23e-02

Pos logit contributions
ocol+0.032
 rib+0.028
 microsc+0.026
rou+0.026
 gymn+0.026
Neg logit contributions
 someone-0.010
 one-0.009
 dad-0.009
Maybe-0.009
 next-0.009
Token I
Token ID314
Feature activation-1.64e-02

Pos logit contributions
ocol+0.026
 rib+0.022
 microsc+0.021
 gymn+0.021
rou+0.021
Neg logit contributions
 someone-0.007
Maybe-0.007
 one-0.007
 dad-0.007
 next-0.007
Tokenâ
Token ID22940
Feature activation-3.35e-03

Pos logit contributions
ocol+0.029
 rib+0.023
 microsc+0.022
 topp+0.021
rou+0.021
Neg logit contributions
Maybe-0.016
 someone-0.016
 one-0.016
 dad-0.015
 next-0.015
Tokenâ
Token ID22940
Feature activation-5.32e-03

Pos logit contributions
ocol+0.022
 rib+0.018
 microsc+0.018
rou+0.017
 gymn+0.017
Neg logit contributions
Maybe-0.013
 someone-0.013
 one-0.012
 dad-0.012
Mom-0.012
Token
Token ID26391
Feature activation+6.57e-03

Pos logit contributions
ocol+0.007
 rib+0.005
 microsc+0.005
 gymn+0.005
rou+0.005
Neg logit contributions
Maybe-0.007
 someone-0.007
 one-0.007
 dad-0.007
 next-0.007
Token Sam
Token ID3409
Feature activation-6.93e-03

Pos logit contributions
Maybe+0.006
 someone+0.006
 one+0.006
 next+0.006
 dad+0.006
Neg logit contributions
ocol-0.011
 rib-0.009
 microsc-0.009
rou-0.009
 gymn-0.009
Tokenâ
Token ID22940
Feature activation-1.22e-02

Pos logit contributions
ocol+0.013
 rib+0.011
 microsc+0.010
 gymn+0.010
rou+0.010
Neg logit contributions
Maybe-0.006
 someone-0.006
 one-0.005
 dad-0.005
 next-0.005
Token
Token ID26391
Feature activation+5.37e-03

Pos logit contributions
ocol+0.017
 rib+0.013
 microsc+0.012
 gymn+0.011
rou+0.011
Neg logit contributions
Maybe-0.016
 someone-0.016
 one-0.016
 next-0.016
Mom-0.015
Token
Token ID8151
Feature activation-5.84e-02

Pos logit contributions
Maybe+0.005
 someone+0.005
 one+0.005
 next+0.005
 dad+0.005
Neg logit contributions
ocol-0.009
 rib-0.007
 microsc-0.007
rou-0.007
 gymn-0.007
Tokens
Token ID82
Feature activation+2.88e-03

Pos logit contributions
ocol+0.115
 rib+0.096
 microsc+0.090
 former+0.087
 gymn+0.084
Neg logit contributions
Maybe-0.049
 one-0.048
 someone-0.046
 everything-0.046
 they-0.045
Token eyes
Token ID2951
Feature activation-3.78e-03

Pos logit contributions
Maybe+0.003
 someone+0.002
 one+0.002
Mom+0.002
How+0.002
Neg logit contributions
ocol-0.005
 rib-0.004
 microsc-0.004
rou-0.004
 gymn-0.004
Token lit
Token ID6578
Feature activation-2.57e-02

Pos logit contributions
ocol+0.006
 rib+0.005
 microsc+0.005
 gymn+0.005
rou+0.005
Neg logit contributions
Maybe-0.004
 someone-0.004
 one-0.003
 dad-0.003
Mom-0.003
Token up
Token ID510
Feature activation-1.83e-02

Pos logit contributions
ocol+0.058
 rib+0.048
 microsc+0.047
pill+0.046
opot+0.046
Neg logit contributions
 one-0.014
 someone-0.013
 next-0.012
 something-0.012
 everything-0.012
Token and
Token ID290
Feature activation+1.81e-03

Pos logit contributions
ocol+0.038
 rib+0.032
 microsc+0.031
 gymn+0.030
rou+0.030
Neg logit contributions
 someone-0.012
 one-0.011
Maybe-0.011
 next-0.011
 dad-0.011
Token balloon
Token ID21190
Feature activation+2.32e-02

Pos logit contributions
Maybe+0.018
 someone+0.016
Mom+0.016
How+0.016
 dad+0.016
Neg logit contributions
ocol-0.029
 rib-0.026
 topp-0.023
 gymn-0.023
ifying-0.022
Token!"
Token ID2474
Feature activation+1.01e-02

Pos logit contributions
Mom+0.028
Maybe+0.028
 dad+0.027
What+0.027
 someone+0.027
Neg logit contributions
ocol-0.031
 rib-0.027
 topp-0.024
 gymn-0.024
rou-0.023
Token Tom
Token ID4186
Feature activation-4.88e-03

Pos logit contributions
Maybe+0.009
How+0.008
 next+0.008
Mom+0.008
 somewhere+0.008
Neg logit contributions
ocol-0.017
 rib-0.015
 microsc-0.014
 gymn-0.013
 topp-0.013
Token said
Token ID531
Feature activation-7.43e-03

Pos logit contributions
ocol+0.009
 rib+0.007
 microsc+0.007
 gymn+0.007
rou+0.007
Neg logit contributions
Maybe-0.004
 someone-0.004
 one-0.004
 dad-0.004
 next-0.004
Token.
Token ID13
Feature activation+2.45e-02

Pos logit contributions
ocol+0.016
 rib+0.013
 microsc+0.013
 gymn+0.013
rou+0.012
Neg logit contributions
Maybe-0.004
 someone-0.004
 one-0.004
 dad-0.004
 next-0.004
Token "
Token ID366
Feature activation+3.03e-02

Pos logit contributions
 forth+0.031
ths+0.030
eks+0.029
 somewhere+0.028
 end+0.028
Neg logit contributions
ocol-0.027
 rib-0.024
 gymn-0.022
hy-0.022
 topp-0.021
TokenI
Token ID40
Feature activation-6.66e-03

Pos logit contributions
 dad+0.026
 someone+0.024
 mom+0.023
 everything+0.023
 what+0.022
Neg logit contributions
ocol-0.047
 rib-0.046
pri-0.043
ifying-0.040
 gymn-0.040
Token want
Token ID765
Feature activation+1.22e-02

Pos logit contributions
ocol+0.012
 rib+0.010
 microsc+0.010
 gymn+0.009
rou+0.009
Neg logit contributions
Maybe-0.006
 someone-0.006
 one-0.006
 dad-0.005
 next-0.005
Token to
Token ID284
Feature activation+1.52e-03

Pos logit contributions
Maybe+0.014
Mom+0.013
How+0.013
What+0.012
Nothing+0.012
Neg logit contributions
ocol-0.018
 rib-0.016
 topp-0.014
 microsc-0.014
 gymn-0.013
Token touch
Token ID3638
Feature activation+5.20e-03

Pos logit contributions
Maybe+0.001
 someone+0.001
 next+0.001
How+0.001
 dad+0.001
Neg logit contributions
ocol-0.003
 rib-0.002
 microsc-0.002
 gymn-0.002
 topp-0.002
Token it
Token ID340
Feature activation+1.47e-02

Pos logit contributions
Maybe+0.003
 dad+0.003
 someone+0.003
Mom+0.002
How+0.002
Neg logit contributions
ocol-0.011
 rib-0.010
 microsc-0.009
 gymn-0.009
 topp-0.009
Token pond
Token ID16723
Feature activation+5.78e-03

Pos logit contributions
Maybe+0.005
 someone+0.005
How+0.004
Mom+0.004
 dad+0.004
Neg logit contributions
ocol-0.010
 rib-0.009
 gymn-0.008
 topp-0.008
 microsc-0.008
Token would
Token ID561
Feature activation-6.13e-03

Pos logit contributions
Maybe+0.005
 someone+0.005
Mom+0.005
 dad+0.005
 one+0.005
Neg logit contributions
ocol-0.010
 rib-0.008
 microsc-0.008
 gymn-0.008
rou-0.008
Token always
Token ID1464
Feature activation+8.99e-03

Pos logit contributions
ocol+0.012
 rib+0.010
 microsc+0.010
 gymn+0.009
rou+0.009
Neg logit contributions
Maybe-0.004
 someone-0.004
 one-0.004
 dad-0.004
 next-0.004
Token offer
Token ID2897
Feature activation-3.86e-03

Pos logit contributions
Maybe+0.008
Mom+0.008
 dad+0.007
 someone+0.007
How+0.007
Neg logit contributions
ocol-0.016
 rib-0.013
 gymn-0.013
 microsc-0.012
rou-0.012
Token her
Token ID607
Feature activation+3.21e-02

Pos logit contributions
ocol+0.007
 rib+0.006
 microsc+0.006
 gymn+0.006
rou+0.005
Neg logit contributions
Maybe-0.003
 someone-0.003
 one-0.003
 dad-0.003
Mom-0.003
Token a
Token ID257
Feature activation+3.32e-02

Pos logit contributions
Mom+0.028
Maybe+0.025
 Blood+0.025
Dad+0.025
What+0.024
Neg logit contributions
ocol-0.050
 rib-0.050
 microsc-0.045
 gymn-0.045
rou-0.044
Token fun
Token ID1257
Feature activation+2.50e-02

Pos logit contributions
Mom+0.038
Maybe+0.036
What+0.036
How+0.035
Parent+0.035
Neg logit contributions
ocol-0.040
 rib-0.039
 gymn-0.038
rou-0.037
 microsc-0.035
Token surprise
Token ID5975
Feature activation-6.94e-04

Pos logit contributions
Mom+0.025
Maybe+0.024
How+0.023
What+0.023
Someone+0.023
Neg logit contributions
ocol-0.037
 rib-0.034
 gymn-0.034
 gol-0.032
 topp-0.032
Token.
Token ID13
Feature activation+1.36e-03

Pos logit contributions
ocol+0.001
 rib+0.001
 microsc+0.001
 gymn+0.001
rou+0.001
Neg logit contributions
Maybe-0.001
 someone-0.001
 one-0.001
 dad-0.001
Mom-0.001
Token<|endoftext|>
Token ID50256
Feature activation-7.01e-03

Pos logit contributions
Maybe+0.001
 someone+0.001
 dad+0.001
 one+0.001
 next+0.001
Neg logit contributions
ocol-0.003
 rib-0.002
 microsc-0.002
 gymn-0.002
rou-0.002
TokenOnce
Token ID7454
Feature activation-2.44e-02

Pos logit contributions
ocol+0.014
 rib+0.012
 microsc+0.012
 gymn+0.011
rou+0.011
Neg logit contributions
Maybe-0.004
 someone-0.004
 one-0.004
 dad-0.004
 next-0.004
Token.
Token ID13
Feature activation-5.98e-03

Pos logit contributions
ocol+0.017
 rib+0.014
 microsc+0.014
 gymn+0.013
rou+0.013
Neg logit contributions
Maybe-0.004
 someone-0.004
 one-0.004
 dad-0.004
 next-0.004
Token Lily
Token ID20037
Feature activation+1.98e-02

Pos logit contributions
ocol+0.012
 rib+0.010
 microsc+0.010
 gymn+0.010
rou+0.010
Neg logit contributions
Maybe-0.004
 someone-0.004
 one-0.004
 dad-0.003
 next-0.003
Token wanted
Token ID2227
Feature activation-7.11e-05

Pos logit contributions
 somewhere+0.018
 someone+0.018
How+0.018
What+0.018
Maybe+0.017
Neg logit contributions
ocol-0.027
 rib-0.026
rou-0.025
 microsc-0.024
 gymn-0.024
Token to
Token ID284
Feature activation-5.81e-03

Pos logit contributions
ocol+0.000
 rib+0.000
 microsc+0.000
 gymn+0.000
 topp+0.000
Neg logit contributions
Maybe-0.000
Mom-0.000
 someone-0.000
How-0.000
 one-0.000
Token help
Token ID1037
Feature activation-2.42e-03

Pos logit contributions
ocol+0.011
 rib+0.009
 microsc+0.009
 gymn+0.009
rou+0.009
Neg logit contributions
Maybe-0.004
 someone-0.004
 one-0.004
 dad-0.004
 next-0.004
Token Max
Token ID5436
Feature activation+3.47e-02

Pos logit contributions
ocol+0.004
 rib+0.004
 microsc+0.003
 gymn+0.003
rou+0.003
Neg logit contributions
Maybe-0.002
 someone-0.002
 one-0.002
 dad-0.002
Mom-0.002
Token feel
Token ID1254
Feature activation-2.63e-03

Pos logit contributions
Mom+0.044
Dad+0.041
Maybe+0.040
How+0.039
Someone+0.038
Neg logit contributions
ocol-0.043
 gymn-0.038
 microsc-0.038
 rib-0.037
 such-0.035
Token better
Token ID1365
Feature activation-1.51e-02

Pos logit contributions
ocol+0.005
 rib+0.005
 microsc+0.004
 gymn+0.004
rou+0.004
Neg logit contributions
Maybe-0.002
 someone-0.001
 dad-0.001
 one-0.001
Mom-0.001
Token.
Token ID13
Feature activation+2.50e-03

Pos logit contributions
ocol+0.034
 rib+0.028
 microsc+0.027
 topp+0.026
rou+0.026
Neg logit contributions
 one-0.008
 someone-0.008
 next-0.008
Maybe-0.008
 dad-0.008
Token
Token ID198
Feature activation-4.85e-03

Pos logit contributions
Maybe+0.002
 someone+0.002
 one+0.002
 dad+0.002
 next+0.002
Neg logit contributions
ocol-0.004
 rib-0.004
 microsc-0.004
 gymn-0.004
rou-0.004
Token
Token ID198
Feature activation-1.07e-02

Pos logit contributions
ocol+0.010
 rib+0.009
 microsc+0.008
 gymn+0.008
rou+0.008
Neg logit contributions
Maybe-0.003
 someone-0.003
 one-0.003
 dad-0.003
 next-0.003
Token goes
Token ID2925
Feature activation+2.14e-03

Pos logit contributions
Maybe+0.011
Mom+0.010
 someone+0.010
 dad+0.010
What+0.010
Neg logit contributions
ocol-0.023
 rib-0.020
 microsc-0.018
 topp-0.018
rou-0.018
Token straight
Token ID3892
Feature activation-8.91e-03

Pos logit contributions
Maybe+0.002
 someone+0.002
 dad+0.002
Mom+0.002
 one+0.002
Neg logit contributions
ocol-0.004
 rib-0.003
 microsc-0.003
 gymn-0.003
rou-0.003
Token.
Token ID13
Feature activation+2.28e-03

Pos logit contributions
ocol+0.016
 rib+0.013
 microsc+0.013
 gymn+0.013
rou+0.012
Neg logit contributions
Maybe-0.007
 someone-0.007
 one-0.007
 dad-0.007
 next-0.007
Token It
Token ID632
Feature activation-5.93e-03

Pos logit contributions
Maybe+0.002
 someone+0.002
 one+0.002
 dad+0.002
 next+0.002
Neg logit contributions
ocol-0.004
 rib-0.003
 microsc-0.003
 gymn-0.003
rou-0.003
Token does
Token ID857
Feature activation+1.56e-02

Pos logit contributions
ocol+0.011
 rib+0.009
 microsc+0.009
 gymn+0.009
rou+0.009
Neg logit contributions
Maybe-0.005
 someone-0.005
 one-0.005
 dad-0.004
 next-0.004
Token not
Token ID407
Feature activation+2.80e-02

Pos logit contributions
Maybe+0.007
Mom+0.007
 dad+0.006
How+0.006
 someone+0.006
Neg logit contributions
ocol-0.034
 rib-0.030
 gymn-0.028
 microsc-0.028
rou-0.028
Token bump
Token ID13852
Feature activation-4.19e-03

Pos logit contributions
Maybe+0.029
How+0.029
Mom+0.029
What+0.027
Where+0.026
Neg logit contributions
ocol-0.044
 rib-0.038
 gymn-0.034
hy-0.034
 gol-0.033
Token or
Token ID393
Feature activation+1.78e-02

Pos logit contributions
ocol+0.008
 rib+0.007
 microsc+0.007
 gymn+0.006
rou+0.006
Neg logit contributions
Maybe-0.003
 someone-0.003
 one-0.003
 dad-0.003
Mom-0.003
Token crash
Token ID7014
Feature activation+2.33e-03

Pos logit contributions
Maybe+0.016
Mom+0.015
How+0.014
 someone+0.013
What+0.013
Neg logit contributions
ocol-0.032
 rib-0.028
 gymn-0.025
 microsc-0.025
 topp-0.025
Token.
Token ID13
Feature activation+6.90e-04

Pos logit contributions
Maybe+0.002
 someone+0.002
 dad+0.002
 one+0.002
Mom+0.002
Neg logit contributions
ocol-0.004
 rib-0.004
 microsc-0.004
 gymn-0.004
rou-0.004
Token It
Token ID632
Feature activation-2.25e-02

Pos logit contributions
Maybe+0.001
 someone+0.001
 one+0.001
 dad+0.001
 next+0.001
Neg logit contributions
ocol-0.001
 rib-0.001
 microsc-0.001
 gymn-0.001
rou-0.001
Token get
Token ID651
Feature activation+2.71e-02

Pos logit contributions
Maybe+0.019
How+0.018
 someone+0.018
What+0.017
Someone+0.017
Neg logit contributions
ocol-0.030
 rib-0.026
 gymn-0.024
ifying-0.023
 microsc-0.023
Token it
Token ID340
Feature activation+2.70e-02

Pos logit contributions
Maybe+0.024
How+0.020
Mom+0.020
Days+0.020
cil+0.019
Neg logit contributions
ocol-0.052
 rib-0.045
 microsc-0.039
 gymn-0.037
 topp-0.036
Token if
Token ID611
Feature activation+4.11e-03

Pos logit contributions
Maybe+0.026
How+0.025
What+0.025
Mom+0.025
Nothing+0.025
Neg logit contributions
ocol-0.041
 rib-0.038
 gymn-0.033
 microsc-0.033
rou-0.032
Token she
Token ID673
Feature activation+1.02e-02

Pos logit contributions
Maybe+0.002
Mom+0.002
 someone+0.002
 dad+0.002
 one+0.002
Neg logit contributions
ocol-0.009
 rib-0.008
 microsc-0.007
 gymn-0.007
 topp-0.007
Token wanted
Token ID2227
Feature activation+2.33e-02

Pos logit contributions
 someone+0.009
Maybe+0.009
Mom+0.008
 one+0.008
What+0.008
Neg logit contributions
ocol-0.018
 rib-0.015
 gymn-0.014
 microsc-0.014
 topp-0.014
Token it
Token ID340
Feature activation+3.14e-02

Pos logit contributions
Maybe+0.025
How+0.023
Mom+0.023
Nothing+0.022
What+0.021
Neg logit contributions
ocol-0.038
 rib-0.034
 microsc-0.030
 gymn-0.030
 topp-0.027
Token.
Token ID13
Feature activation-3.91e-03

Pos logit contributions
Maybe+0.033
Mom+0.032
Parent+0.031
What+0.031
Someone+0.031
Neg logit contributions
ocol-0.043
 rib-0.041
 gymn-0.038
 microsc-0.035
hy-0.034
Token Molly
Token ID30236
Feature activation-7.66e-03

Pos logit contributions
ocol+0.007
 rib+0.006
 microsc+0.006
 gymn+0.006
rou+0.006
Neg logit contributions
Maybe-0.003
 someone-0.003
 one-0.003
 dad-0.003
 next-0.003
Token was
Token ID373
Feature activation+7.88e-03

Pos logit contributions
ocol+0.015
 rib+0.012
 microsc+0.012
 gymn+0.011
rou+0.011
Neg logit contributions
Maybe-0.006
 someone-0.006
 one-0.006
 dad-0.006
 next-0.006
Token so
Token ID523
Feature activation-2.97e-03

Pos logit contributions
Maybe+0.006
Mom+0.006
 someone+0.006
How+0.005
 dad+0.005
Neg logit contributions
ocol-0.014
 rib-0.012
 gymn-0.012
rou-0.012
 microsc-0.011
Token excited
Token ID6568
Feature activation+9.43e-03

Pos logit contributions
ocol+0.006
 rib+0.005
 microsc+0.005
rou+0.005
 gymn+0.005
Neg logit contributions
Maybe-0.002
 someone-0.002
 one-0.002
 dad-0.002
 next-0.002
Token he
Token ID339
Feature activation-1.53e-03

Pos logit contributions
ocol+0.025
 rib+0.021
 microsc+0.020
rou+0.019
 topp+0.019
Neg logit contributions
 someone-0.008
 one-0.007
Maybe-0.007
 dad-0.007
 next-0.007
Token had
Token ID550
Feature activation-2.48e-02

Pos logit contributions
ocol+0.003
 rib+0.002
 microsc+0.002
rou+0.002
 gymn+0.002
Neg logit contributions
Maybe-0.001
 someone-0.001
 one-0.001
 dad-0.001
Mom-0.001
Token a
Token ID257
Feature activation+1.38e-02

Pos logit contributions
ocol+0.052
 rib+0.039
oc+0.038
ifying+0.037
opot+0.037
Neg logit contributions
 one-0.023
 someone-0.023
 dad-0.022
 something-0.022
 everything-0.020
Token surprise
Token ID5975
Feature activation-1.91e-02

Pos logit contributions
Maybe+0.013
Mom+0.013
 someone+0.013
How+0.013
 next+0.013
Neg logit contributions
ocol-0.021
 rib-0.019
 gymn-0.018
 microsc-0.017
 gol-0.017
Token for
Token ID329
Feature activation+9.58e-03

Pos logit contributions
ocol+0.035
 rib+0.028
 microsc+0.027
pill+0.027
rou+0.026
Neg logit contributions
 one-0.017
 someone-0.017
 dad-0.017
 next-0.017
Maybe-0.016
Token her
Token ID607
Feature activation-1.19e-03

Pos logit contributions
Maybe+0.010
Mom+0.009
How+0.009
Nothing+0.008
What+0.008
Neg logit contributions
ocol-0.016
 rib-0.014
 microsc-0.013
 gymn-0.012
 topp-0.012
Token.
Token ID13
Feature activation-1.32e-02

Pos logit contributions
ocol+0.002
 rib+0.002
 microsc+0.002
 gymn+0.002
rou+0.002
Neg logit contributions
Maybe-0.001
 someone-0.001
Mom-0.001
 one-0.001
 dad-0.001
Token He
Token ID679
Feature activation+4.63e-03

Pos logit contributions
ocol+0.027
 rib+0.022
 microsc+0.021
 gymn+0.020
rou+0.020
Neg logit contributions
 someone-0.010
Maybe-0.010
 dad-0.009
 one-0.009
Mom-0.009
Token gave
Token ID2921
Feature activation+5.91e-03

Pos logit contributions
 someone+0.004
Maybe+0.004
 one+0.004
How+0.004
What+0.004
Neg logit contributions
ocol-0.008
 rib-0.007
 microsc-0.007
 gymn-0.006
rou-0.006
Token her
Token ID607
Feature activation+1.16e-02

Pos logit contributions
Maybe+0.005
How+0.005
Mom+0.005
Nothing+0.004
 someone+0.004
Neg logit contributions
ocol-0.011
 rib-0.009
 microsc-0.008
 gymn-0.008
 topp-0.008
Token a
Token ID257
Feature activation-1.01e-02

Pos logit contributions
Maybe+0.007
Mom+0.006
How+0.006
Nothing+0.006
 someone+0.006
Neg logit contributions
ocol-0.023
 rib-0.021
 microsc-0.020
 topp-0.019
rou-0.019
Token if
Token ID611
Feature activation+1.23e-02

Pos logit contributions
ocol+0.031
 rib+0.026
 microsc+0.025
rou+0.024
 gymn+0.024
Neg logit contributions
 someone-0.008
 one-0.008
Maybe-0.008
 next-0.008
 dad-0.008
Token you
Token ID345
Feature activation+1.64e-04

Pos logit contributions
Maybe+0.011
Mom+0.010
How+0.010
 dad+0.009
What+0.009
Neg logit contributions
ocol-0.022
 rib-0.018
 microsc-0.017
 gymn-0.017
 topp-0.016
Token need
Token ID761
Feature activation+2.59e-02

Pos logit contributions
Maybe+0.000
 someone+0.000
 one+0.000
 dad+0.000
Mom+0.000
Neg logit contributions
ocol-0.000
 rib-0.000
 microsc-0.000
 gymn-0.000
rou-0.000
Token extra
Token ID3131
Feature activation+2.20e-02

Pos logit contributions
Maybe+0.029
How+0.026
Mom+0.025
What+0.024
Nothing+0.024
Neg logit contributions
ocol-0.041
 rib-0.035
 gymn-0.031
 microsc-0.031
 topp-0.031
Token help
Token ID1037
Feature activation+1.60e-02

Pos logit contributions
Maybe+0.022
Mom+0.021
How+0.020
What+0.020
Parent+0.019
Neg logit contributions
ocol-0.038
 rib-0.032
 topp-0.029
 microsc-0.029
 gymn-0.028
Token."
Token ID526
Feature activation+9.63e-03

Pos logit contributions
Maybe+0.015
Mom+0.013
How+0.013
Nothing+0.013
What+0.012
Neg logit contributions
ocol-0.028
 rib-0.023
 gymn-0.022
 microsc-0.021
 topp-0.021
Token
Token ID220
Feature activation-1.84e-03

Pos logit contributions
Maybe+0.008
 dad+0.008
 someone+0.008
Mom+0.008
 next+0.008
Neg logit contributions
ocol-0.016
 rib-0.014
 microsc-0.013
pill-0.013
 gymn-0.013
Token
Token ID198
Feature activation+3.38e-03

Pos logit contributions
ocol+0.004
 rib+0.003
 microsc+0.003
 gymn+0.003
rou+0.003
Neg logit contributions
Maybe-0.001
 someone-0.001
 one-0.001
 dad-0.001
 next-0.001
Token
Token ID198
Feature activation+4.64e-04

Pos logit contributions
Maybe+0.002
 someone+0.002
 dad+0.002
 one+0.002
 next+0.002
Neg logit contributions
ocol-0.006
 rib-0.006
 microsc-0.005
 gymn-0.005
rou-0.005
TokenAnd
Token ID1870
Feature activation-4.70e-03

Pos logit contributions
 dad+0.001
 someone+0.001
 one+0.001
Maybe+0.001
 next+0.001
Neg logit contributions
ocol-0.001
 rib-0.001
 microsc-0.001
 gymn-0.000
rou-0.000
Token that
Token ID326
Feature activation+1.07e-02

Pos logit contributions
ocol+0.010
 rib+0.008
 microsc+0.008
 gymn+0.008
rou+0.008
Neg logit contributions
Maybe-0.003
 someone-0.003
 one-0.003
 dad-0.003
Mom-0.003
Token was
Token ID373
Feature activation-9.34e-03

Pos logit contributions
ocol+0.026
 rib+0.021
 microsc+0.020
rou+0.020
 gymn+0.020
Neg logit contributions
Maybe-0.010
 someone-0.010
 one-0.009
 dad-0.009
 next-0.009
Token a
Token ID257
Feature activation+1.19e-02

Pos logit contributions
ocol+0.023
 rib+0.020
 microsc+0.019
 gymn+0.019
rou+0.019
Neg logit contributions
Maybe-0.002
 someone-0.002
 one-0.002
 dad-0.002
 next-0.002
Token girl
Token ID2576
Feature activation+5.88e-03

Pos logit contributions
Maybe+0.011
 someone+0.011
 one+0.010
 somewhere+0.010
How+0.010
Neg logit contributions
ocol-0.020
 rib-0.017
 gymn-0.016
 microsc-0.016
rou-0.015
Token called
Token ID1444
Feature activation-7.82e-03

Pos logit contributions
Maybe+0.006
 someone+0.005
How+0.005
 dad+0.005
What+0.005
Neg logit contributions
ocol-0.010
 rib-0.008
 microsc-0.008
rou-0.008
 gymn-0.008
Token Lucy
Token ID22162
Feature activation-6.80e-03

Pos logit contributions
ocol+0.015
 rib+0.013
 microsc+0.012
 gymn+0.012
rou+0.012
Neg logit contributions
Maybe-0.006
 someone-0.006
 one-0.006
 dad-0.005
 next-0.005
Token.
Token ID13
Feature activation-9.17e-04

Pos logit contributions
ocol+0.014
 rib+0.012
 microsc+0.012
 gymn+0.012
rou+0.011
Neg logit contributions
Maybe-0.004
 someone-0.004
 one-0.004
 dad-0.004
 next-0.003
Token She
Token ID1375
Feature activation+2.09e-03

Pos logit contributions
ocol+0.002
 rib+0.001
 microsc+0.001
 gymn+0.001
rou+0.001
Neg logit contributions
Maybe-0.001
 someone-0.001
 dad-0.001
 one-0.001
 next-0.001
Token loved
Token ID6151
Feature activation-1.78e-02

Pos logit contributions
Maybe+0.002
 someone+0.002
 one+0.002
 dad+0.002
How+0.002
Neg logit contributions
ocol-0.003
 rib-0.003
 microsc-0.003
 gymn-0.003
rou-0.003
Token to
Token ID284
Feature activation+1.48e-02

Pos logit contributions
ocol+0.030
 rib+0.024
rou+0.023
 microsc+0.023
ifying+0.022
Neg logit contributions
 dad-0.018
 one-0.018
 someone-0.018
 next-0.017
 everything-0.017
Token pick
Token ID2298
Feature activation-4.36e-03

Pos logit contributions
Maybe+0.016
How+0.015
 someone+0.014
Mom+0.014
Nothing+0.014
Neg logit contributions
ocol-0.024
 rib-0.021
 gymn-0.019
 microsc-0.019
 topp-0.018
Token things
Token ID1243
Feature activation+2.07e-02

Pos logit contributions
ocol+0.009
 rib+0.008
 microsc+0.007
 gymn+0.007
rou+0.007
Neg logit contributions
Maybe-0.003
 someone-0.003
 one-0.003
 dad-0.003
 next-0.002
Token wanted
Token ID2227
Feature activation+1.39e-02

Pos logit contributions
Maybe+0.006
 next+0.006
Mom+0.006
 someone+0.006
How+0.006
Neg logit contributions
ocol-0.014
 rib-0.013
 microsc-0.012
 gymn-0.012
 topp-0.011
Token it
Token ID340
Feature activation+2.77e-02

Pos logit contributions
Maybe+0.018
How+0.017
Mom+0.017
What+0.017
 next+0.017
Neg logit contributions
ocol-0.017
 rib-0.015
 microsc-0.013
 gymn-0.013
 topp-0.013
Token very
Token ID845
Feature activation+2.15e-02

Pos logit contributions
Someone+0.032
Maybe+0.031
Mom+0.031
Dad+0.030
Parent+0.030
Neg logit contributions
 rib-0.033
 gymn-0.032
ocol-0.032
 microsc-0.029
 such-0.028
Token badly
Token ID11234
Feature activation+1.18e-02

Pos logit contributions
Maybe+0.016
What+0.016
How+0.016
Where+0.015
Mom+0.015
Neg logit contributions
ocol-0.037
rou-0.032
 rib-0.032
 microsc-0.032
 gymn-0.031
Token.
Token ID13
Feature activation+1.28e-03

Pos logit contributions
Mom+0.010
Maybe+0.010
What+0.009
How+0.009
 someone+0.009
Neg logit contributions
ocol-0.019
 rib-0.017
 gymn-0.016
rou-0.016
 microsc-0.016
Token Mom
Token ID11254
Feature activation+1.08e-02

Pos logit contributions
Maybe+0.001
 someone+0.001
 one+0.001
 next+0.001
 dad+0.001
Neg logit contributions
ocol-0.002
 rib-0.002
 microsc-0.002
 gymn-0.002
 topp-0.002
Token said
Token ID531
Feature activation-2.52e-02

Pos logit contributions
 someone+0.011
Maybe+0.011
 somewhere+0.011
How+0.010
 something+0.010
Neg logit contributions
ocol-0.016
 rib-0.014
rou-0.013
 topp-0.013
 microsc-0.013
Token they
Token ID484
Feature activation+2.33e-03

Pos logit contributions
ocol+0.059
 former+0.051
rou+0.049
 microsc+0.049
 rib+0.049
Neg logit contributions
 someone-0.013
 one-0.012
 dad-0.012
 they-0.012
 something-0.011
Token could
Token ID714
Feature activation+1.46e-02

Pos logit contributions
 someone+0.003
Maybe+0.003
 one+0.003
How+0.003
 next+0.003
Neg logit contributions
ocol-0.003
 rib-0.003
 microsc-0.003
 gymn-0.002
rou-0.002
Token buy
Token ID2822
Feature activation+1.49e-02

Pos logit contributions
Maybe+0.016
How+0.015
 someone+0.015
What+0.015
Someone+0.015
Neg logit contributions
ocol-0.021
 rib-0.017
 gymn-0.016
hy-0.016
 topp-0.016
Token it
Token ID340
Feature activation+2.49e-02

Pos logit contributions
Maybe+0.009
Mom+0.008
How+0.008
 next+0.007
Nothing+0.007
Neg logit contributions
ocol-0.031
 rib-0.027
 topp-0.025
 microsc-0.024
 gymn-0.024
Token visit
Token ID3187
Feature activation+1.52e-03

Pos logit contributions
ocol+0.034
 rib+0.028
 microsc+0.027
rou+0.027
 topp+0.026
Neg logit contributions
 someone-0.010
 one-0.010
Maybe-0.010
 next-0.009
 dad-0.009
Token the
Token ID262
Feature activation-1.30e-03

Pos logit contributions
Maybe+0.001
Mom+0.001
 someone+0.001
 dad+0.001
 one+0.001
Neg logit contributions
ocol-0.003
 rib-0.003
 microsc-0.002
 gymn-0.002
rou-0.002
Token sm
Token ID895
Feature activation+5.08e-03

Pos logit contributions
ocol+0.002
 rib+0.002
 microsc+0.002
 gymn+0.002
rou+0.002
Neg logit contributions
Maybe-0.001
 someone-0.001
 one-0.001
 dad-0.001
Mom-0.001
Tokenelly
Token ID6148
Feature activation+1.08e-02

Pos logit contributions
Maybe+0.008
 someone+0.008
 dad+0.008
 one+0.008
Mom+0.007
Neg logit contributions
ocol-0.006
 rib-0.004
 microsc-0.004
 gol-0.004
rou-0.004
Token nation
Token ID3277
Feature activation+6.22e-03

Pos logit contributions
Maybe+0.011
Mom+0.010
How+0.010
What+0.010
 someone+0.010
Neg logit contributions
ocol-0.018
 rib-0.015
 microsc-0.014
 gymn-0.014
 gol-0.013
Token,
Token ID11
Feature activation+2.99e-03

Pos logit contributions
Maybe+0.005
 someone+0.004
Mom+0.004
 dad+0.004
 one+0.004
Neg logit contributions
ocol-0.012
 rib-0.010
 microsc-0.009
 gymn-0.009
rou-0.009
Token but
Token ID475
Feature activation-2.26e-02

Pos logit contributions
Maybe+0.002
 someone+0.002
Mom+0.002
 dad+0.002
 one+0.002
Neg logit contributions
ocol-0.006
 rib-0.005
 microsc-0.005
 gymn-0.005
rou-0.005
Token one
Token ID530
Feature activation+5.15e-03

Pos logit contributions
ocol+0.057
 rib+0.048
pill+0.047
 microsc+0.047
rou+0.047
Neg logit contributions
 one-0.007
 someone-0.007
 dad-0.006
 they-0.006
 next-0.006
Token day
Token ID1110
Feature activation+5.49e-03

Pos logit contributions
Maybe+0.004
Mom+0.003
How+0.003
 someone+0.003
What+0.003
Neg logit contributions
ocol-0.010
 rib-0.008
 microsc-0.008
 gymn-0.008
rou-0.008
Token a
Token ID257
Feature activation+1.12e-02

Pos logit contributions
Maybe+0.003
Mom+0.003
How+0.003
 next+0.003
What+0.003
Neg logit contributions
ocol-0.011
 rib-0.010
 microsc-0.009
 gymn-0.009
 topp-0.009
Token brave
Token ID14802
Feature activation+9.09e-03

Pos logit contributions
Maybe+0.010
How+0.009
 one+0.009
 someone+0.009
Mom+0.009
Neg logit contributions
ocol-0.019
 rib-0.016
 gymn-0.016
 microsc-0.016
 gol-0.015
Token"
Token ID1
Feature activation+1.01e-02

Pos logit contributions
ocol+0.008
 rib+0.007
 microsc+0.007
 gymn+0.007
rou+0.007
Neg logit contributions
Maybe-0.002
 someone-0.002
 one-0.002
 dad-0.002
 next-0.002
TokenNot
Token ID3673
Feature activation+6.72e-04

Pos logit contributions
 someone+0.010
 dad+0.010
 one+0.010
 everything+0.010
 what+0.009
Neg logit contributions
ocol-0.016
 rib-0.014
 gymn-0.013
 microsc-0.012
 gol-0.012
Token yet
Token ID1865
Feature activation+1.13e-02

Pos logit contributions
Maybe+0.001
 someone+0.001
Mom+0.001
 dad+0.001
How+0.001
Neg logit contributions
ocol-0.001
 rib-0.001
 microsc-0.001
 gymn-0.001
rou-0.001
Token,
Token ID11
Feature activation-1.24e-03

Pos logit contributions
Maybe+0.011
Mom+0.010
How+0.010
What+0.010
 someone+0.009
Neg logit contributions
ocol-0.019
 rib-0.016
 microsc-0.015
 gymn-0.015
 topp-0.015
Token we
Token ID356
Feature activation-1.04e-02

Pos logit contributions
ocol+0.002
 rib+0.002
 microsc+0.002
 gymn+0.002
rou+0.002
Neg logit contributions
Maybe-0.001
 someone-0.001
 one-0.001
 dad-0.001
Mom-0.001
Token have
Token ID423
Feature activation-1.10e-02

Pos logit contributions
ocol+0.018
 rib+0.015
 microsc+0.014
 gymn+0.014
rou+0.014
Neg logit contributions
Maybe-0.010
 someone-0.009
 one-0.009
 dad-0.009
 next-0.009
Token to
Token ID284
Feature activation+8.28e-03

Pos logit contributions
ocol+0.019
 rib+0.015
 microsc+0.014
 gymn+0.014
rou+0.014
Neg logit contributions
 someone-0.011
Maybe-0.011
 one-0.011
 dad-0.011
 next-0.011
Token cut
Token ID2005
Feature activation+1.64e-02

Pos logit contributions
Maybe+0.007
 someone+0.007
How+0.006
 dad+0.006
What+0.006
Neg logit contributions
ocol-0.015
 rib-0.012
 gymn-0.012
 microsc-0.012
 topp-0.011
Token it
Token ID340
Feature activation+1.31e-02

Pos logit contributions
Maybe+0.012
How+0.010
Nothing+0.010
Mom+0.010
 dad+0.010
Neg logit contributions
ocol-0.033
 rib-0.028
 topp-0.025
 microsc-0.025
 gymn-0.024
Token first
Token ID717
Feature activation+1.26e-02

Pos logit contributions
Maybe+0.012
 dad+0.011
 someone+0.011
What+0.011
Mom+0.011
Neg logit contributions
ocol-0.022
 rib-0.019
 microsc-0.017
 topp-0.017
 gymn-0.017
Token,"
Token ID553
Feature activation+3.34e-03

Pos logit contributions
Maybe+0.013
 someone+0.012
How+0.012
What+0.012
Mom+0.012
Neg logit contributions
ocol-0.020
 rib-0.017
 topp-0.016
 microsc-0.015
 gymn-0.015
Token she
Token ID673
Feature activation-1.81e-02

Pos logit contributions
Maybe+0.009
How+0.008
 someone+0.007
 somewhere+0.007
 next+0.007
Neg logit contributions
ocol-0.027
 rib-0.025
 gymn-0.024
 microsc-0.023
 gol-0.022
Token was
Token ID373
Feature activation+3.11e-03

Pos logit contributions
ocol+0.045
 rib+0.038
 microsc+0.037
 former+0.036
 topp+0.036
Neg logit contributions
 one-0.006
Maybe-0.006
 someone-0.005
 looked-0.005
 dad-0.005
Token done
Token ID1760
Feature activation-8.12e-03

Pos logit contributions
Maybe+0.003
 someone+0.003
How+0.003
Mom+0.003
 one+0.003
Neg logit contributions
ocol-0.005
 rib-0.004
rou-0.004
 gymn-0.004
 microsc-0.004
Token,
Token ID11
Feature activation+3.00e-03

Pos logit contributions
ocol+0.016
 rib+0.014
 microsc+0.013
rou+0.013
 gymn+0.013
Neg logit contributions
 someone-0.005
Maybe-0.005
 one-0.005
 dad-0.005
 next-0.005
Token she
Token ID673
Feature activation-1.18e-02

Pos logit contributions
Maybe+0.001
 someone+0.001
 next+0.001
Mom+0.001
How+0.001
Neg logit contributions
ocol-0.006
 rib-0.006
 microsc-0.005
 gymn-0.005
rou-0.005
Token said
Token ID531
Feature activation-1.76e-02

Pos logit contributions
ocol+0.025
 rib+0.020
 microsc+0.020
 gymn+0.019
 topp+0.019
Neg logit contributions
Maybe-0.008
 someone-0.008
 dad-0.008
 one-0.008
 next-0.007
Token,
Token ID11
Feature activation+7.49e-03

Pos logit contributions
ocol+0.038
 rib+0.031
 microsc+0.031
pill+0.030
 topp+0.030
Neg logit contributions
 one-0.011
 someone-0.011
 dad-0.011
 everything-0.010
 next-0.010
Token "
Token ID366
Feature activation-1.03e-02

Pos logit contributions
Maybe+0.008
 someone+0.007
 one+0.007
Mom+0.007
 dad+0.007
Neg logit contributions
ocol-0.012
 rib-0.010
 gymn-0.009
 microsc-0.009
rou-0.009
TokenThank
Token ID10449
Feature activation-1.03e-02

Pos logit contributions
ocol+0.021
 rib+0.018
 microsc+0.017
rou+0.017
 gymn+0.017
Neg logit contributions
Maybe-0.007
 someone-0.006
Mom-0.006
 one-0.006
 next-0.006
Token you
Token ID345
Feature activation-8.98e-03

Pos logit contributions
ocol+0.020
 rib+0.017
 microsc+0.017
rou+0.016
 gymn+0.016
Neg logit contributions
 someone-0.007
Maybe-0.007
 one-0.007
 next-0.007
 dad-0.007
Token,
Token ID11
Feature activation-1.35e-02

Pos logit contributions
ocol+0.018
 rib+0.015
 microsc+0.014
 gymn+0.014
 topp+0.014
Neg logit contributions
 someone-0.006
Maybe-0.006
 one-0.006
 dad-0.006
 next-0.006
Token told
Token ID1297
Feature activation-7.55e-03

Pos logit contributions
 someone+0.004
Maybe+0.004
 one+0.004
Mom+0.004
How+0.004
Neg logit contributions
ocol-0.009
 rib-0.008
 microsc-0.007
rou-0.007
 gymn-0.007
Token T
Token ID309
Feature activation-9.13e-03

Pos logit contributions
ocol+0.013
 rib+0.011
 microsc+0.010
 gymn+0.010
rou+0.010
Neg logit contributions
 someone-0.007
Maybe-0.007
 one-0.007
 dad-0.007
 next-0.007
Tokenane
Token ID1531
Feature activation+3.07e-06

Pos logit contributions
ocol+0.013
 rib+0.010
 microsc+0.010
 gymn+0.009
 topp+0.009
Neg logit contributions
Maybe-0.011
 someone-0.011
 one-0.011
 dad-0.011
 next-0.011
Token that
Token ID326
Feature activation-6.04e-03

Feature not active on prev token;
no predictions were affected.
Token it
Token ID340
Feature activation-8.26e-03

Pos logit contributions
ocol+0.013
 rib+0.011
 microsc+0.011
 gymn+0.011
rou+0.011
Neg logit contributions
Maybe-0.003
 someone-0.003
 one-0.003
 dad-0.003
 next-0.003
Token was
Token ID373
Feature activation-1.17e-02

Pos logit contributions
ocol+0.015
 rib+0.012
 microsc+0.012
 gymn+0.012
rou+0.011
Neg logit contributions
Maybe-0.007
 someone-0.007
 one-0.007
 dad-0.007
 next-0.007
Token a
Token ID257
Feature activation+6.65e-04

Pos logit contributions
ocol+0.023
 rib+0.020
 microsc+0.019
 gymn+0.018
rou+0.018
Neg logit contributions
 someone-0.008
 one-0.008
Maybe-0.008
 dad-0.008
 next-0.007
Token sign
Token ID1051
Feature activation+3.18e-03

Pos logit contributions
Maybe+0.001
 someone+0.001
 dad+0.001
 one+0.001
Mom+0.001
Neg logit contributions
ocol-0.001
 rib-0.001
 microsc-0.001
 gymn-0.001
rou-0.001
Token of
Token ID286
Feature activation-6.35e-03

Pos logit contributions
Maybe+0.003
 someone+0.002
Mom+0.002
 dad+0.002
 one+0.002
Neg logit contributions
ocol-0.006
 rib-0.005
 microsc-0.005
 gymn-0.005
rou-0.005
Token hope
Token ID2911
Feature activation-2.05e-02

Pos logit contributions
ocol+0.013
 rib+0.011
 microsc+0.011
 gymn+0.010
rou+0.010
Neg logit contributions
Maybe-0.004
 someone-0.004
 one-0.004
 dad-0.004
 next-0.004
Token and
Token ID290
Feature activation-3.03e-03

Pos logit contributions
ocol+0.042
 rib+0.035
 microsc+0.034
rou+0.033
 topp+0.033
Neg logit contributions
 someone-0.014
 one-0.013
Maybe-0.013
 next-0.013
 dad-0.013
Token them
Token ID606
Feature activation+2.53e-02

Pos logit contributions
Maybe+0.020
 dad+0.019
How+0.019
What+0.018
Mom+0.018
Neg logit contributions
ocol-0.036
 rib-0.032
 microsc-0.029
 gymn-0.028
rou-0.028
Token and
Token ID290
Feature activation+5.04e-03

Pos logit contributions
Maybe+0.028
What+0.027
How+0.027
Where+0.026
Turn+0.025
Neg logit contributions
ocol-0.036
 rib-0.032
rou-0.029
 microsc-0.029
hy-0.028
Token makes
Token ID1838
Feature activation+2.85e-03

Pos logit contributions
Maybe+0.004
Mom+0.004
 someone+0.004
 one+0.004
 dad+0.003
Neg logit contributions
ocol-0.009
 rib-0.008
 microsc-0.008
 gymn-0.007
 topp-0.007
Token them
Token ID606
Feature activation+8.68e-03

Pos logit contributions
Maybe+0.002
How+0.002
Mom+0.002
 someone+0.002
 one+0.002
Neg logit contributions
ocol-0.005
 rib-0.004
 microsc-0.004
 gymn-0.004
rou-0.004
Token mad
Token ID8805
Feature activation+2.55e-02

Pos logit contributions
Maybe+0.006
 someone+0.006
How+0.006
Mom+0.006
What+0.006
Neg logit contributions
ocol-0.016
 rib-0.014
 microsc-0.013
rou-0.013
 gymn-0.013
Token.
Token ID13
Feature activation-1.97e-02

Pos logit contributions
Maybe+0.027
How+0.026
Mom+0.026
Turn+0.026
What+0.026
Neg logit contributions
ocol-0.036
 rib-0.032
rou-0.028
 gymn-0.028
 microsc-0.027
Token You
Token ID921
Feature activation-2.29e-02

Pos logit contributions
ocol+0.036
 rib+0.030
 microsc+0.028
rou+0.028
 gymn+0.027
Neg logit contributions
 someone-0.018
Maybe-0.017
Mom-0.017
 one-0.017
 dad-0.017
Token could
Token ID714
Feature activation+3.99e-03

Pos logit contributions
ocol+0.044
 rib+0.035
 microsc+0.034
 gymn+0.034
rou+0.034
Neg logit contributions
Maybe-0.019
 someone-0.019
 one-0.019
 dad-0.018
 everything-0.018
Token have
Token ID423
Feature activation+6.68e-03

Pos logit contributions
Maybe+0.003
 someone+0.003
 dad+0.003
How+0.003
Mom+0.003
Neg logit contributions
ocol-0.007
 rib-0.006
 gymn-0.006
 microsc-0.006
rou-0.006
Token been
Token ID587
Feature activation+1.83e-02

Pos logit contributions
Maybe+0.005
 dad+0.005
Mom+0.005
 someone+0.005
How+0.005
Neg logit contributions
ocol-0.012
 rib-0.011
 topp-0.010
 microsc-0.010
 gymn-0.010
Token hurt
Token ID5938
Feature activation+1.33e-02

Pos logit contributions
Maybe+0.018
Mom+0.016
What+0.015
How+0.014
Dad+0.014
Neg logit contributions
ocol-0.031
 rib-0.026
 topp-0.025
 gymn-0.025
rou-0.025
Token gentle
Token ID10296
Feature activation+6.81e-03

Pos logit contributions
ocol+0.030
 rib+0.025
 microsc+0.024
rou+0.023
 gymn+0.023
Neg logit contributions
 someone-0.011
Maybe-0.011
 one-0.011
 dad-0.011
 next-0.010
Token tap
Token ID9814
Feature activation+3.24e-03

Pos logit contributions
Maybe+0.006
 someone+0.006
How+0.006
Mom+0.006
 dad+0.006
Neg logit contributions
ocol-0.012
 rib-0.010
 microsc-0.009
 gymn-0.009
 topp-0.009
Token,
Token ID11
Feature activation-1.41e-03

Pos logit contributions
Maybe+0.003
 someone+0.003
Mom+0.002
 dad+0.002
How+0.002
Neg logit contributions
ocol-0.006
 rib-0.005
 microsc-0.005
rou-0.005
 gymn-0.005
Token and
Token ID290
Feature activation-1.21e-03

Pos logit contributions
ocol+0.003
 rib+0.002
 microsc+0.002
 gymn+0.002
rou+0.002
Neg logit contributions
Maybe-0.001
 someone-0.001
 dad-0.001
 one-0.001
Mom-0.001
Token spoke
Token ID5158
Feature activation-2.12e-02

Pos logit contributions
ocol+0.002
 rib+0.002
 microsc+0.002
 gymn+0.002
 topp+0.002
Neg logit contributions
Maybe-0.001
 someone-0.001
 one-0.001
 dad-0.001
Mom-0.001
Token softly
Token ID26625
Feature activation-1.82e-02

Pos logit contributions
ocol+0.043
 rib+0.035
pill+0.033
 microsc+0.033
 gol+0.033
Neg logit contributions
 someone-0.016
 one-0.016
 next-0.015
 dad-0.015
 something-0.015
Token to
Token ID284
Feature activation+3.63e-03

Pos logit contributions
ocol+0.038
 rib+0.031
 microsc+0.030
 gymn+0.029
rou+0.029
Neg logit contributions
 one-0.012
 someone-0.011
 next-0.011
 dad-0.011
Maybe-0.011
Token him
Token ID683
Feature activation+7.55e-04

Pos logit contributions
Maybe+0.003
Mom+0.003
 next+0.002
How+0.002
 dad+0.002
Neg logit contributions
ocol-0.007
 rib-0.006
 microsc-0.005
 gymn-0.005
 topp-0.005
Token.
Token ID13
Feature activation-4.37e-04

Pos logit contributions
Maybe+0.001
 someone+0.000
Mom+0.000
 dad+0.000
 one+0.000
Neg logit contributions
ocol-0.002
 rib-0.001
 microsc-0.001
 gymn-0.001
rou-0.001
Token He
Token ID679
Feature activation+2.07e-03

Pos logit contributions
ocol+0.001
 rib+0.001
 microsc+0.001
 gymn+0.001
 topp+0.001
Neg logit contributions
Maybe-0.000
 someone-0.000
 one-0.000
 dad-0.000
 next-0.000
Token said
Token ID531
Feature activation-2.46e-02

Pos logit contributions
 someone+0.002
Maybe+0.002
 one+0.002
 dad+0.002
Mom+0.002
Neg logit contributions
ocol-0.004
 rib-0.003
 microsc-0.003
rou-0.003
 gymn-0.003
Token animals
Token ID4695
Feature activation+1.06e-03

Pos logit contributions
ocol+0.055
 rib+0.047
ifying+0.046
rou+0.045
 gol+0.045
Neg logit contributions
 someone-0.010
 everything-0.010
 one-0.010
Maybe-0.009
 next-0.009
Token like
Token ID588
Feature activation-8.40e-04

Pos logit contributions
Maybe+0.001
 someone+0.001
Mom+0.001
 dad+0.001
 one+0.001
Neg logit contributions
ocol-0.002
 rib-0.002
 microsc-0.002
 gymn-0.002
rou-0.002
Token lions
Token ID31420
Feature activation-7.84e-03

Pos logit contributions
ocol+0.001
 rib+0.001
 microsc+0.001
 gymn+0.001
 topp+0.001
Neg logit contributions
Maybe-0.001
 someone-0.001
 dad-0.001
Mom-0.001
 one-0.001
Token,
Token ID11
Feature activation-7.46e-03

Pos logit contributions
ocol+0.016
 rib+0.014
 microsc+0.013
 gymn+0.013
rou+0.013
Neg logit contributions
Maybe-0.005
 someone-0.005
 one-0.005
 dad-0.005
 next-0.005
Token ze
Token ID41271
Feature activation-2.19e-02

Pos logit contributions
ocol+0.012
 rib+0.009
 microsc+0.009
 gymn+0.009
rou+0.009
Neg logit contributions
Maybe-0.008
 someone-0.008
 one-0.008
 dad-0.008
 next-0.008
Tokenbr
Token ID1671
Feature activation-4.64e-02

Pos logit contributions
ocol+0.029
 rib+0.022
 microsc+0.022
rou+0.021
 former+0.021
Neg logit contributions
 someone-0.028
 one-0.027
Maybe-0.027
 next-0.027
 dad-0.026
Tokenas
Token ID292
Feature activation-2.78e-02

Pos logit contributions
ocol+0.081
 microsc+0.069
 rib+0.069
 gymn+0.068
 gol+0.067
Neg logit contributions
They-0.039
 dad-0.039
Mom-0.037
 everything-0.036
 one-0.036
Token,
Token ID11
Feature activation-5.49e-04

Pos logit contributions
ocol+0.063
 rib+0.052
 microsc+0.052
 topp+0.050
 former+0.050
Neg logit contributions
 one-0.014
 next-0.012
 someone-0.011
 dad-0.011
 they-0.011
Token and
Token ID290
Feature activation-1.14e-02

Pos logit contributions
ocol+0.001
 rib+0.001
 microsc+0.001
 gymn+0.001
rou+0.001
Neg logit contributions
Maybe-0.000
 someone-0.000
 dad-0.000
Mom-0.000
 one-0.000
Token gir
Token ID37370
Feature activation-4.23e-02

Pos logit contributions
ocol+0.020
 rib+0.016
 microsc+0.015
rou+0.015
 gymn+0.015
Neg logit contributions
Maybe-0.010
 someone-0.010
 one-0.010
 dad-0.010
 next-0.010
Tokenaff
Token ID2001
Feature activation-2.61e-02

Pos logit contributions
ocol+0.044
 microsc+0.033
ifying+0.032
 former+0.030
pri+0.030
Neg logit contributions
 one-0.063
 they-0.062
 everything-0.062
 someone-0.061
 dad-0.061
Token.
Token ID13
Feature activation-9.79e-03

Pos logit contributions
ocol+0.003
 rib+0.003
 microsc+0.003
 gymn+0.003
rou+0.003
Neg logit contributions
Maybe-0.001
 someone-0.001
 dad-0.001
 one-0.001
Mom-0.001
Token You
Token ID921
Feature activation-1.41e-02

Pos logit contributions
ocol+0.016
 rib+0.013
 microsc+0.012
rou+0.012
 gymn+0.012
Neg logit contributions
 someone-0.010
Maybe-0.010
 one-0.010
 dad-0.010
Mom-0.010
Token are
Token ID389
Feature activation+6.45e-03

Pos logit contributions
ocol+0.024
 rib+0.019
 microsc+0.018
rou+0.018
 gymn+0.018
Neg logit contributions
Maybe-0.014
 someone-0.014
 one-0.014
 dad-0.014
 next-0.013
Token rude
Token ID22066
Feature activation+8.67e-03

Pos logit contributions
Maybe+0.005
Mom+0.004
 someone+0.004
 dad+0.004
How+0.004
Neg logit contributions
ocol-0.013
 rib-0.011
 microsc-0.010
 gymn-0.010
rou-0.010
Token.
Token ID13
Feature activation-1.03e-02

Pos logit contributions
Maybe+0.007
Mom+0.006
 someone+0.006
How+0.006
What+0.006
Neg logit contributions
ocol-0.015
 rib-0.013
rou-0.013
 microsc-0.012
 gymn-0.012
Token I
Token ID314
Feature activation-4.49e-02

Pos logit contributions
ocol+0.017
 rib+0.014
 microsc+0.013
rou+0.012
 gymn+0.012
Neg logit contributions
 someone-0.011
Maybe-0.011
 one-0.011
 dad-0.010
Mom-0.010
Token do
Token ID466
Feature activation+1.66e-02

Pos logit contributions
ocol+0.097
 microsc+0.079
 rib+0.076
aman+0.073
ksh+0.071
Neg logit contributions
Maybe-0.037
 one-0.034
 someone-0.031
 not-0.031
 turned-0.031
Token not
Token ID407
Feature activation+1.92e-02

Pos logit contributions
Maybe+0.009
Mom+0.008
 dad+0.007
Someone+0.007
 someone+0.006
Neg logit contributions
ocol-0.034
 rib-0.030
 gymn-0.030
rou-0.029
 microsc-0.028
Token want
Token ID765
Feature activation+1.51e-02

Pos logit contributions
Maybe+0.021
Mom+0.019
 dad+0.019
How+0.019
Someone+0.019
Neg logit contributions
ocol-0.029
 rib-0.024
 gol-0.023
 gymn-0.023
hy-0.023
Token to
Token ID284
Feature activation+9.45e-03

Pos logit contributions
Maybe+0.019
Mom+0.018
How+0.017
What+0.017
 dad+0.017
Neg logit contributions
ocol-0.021
 rib-0.018
 microsc-0.016
 gymn-0.016
 topp-0.015
Token play
Token ID711
Feature activation+3.61e-04

Pos logit contributions
Maybe+0.010
Mom+0.009
How+0.009
 someone+0.009
Someone+0.009
Neg logit contributions
ocol-0.015
 rib-0.013
 gymn-0.012
 gol-0.012
 microsc-0.012
Token end
Token ID886
Feature activation+4.35e-03

Pos logit contributions
Maybe+0.017
 someone+0.015
How+0.015
 everything+0.014
Mom+0.014
Neg logit contributions
ocol-0.037
 rib-0.032
 microsc-0.030
 gol-0.029
 gymn-0.029
Token.
Token ID13
Feature activation-8.95e-03

Pos logit contributions
Maybe+0.003
 dad+0.003
 someone+0.003
Mom+0.003
How+0.003
Neg logit contributions
ocol-0.008
 rib-0.007
 microsc-0.007
 gymn-0.007
rou-0.006
Token<|endoftext|>
Token ID50256
Feature activation-9.12e-03

Pos logit contributions
ocol+0.019
 rib+0.016
 microsc+0.015
 gymn+0.015
rou+0.015
Neg logit contributions
Maybe-0.005
 someone-0.005
 one-0.005
 dad-0.005
Mom-0.005
TokenMom
Token ID29252
Feature activation+1.35e-02

Pos logit contributions
ocol+0.019
 rib+0.016
 microsc+0.015
rou+0.015
 gymn+0.015
Neg logit contributions
Maybe-0.006
 someone-0.006
 one-0.005
 dad-0.005
 next-0.005
Tokenmy
Token ID1820
Feature activation+1.23e-02

Pos logit contributions
Maybe+0.010
 someone+0.010
How+0.009
 somewhere+0.009
Mom+0.009
Neg logit contributions
ocol-0.025
 rib-0.022
rou-0.021
 microsc-0.020
 topp-0.020
Token had
Token ID550
Feature activation-4.64e-02

Pos logit contributions
Maybe+0.009
 someone+0.009
How+0.009
Mom+0.009
What+0.008
Neg logit contributions
ocol-0.022
 rib-0.019
rou-0.018
 microsc-0.018
 gymn-0.018
Token a
Token ID257
Feature activation-8.69e-04

Pos logit contributions
ocol+0.124
oc+0.103
 microsc+0.102
spe+0.102
oke+0.100
Neg logit contributions
 one-0.016
 a-0.014
 dad-0.014
 something-0.013
 there-0.012
Token plan
Token ID1410
Feature activation-2.57e-03

Pos logit contributions
ocol+0.002
 rib+0.001
 microsc+0.001
 gymn+0.001
rou+0.001
Neg logit contributions
Maybe-0.001
 someone-0.001
 one-0.001
Mom-0.001
 next-0.001
Token.
Token ID13
Feature activation-1.29e-02

Pos logit contributions
ocol+0.005
 rib+0.004
 microsc+0.004
 gymn+0.004
rou+0.004
Neg logit contributions
Maybe-0.002
 someone-0.002
 one-0.001
 dad-0.001
Mom-0.001
Token She
Token ID1375
Feature activation+7.46e-03

Pos logit contributions
ocol+0.025
 rib+0.021
 microsc+0.020
rou+0.020
 gymn+0.020
Neg logit contributions
Maybe-0.009
 someone-0.009
 one-0.009
 dad-0.009
Mom-0.009
Token wanted
Token ID2227
Feature activation+6.05e-03

Pos logit contributions
 someone+0.006
Maybe+0.006
 dad+0.006
 one+0.006
How+0.006
Neg logit contributions
ocol-0.013
 rib-0.011
 microsc-0.010
 gymn-0.010
rou-0.010
Token to
Token ID284
Feature activation+5.27e-03

Pos logit contributions
Maybe+0.006
Mom+0.006
 someone+0.006
How+0.006
 dad+0.006
Neg logit contributions
ocol-0.006
 rib-0.005
 microsc-0.004
 gymn-0.004
rou-0.004
Token hurt
Token ID5938
Feature activation+6.67e-03

Pos logit contributions
Maybe+0.004
Mom+0.004
 someone+0.004
 dad+0.004
 one+0.004
Neg logit contributions
ocol-0.009
 rib-0.008
 gymn-0.008
 microsc-0.008
hy-0.007
Token your
Token ID534
Feature activation+3.31e-03

Pos logit contributions
Maybe+0.006
 dad+0.005
How+0.005
Mom+0.005
 one+0.005
Neg logit contributions
ocol-0.012
 rib-0.010
 microsc-0.009
 gymn-0.009
rou-0.008
Token feelings
Token ID7666
Feature activation+5.96e-03

Pos logit contributions
Maybe+0.003
Mom+0.003
How+0.003
 someone+0.003
What+0.003
Neg logit contributions
ocol-0.006
 rib-0.005
 microsc-0.005
 gymn-0.005
 topp-0.005
Token.
Token ID13
Feature activation-1.86e-02

Pos logit contributions
Maybe+0.004
 someone+0.004
 dad+0.004
Mom+0.004
How+0.004
Neg logit contributions
ocol-0.011
 rib-0.009
 microsc-0.009
 gymn-0.009
rou-0.009
Token I
Token ID314
Feature activation-4.43e-02

Pos logit contributions
ocol+0.038
 rib+0.031
 microsc+0.030
rou+0.030
 gymn+0.029
Neg logit contributions
Maybe-0.014
 someone-0.014
Mom-0.013
 dad-0.013
 one-0.012
Token like
Token ID588
Feature activation-4.87e-03

Pos logit contributions
ocol+0.100
 rib+0.077
 microsc+0.076
pill+0.073
ifying+0.072
Neg logit contributions
Maybe-0.039
 looked-0.035
 turned-0.035
 one-0.034
 not-0.031
Token your
Token ID534
Feature activation+7.15e-04

Pos logit contributions
ocol+0.009
 rib+0.007
 microsc+0.007
 gymn+0.007
rou+0.007
Neg logit contributions
Maybe-0.004
 someone-0.004
 one-0.004
 dad-0.004
Mom-0.004
Token stick
Token ID4859
Feature activation+8.29e-03

Pos logit contributions
Maybe+0.001
 someone+0.001
Mom+0.001
How+0.001
 one+0.001
Neg logit contributions
ocol-0.001
 rib-0.001
 microsc-0.001
 gymn-0.001
 topp-0.001
Token.
Token ID13
Feature activation-1.62e-02

Pos logit contributions
Maybe+0.006
 someone+0.006
Mom+0.006
What+0.006
 dad+0.006
Neg logit contributions
ocol-0.016
 rib-0.013
 microsc-0.013
rou-0.013
 gymn-0.012
Token It
Token ID632
Feature activation-2.13e-02

Pos logit contributions
ocol+0.029
 rib+0.024
 microsc+0.023
rou+0.022
 gymn+0.022
Neg logit contributions
Maybe-0.015
 someone-0.015
 one-0.014
Mom-0.014
 dad-0.014
Token.
Token ID13
Feature activation+1.36e-02

Pos logit contributions
ocol+0.062
 rib+0.051
 microsc+0.050
 topp+0.050
rou+0.049
Neg logit contributions
 one-0.015
 next-0.014
 someone-0.013
 there-0.011
 something-0.011
Token This
Token ID770
Feature activation+4.39e-03

Pos logit contributions
Maybe+0.015
 next+0.014
Mom+0.014
 dad+0.014
 one+0.014
Neg logit contributions
ocol-0.020
 microsc-0.016
 rib-0.015
 gol-0.015
 gymn-0.015
Token worked
Token ID3111
Feature activation-6.86e-03

Pos logit contributions
Maybe+0.005
How+0.005
 someone+0.005
 dad+0.004
Mom+0.004
Neg logit contributions
ocol-0.007
 rib-0.006
 gymn-0.005
 microsc-0.005
 topp-0.005
Token until
Token ID1566
Feature activation+1.37e-02

Pos logit contributions
ocol+0.013
 rib+0.011
 microsc+0.011
 gymn+0.011
rou+0.010
Neg logit contributions
Maybe-0.005
 someone-0.005
 one-0.005
 dad-0.005
 next-0.004
Token a
Token ID257
Feature activation+9.43e-03

Pos logit contributions
Maybe+0.011
How+0.011
Mom+0.010
What+0.010
 dad+0.009
Neg logit contributions
ocol-0.025
 rib-0.021
 microsc-0.020
 gol-0.019
 gymn-0.019
Token gust
Token ID35253
Feature activation-4.62e-02

Pos logit contributions
Maybe+0.009
 someone+0.008
 dad+0.008
 one+0.008
How+0.008
Neg logit contributions
ocol-0.016
 rib-0.014
 gymn-0.013
 microsc-0.013
rou-0.012
Token of
Token ID286
Feature activation+1.55e-02

Pos logit contributions
ocol+0.082
 former+0.081
 microsc+0.078
 rightful+0.073
 rib+0.073
Neg logit contributions
 one-0.040
 next-0.038
 there-0.036
Maybe-0.035
 they-0.035
Token wind
Token ID2344
Feature activation-1.82e-02

Pos logit contributions
Maybe+0.012
How+0.010
 someone+0.010
 dad+0.009
 they+0.009
Neg logit contributions
ocol-0.030
 rib-0.026
 gymn-0.024
 microsc-0.024
 gol-0.024
Token blew
Token ID17948
Feature activation-1.56e-02

Pos logit contributions
ocol+0.038
 microsc+0.031
 rib+0.031
 gymn+0.030
 topp+0.029
Neg logit contributions
Maybe-0.011
 one-0.011
 someone-0.011
 next-0.011
 dad-0.010
Token the
Token ID262
Feature activation-9.17e-03

Pos logit contributions
ocol+0.032
 rib+0.027
 microsc+0.026
 gymn+0.026
rou+0.026
Neg logit contributions
 someone-0.010
 one-0.010
Maybe-0.010
 next-0.009
 dad-0.009
Token tw
Token ID665
Feature activation-2.05e-02

Pos logit contributions
ocol+0.013
 rib+0.010
 microsc+0.010
 topp+0.009
rou+0.009
Neg logit contributions
Maybe-0.011
 someone-0.011
 one-0.011
 next-0.011
 dad-0.011