TOP ACTIVATIONS
MAX = 3.209

and saw a magical fair yt ale creature called a n
it first ? â M ummy said
tall hills and passed through vibrant forests . After several days
came across rolling hills with lush green grass . The sun
lot of different animals and lush green trees .
Finally Lucy â s mom said ,
Don â t worry , Jack .
, why don â t you take a break
you want ? â Alex wanted the
felt bad and said ," I 'm sorry , let me
! Why don â t you come with me
my friend ? â The chicken said yes and
, if that â s what you want to
I do ? â Her dad replied ,
. They promised to play and eat popcorn together every day
Don â t worry , Mat ty
, please ? â Her mom
he just couldn â t reach it . Then
Don â t worry , I â
â Sam â s eyes lit up and

MIN ACTIVATIONS
MIN = -2.399

says . She hears the noise and comes to the attic
very much . She hugged it every night and played with
Mom hears the noise and comes to the garage
. Lily had a bath tub in her bathroom . She
every night . She hugs it and squee zes it tight
Mom hears the noise . She comes to see
. His name was Joe , and he was wearing red
Mom hears the noise and comes to see what
Mom hears the noise and comes to see what
Mom hears the noise and comes to the room
Mom hears the noise and comes to see what
safe . His parents made him feel so special and loved
Mom heard the noise and came to see what
Mom heard the noise and came to see what
mess . Mom hears the noise and comes to see .
Mom hears the noise and comes to the room
to end . Sarah gave Jack one last squeeze and they
Mom hears the noise . She comes to see
Mom hears the noise and comes to the room
shop . She gave Po ppy some money and said "

INTERVAL 1.807 - 3.209
CONTAINS 0.027%

colourful flowers , through the lush forests deep in thought .
explore the woods and feel close to nature . <|endoftext|> Once
Max saw a big pile of fruit and he wanted to
You â re very welcome Clara .
Let â s not think about the

INTERVAL 0.405 - 1.807
CONTAINS 19.454%

and Ben are bad . He takes the phone from them
The lady opened the box . She was shocked
living room and sit on the couch . Sam opens the
seen the little girl playing the piano came up to her
They can hurt someone . We should only throw round and

INTERVAL -0.997 - 0.405
CONTAINS 78.617%

and they brought the arm chair home together .
block that can snap to other blocks . She thinks it
they reached the other side , the dog ran towards them
came and washed away the whale 's painting . The whale
a time , there was a little boy named Tim my

INTERVAL -2.399 - -0.997
CONTAINS 1.902%

the toy car and thought it was fun .
says . " Wow , Mom , that 's
Sp ide y thought , " I need to prevent
She felt helpless and didn 't know what to do .
you ." <|endoftext|> Anna and Ben are best friends . They
Token and
Token ID290
Feature activation-2.32e-01

Pos logit contributions
ocol+0.013
 rib+0.011
 microsc+0.010
 gymn+0.010
rou+0.010
Neg logit contributions
Maybe-0.005
 someone-0.005
 one-0.004
 dad-0.004
 next-0.004
Token saw
Token ID2497
Feature activation-7.63e-02

Pos logit contributions
Maybe+0.004
Mom+0.004
 someone+0.004
How+0.004
 dad+0.004
Neg logit contributions
ocol-0.010
 rib-0.009
 microsc-0.008
 gymn-0.008
rou-0.008
Token a
Token ID257
Feature activation-9.40e-02

Pos logit contributions
Maybe+0.001
Mom+0.001
 someone+0.001
How+0.001
 dad+0.001
Neg logit contributions
ocol-0.004
 rib-0.003
 microsc-0.003
rou-0.003
 gymn-0.003
Token magical
Token ID10883
Feature activation+3.14e-01

Pos logit contributions
Maybe+0.002
 someone+0.002
 one+0.002
 dad+0.002
How+0.002
Neg logit contributions
ocol-0.004
 rib-0.004
 microsc-0.003
 gymn-0.003
rou-0.003
Token fair
Token ID3148
Feature activation-6.51e-02

Pos logit contributions
ocol+0.015
 rib+0.012
 microsc+0.012
 gymn+0.011
rou+0.011
Neg logit contributions
Maybe-0.006
 someone-0.006
 one-0.006
 dad-0.005
 next-0.005
Tokenyt
Token ID20760
Feature activation+2.53e+00

Pos logit contributions
Maybe+0.001
 someone+0.001
How+0.001
Mom+0.001
 one+0.001
Neg logit contributions
ocol-0.003
 rib-0.003
 microsc-0.002
 gymn-0.002
rou-0.002
Tokenale
Token ID1000
Feature activation+6.58e-01

Pos logit contributions
ocol+0.111
 microsc+0.111
pill+0.110
 gymn+0.109
 topp+0.108
Neg logit contributions
 one-0.041
 end-0.036
 where-0.036
 back-0.035
 next-0.035
Token creature
Token ID7185
Feature activation+2.26e-01

Pos logit contributions
ocol+0.032
 rib+0.027
 microsc+0.026
 gymn+0.025
rou+0.025
Neg logit contributions
 someone-0.011
 one-0.011
Maybe-0.011
 next-0.011
 dad-0.010
Token called
Token ID1444
Feature activation+7.40e-01

Pos logit contributions
ocol+0.011
 rib+0.009
 microsc+0.009
 gymn+0.009
rou+0.009
Neg logit contributions
Maybe-0.004
 someone-0.004
 dad-0.004
 one-0.004
Mom-0.003
Token a
Token ID257
Feature activation+4.51e-01

Pos logit contributions
ocol+0.037
 rib+0.031
 microsc+0.029
 gymn+0.029
rou+0.029
Neg logit contributions
 someone-0.013
 one-0.012
 dad-0.012
Maybe-0.011
 next-0.011
Token n
Token ID299
Feature activation+7.45e-01

Pos logit contributions
ocol+0.021
 rib+0.018
 microsc+0.017
 gymn+0.016
rou+0.016
Neg logit contributions
 someone-0.009
Maybe-0.009
 one-0.008
 dad-0.008
 next-0.008
Token it
Token ID340
Feature activation-6.68e-01

Pos logit contributions
ocol+0.002
 rib+0.002
 microsc+0.002
 gymn+0.002
rou+0.002
Neg logit contributions
Maybe-0.001
 someone-0.001
 dad-0.001
Mom-0.001
 one-0.001
Token first
Token ID717
Feature activation-3.54e-01

Pos logit contributions
Maybe+0.012
Mom+0.012
 dad+0.012
 someone+0.011
What+0.011
Neg logit contributions
ocol-0.029
 rib-0.026
 gymn-0.025
 microsc-0.023
rou-0.023
Token?
Token ID30
Feature activation+5.09e-01

Pos logit contributions
Maybe+0.006
 someone+0.006
 dad+0.005
Mom+0.005
What+0.005
Neg logit contributions
ocol-0.016
 rib-0.014
 gymn-0.014
 microsc-0.013
rou-0.013
Tokenâ
Token ID22940
Feature activation+5.05e-01

Pos logit contributions
ocol+0.022
 rib+0.018
 microsc+0.017
rou+0.017
 gymn+0.017
Neg logit contributions
Maybe-0.012
 someone-0.011
 one-0.011
 dad-0.011
Mom-0.011
Token
Token ID26391
Feature activation-1.36e-01

Pos logit contributions
ocol+0.016
 rib+0.012
 microsc+0.011
 gymn+0.011
rou+0.010
Neg logit contributions
Maybe-0.017
 someone-0.017
 one-0.017
 next-0.017
 dad-0.017
Token
Token ID8151
Feature activation+2.50e+00

Pos logit contributions
Maybe+0.003
 someone+0.003
 one+0.003
 next+0.002
 dad+0.002
Neg logit contributions
ocol-0.006
 rib-0.005
 microsc-0.005
 gymn-0.005
rou-0.005
Token M
Token ID337
Feature activation+9.64e-01

Pos logit contributions
ocol+0.137
 rib+0.117
 microsc+0.107
 former+0.104
 topp+0.103
Neg logit contributions
 one-0.038
Maybe-0.038
 they-0.037
 someone-0.037
 dad-0.037
Tokenummy
Token ID13513
Feature activation-6.54e-01

Pos logit contributions
ocol+0.037
 rib+0.031
 microsc+0.029
 topp+0.028
ifying+0.027
Neg logit contributions
 dad-0.026
 someone-0.025
 next-0.025
 one-0.024
Maybe-0.024
Token said
Token ID531
Feature activation+9.33e-01

Pos logit contributions
Maybe+0.016
 someone+0.016
How+0.015
What+0.015
Mom+0.015
Neg logit contributions
ocol-0.026
 rib-0.022
 gymn-0.020
rou-0.020
 microsc-0.020
Token
Token ID6184
Feature activation+4.95e-01

Pos logit contributions
ocol+0.043
 rib+0.035
 microsc+0.034
rou+0.033
pill+0.033
Neg logit contributions
 someone-0.020
 one-0.020
 dad-0.019
Maybe-0.018
 next-0.018
Token
Token ID95
Feature activation+6.97e-01

Pos logit contributions
ocol+0.019
 rib+0.015
 microsc+0.014
rou+0.014
 gymn+0.014
Neg logit contributions
Maybe-0.013
 someone-0.013
 one-0.013
 dad-0.013
 next-0.013
Token tall
Token ID7331
Feature activation+1.43e+00

Pos logit contributions
ocol+0.032
 rib+0.027
 microsc+0.026
rou+0.025
 gymn+0.025
Neg logit contributions
 one-0.008
 someone-0.008
Maybe-0.008
 next-0.008
 everything-0.007
Token hills
Token ID18639
Feature activation+1.23e+00

Pos logit contributions
ocol+0.078
rou+0.068
ifying+0.065
 gol+0.064
 microsc+0.064
Neg logit contributions
 one-0.018
 next-0.016
 someone-0.015
 everything-0.015
 looked-0.013
Token and
Token ID290
Feature activation-4.63e-02

Pos logit contributions
ocol+0.073
 rib+0.061
 microsc+0.060
rou+0.058
 topp+0.058
Neg logit contributions
 one-0.013
 next-0.012
 fast-0.010
 someone-0.010
 everything-0.010
Token passed
Token ID3804
Feature activation+9.17e-01

Pos logit contributions
Maybe+0.001
 someone+0.001
 dad+0.001
Mom+0.001
 one+0.001
Neg logit contributions
ocol-0.002
 rib-0.002
 microsc-0.002
 gymn-0.002
rou-0.002
Token through
Token ID832
Feature activation-1.01e-01

Pos logit contributions
ocol+0.045
 rib+0.036
rou+0.035
 microsc+0.035
 gymn+0.035
Neg logit contributions
 someone-0.018
 one-0.018
Maybe-0.017
 everything-0.017
 next-0.017
Token vibrant
Token ID21266
Feature activation+2.47e+00

Pos logit contributions
Maybe+0.002
Mom+0.002
 someone+0.002
 dad+0.002
How+0.002
Neg logit contributions
ocol-0.005
 rib-0.004
 microsc-0.004
 gymn-0.004
 topp-0.004
Token forests
Token ID17039
Feature activation+1.76e+00

Pos logit contributions
ocol+0.137
liv+0.125
oke+0.125
 gol+0.125
rou+0.124
Neg logit contributions
 everything-0.028
 one-0.028
 things-0.022
 water-0.020
 something-0.020
Token.
Token ID13
Feature activation-2.12e-01

Pos logit contributions
ocol+0.111
 rib+0.092
 microsc+0.090
 gol+0.089
 gymn+0.088
Neg logit contributions
 one-0.014
 next-0.013
 someone-0.010
 fast-0.010
 everything-0.009
Token After
Token ID2293
Feature activation-9.40e-01

Pos logit contributions
Maybe+0.005
 dad+0.005
Mom+0.005
 someone+0.005
 next+0.005
Neg logit contributions
ocol-0.008
 rib-0.007
 microsc-0.006
 gymn-0.006
hy-0.006
Token several
Token ID1811
Feature activation-6.00e-01

Pos logit contributions
Maybe+0.018
Mom+0.017
How+0.016
Parent+0.015
 dad+0.014
Neg logit contributions
ocol-0.043
 rib-0.038
 microsc-0.037
 gymn-0.035
pill-0.034
Token days
Token ID1528
Feature activation+1.09e+00

Pos logit contributions
Maybe+0.014
Mom+0.014
 dad+0.013
 someone+0.013
How+0.013
Neg logit contributions
ocol-0.026
 rib-0.022
 gymn-0.020
 microsc-0.020
 topp-0.019
Token came
Token ID1625
Feature activation-2.21e-02

Pos logit contributions
ocol+0.005
 rib+0.004
 microsc+0.004
 gymn+0.004
rou+0.004
Neg logit contributions
Maybe-0.002
 someone-0.002
 dad-0.002
 one-0.002
Mom-0.002
Token across
Token ID1973
Feature activation+1.31e-01

Pos logit contributions
Maybe+0.000
 someone+0.000
 dad+0.000
Mom+0.000
How+0.000
Neg logit contributions
ocol-0.001
 rib-0.001
 microsc-0.001
rou-0.001
 gymn-0.001
Token rolling
Token ID10708
Feature activation+5.79e-01

Pos logit contributions
ocol+0.007
 rib+0.006
 microsc+0.006
 gymn+0.006
rou+0.006
Neg logit contributions
Maybe-0.001
 someone-0.001
 dad-0.001
Mom-0.001
 one-0.001
Token hills
Token ID18639
Feature activation+1.08e+00

Pos logit contributions
ocol+0.029
 rib+0.024
 microsc+0.023
rou+0.023
 gymn+0.023
Neg logit contributions
 someone-0.009
 one-0.009
Maybe-0.009
 next-0.009
 dad-0.009
Token with
Token ID351
Feature activation+1.73e-01

Pos logit contributions
ocol+0.062
 rib+0.052
 microsc+0.051
 topp+0.049
 gymn+0.049
Neg logit contributions
 one-0.012
 next-0.011
 someone-0.010
Maybe-0.010
 fast-0.009
Token lush
Token ID37408
Feature activation+2.46e+00

Pos logit contributions
ocol+0.008
 rib+0.007
 microsc+0.007
 gymn+0.007
rou+0.006
Neg logit contributions
Maybe-0.003
 someone-0.003
 dad-0.003
 one-0.003
Mom-0.003
Token green
Token ID4077
Feature activation+1.80e+00

Pos logit contributions
ocol+0.134
rou+0.121
liv+0.119
ifying+0.119
pri+0.118
Neg logit contributions
 water-0.025
 one-0.025
 everything-0.024
 fast-0.021
 sun-0.020
Token grass
Token ID8701
Feature activation+5.82e-01

Pos logit contributions
ocol+0.099
rou+0.087
pri+0.084
liv+0.084
hy+0.083
Neg logit contributions
 one-0.021
 water-0.019
 everything-0.016
 someone-0.016
 next-0.015
Token.
Token ID13
Feature activation-2.54e-01

Pos logit contributions
ocol+0.031
 rib+0.026
 microsc+0.025
 gymn+0.025
rou+0.024
Neg logit contributions
 one-0.007
Maybe-0.007
 someone-0.007
 next-0.007
 dad-0.007
Token The
Token ID383
Feature activation-4.95e-01

Pos logit contributions
Maybe+0.006
 someone+0.006
 dad+0.006
Mom+0.006
 next+0.006
Neg logit contributions
ocol-0.010
 rib-0.009
 microsc-0.008
pill-0.008
 gymn-0.008
Token sun
Token ID4252
Feature activation+2.42e-02

Pos logit contributions
Maybe+0.011
 someone+0.010
Nothing+0.010
Mom+0.010
How+0.009
Neg logit contributions
ocol-0.021
 rib-0.019
 gol-0.017
 gymn-0.016
 microsc-0.016
Token lot
Token ID1256
Feature activation+4.65e-01

Pos logit contributions
Maybe+0.006
 someone+0.006
 one+0.005
How+0.005
 dad+0.005
Neg logit contributions
ocol-0.013
 rib-0.011
 microsc-0.010
 gymn-0.010
rou-0.010
Token of
Token ID286
Feature activation+6.81e-01

Pos logit contributions
ocol+0.017
 rib+0.013
 microsc+0.012
 gymn+0.012
rou+0.012
Neg logit contributions
Maybe-0.014
 someone-0.014
 one-0.014
 next-0.013
 dad-0.013
Token different
Token ID1180
Feature activation+1.09e+00

Pos logit contributions
ocol+0.033
 rib+0.027
 microsc+0.026
rou+0.026
 gymn+0.026
Neg logit contributions
 someone-0.012
 one-0.012
Maybe-0.012
 dad-0.011
 everything-0.011
Token animals
Token ID4695
Feature activation+1.19e-01

Pos logit contributions
ocol+0.055
 rib+0.044
rou+0.043
 gol+0.043
 microsc+0.042
Neg logit contributions
 one-0.019
 someone-0.019
 everything-0.018
 something-0.017
Maybe-0.017
Token and
Token ID290
Feature activation+4.54e-01

Pos logit contributions
ocol+0.006
 rib+0.005
 microsc+0.005
 gymn+0.005
rou+0.005
Neg logit contributions
Maybe-0.002
 someone-0.002
 dad-0.002
 one-0.002
Mom-0.002
Token lush
Token ID37408
Feature activation+2.46e+00

Pos logit contributions
ocol+0.023
 rib+0.019
 microsc+0.018
 gymn+0.018
rou+0.018
Neg logit contributions
 someone-0.007
Maybe-0.007
 one-0.007
 dad-0.007
 next-0.006
Token green
Token ID4077
Feature activation+1.68e+00

Pos logit contributions
ocol+0.135
rou+0.124
ifying+0.123
pri+0.121
bek+0.120
Neg logit contributions
 one-0.024
 water-0.022
 everything-0.020
 fast-0.017
 next-0.016
Token trees
Token ID7150
Feature activation+9.23e-01

Pos logit contributions
ocol+0.091
rou+0.078
 gol+0.077
pri+0.076
 gymn+0.076
Neg logit contributions
 one-0.021
 someone-0.017
 next-0.016
 everything-0.015
 something-0.015
Token.
Token ID13
Feature activation-2.16e-01

Pos logit contributions
ocol+0.051
 rib+0.043
 microsc+0.042
 gymn+0.041
rou+0.041
Neg logit contributions
 one-0.010
 next-0.010
 someone-0.010
Maybe-0.009
 they-0.009
Token
Token ID198
Feature activation-1.79e-01

Pos logit contributions
Maybe+0.005
 someone+0.004
 next+0.004
 dad+0.004
 one+0.004
Neg logit contributions
ocol-0.009
 rib-0.008
 microsc-0.008
 gymn-0.007
pill-0.007
Token
Token ID198
Feature activation+1.72e-01

Pos logit contributions
 someone+0.003
Maybe+0.003
 one+0.003
 dad+0.003
 next+0.003
Neg logit contributions
ocol-0.008
 rib-0.007
 microsc-0.007
 gymn-0.007
rou-0.007
Token
Token ID198
Feature activation-4.30e-01

Pos logit contributions
Maybe+0.014
 someone+0.014
 dad+0.013
 one+0.013
 turned+0.013
Neg logit contributions
ocol-0.030
 rib-0.024
 microsc-0.024
rou-0.023
 gymn-0.023
TokenFinally
Token ID11158
Feature activation-7.61e-01

Pos logit contributions
 someone+0.008
 dad+0.008
Maybe+0.008
 one+0.008
 somewhere+0.008
Neg logit contributions
ocol-0.019
 rib-0.016
 microsc-0.015
 gymn-0.015
rou-0.015
Token Lucy
Token ID22162
Feature activation+4.76e-01

Pos logit contributions
Maybe+0.020
How+0.019
Mom+0.018
What+0.018
Nothing+0.017
Neg logit contributions
ocol-0.031
 rib-0.025
 microsc-0.023
 gymn-0.023
 topp-0.021
Tokenâ
Token ID22940
Feature activation+5.32e-01

Pos logit contributions
ocol+0.024
 rib+0.020
 microsc+0.019
 gymn+0.019
rou+0.019
Neg logit contributions
Maybe-0.008
 someone-0.007
 one-0.007
 dad-0.007
 next-0.007
Token
Token ID26391
Feature activation-2.42e-01

Pos logit contributions
ocol+0.017
 rib+0.013
 microsc+0.012
 gymn+0.012
rou+0.011
Neg logit contributions
Maybe-0.018
 someone-0.018
 one-0.017
 next-0.017
Mom-0.017
Token
Token ID8151
Feature activation+2.44e+00

Pos logit contributions
Maybe+0.006
 someone+0.006
 one+0.005
 dad+0.005
 next+0.005
Neg logit contributions
ocol-0.010
 rib-0.008
 microsc-0.008
rou-0.008
 gymn-0.008
Tokens
Token ID82
Feature activation-2.35e-01

Pos logit contributions
ocol+0.123
 rib+0.103
 microsc+0.094
 former+0.091
 gymn+0.090
Neg logit contributions
Maybe-0.048
 one-0.048
 they-0.047
 everything-0.047
 someone-0.045
Token mom
Token ID1995
Feature activation+1.28e-02

Pos logit contributions
Maybe+0.004
How+0.003
 someone+0.003
What+0.003
Mom+0.003
Neg logit contributions
ocol-0.012
 rib-0.010
 microsc-0.010
rou-0.009
 gymn-0.009
Token said
Token ID531
Feature activation+6.35e-01

Pos logit contributions
ocol+0.001
 rib+0.001
 microsc+0.000
rou+0.000
 gymn+0.000
Neg logit contributions
Maybe-0.000
 someone-0.000
 one-0.000
 dad-0.000
Mom-0.000
Token,
Token ID11
Feature activation-2.73e-01

Pos logit contributions
ocol+0.033
 rib+0.027
 microsc+0.027
rou+0.026
 gymn+0.026
Neg logit contributions
 someone-0.010
 one-0.010
Maybe-0.009
 dad-0.009
 next-0.009
Token
Token ID6184
Feature activation+3.89e-01

Pos logit contributions
Maybe+0.007
 someone+0.007
Mom+0.007
How+0.007
 one+0.006
Neg logit contributions
ocol-0.010
 rib-0.009
 microsc-0.008
 gymn-0.008
rou-0.008
Token
Token ID129
Feature activation+1.03e+00

Pos logit contributions
Maybe+0.012
 someone+0.011
 one+0.011
 dad+0.011
 next+0.011
Neg logit contributions
ocol-0.022
 rib-0.018
 gymn-0.017
rou-0.017
 microsc-0.017
Token
Token ID241
Feature activation+3.71e-01

Pos logit contributions
ocol+0.039
 rib+0.033
 microsc+0.030
 topp+0.030
ifying+0.028
Neg logit contributions
 someone-0.028
 one-0.027
 next-0.027
 dad-0.027
Maybe-0.027
TokenDon
Token ID3987
Feature activation+9.45e-01

Pos logit contributions
ocol+0.018
 rib+0.015
 microsc+0.014
 gymn+0.014
rou+0.014
Neg logit contributions
Maybe-0.006
 someone-0.006
 one-0.006
 dad-0.006
 next-0.006
Tokenâ
Token ID22940
Feature activation+3.55e-01

Pos logit contributions
ocol+0.048
 rib+0.040
 microsc+0.039
 topp+0.038
rou+0.037
Neg logit contributions
 someone-0.018
Maybe-0.017
 next-0.017
Mom-0.016
 one-0.016
Token
Token ID26391
Feature activation-1.92e-01

Pos logit contributions
ocol+0.011
 rib+0.008
 microsc+0.008
 gymn+0.007
rou+0.007
Neg logit contributions
Maybe-0.013
 someone-0.012
 one-0.012
 dad-0.012
 next-0.012
Token
Token ID8151
Feature activation+2.42e+00

Pos logit contributions
Maybe+0.004
 someone+0.004
 one+0.004
 dad+0.004
 next+0.004
Neg logit contributions
ocol-0.008
 rib-0.007
 gymn-0.006
 microsc-0.006
rou-0.006
Tokent
Token ID83
Feature activation-1.32e-01

Pos logit contributions
ocol+0.120
 rib+0.103
 microsc+0.095
 former+0.090
 topp+0.089
Neg logit contributions
Maybe-0.045
 one-0.045
 someone-0.045
 they-0.044
 next-0.043
Token worry
Token ID5490
Feature activation+2.37e-01

Pos logit contributions
Maybe+0.003
 someone+0.003
 dad+0.003
How+0.003
Mom+0.003
Neg logit contributions
ocol-0.006
 rib-0.005
 microsc-0.004
 gymn-0.004
rou-0.004
Token,
Token ID11
Feature activation+4.10e-01

Pos logit contributions
ocol+0.011
 rib+0.010
 microsc+0.009
 gymn+0.009
rou+0.009
Neg logit contributions
Maybe-0.004
 someone-0.004
 one-0.004
 dad-0.004
 next-0.004
Token Jack
Token ID3619
Feature activation+4.10e-01

Pos logit contributions
ocol+0.020
 rib+0.017
 microsc+0.016
 gymn+0.016
rou+0.016
Neg logit contributions
 someone-0.007
Maybe-0.007
 one-0.006
 dad-0.006
 next-0.006
Token.
Token ID13
Feature activation-1.50e-01

Pos logit contributions
ocol+0.019
 rib+0.016
 microsc+0.015
 gymn+0.015
rou+0.015
Neg logit contributions
Maybe-0.008
 someone-0.008
 one-0.007
 dad-0.007
 next-0.007
Token,
Token ID11
Feature activation-2.03e-02

Pos logit contributions
 dad+0.001
 someone+0.001
 one+0.001
Maybe+0.001
 everything+0.001
Neg logit contributions
ocol-0.002
 rib-0.001
 microsc-0.001
 gymn-0.001
 gol-0.001
Token why
Token ID1521
Feature activation+7.41e-01

Pos logit contributions
Maybe+0.000
 someone+0.000
 one+0.000
 dad+0.000
Mom+0.000
Neg logit contributions
ocol-0.001
 rib-0.001
 microsc-0.001
 gymn-0.001
rou-0.001
Token don
Token ID836
Feature activation+1.05e-01

Pos logit contributions
ocol+0.033
 rib+0.026
 microsc+0.025
rou+0.024
 gymn+0.024
Neg logit contributions
 someone-0.017
Maybe-0.016
 one-0.016
 dad-0.016
 everything-0.016
Tokenâ
Token ID22940
Feature activation+3.33e-01

Pos logit contributions
ocol+0.005
 rib+0.004
 microsc+0.004
 gymn+0.003
rou+0.003
Neg logit contributions
Maybe-0.002
 someone-0.002
 dad-0.002
 one-0.002
Mom-0.002
Token
Token ID26391
Feature activation+1.85e-01

Pos logit contributions
ocol+0.011
 rib+0.009
 microsc+0.008
 gymn+0.008
rou+0.008
Neg logit contributions
Maybe-0.011
 someone-0.011
 one-0.010
 dad-0.010
 next-0.010
Token
Token ID8151
Feature activation+2.42e+00

Pos logit contributions
ocol+0.009
 rib+0.007
 microsc+0.007
 gymn+0.007
rou+0.007
Neg logit contributions
Maybe-0.003
 someone-0.003
 one-0.003
 dad-0.003
 next-0.003
Tokent
Token ID83
Feature activation+1.23e-01

Pos logit contributions
ocol+0.118
 rib+0.099
 microsc+0.094
 former+0.086
 topp+0.086
Neg logit contributions
Maybe-0.050
 one-0.048
 they-0.047
 someone-0.047
 everything-0.045
Token you
Token ID345
Feature activation+1.92e-01

Pos logit contributions
ocol+0.006
 rib+0.005
 microsc+0.004
 gymn+0.004
rou+0.004
Neg logit contributions
Maybe-0.002
 someone-0.002
 dad-0.002
 one-0.002
Mom-0.002
Token take
Token ID1011
Feature activation-5.76e-01

Pos logit contributions
ocol+0.008
 rib+0.006
 microsc+0.006
 gymn+0.006
rou+0.006
Neg logit contributions
Maybe-0.005
 someone-0.005
 dad-0.005
 one-0.005
Mom-0.005
Token a
Token ID257
Feature activation-1.13e+00

Pos logit contributions
Maybe+0.009
Mom+0.008
 dad+0.008
How+0.008
 next+0.007
Neg logit contributions
ocol-0.029
 rib-0.026
 gymn-0.023
 microsc-0.023
 topp-0.023
Token break
Token ID2270
Feature activation-1.74e-01

Pos logit contributions
Maybe+0.031
How+0.029
What+0.028
Mom+0.028
 someone+0.028
Neg logit contributions
ocol-0.042
 rib-0.037
 gymn-0.035
 tant-0.033
 microsc-0.033
Token you
Token ID345
Feature activation+4.50e-01

Pos logit contributions
Maybe+0.003
 someone+0.003
 dad+0.003
Mom+0.003
 one+0.003
Neg logit contributions
ocol-0.008
 rib-0.006
 gymn-0.006
 microsc-0.006
 topp-0.006
Token want
Token ID765
Feature activation-2.32e-01

Pos logit contributions
ocol+0.021
 rib+0.017
 microsc+0.017
 gymn+0.016
rou+0.016
Neg logit contributions
Maybe-0.009
 someone-0.009
 one-0.008
 dad-0.008
 next-0.008
Token?
Token ID30
Feature activation+7.60e-02

Pos logit contributions
Maybe+0.006
 someone+0.005
 dad+0.005
Mom+0.005
How+0.005
Neg logit contributions
ocol-0.010
 rib-0.008
 microsc-0.007
 gymn-0.007
 topp-0.007
Tokenâ
Token ID22940
Feature activation+3.46e-01

Pos logit contributions
ocol+0.003
 rib+0.003
 microsc+0.002
 gymn+0.002
rou+0.002
Neg logit contributions
Maybe-0.002
 someone-0.002
 dad-0.002
 one-0.002
 next-0.002
Token
Token ID26391
Feature activation-7.14e-01

Pos logit contributions
ocol+0.012
 rib+0.009
 microsc+0.008
 gymn+0.008
rou+0.008
Neg logit contributions
Maybe-0.011
 someone-0.011
 one-0.011
 dad-0.011
 next-0.011
Token
Token ID8151
Feature activation+2.42e+00

Pos logit contributions
Maybe+0.015
 someone+0.015
 one+0.014
 next+0.014
What+0.014
Neg logit contributions
ocol-0.029
 rib-0.024
 microsc-0.024
 gymn-0.024
opot-0.024
Token
Token ID198
Feature activation+3.15e-01

Pos logit contributions
ocol+0.125
 rib+0.107
 former+0.096
 microsc+0.096
 topp+0.094
Neg logit contributions
 someone-0.043
 one-0.043
 they-0.041
 dad-0.041
Maybe-0.040
Token
Token ID198
Feature activation+4.15e-01

Pos logit contributions
ocol+0.016
 rib+0.014
 microsc+0.013
 gymn+0.013
rou+0.013
Neg logit contributions
Maybe-0.005
 someone-0.004
 one-0.004
 dad-0.004
Mom-0.004
TokenAlex
Token ID15309
Feature activation+1.78e-01

Pos logit contributions
ocol+0.019
 rib+0.016
 microsc+0.015
rou+0.015
 gymn+0.015
Neg logit contributions
Maybe-0.008
 someone-0.008
 one-0.008
Mom-0.008
 dad-0.008
Token wanted
Token ID2227
Feature activation+3.01e-02

Pos logit contributions
ocol+0.008
 rib+0.006
 microsc+0.006
 gymn+0.006
rou+0.006
Neg logit contributions
Maybe-0.004
 someone-0.004
 dad-0.004
 one-0.004
Mom-0.003
Token the
Token ID262
Feature activation-1.90e-01

Pos logit contributions
ocol+0.001
 rib+0.001
 microsc+0.001
 gymn+0.001
 topp+0.001
Neg logit contributions
Maybe-0.001
 someone-0.001
Mom-0.001
 dad-0.001
How-0.001
Token felt
Token ID2936
Feature activation-4.31e-01

Pos logit contributions
ocol+0.003
 rib+0.003
 microsc+0.002
 gymn+0.002
rou+0.002
Neg logit contributions
Maybe-0.001
 someone-0.001
 dad-0.001
 one-0.001
Mom-0.001
Token bad
Token ID2089
Feature activation-1.89e-01

Pos logit contributions
Maybe+0.008
 dad+0.008
Mom+0.007
 someone+0.007
How+0.007
Neg logit contributions
ocol-0.019
 rib-0.016
 microsc-0.015
 gymn-0.015
hy-0.015
Token and
Token ID290
Feature activation+5.92e-01

Pos logit contributions
Maybe+0.004
Mom+0.004
 someone+0.004
 dad+0.004
How+0.004
Neg logit contributions
ocol-0.008
 rib-0.007
 gymn-0.007
 microsc-0.007
rou-0.006
Token said
Token ID531
Feature activation+4.27e-01

Pos logit contributions
ocol+0.030
 rib+0.025
 microsc+0.024
 gymn+0.023
rou+0.023
Neg logit contributions
 someone-0.010
Maybe-0.010
 one-0.009
 dad-0.009
 next-0.009
Token,"
Token ID553
Feature activation+1.84e-02

Pos logit contributions
ocol+0.022
 rib+0.018
 microsc+0.018
 gymn+0.017
rou+0.017
Neg logit contributions
 someone-0.007
Maybe-0.006
 one-0.006
 dad-0.006
 next-0.006
TokenI
Token ID40
Feature activation+2.40e+00

Pos logit contributions
ocol+0.001
 rib+0.001
 microsc+0.001
 gymn+0.001
 topp+0.001
Neg logit contributions
 someone-0.000
Maybe-0.000
 dad-0.000
 one-0.000
Mom-0.000
Token'm
Token ID1101
Feature activation+4.15e-01

Pos logit contributions
ocol+0.134
 microsc+0.108
 rib+0.106
ifying+0.103
pill+0.100
Neg logit contributions
Maybe-0.050
 turned-0.045
 not-0.045
 looked-0.044
 one-0.042
Token sorry
Token ID7926
Feature activation-3.96e-01

Pos logit contributions
ocol+0.023
 rib+0.019
 microsc+0.019
 gymn+0.018
rou+0.018
Neg logit contributions
 someone-0.005
Maybe-0.005
 one-0.005
 dad-0.005
 next-0.004
Token,
Token ID11
Feature activation-5.55e-02

Pos logit contributions
Maybe+0.010
 someone+0.010
How+0.009
 dad+0.009
What+0.009
Neg logit contributions
ocol-0.015
 rib-0.012
 gymn-0.012
rou-0.011
 microsc-0.011
Token let
Token ID1309
Feature activation+2.24e-01

Pos logit contributions
Maybe+0.001
 someone+0.001
 dad+0.001
 one+0.001
Mom+0.001
Neg logit contributions
ocol-0.003
 rib-0.002
 microsc-0.002
 gymn-0.002
rou-0.002
Token me
Token ID502
Feature activation+9.13e-02

Pos logit contributions
ocol+0.010
 rib+0.009
 microsc+0.008
 gymn+0.008
rou+0.008
Neg logit contributions
Maybe-0.004
 someone-0.004
 one-0.004
 dad-0.004
 next-0.004
Token!
Token ID0
Feature activation+1.52e-01

Pos logit contributions
Maybe+0.001
 someone+0.001
How+0.001
Mom+0.001
 dad+0.001
Neg logit contributions
ocol-0.003
 rib-0.002
 microsc-0.002
 gymn-0.002
rou-0.002
Token Why
Token ID4162
Feature activation+7.26e-01

Pos logit contributions
ocol+0.007
 rib+0.005
 microsc+0.005
 gymn+0.005
rou+0.005
Neg logit contributions
Maybe-0.003
 someone-0.003
 dad-0.003
 one-0.003
 next-0.003
Token don
Token ID836
Feature activation-1.12e-02

Pos logit contributions
ocol+0.031
 rib+0.024
 microsc+0.023
rou+0.022
 gol+0.022
Neg logit contributions
 someone-0.019
Maybe-0.018
 one-0.018
 dad-0.018
 everything-0.017
Tokenâ
Token ID22940
Feature activation+2.32e-01

Pos logit contributions
Maybe+0.000
 someone+0.000
 dad+0.000
 one+0.000
Mom+0.000
Neg logit contributions
ocol-0.000
 rib-0.000
 microsc-0.000
 gymn-0.000
rou-0.000
Token
Token ID26391
Feature activation+1.11e-01

Pos logit contributions
ocol+0.008
 rib+0.006
 microsc+0.006
 gymn+0.006
rou+0.006
Neg logit contributions
Maybe-0.007
 someone-0.007
 one-0.007
 dad-0.007
 next-0.007
Token
Token ID8151
Feature activation+2.40e+00

Pos logit contributions
ocol+0.005
 rib+0.004
 microsc+0.004
 gymn+0.004
rou+0.004
Neg logit contributions
Maybe-0.002
 someone-0.002
 one-0.002
 dad-0.002
 next-0.002
Tokent
Token ID83
Feature activation+8.16e-02

Pos logit contributions
ocol+0.121
 rib+0.103
 microsc+0.096
 former+0.091
 topp+0.090
Neg logit contributions
Maybe-0.045
 one-0.043
 someone-0.043
 they-0.043
 there-0.041
Token you
Token ID345
Feature activation+3.71e-02

Pos logit contributions
ocol+0.004
 rib+0.003
 microsc+0.003
 gymn+0.003
rou+0.003
Neg logit contributions
Maybe-0.002
 someone-0.001
 dad-0.001
 one-0.001
Mom-0.001
Token come
Token ID1282
Feature activation-2.03e-01

Pos logit contributions
ocol+0.002
 rib+0.001
 microsc+0.001
 gymn+0.001
rou+0.001
Neg logit contributions
Maybe-0.001
 someone-0.001
 dad-0.001
 one-0.001
How-0.001
Token with
Token ID351
Feature activation-2.26e-01

Pos logit contributions
Maybe+0.004
 someone+0.004
 dad+0.004
Mom+0.004
 one+0.004
Neg logit contributions
ocol-0.009
 rib-0.008
 gymn-0.007
 microsc-0.007
rou-0.007
Token me
Token ID502
Feature activation+3.96e-02

Pos logit contributions
Maybe+0.005
 someone+0.005
How+0.005
Mom+0.005
What+0.004
Neg logit contributions
ocol-0.010
 rib-0.008
 microsc-0.008
 gymn-0.007
rou-0.007
Token my
Token ID616
Feature activation+4.56e-01

Pos logit contributions
Maybe+0.009
 someone+0.008
How+0.007
 dad+0.007
Mom+0.007
Neg logit contributions
ocol-0.018
 rib-0.014
rou-0.014
 gymn-0.014
 microsc-0.013
Token friend
Token ID1545
Feature activation+9.39e-02

Pos logit contributions
ocol+0.020
 rib+0.017
 microsc+0.016
 gymn+0.015
rou+0.015
Neg logit contributions
 someone-0.010
Maybe-0.010
 one-0.010
 dad-0.009
 next-0.009
Token?
Token ID30
Feature activation-3.02e-02

Pos logit contributions
ocol+0.004
 rib+0.004
 microsc+0.004
 gymn+0.003
rou+0.003
Neg logit contributions
Maybe-0.002
 someone-0.002
 dad-0.002
 one-0.002
Mom-0.002
Tokenâ
Token ID22940
Feature activation+6.49e-01

Pos logit contributions
Maybe+0.001
 someone+0.001
 dad+0.001
 one+0.001
 next+0.001
Neg logit contributions
ocol-0.001
 rib-0.001
 microsc-0.001
 gymn-0.001
rou-0.001
Token
Token ID26391
Feature activation-4.25e-01

Pos logit contributions
ocol+0.021
 rib+0.017
 microsc+0.016
 gymn+0.014
rou+0.014
Neg logit contributions
Maybe-0.022
 someone-0.021
 next-0.021
 one-0.021
Mom-0.021
Token
Token ID8151
Feature activation+2.40e+00

Pos logit contributions
Maybe+0.010
 someone+0.009
 one+0.009
 next+0.009
 dad+0.009
Neg logit contributions
ocol-0.018
 rib-0.014
 gymn-0.014
 microsc-0.014
rou-0.014
Token The
Token ID383
Feature activation-1.95e-01

Pos logit contributions
ocol+0.129
 rib+0.111
 microsc+0.100
 topp+0.100
 former+0.099
Neg logit contributions
 someone-0.037
 one-0.037
 dad-0.036
 they-0.036
 everything-0.035
Token chicken
Token ID9015
Feature activation-5.52e-01

Pos logit contributions
Maybe+0.005
 someone+0.004
Mom+0.004
 dad+0.004
 everything+0.004
Neg logit contributions
ocol-0.008
 rib-0.007
 gymn-0.006
 microsc-0.006
rou-0.006
Token said
Token ID531
Feature activation+9.88e-01

Pos logit contributions
Maybe+0.013
 someone+0.012
Mom+0.012
How+0.012
What+0.012
Neg logit contributions
ocol-0.023
 rib-0.019
 microsc-0.018
 gymn-0.018
rou-0.017
Token yes
Token ID3763
Feature activation+7.57e-01

Pos logit contributions
ocol+0.048
 rib+0.040
 microsc+0.040
rou+0.038
 topp+0.038
Neg logit contributions
 someone-0.018
 one-0.017
 dad-0.016
Maybe-0.016
 next-0.016
Token and
Token ID290
Feature activation+2.93e-01

Pos logit contributions
ocol+0.040
 rib+0.033
 microsc+0.032
pill+0.031
 topp+0.030
Neg logit contributions
 someone-0.012
 one-0.012
Maybe-0.011
 dad-0.011
 next-0.011
Token,
Token ID11
Feature activation+6.57e-02

Pos logit contributions
Maybe+0.013
How+0.012
 someone+0.012
What+0.012
Mom+0.012
Neg logit contributions
ocol-0.023
 rib-0.020
 gymn-0.019
rou-0.018
hy-0.018
Token if
Token ID611
Feature activation+3.61e-01

Pos logit contributions
ocol+0.003
 rib+0.003
 microsc+0.002
 gymn+0.002
rou+0.002
Neg logit contributions
Maybe-0.001
 someone-0.001
 dad-0.001
 one-0.001
Mom-0.001
Token that
Token ID326
Feature activation-3.34e-01

Pos logit contributions
ocol+0.018
 rib+0.015
 microsc+0.014
 gymn+0.014
rou+0.014
Neg logit contributions
 someone-0.006
Maybe-0.006
 one-0.006
 dad-0.006
 next-0.006
Tokenâ
Token ID22940
Feature activation+2.08e-01

Pos logit contributions
Maybe+0.007
Mom+0.007
 dad+0.007
 someone+0.007
What+0.007
Neg logit contributions
ocol-0.014
 rib-0.012
 gymn-0.011
 microsc-0.011
 topp-0.011
Token
Token ID26391
Feature activation-2.72e-01

Pos logit contributions
ocol+0.006
 rib+0.005
 microsc+0.004
 gymn+0.004
rou+0.004
Neg logit contributions
Maybe-0.008
 someone-0.007
 dad-0.007
 one-0.007
 next-0.007
Token
Token ID8151
Feature activation+2.39e+00

Pos logit contributions
Maybe+0.006
 someone+0.006
 dad+0.006
 one+0.006
 next+0.006
Neg logit contributions
ocol-0.011
 rib-0.009
 gymn-0.009
 microsc-0.009
rou-0.009
Tokens
Token ID82
Feature activation-5.51e-01

Pos logit contributions
ocol+0.114
 rib+0.099
 microsc+0.090
 former+0.090
 topp+0.085
Neg logit contributions
 they-0.048
Maybe-0.047
 one-0.047
 someone-0.047
 there-0.047
Token what
Token ID644
Feature activation+6.19e-01

Pos logit contributions
Maybe+0.013
Mom+0.012
How+0.011
What+0.011
 someone+0.011
Neg logit contributions
ocol-0.023
 rib-0.020
 gymn-0.018
rou-0.018
 microsc-0.018
Token you
Token ID345
Feature activation+4.62e-01

Pos logit contributions
ocol+0.028
 rib+0.023
 microsc+0.022
rou+0.021
 gymn+0.021
Neg logit contributions
 someone-0.013
Maybe-0.013
 one-0.013
 dad-0.013
 next-0.012
Token want
Token ID765
Feature activation+6.47e-02

Pos logit contributions
ocol+0.020
 rib+0.016
 microsc+0.015
 gymn+0.015
rou+0.015
Neg logit contributions
Maybe-0.011
 someone-0.010
 one-0.010
 dad-0.010
 next-0.010
Token to
Token ID284
Feature activation+5.23e-01

Pos logit contributions
ocol+0.003
 rib+0.002
 microsc+0.002
 gymn+0.002
rou+0.002
Neg logit contributions
Maybe-0.001
 someone-0.001
 dad-0.001
 one-0.001
Mom-0.001
Token I
Token ID314
Feature activation+9.36e-01

Pos logit contributions
ocol+0.029
 rib+0.024
 microsc+0.023
rou+0.023
 gymn+0.023
Neg logit contributions
 someone-0.007
Maybe-0.006
 one-0.006
 dad-0.006
 next-0.006
Token do
Token ID466
Feature activation-1.24e-01

Pos logit contributions
ocol+0.043
 rib+0.035
 microsc+0.033
pill+0.032
 topp+0.032
Neg logit contributions
 someone-0.021
 one-0.021
Maybe-0.020
 dad-0.020
 next-0.020
Token?
Token ID30
Feature activation-9.86e-02

Pos logit contributions
Maybe+0.002
 someone+0.002
 dad+0.002
Mom+0.002
 one+0.002
Neg logit contributions
ocol-0.006
 rib-0.005
 gymn-0.005
 microsc-0.005
rou-0.004
Tokenâ
Token ID22940
Feature activation+2.40e-01

Pos logit contributions
Maybe+0.002
 someone+0.002
 dad+0.002
 one+0.002
 next+0.002
Neg logit contributions
ocol-0.004
 rib-0.003
 microsc-0.003
 gymn-0.003
rou-0.003
Token
Token ID26391
Feature activation-6.29e-01

Pos logit contributions
ocol+0.007
 rib+0.005
 microsc+0.005
 gymn+0.005
rou+0.005
Neg logit contributions
Maybe-0.009
 someone-0.008
 one-0.008
 dad-0.008
 next-0.008
Token
Token ID8151
Feature activation+2.38e+00

Pos logit contributions
Maybe+0.014
 someone+0.014
 one+0.013
 next+0.013
 somewhere+0.013
Neg logit contributions
ocol-0.025
 rib-0.021
 microsc-0.020
 gymn-0.020
rou-0.020
Token Her
Token ID2332
Feature activation+2.95e-01

Pos logit contributions
ocol+0.119
 rib+0.102
 topp+0.091
 former+0.091
 microsc+0.090
Neg logit contributions
 someone-0.047
 one-0.046
 dad-0.046
 they-0.046
 there-0.044
Token dad
Token ID9955
Feature activation-3.67e-01

Pos logit contributions
ocol+0.014
 rib+0.012
 microsc+0.011
 gymn+0.011
rou+0.011
Neg logit contributions
Maybe-0.005
 someone-0.005
 one-0.005
 dad-0.005
 next-0.005
Token replied
Token ID8712
Feature activation+6.51e-01

Pos logit contributions
Maybe+0.009
 someone+0.008
Mom+0.008
How+0.008
 dad+0.008
Neg logit contributions
ocol-0.015
 rib-0.013
 microsc-0.012
rou-0.012
 gymn-0.012
Token,
Token ID11
Feature activation+9.17e-02

Pos logit contributions
ocol+0.033
 rib+0.028
 microsc+0.027
 gymn+0.026
rou+0.026
Neg logit contributions
 someone-0.010
 one-0.010
Maybe-0.009
 dad-0.009
 next-0.009
Token
Token ID6184
Feature activation+3.45e-01

Pos logit contributions
ocol+0.004
 rib+0.003
 microsc+0.003
 gymn+0.003
rou+0.003
Neg logit contributions
Maybe-0.002
 someone-0.002
 one-0.002
 dad-0.002
Mom-0.002
Token.
Token ID13
Feature activation-2.67e-01

Pos logit contributions
ocol+0.004
 rib+0.003
 microsc+0.003
 gymn+0.003
rou+0.003
Neg logit contributions
Maybe-0.001
 someone-0.001
 dad-0.001
 one-0.001
Mom-0.001
Token They
Token ID1119
Feature activation+6.91e-01

Pos logit contributions
Maybe+0.005
 someone+0.005
 one+0.005
 dad+0.005
 next+0.005
Neg logit contributions
ocol-0.012
 rib-0.010
 microsc-0.010
 gymn-0.009
pill-0.009
Token promised
Token ID8072
Feature activation+4.96e-01

Pos logit contributions
ocol+0.033
 rib+0.026
 microsc+0.025
 gymn+0.025
rou+0.024
Neg logit contributions
Maybe-0.014
 one-0.013
 dad-0.013
 someone-0.013
 everything-0.013
Token to
Token ID284
Feature activation+9.15e-01

Pos logit contributions
ocol+0.021
 rib+0.016
 microsc+0.015
rou+0.015
 topp+0.015
Neg logit contributions
 someone-0.013
Maybe-0.013
 one-0.013
 dad-0.013
 next-0.013
Token play
Token ID711
Feature activation+3.64e-01

Pos logit contributions
ocol+0.046
 rib+0.038
 microsc+0.037
pill+0.037
rou+0.035
Neg logit contributions
 one-0.015
 someone-0.015
 dad-0.015
Mom-0.015
Maybe-0.015
Token and
Token ID290
Feature activation+2.38e+00

Pos logit contributions
ocol+0.019
 rib+0.016
 microsc+0.015
rou+0.015
 gymn+0.015
Neg logit contributions
Maybe-0.005
 someone-0.005
 one-0.005
 dad-0.005
 next-0.005
Token eat
Token ID4483
Feature activation+4.35e-01

Pos logit contributions
ocol+0.158
pill+0.126
rou+0.125
ifying+0.123
aman+0.123
Neg logit contributions
 they-0.033
 not-0.030
 one-0.027
 look-0.025
 looked-0.025
Token popcorn
Token ID38914
Feature activation+2.73e-02

Pos logit contributions
ocol+0.022
 rib+0.019
 microsc+0.018
rou+0.017
 gymn+0.017
Neg logit contributions
 someone-0.007
Maybe-0.007
 one-0.006
 dad-0.006
 next-0.006
Token together
Token ID1978
Feature activation+1.81e-01

Pos logit contributions
ocol+0.001
 rib+0.001
 microsc+0.001
 gymn+0.001
rou+0.001
Neg logit contributions
Maybe-0.000
 someone-0.000
 dad-0.000
Mom-0.000
 one-0.000
Token every
Token ID790
Feature activation-1.02e+00

Pos logit contributions
ocol+0.009
 rib+0.008
 microsc+0.007
 gymn+0.007
rou+0.007
Neg logit contributions
Maybe-0.003
 someone-0.003
 dad-0.003
 one-0.003
Mom-0.002
Token day
Token ID1110
Feature activation+1.48e-01

Pos logit contributions
Maybe+0.026
Nothing+0.024
 someone+0.024
What+0.023
Someone+0.023
Neg logit contributions
ocol-0.036
 topp-0.031
 gymn-0.031
 rib-0.030
pri-0.029
Token
Token ID129
Feature activation+1.05e+00

Pos logit contributions
Maybe+0.012
 someone+0.012
 one+0.011
 next+0.011
What+0.011
Neg logit contributions
ocol-0.019
 rib-0.016
 microsc-0.016
rou-0.015
 gymn-0.015
Token
Token ID241
Feature activation+3.58e-01

Pos logit contributions
ocol+0.040
 rib+0.034
 topp+0.031
 microsc+0.031
ifying+0.030
Neg logit contributions
 someone-0.028
 next-0.027
 dad-0.027
 one-0.027
Maybe-0.026
TokenDon
Token ID3987
Feature activation+1.10e+00

Pos logit contributions
ocol+0.018
 rib+0.015
 microsc+0.014
 gymn+0.014
rou+0.014
Neg logit contributions
Maybe-0.006
 someone-0.006
 one-0.006
 dad-0.006
 next-0.005
Tokenâ
Token ID22940
Feature activation+4.48e-01

Pos logit contributions
ocol+0.059
 rib+0.049
 microsc+0.048
 topp+0.047
rou+0.045
Neg logit contributions
 someone-0.019
 next-0.019
Maybe-0.019
Mom-0.018
 one-0.017
Token
Token ID26391
Feature activation-1.42e-01

Pos logit contributions
ocol+0.014
 rib+0.011
 microsc+0.010
rou+0.010
 gymn+0.010
Neg logit contributions
Maybe-0.015
 someone-0.015
 one-0.015
 next-0.015
 dad-0.014
Token
Token ID8151
Feature activation+2.37e+00

Pos logit contributions
Maybe+0.003
 someone+0.003
 one+0.003
 dad+0.003
 next+0.003
Neg logit contributions
ocol-0.006
 rib-0.005
 microsc-0.005
 gymn-0.005
rou-0.005
Tokent
Token ID83
Feature activation-4.20e-01

Pos logit contributions
ocol+0.120
 rib+0.104
 microsc+0.093
 former+0.093
 topp+0.090
Neg logit contributions
 someone-0.044
 they-0.044
Maybe-0.043
 one-0.043
Mom-0.042
Token worry
Token ID5490
Feature activation+2.24e-01

Pos logit contributions
Maybe+0.010
How+0.009
 dad+0.009
Mom+0.009
What+0.008
Neg logit contributions
ocol-0.018
 rib-0.014
 gymn-0.014
 microsc-0.014
hy-0.014
Token,
Token ID11
Feature activation+2.27e-01

Pos logit contributions
ocol+0.011
 rib+0.009
 microsc+0.009
 gymn+0.009
rou+0.008
Neg logit contributions
Maybe-0.004
 someone-0.004
 one-0.004
 dad-0.004
 next-0.003
Token Mat
Token ID6550
Feature activation+1.35e+00

Pos logit contributions
ocol+0.011
 rib+0.009
 microsc+0.009
 gymn+0.009
rou+0.009
Neg logit contributions
Maybe-0.004
 someone-0.004
 one-0.003
 dad-0.003
Mom-0.003
Tokenty
Token ID774
Feature activation+1.07e-01

Pos logit contributions
ocol+0.061
 rib+0.050
 microsc+0.048
 topp+0.047
ifying+0.046
Neg logit contributions
 next-0.031
 someone-0.030
 one-0.029
 dad-0.029
Maybe-0.028
Token,
Token ID11
Feature activation+1.94e-02

Pos logit contributions
ocol+0.022
 rib+0.019
 microsc+0.018
 gymn+0.017
rou+0.017
Neg logit contributions
 someone-0.007
Maybe-0.007
 one-0.007
 dad-0.007
 next-0.006
Token please
Token ID3387
Feature activation-6.48e-01

Pos logit contributions
ocol+0.001
 rib+0.001
 microsc+0.001
 gymn+0.001
rou+0.001
Neg logit contributions
Maybe-0.000
 someone-0.000
 one-0.000
 dad-0.000
Mom-0.000
Token?
Token ID30
Feature activation+2.74e-01

Pos logit contributions
Maybe+0.013
 someone+0.011
Mom+0.011
Someone+0.011
What+0.011
Neg logit contributions
ocol-0.030
 rib-0.025
 gymn-0.024
 microsc-0.023
 topp-0.022
Tokenâ
Token ID22940
Feature activation+3.29e-01

Pos logit contributions
ocol+0.011
 rib+0.009
 microsc+0.009
 gymn+0.008
rou+0.008
Neg logit contributions
Maybe-0.007
 someone-0.007
 one-0.006
 dad-0.006
 next-0.006
Token
Token ID26391
Feature activation-4.61e-01

Pos logit contributions
ocol+0.010
 rib+0.008
 microsc+0.007
 gymn+0.007
rou+0.007
Neg logit contributions
Maybe-0.011
 someone-0.011
 one-0.011
 dad-0.011
 next-0.011
Token
Token ID8151
Feature activation+2.37e+00

Pos logit contributions
 someone+0.011
Maybe+0.011
 one+0.010
 next+0.010
 somewhere+0.010
Neg logit contributions
ocol-0.018
 rib-0.016
 microsc-0.015
 gymn-0.015
rou-0.014
Token
Token ID220
Feature activation+2.94e-01

Pos logit contributions
ocol+0.128
 rib+0.111
 microsc+0.099
 former+0.099
 topp+0.099
Neg logit contributions
 one-0.038
 dad-0.038
 someone-0.037
 mom-0.036
 they-0.035
Token
Token ID198
Feature activation+3.33e-01

Pos logit contributions
ocol+0.015
 rib+0.013
 microsc+0.012
 gymn+0.012
rou+0.012
Neg logit contributions
Maybe-0.004
 someone-0.004
 one-0.004
 dad-0.004
 next-0.004
Token
Token ID198
Feature activation+4.39e-01

Pos logit contributions
ocol+0.017
 rib+0.015
 microsc+0.014
 gymn+0.014
rou+0.014
Neg logit contributions
Maybe-0.005
 someone-0.005
 dad-0.004
 one-0.004
Mom-0.004
TokenHer
Token ID9360
Feature activation+3.98e-01

Pos logit contributions
ocol+0.022
 rib+0.018
 microsc+0.018
rou+0.017
 topp+0.017
Neg logit contributions
Maybe-0.008
 someone-0.007
Mom-0.007
 one-0.007
 dad-0.007
Token mom
Token ID1995
Feature activation-2.00e-01

Pos logit contributions
ocol+0.025
 rib+0.022
 microsc+0.021
 gymn+0.021
rou+0.021
Neg logit contributions
 someone-0.001
Maybe-0.001
 dad-0.001
 one-0.001
 next-0.001
Token he
Token ID339
Feature activation-5.29e-01

Pos logit contributions
ocol+0.029
 rib+0.024
 microsc+0.023
 gymn+0.022
rou+0.022
Neg logit contributions
Maybe-0.013
 one-0.013
 someone-0.013
 dad-0.012
 something-0.012
Token just
Token ID655
Feature activation+3.67e-01

Pos logit contributions
Maybe+0.011
 someone+0.010
Mom+0.010
How+0.010
What+0.010
Neg logit contributions
ocol-0.023
 rib-0.019
 microsc-0.018
 gymn-0.018
rou-0.018
Token couldn
Token ID3521
Feature activation+8.45e-01

Pos logit contributions
ocol+0.017
 rib+0.014
 microsc+0.013
 gymn+0.013
rou+0.013
Neg logit contributions
Maybe-0.007
 someone-0.007
 one-0.007
 dad-0.007
 next-0.006
Tokenâ
Token ID22940
Feature activation+6.48e-01

Pos logit contributions
ocol+0.040
 rib+0.033
 microsc+0.032
rou+0.031
 topp+0.030
Neg logit contributions
 someone-0.018
 one-0.017
Maybe-0.017
 next-0.016
 dad-0.015
Token
Token ID26391
Feature activation+9.74e-02

Pos logit contributions
ocol+0.022
 rib+0.017
 microsc+0.016
rou+0.015
 topp+0.015
Neg logit contributions
Maybe-0.021
 someone-0.021
 one-0.021
 next-0.020
Mom-0.020
Token
Token ID8151
Feature activation+2.36e+00

Pos logit contributions
ocol+0.004
 rib+0.004
 microsc+0.003
 gymn+0.003
rou+0.003
Neg logit contributions
Maybe-0.002
 someone-0.002
 one-0.002
 dad-0.002
 next-0.002
Tokent
Token ID83
Feature activation-7.12e-01

Pos logit contributions
ocol+0.117
 rib+0.096
 microsc+0.093
 topp+0.088
 gymn+0.086
Neg logit contributions
 one-0.047
Maybe-0.046
 someone-0.045
 they-0.045
 next-0.043
Token reach
Token ID3151
Feature activation-6.25e-01

Pos logit contributions
Maybe+0.015
How+0.014
Mom+0.014
 dad+0.014
 someone+0.013
Neg logit contributions
ocol-0.031
 rib-0.025
 gymn-0.024
hy-0.024
 microsc-0.023
Token it
Token ID340
Feature activation-7.79e-01

Pos logit contributions
Maybe+0.012
Mom+0.011
How+0.011
 dad+0.011
 someone+0.010
Neg logit contributions
ocol-0.029
 rib-0.025
 microsc-0.023
 gymn-0.023
rou-0.022
Token.
Token ID13
Feature activation+3.06e-01

Pos logit contributions
Maybe+0.016
Mom+0.015
 dad+0.015
How+0.015
What+0.014
Neg logit contributions
ocol-0.033
 rib-0.029
 gymn-0.027
 microsc-0.027
rou-0.026
Token Then
Token ID3244
Feature activation-1.73e-01

Pos logit contributions
ocol+0.014
 rib+0.012
 microsc+0.011
 gymn+0.011
rou+0.011
Neg logit contributions
Maybe-0.006
 someone-0.006
 one-0.006
 dad-0.006
 next-0.006
Token
Token ID129
Feature activation+9.27e-01

Pos logit contributions
Maybe+0.023
 someone+0.023
 one+0.022
What+0.022
Mom+0.022
Neg logit contributions
ocol-0.033
 rib-0.027
opot-0.027
 microsc-0.027
oc-0.026
Token
Token ID241
Feature activation+7.59e-01

Pos logit contributions
ocol+0.036
 rib+0.029
 microsc+0.027
 topp+0.027
ifying+0.026
Neg logit contributions
 someone-0.025
 one-0.025
 next-0.025
 dad-0.024
Maybe-0.024
TokenDon
Token ID3987
Feature activation+1.14e+00

Pos logit contributions
ocol+0.037
 rib+0.031
 microsc+0.030
 gymn+0.028
rou+0.028
Neg logit contributions
Maybe-0.015
 someone-0.014
 one-0.013
 dad-0.013
Mom-0.012
Tokenâ
Token ID22940
Feature activation+3.39e-01

Pos logit contributions
ocol+0.060
 rib+0.050
 microsc+0.050
 topp+0.047
rou+0.047
Neg logit contributions
 next-0.019
 someone-0.019
Maybe-0.019
 one-0.018
Mom-0.017
Token
Token ID26391
Feature activation-3.35e-01

Pos logit contributions
ocol+0.012
 rib+0.009
 microsc+0.009
 gymn+0.008
rou+0.008
Neg logit contributions
Maybe-0.010
 someone-0.010
 one-0.010
 dad-0.010
 next-0.010
Token
Token ID8151
Feature activation+2.36e+00

Pos logit contributions
Maybe+0.007
 someone+0.007
 one+0.007
Mom+0.006
 dad+0.006
Neg logit contributions
ocol-0.014
 rib-0.012
 microsc-0.011
 gymn-0.011
 gol-0.011
Tokent
Token ID83
Feature activation-5.19e-01

Pos logit contributions
ocol+0.119
 rib+0.107
 former+0.100
 microsc+0.095
 gymn+0.093
Neg logit contributions
 someone-0.041
 they-0.040
 one-0.040
Mom-0.040
 there-0.039
Token worry
Token ID5490
Feature activation+6.18e-01

Pos logit contributions
Maybe+0.012
How+0.012
Mom+0.011
What+0.011
 dad+0.011
Neg logit contributions
ocol-0.021
 rib-0.017
 microsc-0.017
 gymn-0.017
 gol-0.016
Token,
Token ID11
Feature activation+4.99e-01

Pos logit contributions
ocol+0.032
 rib+0.027
 microsc+0.026
 gymn+0.025
rou+0.025
Neg logit contributions
 someone-0.010
 dad-0.009
 one-0.009
Maybe-0.009
 next-0.008
Token I
Token ID314
Feature activation+6.72e-01

Pos logit contributions
ocol+0.026
 rib+0.022
 microsc+0.021
 gymn+0.021
rou+0.020
Neg logit contributions
 someone-0.007
Maybe-0.007
 one-0.007
 dad-0.007
 next-0.006
Tokenâ
Token ID22940
Feature activation+1.28e-01

Pos logit contributions
ocol+0.029
 rib+0.023
 microsc+0.022
 topp+0.021
 gymn+0.021
Neg logit contributions
Maybe-0.017
 someone-0.016
 one-0.016
 dad-0.016
 next-0.015
Tokenâ
Token ID22940
Feature activation+2.04e-01

Pos logit contributions
ocol+0.022
 rib+0.018
 microsc+0.017
rou+0.017
 gymn+0.017
Neg logit contributions
Maybe-0.013
 someone-0.013
 one-0.012
 dad-0.012
Mom-0.012
Token
Token ID26391
Feature activation-2.71e-01

Pos logit contributions
ocol+0.007
 rib+0.005
 microsc+0.005
 gymn+0.004
rou+0.004
Neg logit contributions
Maybe-0.007
 someone-0.007
 dad-0.007
 one-0.007
 next-0.006
Token Sam
Token ID3409
Feature activation+2.65e-01

Pos logit contributions
Maybe+0.006
 someone+0.006
 one+0.006
 next+0.006
 dad+0.006
Neg logit contributions
ocol-0.011
 rib-0.009
 microsc-0.009
rou-0.009
 gymn-0.009
Tokenâ
Token ID22940
Feature activation+4.84e-01

Pos logit contributions
ocol+0.012
 rib+0.010
 microsc+0.009
 gymn+0.009
rou+0.009
Neg logit contributions
Maybe-0.005
 someone-0.005
 dad-0.005
 one-0.005
 next-0.005
Token
Token ID26391
Feature activation-2.19e-01

Pos logit contributions
ocol+0.016
 rib+0.013
 microsc+0.012
 gymn+0.011
rou+0.011
Neg logit contributions
Maybe-0.016
 someone-0.016
 one-0.015
 next-0.015
 dad-0.015
Token
Token ID8151
Feature activation+2.36e+00

Pos logit contributions
Maybe+0.005
 someone+0.005
 one+0.005
 dad+0.005
 next+0.005
Neg logit contributions
ocol-0.009
 rib-0.007
 gymn-0.007
 microsc-0.007
rou-0.007
Tokens
Token ID82
Feature activation-1.20e-01

Pos logit contributions
ocol+0.114
 rib+0.095
 microsc+0.089
 former+0.085
 gymn+0.083
Neg logit contributions
Maybe-0.050
 one-0.048
 someone-0.047
 everything-0.046
 they-0.046
Token eyes
Token ID2951
Feature activation+1.40e-01

Pos logit contributions
Maybe+0.003
 someone+0.003
How+0.002
Mom+0.002
 one+0.002
Neg logit contributions
ocol-0.005
 rib-0.004
 microsc-0.004
 gymn-0.004
rou-0.004
Token lit
Token ID6578
Feature activation+1.04e+00

Pos logit contributions
ocol+0.006
 rib+0.005
 microsc+0.005
 gymn+0.004
rou+0.004
Neg logit contributions
Maybe-0.003
 someone-0.003
 dad-0.003
 one-0.003
Mom-0.003
Token up
Token ID510
Feature activation+7.43e-01

Pos logit contributions
ocol+0.057
 rib+0.048
 microsc+0.046
pill+0.046
opot+0.046
Neg logit contributions
 one-0.013
 someone-0.013
 something-0.012
 next-0.012
 everything-0.011
Token and
Token ID290
Feature activation-7.32e-02

Pos logit contributions
ocol+0.037
 rib+0.032
 microsc+0.030
 gymn+0.029
rou+0.029
Neg logit contributions
 someone-0.012
 one-0.011
Maybe-0.011
 dad-0.011
 next-0.011
Token says
Token ID1139
Feature activation+1.35e-01

Pos logit contributions
 someone+0.012
Maybe+0.012
 one+0.011
 dad+0.011
 somewhere+0.011
Neg logit contributions
ocol-0.016
 rib-0.013
 microsc-0.012
 gymn-0.012
rou-0.012
Token.
Token ID13
Feature activation-9.44e-01

Pos logit contributions
ocol+0.007
 rib+0.006
 microsc+0.006
 gymn+0.005
rou+0.005
Neg logit contributions
Maybe-0.002
 someone-0.002
 one-0.002
 dad-0.002
Mom-0.002
Token She
Token ID1375
Feature activation+1.50e-01

Pos logit contributions
 somewhere+0.024
 someone+0.023
Maybe+0.023
ths+0.023
 one+0.023
Neg logit contributions
ocol-0.028
 rib-0.026
 gymn-0.024
 microsc-0.024
 tant-0.022
Token hears
Token ID23180
Feature activation-9.43e-01

Pos logit contributions
ocol+0.007
 rib+0.006
 microsc+0.005
 gymn+0.005
rou+0.005
Neg logit contributions
Maybe-0.003
 someone-0.003
 dad-0.003
 one-0.003
Mom-0.003
Token the
Token ID262
Feature activation+3.52e-01

Pos logit contributions
Maybe+0.024
How+0.021
Nothing+0.020
Mom+0.020
cil+0.020
Neg logit contributions
ocol-0.038
 rib-0.032
 microsc-0.029
 gymn-0.028
 tant-0.027
Token noise
Token ID7838
Feature activation-2.40e+00

Pos logit contributions
ocol+0.018
 rib+0.015
 microsc+0.014
 gymn+0.014
rou+0.014
Neg logit contributions
Maybe-0.005
 someone-0.005
 dad-0.005
 one-0.005
 next-0.005
Token and
Token ID290
Feature activation-8.71e-01

Pos logit contributions
cil+0.092
 Mysteries+0.080
Someone+0.079
 Rays+0.078
 gentlemen+0.077
Neg logit contributions
ocol-0.053
 gymn-0.043
 rib-0.041
oke-0.041
 tant-0.040
Token comes
Token ID2058
Feature activation-2.25e-01

Pos logit contributions
Mom+0.021
Maybe+0.020
Dad+0.019
 someone+0.019
 somewhere+0.018
Neg logit contributions
ocol-0.034
 rib-0.031
 gymn-0.028
 microsc-0.027
rou-0.026
Token to
Token ID284
Feature activation+5.72e-01

Pos logit contributions
Maybe+0.006
 someone+0.006
How+0.006
 one+0.006
 dad+0.006
Neg logit contributions
ocol-0.008
 rib-0.007
 microsc-0.006
 gymn-0.006
rou-0.006
Token the
Token ID262
Feature activation+1.00e+00

Pos logit contributions
ocol+0.030
 rib+0.025
rou+0.025
 microsc+0.025
pill+0.024
Neg logit contributions
 someone-0.008
Maybe-0.008
 dad-0.008
 one-0.008
 next-0.007
Token attic
Token ID46113
Feature activation+3.06e-01

Pos logit contributions
ocol+0.053
 rib+0.042
pill+0.040
 microsc+0.040
opot+0.040
Neg logit contributions
 someone-0.018
 dad-0.018
 next-0.017
Maybe-0.017
 one-0.017
Token very
Token ID845
Feature activation-1.31e+00

Pos logit contributions
ocol+0.009
 rib+0.007
 microsc+0.007
 gymn+0.006
rou+0.006
Neg logit contributions
Maybe-0.006
 someone-0.006
 one-0.006
 dad-0.006
Mom-0.006
Token much
Token ID881
Feature activation+5.21e-01

Pos logit contributions
How+0.030
Maybe+0.030
What+0.029
Where+0.027
 someone+0.025
Neg logit contributions
ocol-0.060
hy-0.046
 microsc-0.045
 rib-0.045
rou-0.044
Token.
Token ID13
Feature activation+9.83e-01

Pos logit contributions
ocol+0.028
 rib+0.024
 microsc+0.023
 gymn+0.022
rou+0.022
Neg logit contributions
 someone-0.006
Maybe-0.006
 one-0.006
 dad-0.006
 next-0.006
Token She
Token ID1375
Feature activation-2.42e-01

Pos logit contributions
ocol+0.051
 rib+0.043
rou+0.040
 microsc+0.040
 topp+0.040
Neg logit contributions
 one-0.016
 someone-0.015
Maybe-0.015
 dad-0.014
 everything-0.013
Token hugged
Token ID41130
Feature activation-1.27e-01

Pos logit contributions
 someone+0.006
Maybe+0.006
 dad+0.005
Mom+0.005
How+0.005
Neg logit contributions
ocol-0.010
 rib-0.008
 microsc-0.008
 gymn-0.008
rou-0.007
Token it
Token ID340
Feature activation-2.39e+00

Pos logit contributions
Maybe+0.003
 someone+0.002
How+0.002
Mom+0.002
 next+0.002
Neg logit contributions
ocol-0.006
 rib-0.005
 microsc-0.005
 gymn-0.004
 topp-0.004
Token every
Token ID790
Feature activation-7.71e-01

Pos logit contributions
Nothing+0.076
Where+0.075
What+0.074
How+0.073
Maybe+0.072
Neg logit contributions
 rib-0.064
 anx-0.057
 tre-0.054
 stuffed-0.054
rou-0.054
Token night
Token ID1755
Feature activation+2.50e-01

Pos logit contributions
Maybe+0.017
 someone+0.015
How+0.015
They+0.015
Nothing+0.015
Neg logit contributions
ocol-0.033
 rib-0.028
 gymn-0.026
 topp-0.025
 gol-0.025
Token and
Token ID290
Feature activation+4.18e-01

Pos logit contributions
ocol+0.012
 rib+0.010
 microsc+0.009
 gymn+0.009
rou+0.009
Neg logit contributions
Maybe-0.005
 someone-0.005
 one-0.004
 dad-0.004
Mom-0.004
Token played
Token ID2826
Feature activation+5.71e-01

Pos logit contributions
ocol+0.020
 rib+0.017
 microsc+0.016
 gymn+0.016
rou+0.016
Neg logit contributions
Maybe-0.007
 someone-0.007
 one-0.007
 dad-0.007
 next-0.007
Token with
Token ID351
Feature activation+1.06e-01

Pos logit contributions
ocol+0.031
 rib+0.026
 microsc+0.025
rou+0.025
ifying+0.024
Neg logit contributions
 someone-0.008
 one-0.007
Maybe-0.007
 next-0.007
 dad-0.007
Token
Token ID198
Feature activation-7.89e-01

Pos logit contributions
Maybe+0.006
 one+0.006
 next+0.006
 someone+0.006
 dad+0.006
Neg logit contributions
ocol-0.015
 rib-0.012
 microsc-0.012
 gymn-0.012
 gol-0.011
Token
Token ID198
Feature activation-7.51e-01

Pos logit contributions
 one+0.016
Maybe+0.016
 someone+0.015
How+0.015
 dad+0.015
Neg logit contributions
ocol-0.031
 microsc-0.025
 rib-0.025
rou-0.024
 gymn-0.024
TokenMom
Token ID29252
Feature activation-5.20e-01

Pos logit contributions
 someone+0.012
 one+0.012
 dad+0.011
Maybe+0.011
 look+0.011
Neg logit contributions
ocol-0.033
 rib-0.028
 microsc-0.027
 gol-0.027
rou-0.026
Token hears
Token ID23180
Feature activation-7.99e-01

Pos logit contributions
 someone+0.011
Maybe+0.011
 one+0.010
 dad+0.010
 somewhere+0.010
Neg logit contributions
ocol-0.022
 rib-0.019
rou-0.018
 microsc-0.018
 gymn-0.018
Token the
Token ID262
Feature activation-1.68e-01

Pos logit contributions
Maybe+0.019
How+0.017
Mom+0.016
Nothing+0.016
cil+0.016
Neg logit contributions
ocol-0.034
 rib-0.029
 microsc-0.026
 gymn-0.025
 gol-0.024
Token noise
Token ID7838
Feature activation-2.38e+00

Pos logit contributions
Maybe+0.003
 someone+0.003
How+0.003
 one+0.003
Mom+0.003
Neg logit contributions
ocol-0.008
 rib-0.007
 microsc-0.006
 gymn-0.006
 topp-0.006
Token and
Token ID290
Feature activation-1.00e+00

Pos logit contributions
cil+0.089
Someone+0.081
Maybe+0.080
 Mysteries+0.079
Mother+0.079
Neg logit contributions
ocol-0.052
 gymn-0.042
oke-0.042
 rib-0.040
opot-0.036
Token comes
Token ID2058
Feature activation+4.53e-02

Pos logit contributions
Mom+0.024
Maybe+0.024
cil+0.023
 someone+0.023
 somewhere+0.022
Neg logit contributions
ocol-0.039
 rib-0.035
 gymn-0.031
 microsc-0.030
pill-0.029
Token to
Token ID284
Feature activation+6.57e-01

Pos logit contributions
ocol+0.002
 rib+0.001
 microsc+0.001
 gymn+0.001
rou+0.001
Neg logit contributions
Maybe-0.001
 someone-0.001
 one-0.001
 dad-0.001
Mom-0.001
Token the
Token ID262
Feature activation-3.37e-03

Pos logit contributions
ocol+0.037
 rib+0.031
rou+0.030
 microsc+0.030
pill+0.030
Neg logit contributions
 someone-0.008
Maybe-0.008
 dad-0.007
 one-0.007
 next-0.007
Token garage
Token ID15591
Feature activation-4.17e-01

Pos logit contributions
Maybe+0.000
 someone+0.000
 one+0.000
 dad+0.000
Mom+0.000
Neg logit contributions
ocol-0.000
 rib-0.000
 microsc-0.000
 gymn-0.000
rou-0.000
Token.
Token ID13
Feature activation+1.23e-01

Pos logit contributions
Maybe+0.001
 someone+0.001
 dad+0.001
 one+0.001
How+0.001
Neg logit contributions
ocol-0.003
 rib-0.003
 microsc-0.003
rou-0.003
 gymn-0.003
Token Lily
Token ID20037
Feature activation-1.88e-01

Pos logit contributions
ocol+0.006
 rib+0.005
 microsc+0.005
 gymn+0.005
rou+0.004
Neg logit contributions
Maybe-0.002
 someone-0.002
 dad-0.002
 one-0.002
 next-0.002
Token had
Token ID550
Feature activation+6.01e-01

Pos logit contributions
 someone+0.004
Maybe+0.004
 dad+0.004
 one+0.004
How+0.004
Neg logit contributions
ocol-0.007
 rib-0.006
 microsc-0.006
 gymn-0.006
rou-0.006
Token a
Token ID257
Feature activation+5.12e-01

Pos logit contributions
ocol+0.038
 rib+0.032
 microsc+0.031
rou+0.031
 gymn+0.031
Neg logit contributions
 someone-0.003
Maybe-0.003
 one-0.003
 dad-0.003
Mom-0.002
Token bath
Token ID7837
Feature activation-8.73e-01

Pos logit contributions
ocol+0.025
 rib+0.020
 microsc+0.020
rou+0.019
 gymn+0.019
Neg logit contributions
 someone-0.009
 dad-0.009
Maybe-0.009
 one-0.009
 next-0.008
Tokentub
Token ID37995
Feature activation-2.36e+00

Pos logit contributions
Maybe+0.023
What+0.022
 someone+0.022
How+0.022
Mom+0.021
Neg logit contributions
ocol-0.030
 rib-0.026
 gymn-0.024
rou-0.024
 gol-0.023
Token in
Token ID287
Feature activation-4.98e-01

Pos logit contributions
Maybe+0.072
Someone+0.068
What+0.067
Mother+0.066
Parent+0.066
Neg logit contributions
ocol-0.075
 rib-0.065
hy-0.062
 tre-0.061
rou-0.059
Token her
Token ID607
Feature activation-3.62e-02

Pos logit contributions
Maybe+0.012
How+0.011
 dad+0.011
Nothing+0.011
ths+0.011
Neg logit contributions
ocol-0.020
 rib-0.017
 microsc-0.016
 gymn-0.015
 topp-0.014
Token bathroom
Token ID12436
Feature activation-1.20e+00

Pos logit contributions
Maybe+0.001
 someone+0.001
 one+0.001
How+0.001
What+0.001
Neg logit contributions
ocol-0.002
 rib-0.001
 microsc-0.001
 gymn-0.001
rou-0.001
Token.
Token ID13
Feature activation+2.90e-01

Pos logit contributions
Maybe+0.030
What+0.029
Someone+0.027
Nothing+0.027
How+0.026
Neg logit contributions
ocol-0.045
 rib-0.040
 gymn-0.038
rou-0.038
 topp-0.037
Token She
Token ID1375
Feature activation+1.03e-01

Pos logit contributions
ocol+0.014
 rib+0.011
 microsc+0.011
 gymn+0.011
rou+0.011
Neg logit contributions
Maybe-0.005
 someone-0.005
 one-0.005
 dad-0.005
Mom-0.005
Token every
Token ID790
Feature activation-1.33e+00

Pos logit contributions
Maybe+0.009
How+0.008
Mom+0.007
 someone+0.007
 dad+0.007
Neg logit contributions
ocol-0.018
 rib-0.015
 microsc-0.014
 gymn-0.013
pill-0.013
Token night
Token ID1755
Feature activation+1.94e-01

Pos logit contributions
Maybe+0.035
Nothing+0.032
How+0.031
What+0.031
Mom+0.030
Neg logit contributions
ocol-0.050
 rib-0.042
 gymn-0.039
ifying-0.037
 topp-0.037
Token.
Token ID13
Feature activation+5.80e-01

Pos logit contributions
ocol+0.010
 rib+0.008
 microsc+0.008
 gymn+0.008
rou+0.008
Neg logit contributions
Maybe-0.003
 someone-0.003
 dad-0.003
 one-0.003
Mom-0.002
Token She
Token ID1375
Feature activation-2.66e-01

Pos logit contributions
ocol+0.029
 rib+0.024
 microsc+0.023
rou+0.023
 gol+0.022
Neg logit contributions
 someone-0.010
 one-0.010
Maybe-0.010
 dad-0.009
 everything-0.009
Token hugs
Token ID40657
Feature activation-2.58e-01

Pos logit contributions
 someone+0.007
Maybe+0.007
Mom+0.007
 dad+0.007
What+0.006
Neg logit contributions
ocol-0.010
 rib-0.009
 microsc-0.008
 gymn-0.008
rou-0.008
Token it
Token ID340
Feature activation-2.33e+00

Pos logit contributions
Maybe+0.005
 someone+0.004
How+0.004
Mom+0.004
Nothing+0.004
Neg logit contributions
ocol-0.012
 rib-0.010
 gymn-0.009
 microsc-0.009
rou-0.009
Token and
Token ID290
Feature activation-2.89e-01

Pos logit contributions
Nothing+0.079
Where+0.077
What+0.077
Maybe+0.077
How+0.075
Neg logit contributions
 rib-0.059
 stuffed-0.053
 tre-0.049
 gymn-0.047
rou-0.046
Token squee
Token ID13170
Feature activation-1.40e-01

Pos logit contributions
Maybe+0.007
 someone+0.006
Mom+0.006
How+0.006
 next+0.006
Neg logit contributions
ocol-0.012
 rib-0.011
 microsc-0.010
 gymn-0.009
 topp-0.009
Tokenzes
Token ID12271
Feature activation+1.48e-01

Pos logit contributions
Maybe+0.003
 someone+0.003
 one+0.003
 dad+0.003
 next+0.003
Neg logit contributions
ocol-0.006
 rib-0.005
 microsc-0.005
 gymn-0.005
 topp-0.004
Token it
Token ID340
Feature activation-1.66e+00

Pos logit contributions
ocol+0.008
 rib+0.007
 microsc+0.007
 gymn+0.006
rou+0.006
Neg logit contributions
Maybe-0.002
 someone-0.002
 dad-0.002
 one-0.002
Mom-0.001
Token tight
Token ID5381
Feature activation+6.09e-01

Pos logit contributions
Dad+0.046
Someone+0.045
Maybe+0.045
Nothing+0.045
What+0.044
Neg logit contributions
 rib-0.054
ocol-0.049
 gymn-0.046
 topp-0.042
hy-0.042
Token
Token ID198
Feature activation-5.62e-01

Pos logit contributions
Maybe+0.009
 someone+0.009
 one+0.009
 next+0.009
 somewhere+0.008
Neg logit contributions
ocol-0.019
 rib-0.016
 microsc-0.016
 gymn-0.015
 topp-0.015
Token
Token ID198
Feature activation-4.13e-01

Pos logit contributions
Maybe+0.011
 one+0.011
 someone+0.011
 dad+0.010
How+0.010
Neg logit contributions
ocol-0.023
 rib-0.019
 microsc-0.019
 gymn-0.019
rou-0.018
TokenMom
Token ID29252
Feature activation-5.50e-01

Pos logit contributions
 someone+0.007
Maybe+0.007
 dad+0.007
 one+0.007
 somewhere+0.006
Neg logit contributions
ocol-0.019
 rib-0.016
 microsc-0.015
 gymn-0.015
ifying-0.015
Token hears
Token ID23180
Feature activation-4.85e-01

Pos logit contributions
 someone+0.013
Maybe+0.012
 dad+0.011
 one+0.011
 somewhere+0.011
Neg logit contributions
ocol-0.023
 rib-0.020
rou-0.019
 microsc-0.018
 gymn-0.018
Token the
Token ID262
Feature activation+3.58e-01

Pos logit contributions
Maybe+0.010
How+0.009
Mom+0.009
 someone+0.009
Nothing+0.009
Neg logit contributions
ocol-0.022
 rib-0.018
 microsc-0.017
 gymn-0.017
 topp-0.016
Token noise
Token ID7838
Feature activation-2.29e+00

Pos logit contributions
ocol+0.019
 rib+0.016
 microsc+0.015
 gymn+0.015
rou+0.015
Neg logit contributions
 someone-0.005
Maybe-0.005
 dad-0.005
 one-0.005
 next-0.005
Token.
Token ID13
Feature activation-5.94e-01

Pos logit contributions
cil+0.084
Someone+0.078
Maybe+0.077
Mother+0.076
 Mysteries+0.075
Neg logit contributions
ocol-0.053
 gymn-0.043
oke-0.041
 rib-0.040
opot-0.037
Token She
Token ID1375
Feature activation-3.34e-01

Pos logit contributions
Maybe+0.015
 somewhere+0.015
 someone+0.014
How+0.014
 one+0.014
Neg logit contributions
ocol-0.021
 rib-0.018
 microsc-0.017
 gymn-0.016
pill-0.016
Token comes
Token ID2058
Feature activation-7.34e-02

Pos logit contributions
 someone+0.007
Maybe+0.007
 dad+0.006
Mom+0.006
 one+0.006
Neg logit contributions
ocol-0.015
 rib-0.013
 gymn-0.012
 microsc-0.012
rou-0.012
Token to
Token ID284
Feature activation+1.63e-01

Pos logit contributions
Maybe+0.002
 someone+0.002
How+0.002
 one+0.002
Mom+0.002
Neg logit contributions
ocol-0.002
 rib-0.002
 microsc-0.002
 gymn-0.002
rou-0.002
Token see
Token ID766
Feature activation-1.14e+00

Pos logit contributions
ocol+0.008
 rib+0.007
 microsc+0.007
 gymn+0.006
rou+0.006
Neg logit contributions
Maybe-0.003
 someone-0.002
 one-0.002
 dad-0.002
 next-0.002
Token.
Token ID13
Feature activation-2.73e-01

Pos logit contributions
ocol+0.017
 rib+0.014
 microsc+0.014
 gymn+0.013
rou+0.013
Neg logit contributions
 someone-0.005
Maybe-0.005
 one-0.005
 dad-0.005
 next-0.004
Token His
Token ID2399
Feature activation-6.50e-02

Pos logit contributions
Maybe+0.008
 someone+0.007
 somewhere+0.007
 next+0.007
How+0.007
Neg logit contributions
ocol-0.009
 rib-0.008
 microsc-0.007
 gymn-0.007
 topp-0.007
Token name
Token ID1438
Feature activation-8.85e-01

Pos logit contributions
Maybe+0.001
 someone+0.001
How+0.001
 one+0.001
What+0.001
Neg logit contributions
ocol-0.003
 rib-0.003
 microsc-0.002
 gymn-0.002
rou-0.002
Token was
Token ID373
Feature activation+6.98e-01

Pos logit contributions
How+0.024
Maybe+0.024
 gentlemen+0.024
 dad+0.024
Nothing+0.023
Neg logit contributions
ocol-0.028
 rib-0.024
 gymn-0.022
hy-0.021
 topp-0.021
Token Joe
Token ID5689
Feature activation+4.39e-01

Pos logit contributions
ocol+0.031
 rib+0.026
 microsc+0.024
rou+0.023
 gymn+0.023
Neg logit contributions
 someone-0.016
 dad-0.016
 one-0.015
Maybe-0.015
 everything-0.015
Token,
Token ID11
Feature activation-2.29e+00

Pos logit contributions
ocol+0.023
 rib+0.020
 microsc+0.019
 gymn+0.019
rou+0.018
Neg logit contributions
 someone-0.006
Maybe-0.006
 one-0.006
 dad-0.006
 next-0.005
Token and
Token ID290
Feature activation-5.97e-01

Pos logit contributions
 halftime+0.062
Days+0.060
Parent+0.057
ths+0.056
Mom+0.056
Neg logit contributions
 rib-0.065
ocol-0.063
rou-0.062
 gymn-0.061
 such-0.059
Token he
Token ID339
Feature activation+2.00e-02

Pos logit contributions
Maybe+0.012
Mom+0.011
 somewhere+0.010
How+0.010
 someone+0.010
Neg logit contributions
ocol-0.025
 rib-0.022
 microsc-0.022
 gymn-0.020
rou-0.020
Token was
Token ID373
Feature activation+4.04e-01

Pos logit contributions
ocol+0.001
 rib+0.001
 microsc+0.001
 gymn+0.001
rou+0.001
Neg logit contributions
Maybe-0.000
 someone-0.000
 dad-0.000
 one-0.000
Mom-0.000
Token wearing
Token ID5762
Feature activation+1.33e-01

Pos logit contributions
ocol+0.021
 rib+0.018
 microsc+0.017
 gymn+0.016
 topp+0.016
Neg logit contributions
 someone-0.006
Maybe-0.006
 one-0.006
 dad-0.006
 next-0.006
Token red
Token ID2266
Feature activation+5.16e-01

Pos logit contributions
ocol+0.007
 rib+0.006
 microsc+0.006
 gymn+0.006
rou+0.006
Neg logit contributions
Maybe-0.002
 someone-0.001
 dad-0.001
 one-0.001
Mom-0.001
Token
Token ID198
Feature activation-7.83e-01

Pos logit contributions
Maybe+0.005
 one+0.005
 someone+0.005
 dad+0.005
 next+0.005
Neg logit contributions
ocol-0.012
 rib-0.011
 microsc-0.010
 gymn-0.010
 gol-0.010
Token
Token ID198
Feature activation-6.74e-01

Pos logit contributions
Maybe+0.017
 one+0.016
 someone+0.016
How+0.016
 looked+0.015
Neg logit contributions
ocol-0.030
 rib-0.025
 microsc-0.025
 gymn-0.024
rou-0.024
TokenMom
Token ID29252
Feature activation-4.14e-01

Pos logit contributions
 somewhere+0.010
Maybe+0.010
 someone+0.010
 looked+0.010
 one+0.010
Neg logit contributions
ocol-0.031
 rib-0.025
 microsc-0.025
 gymn-0.025
 gol-0.025
Token hears
Token ID23180
Feature activation-7.83e-01

Pos logit contributions
 someone+0.009
Maybe+0.008
 dad+0.008
 one+0.008
 somewhere+0.007
Neg logit contributions
ocol-0.018
 rib-0.015
rou-0.015
 gymn-0.014
 microsc-0.014
Token the
Token ID262
Feature activation+3.43e-01

Pos logit contributions
Maybe+0.019
How+0.017
Mom+0.016
Nothing+0.016
cil+0.016
Neg logit contributions
ocol-0.033
 rib-0.028
 microsc-0.026
 gymn-0.024
pill-0.023
Token noise
Token ID7838
Feature activation-2.29e+00

Pos logit contributions
ocol+0.018
 rib+0.015
 microsc+0.015
 gymn+0.014
rou+0.014
Neg logit contributions
 someone-0.005
Maybe-0.005
 dad-0.004
 one-0.004
 next-0.004
Token and
Token ID290
Feature activation-9.61e-01

Pos logit contributions
cil+0.084
Someone+0.075
Mother+0.075
Maybe+0.074
 gentlemen+0.074
Neg logit contributions
ocol-0.055
 gymn-0.044
oke-0.043
 rib-0.042
opot-0.037
Token comes
Token ID2058
Feature activation+3.04e-01

Pos logit contributions
Mom+0.023
Maybe+0.022
 someone+0.022
cil+0.021
 somewhere+0.021
Neg logit contributions
ocol-0.038
 rib-0.034
 gymn-0.030
 microsc-0.030
pill-0.028
Token to
Token ID284
Feature activation+5.53e-01

Pos logit contributions
ocol+0.012
 rib+0.010
 microsc+0.009
 gymn+0.009
rou+0.009
Neg logit contributions
 someone-0.008
Maybe-0.008
 dad-0.008
 one-0.008
 next-0.008
Token see
Token ID766
Feature activation-9.89e-01

Pos logit contributions
ocol+0.030
 rib+0.026
 microsc+0.025
rou+0.025
pill+0.024
Neg logit contributions
 someone-0.007
Maybe-0.007
 dad-0.006
 one-0.006
 next-0.006
Token what
Token ID644
Feature activation-3.93e-01

Pos logit contributions
Maybe+0.017
ths+0.016
Someone+0.016
cil+0.016
Days+0.016
Neg logit contributions
ocol-0.046
 rib-0.039
 microsc-0.037
rou-0.033
 gymn-0.033
Token
Token ID198
Feature activation-8.03e-01

Pos logit contributions
Maybe+0.005
 one+0.005
 dad+0.005
 someone+0.005
 next+0.005
Neg logit contributions
ocol-0.010
 rib-0.009
 microsc-0.008
 gymn-0.008
 gol-0.008
Token
Token ID198
Feature activation-8.37e-01

Pos logit contributions
 one+0.017
Maybe+0.017
 someone+0.016
 dad+0.016
 forth+0.016
Neg logit contributions
ocol-0.031
 microsc-0.025
 rib-0.024
 gymn-0.024
rou-0.024
TokenMom
Token ID29252
Feature activation-5.24e-01

Pos logit contributions
 dad+0.017
 look+0.017
 somewhere+0.017
 someone+0.017
 one+0.017
Neg logit contributions
ocol-0.032
 rib-0.026
 microsc-0.026
 gol-0.025
oc-0.025
Token hears
Token ID23180
Feature activation-5.39e-01

Pos logit contributions
 someone+0.011
Maybe+0.011
 one+0.010
 dad+0.010
Mom+0.010
Neg logit contributions
ocol-0.022
 rib-0.019
rou-0.018
 microsc-0.018
 gymn-0.018
Token the
Token ID262
Feature activation+1.07e+00

Pos logit contributions
Maybe+0.012
How+0.011
Mom+0.010
Nothing+0.010
 someone+0.010
Neg logit contributions
ocol-0.024
 rib-0.020
 microsc-0.018
 gymn-0.017
 gol-0.017
Token noise
Token ID7838
Feature activation-2.28e+00

Pos logit contributions
ocol+0.066
 rib+0.053
ifying+0.052
pill+0.052
 microsc+0.052
Neg logit contributions
 dad-0.011
 next-0.011
 someone-0.010
 one-0.009
 mom-0.009
Token and
Token ID290
Feature activation-1.04e+00

Pos logit contributions
cil+0.087
Maybe+0.079
Someone+0.079
How+0.078
 gentlemen+0.077
Neg logit contributions
ocol-0.050
oke-0.041
 gymn-0.038
 rib-0.037
opot-0.033
Token comes
Token ID2058
Feature activation+2.47e-01

Pos logit contributions
Mom+0.025
Maybe+0.024
 someone+0.024
cil+0.023
 somewhere+0.023
Neg logit contributions
ocol-0.040
 rib-0.036
 microsc-0.031
 gymn-0.031
pill-0.029
Token to
Token ID284
Feature activation+7.04e-01

Pos logit contributions
ocol+0.010
 rib+0.008
 microsc+0.007
 gymn+0.007
rou+0.007
Neg logit contributions
Maybe-0.007
 someone-0.007
 one-0.007
 dad-0.007
 next-0.006
Token see
Token ID766
Feature activation-6.21e-01

Pos logit contributions
ocol+0.039
 rib+0.033
rou+0.032
 microsc+0.032
pill+0.032
Neg logit contributions
 someone-0.009
Maybe-0.008
 dad-0.008
 one-0.008
 next-0.007
Token what
Token ID644
Feature activation-4.88e-02

Pos logit contributions
Maybe+0.009
cil+0.008
How+0.008
 dad+0.007
Mom+0.007
Neg logit contributions
ocol-0.031
 rib-0.026
 microsc-0.025
rou-0.023
 gymn-0.023
Token
Token ID198
Feature activation-5.10e-01

Pos logit contributions
Maybe+0.004
 one+0.004
 someone+0.004
 next+0.004
 somewhere+0.004
Neg logit contributions
ocol-0.010
 rib-0.009
 microsc-0.009
 gymn-0.008
 gol-0.008
Token
Token ID198
Feature activation-3.75e-01

Pos logit contributions
Maybe+0.010
 one+0.009
 someone+0.009
 dad+0.009
How+0.009
Neg logit contributions
ocol-0.022
 rib-0.018
 microsc-0.018
 gymn-0.018
rou-0.017
TokenMom
Token ID29252
Feature activation-5.90e-01

Pos logit contributions
Maybe+0.006
 someone+0.006
 one+0.006
 dad+0.006
 somewhere+0.006
Neg logit contributions
ocol-0.017
 rib-0.014
 microsc-0.014
 gol-0.013
 gymn-0.013
Token hears
Token ID23180
Feature activation-6.74e-01

Pos logit contributions
 someone+0.013
Maybe+0.012
 one+0.012
 dad+0.012
 somewhere+0.011
Neg logit contributions
ocol-0.025
 rib-0.021
rou-0.020
 microsc-0.020
 gymn-0.020
Token the
Token ID262
Feature activation+4.71e-01

Pos logit contributions
Maybe+0.016
How+0.014
Nothing+0.013
 next+0.013
cil+0.013
Neg logit contributions
ocol-0.029
 rib-0.024
 microsc-0.022
 gymn-0.021
 topp-0.020
Token noise
Token ID7838
Feature activation-2.26e+00

Pos logit contributions
ocol+0.026
 rib+0.022
 microsc+0.021
 gymn+0.020
rou+0.020
Neg logit contributions
 someone-0.006
Maybe-0.006
 dad-0.006
 next-0.006
 one-0.005
Token and
Token ID290
Feature activation-9.02e-01

Pos logit contributions
cil+0.086
Someone+0.078
How+0.077
Maybe+0.077
Mother+0.077
Neg logit contributions
ocol-0.049
 gymn-0.041
 rib-0.040
oke-0.038
hy-0.034
Token comes
Token ID2058
Feature activation+1.37e-01

Pos logit contributions
Mom+0.021
Maybe+0.021
cil+0.021
 someone+0.020
ble+0.020
Neg logit contributions
ocol-0.035
 rib-0.032
 gymn-0.028
 microsc-0.027
rou-0.026
Token to
Token ID284
Feature activation+5.60e-01

Pos logit contributions
ocol+0.005
 rib+0.004
 microsc+0.004
 gymn+0.004
rou+0.004
Neg logit contributions
Maybe-0.004
 someone-0.004
 one-0.004
 dad-0.004
Mom-0.004
Token the
Token ID262
Feature activation+1.08e+00

Pos logit contributions
ocol+0.031
 rib+0.026
 microsc+0.025
rou+0.025
pill+0.025
Neg logit contributions
 someone-0.007
Maybe-0.007
 dad-0.007
 one-0.006
 next-0.006
Token room
Token ID2119
Feature activation-1.94e-01

Pos logit contributions
ocol+0.061
 rib+0.048
pill+0.046
 gol+0.046
rou+0.045
Neg logit contributions
 someone-0.019
 next-0.019
 dad-0.017
Maybe-0.017
Mom-0.017
Token
Token ID198
Feature activation-7.03e-01

Pos logit contributions
Maybe+0.009
 someone+0.009
 one+0.008
 next+0.008
 somewhere+0.008
Neg logit contributions
ocol-0.019
 rib-0.016
 gymn-0.015
 microsc-0.015
pill-0.014
Token
Token ID198
Feature activation-2.84e-01

Pos logit contributions
 one+0.014
Maybe+0.014
 someone+0.013
 dad+0.013
How+0.013
Neg logit contributions
ocol-0.028
 rib-0.023
 microsc-0.023
 gymn-0.023
pri-0.021
TokenMom
Token ID29252
Feature activation-7.50e-01

Pos logit contributions
 someone+0.004
Maybe+0.004
 dad+0.004
 one+0.004
 somewhere+0.004
Neg logit contributions
ocol-0.014
 rib-0.012
 microsc-0.011
 gymn-0.011
ifying-0.011
Token hears
Token ID23180
Feature activation-5.31e-01

Pos logit contributions
 someone+0.018
Maybe+0.017
 somewhere+0.016
 one+0.016
 dad+0.016
Neg logit contributions
ocol-0.031
 rib-0.026
rou-0.025
 microsc-0.024
 gymn-0.024
Token the
Token ID262
Feature activation-8.63e-02

Pos logit contributions
Maybe+0.011
Mom+0.009
How+0.009
 someone+0.009
 next+0.009
Neg logit contributions
ocol-0.024
 rib-0.020
 microsc-0.019
 gymn-0.018
 topp-0.017
Token noise
Token ID7838
Feature activation-2.26e+00

Pos logit contributions
Maybe+0.002
 someone+0.002
How+0.001
 one+0.001
Mom+0.001
Neg logit contributions
ocol-0.004
 rib-0.003
 microsc-0.003
 gymn-0.003
rou-0.003
Token and
Token ID290
Feature activation-1.14e+00

Pos logit contributions
cil+0.079
Someone+0.074
Maybe+0.073
Mother+0.073
How+0.073
Neg logit contributions
ocol-0.057
 gymn-0.047
 rib-0.046
oke-0.042
hy-0.041
Token comes
Token ID2058
Feature activation+7.40e-02

Pos logit contributions
Mom+0.031
ble+0.030
cil+0.029
 somewhere+0.029
Maybe+0.029
Neg logit contributions
ocol-0.040
 rib-0.038
 gymn-0.032
 microsc-0.031
 spec-0.030
Token to
Token ID284
Feature activation+2.02e-01

Pos logit contributions
ocol+0.003
 rib+0.002
 microsc+0.002
 gymn+0.002
rou+0.002
Neg logit contributions
Maybe-0.002
 someone-0.002
 one-0.002
 dad-0.002
Mom-0.002
Token see
Token ID766
Feature activation-1.27e+00

Pos logit contributions
ocol+0.011
 rib+0.009
 microsc+0.009
 gymn+0.009
rou+0.009
Neg logit contributions
Maybe-0.002
 someone-0.002
 one-0.002
 dad-0.002
 next-0.002
Token what
Token ID644
Feature activation-4.78e-01

Pos logit contributions
 forth+0.020
ths+0.019
cil+0.019
Days+0.018
Years+0.018
Neg logit contributions
ocol-0.058
 rib-0.050
 microsc-0.046
 gymn-0.043
ifying-0.042
Token safe
Token ID3338
Feature activation+1.75e-01

Pos logit contributions
Maybe+0.016
Mom+0.015
 someone+0.013
Parent+0.013
Dad+0.013
Neg logit contributions
ocol-0.034
 rib-0.029
rou-0.028
 gymn-0.027
 microsc-0.026
Token.
Token ID13
Feature activation+1.83e-01

Pos logit contributions
ocol+0.009
 rib+0.007
 microsc+0.007
 gymn+0.007
rou+0.007
Neg logit contributions
Maybe-0.003
 someone-0.003
 dad-0.003
 one-0.003
Mom-0.003
Token His
Token ID2399
Feature activation-6.64e-01

Pos logit contributions
ocol+0.008
 rib+0.006
 microsc+0.006
 gymn+0.006
rou+0.006
Neg logit contributions
Maybe-0.004
 someone-0.004
 one-0.004
 dad-0.004
 next-0.004
Token parents
Token ID3397
Feature activation-1.03e-01

Pos logit contributions
Maybe+0.013
How+0.012
What+0.011
Nothing+0.010
Where+0.010
Neg logit contributions
ocol-0.031
 rib-0.027
 microsc-0.026
 gymn-0.025
 former-0.024
Token made
Token ID925
Feature activation+3.82e-01

Pos logit contributions
 someone+0.002
Maybe+0.002
 dad+0.002
How+0.002
 one+0.002
Neg logit contributions
ocol-0.004
 rib-0.004
 microsc-0.003
rou-0.003
 gymn-0.003
Token him
Token ID683
Feature activation-2.26e+00

Pos logit contributions
ocol+0.018
 rib+0.015
 microsc+0.014
 gymn+0.014
rou+0.013
Neg logit contributions
 someone-0.008
Maybe-0.008
 one-0.007
 dad-0.007
 next-0.007
Token feel
Token ID1254
Feature activation-1.74e-01

Pos logit contributions
Mom+0.049
Where+0.049
How+0.047
Maybe+0.047
What+0.047
Neg logit contributions
ocol-0.085
 rib-0.081
rou-0.078
hy-0.075
 gymn-0.073
Token so
Token ID523
Feature activation+2.16e-01

Pos logit contributions
Maybe+0.003
 someone+0.003
 dad+0.003
Mom+0.003
 one+0.003
Neg logit contributions
ocol-0.008
 rib-0.007
rou-0.007
 microsc-0.007
 gymn-0.007
Token special
Token ID2041
Feature activation+4.23e-01

Pos logit contributions
ocol+0.011
 rib+0.009
 microsc+0.009
 gymn+0.008
rou+0.008
Neg logit contributions
Maybe-0.004
 someone-0.003
 one-0.003
 dad-0.003
Mom-0.003
Token and
Token ID290
Feature activation+7.55e-01

Pos logit contributions
ocol+0.021
 rib+0.018
 microsc+0.017
 gymn+0.017
rou+0.017
Neg logit contributions
 someone-0.007
Maybe-0.007
 one-0.006
 dad-0.006
 next-0.006
Token loved
Token ID6151
Feature activation+7.07e-01

Pos logit contributions
ocol+0.039
 rib+0.033
 microsc+0.031
 gymn+0.031
pill+0.030
Neg logit contributions
 someone-0.012
Maybe-0.011
 one-0.011
 dad-0.011
 next-0.011
Token
Token ID198
Feature activation-8.16e-01

Pos logit contributions
Maybe+0.000
 someone+0.000
 one+0.000
 dad+0.000
 next+0.000
Neg logit contributions
ocol-0.000
 rib-0.000
 microsc-0.000
 gymn-0.000
 gol-0.000
Token
Token ID198
Feature activation-6.95e-01

Pos logit contributions
 one+0.018
Maybe+0.017
 someone+0.017
How+0.017
 forth+0.017
Neg logit contributions
ocol-0.031
 microsc-0.025
 rib-0.025
rou-0.024
 gymn-0.024
TokenMom
Token ID29252
Feature activation-3.50e-01

Pos logit contributions
 someone+0.012
 one+0.012
 dad+0.012
 looked+0.012
 somewhere+0.012
Neg logit contributions
ocol-0.029
 rib-0.024
 microsc-0.023
rou-0.023
oc-0.023
Token heard
Token ID2982
Feature activation-8.01e-01

Pos logit contributions
 someone+0.007
Maybe+0.007
 one+0.006
 dad+0.006
 somewhere+0.006
Neg logit contributions
ocol-0.015
 rib-0.013
rou-0.012
 microsc-0.012
 gymn-0.012
Token the
Token ID262
Feature activation-2.73e-01

Pos logit contributions
Maybe+0.020
cil+0.017
How+0.017
Someone+0.017
Nothing+0.017
Neg logit contributions
ocol-0.033
 rib-0.028
 microsc-0.025
 gymn-0.024
 gol-0.023
Token noise
Token ID7838
Feature activation-2.26e+00

Pos logit contributions
Maybe+0.005
 someone+0.005
How+0.005
 one+0.005
Mom+0.005
Neg logit contributions
ocol-0.012
 rib-0.011
 microsc-0.010
 gymn-0.010
 topp-0.010
Token and
Token ID290
Feature activation-1.12e+00

Pos logit contributions
cil+0.084
Maybe+0.075
How+0.075
 gentlemen+0.074
Mother+0.074
Neg logit contributions
ocol-0.054
oke-0.042
 gymn-0.042
 rib-0.041
hy-0.036
Token came
Token ID1625
Feature activation-7.54e-02

Pos logit contributions
cil+0.026
ble+0.025
Maybe+0.025
 somewhere+0.025
How+0.024
Neg logit contributions
ocol-0.040
 rib-0.038
 gymn-0.032
 microsc-0.032
oke-0.030
Token to
Token ID284
Feature activation+6.49e-01

Pos logit contributions
Maybe+0.002
 someone+0.002
 one+0.002
 dad+0.002
Mom+0.002
Neg logit contributions
ocol-0.003
 rib-0.002
 microsc-0.002
 gymn-0.002
rou-0.002
Token see
Token ID766
Feature activation-1.01e+00

Pos logit contributions
ocol+0.037
 rib+0.031
rou+0.030
 microsc+0.030
pill+0.030
Neg logit contributions
 someone-0.008
Maybe-0.007
 dad-0.007
 one-0.007
 next-0.006
Token what
Token ID644
Feature activation-1.67e-01

Pos logit contributions
cil+0.016
Maybe+0.015
Someone+0.015
ble+0.015
ths+0.015
Neg logit contributions
ocol-0.047
 rib-0.039
 microsc-0.038
 gymn-0.034
rou-0.034
Token
Token ID198
Feature activation-7.80e-01

Pos logit contributions
Maybe+0.005
 one+0.004
 next+0.004
 someone+0.004
 dad+0.004
Neg logit contributions
ocol-0.011
 rib-0.009
 microsc-0.009
 gymn-0.009
 gol-0.009
Token
Token ID198
Feature activation-5.91e-01

Pos logit contributions
 one+0.017
Maybe+0.017
How+0.016
 looked+0.015
 someone+0.015
Neg logit contributions
ocol-0.031
 rib-0.025
 microsc-0.025
 gymn-0.024
rou-0.024
TokenMom
Token ID29252
Feature activation-6.81e-01

Pos logit contributions
Maybe+0.010
 one+0.010
 looked+0.010
 someone+0.010
 dad+0.010
Neg logit contributions
ocol-0.026
 rib-0.021
 microsc-0.021
 gol-0.020
ifying-0.020
Token heard
Token ID2982
Feature activation-7.65e-01

Pos logit contributions
 someone+0.015
 one+0.014
Maybe+0.014
 dad+0.013
 somewhere+0.013
Neg logit contributions
ocol-0.027
 rib-0.024
rou-0.023
 microsc-0.022
 gymn-0.022
Token the
Token ID262
Feature activation+2.50e-01

Pos logit contributions
Maybe+0.018
How+0.016
cil+0.015
Mom+0.015
 next+0.015
Neg logit contributions
ocol-0.033
 rib-0.028
 microsc-0.025
 gymn-0.024
pill-0.023
Token noise
Token ID7838
Feature activation-2.24e+00

Pos logit contributions
ocol+0.013
 rib+0.011
 microsc+0.010
 gymn+0.010
rou+0.010
Neg logit contributions
Maybe-0.004
 someone-0.004
 dad-0.003
 one-0.003
 next-0.003
Token and
Token ID290
Feature activation-1.24e+00

Pos logit contributions
cil+0.082
 gentlemen+0.075
Mother+0.074
Someone+0.074
Maybe+0.074
Neg logit contributions
ocol-0.053
 gymn-0.043
 rib-0.042
oke-0.039
hy-0.037
Token came
Token ID1625
Feature activation-1.70e-01

Pos logit contributions
cil+0.031
ble+0.030
Maybe+0.029
 somewhere+0.029
 one+0.028
Neg logit contributions
ocol-0.042
 rib-0.040
 microsc-0.033
 gymn-0.033
rou-0.031
Token to
Token ID284
Feature activation+5.68e-01

Pos logit contributions
Maybe+0.005
 someone+0.005
 one+0.004
 dad+0.004
How+0.004
Neg logit contributions
ocol-0.006
 rib-0.005
 microsc-0.005
 gymn-0.005
rou-0.005
Token see
Token ID766
Feature activation-1.20e+00

Pos logit contributions
ocol+0.032
 rib+0.027
 microsc+0.026
rou+0.026
 topp+0.026
Neg logit contributions
 someone-0.006
Maybe-0.006
 dad-0.006
 one-0.006
 next-0.005
Token what
Token ID644
Feature activation-3.04e-01

Pos logit contributions
ths+0.023
Maybe+0.021
Days+0.021
cil+0.021
Years+0.021
Neg logit contributions
ocol-0.055
 rib-0.045
 microsc-0.045
rou-0.039
 gymn-0.039
Token mess
Token ID2085
Feature activation-3.33e-02

Pos logit contributions
Maybe+0.004
 someone+0.004
How+0.004
 dad+0.004
 one+0.004
Neg logit contributions
ocol-0.007
 rib-0.006
 microsc-0.006
 gymn-0.006
 topp-0.006
Token.
Token ID13
Feature activation-5.63e-01

Pos logit contributions
Maybe+0.001
 someone+0.001
 dad+0.001
 one+0.000
Mom+0.000
Neg logit contributions
ocol-0.002
 rib-0.001
 microsc-0.001
 gymn-0.001
rou-0.001
Token Mom
Token ID11254
Feature activation-5.90e-01

Pos logit contributions
Maybe+0.012
 next+0.012
 one+0.012
 somewhere+0.012
 forth+0.011
Neg logit contributions
ocol-0.022
 microsc-0.019
 rib-0.018
 gol-0.017
 gymn-0.017
Token hears
Token ID23180
Feature activation-8.46e-01

Pos logit contributions
 someone+0.014
Maybe+0.013
 dad+0.013
 one+0.013
 somewhere+0.012
Neg logit contributions
ocol-0.024
 rib-0.020
rou-0.020
 gymn-0.019
 topp-0.019
Token the
Token ID262
Feature activation+2.75e-01

Pos logit contributions
Maybe+0.020
How+0.018
Mom+0.017
Nothing+0.017
cil+0.017
Neg logit contributions
ocol-0.036
 rib-0.030
 microsc-0.028
 gymn-0.026
rou-0.025
Token noise
Token ID7838
Feature activation-2.24e+00

Pos logit contributions
ocol+0.014
 rib+0.012
 microsc+0.011
 gymn+0.011
rou+0.011
Neg logit contributions
Maybe-0.004
 someone-0.004
 dad-0.004
 one-0.004
 next-0.004
Token and
Token ID290
Feature activation-8.04e-01

Pos logit contributions
cil+0.082
Someone+0.075
Maybe+0.074
 gentlemen+0.074
 Mysteries+0.073
Neg logit contributions
ocol-0.051
oke-0.041
 gymn-0.039
 rib-0.039
rou-0.036
Token comes
Token ID2058
Feature activation+1.95e-01

Pos logit contributions
Mom+0.018
Maybe+0.018
 someone+0.017
cil+0.017
 somewhere+0.016
Neg logit contributions
ocol-0.033
 rib-0.030
 gymn-0.026
 microsc-0.026
rou-0.025
Token to
Token ID284
Feature activation+7.04e-01

Pos logit contributions
ocol+0.008
 rib+0.006
 microsc+0.006
 gymn+0.006
rou+0.006
Neg logit contributions
Maybe-0.005
 someone-0.005
 one-0.005
 dad-0.005
Mom-0.005
Token see
Token ID766
Feature activation-6.62e-01

Pos logit contributions
ocol+0.039
 rib+0.032
rou+0.032
 microsc+0.032
pill+0.032
Neg logit contributions
 someone-0.009
 dad-0.009
Maybe-0.009
 one-0.008
 next-0.008
Token.
Token ID13
Feature activation-5.33e-01

Pos logit contributions
Maybe+0.011
cil+0.010
Someone+0.009
 looked+0.009
How+0.009
Neg logit contributions
ocol-0.032
 rib-0.027
 microsc-0.025
rou-0.024
 gymn-0.023
Token
Token ID198
Feature activation-3.00e-01

Pos logit contributions
Maybe+0.010
 one+0.009
 someone+0.009
 next+0.009
 dad+0.009
Neg logit contributions
ocol-0.021
 rib-0.018
 microsc-0.017
 gymn-0.017
pill-0.016
Token
Token ID198
Feature activation-2.64e-01

Pos logit contributions
Maybe+0.005
 one+0.005
 someone+0.005
 dad+0.005
How+0.005
Neg logit contributions
ocol-0.014
 rib-0.011
 microsc-0.011
 gymn-0.011
rou-0.011
TokenMom
Token ID29252
Feature activation-7.90e-01

Pos logit contributions
 someone+0.004
 dad+0.004
Maybe+0.004
 one+0.004
 somewhere+0.004
Neg logit contributions
ocol-0.012
 rib-0.010
 microsc-0.010
 gymn-0.010
rou-0.010
Token hears
Token ID23180
Feature activation-9.38e-01

Pos logit contributions
 someone+0.019
Maybe+0.017
 one+0.017
 dad+0.017
 something+0.016
Neg logit contributions
ocol-0.031
 rib-0.027
rou-0.026
 microsc-0.025
 gymn-0.025
Token the
Token ID262
Feature activation+1.32e-01

Pos logit contributions
Maybe+0.023
How+0.020
cil+0.020
Mom+0.020
Nothing+0.019
Neg logit contributions
ocol-0.040
 rib-0.032
 microsc-0.030
 gymn-0.028
pill-0.028
Token noise
Token ID7838
Feature activation-2.24e+00

Pos logit contributions
ocol+0.007
 rib+0.006
 microsc+0.005
 gymn+0.005
rou+0.005
Neg logit contributions
Maybe-0.002
 someone-0.002
 one-0.002
 dad-0.002
Mom-0.002
Token and
Token ID290
Feature activation-1.07e+00

Pos logit contributions
cil+0.079
Someone+0.073
Maybe+0.072
Mother+0.071
How+0.070
Neg logit contributions
ocol-0.058
oke-0.044
 gymn-0.044
 rib-0.041
opot-0.040
Token comes
Token ID2058
Feature activation+2.05e-01

Pos logit contributions
Mom+0.027
Maybe+0.026
 someone+0.026
cil+0.025
 somewhere+0.025
Neg logit contributions
ocol-0.040
 rib-0.036
 gymn-0.031
 microsc-0.031
pill-0.030
Token to
Token ID284
Feature activation+5.63e-01

Pos logit contributions
ocol+0.008
 rib+0.006
 microsc+0.006
 gymn+0.006
rou+0.006
Neg logit contributions
Maybe-0.006
 someone-0.005
 one-0.005
 dad-0.005
 next-0.005
Token the
Token ID262
Feature activation+5.13e-01

Pos logit contributions
ocol+0.032
 rib+0.027
rou+0.027
 microsc+0.027
 topp+0.026
Neg logit contributions
 someone-0.006
Maybe-0.006
 dad-0.005
 one-0.005
 next-0.005
Token room
Token ID2119
Feature activation-2.37e-01

Pos logit contributions
ocol+0.025
 rib+0.020
 microsc+0.019
rou+0.019
 gymn+0.018
Neg logit contributions
 someone-0.010
Maybe-0.010
 dad-0.010
 one-0.010
 next-0.010
Token to
Token ID284
Feature activation+2.27e-01

Pos logit contributions
ocol+0.035
 rib+0.028
 microsc+0.027
rou+0.026
pill+0.026
Neg logit contributions
 someone-0.016
 one-0.015
Maybe-0.015
 dad-0.015
 next-0.014
Token end
Token ID886
Feature activation+1.97e-01

Pos logit contributions
ocol+0.011
 rib+0.010
 microsc+0.009
 gymn+0.009
rou+0.009
Neg logit contributions
Maybe-0.004
 someone-0.004
 one-0.003
 dad-0.003
 next-0.003
Token.
Token ID13
Feature activation-3.18e-01

Pos logit contributions
ocol+0.010
 rib+0.008
 microsc+0.008
 gymn+0.008
rou+0.008
Neg logit contributions
Maybe-0.003
 someone-0.003
 dad-0.003
 one-0.003
Mom-0.003
Token Sarah
Token ID10490
Feature activation+5.87e-02

Pos logit contributions
Maybe+0.006
 someone+0.006
 next+0.006
 dad+0.006
 one+0.006
Neg logit contributions
ocol-0.013
 microsc-0.011
 rib-0.011
 gymn-0.010
rou-0.010
Token gave
Token ID2921
Feature activation-4.17e-02

Pos logit contributions
ocol+0.003
 rib+0.002
 microsc+0.002
 gymn+0.002
rou+0.002
Neg logit contributions
Maybe-0.001
 someone-0.001
 one-0.001
 dad-0.001
How-0.001
Token Jack
Token ID3619
Feature activation-2.24e+00

Pos logit contributions
Maybe+0.001
 someone+0.001
How+0.001
Mom+0.001
 next+0.001
Neg logit contributions
ocol-0.002
 rib-0.002
 microsc-0.001
 gymn-0.001
rou-0.001
Token one
Token ID530
Feature activation-7.29e-01

Pos logit contributions
How+0.045
 knock+0.045
Maybe+0.044
Where+0.044
 Blood+0.044
Neg logit contributions
 rib-0.083
ocol-0.080
rou-0.076
 tre-0.073
 such-0.073
Token last
Token ID938
Feature activation-8.90e-01

Pos logit contributions
Maybe+0.015
 someone+0.014
How+0.014
What+0.014
Mom+0.014
Neg logit contributions
ocol-0.030
 rib-0.027
rou-0.025
 gymn-0.024
 topp-0.024
Token squeeze
Token ID21229
Feature activation+3.26e-01

Pos logit contributions
Maybe+0.022
How+0.022
Mom+0.021
What+0.021
 someone+0.021
Neg logit contributions
ocol-0.034
 rib-0.030
rou-0.026
 gymn-0.026
hy-0.026
Token and
Token ID290
Feature activation-2.25e-01

Pos logit contributions
ocol+0.016
 rib+0.014
 microsc+0.013
 gymn+0.013
rou+0.013
Neg logit contributions
Maybe-0.005
 someone-0.005
 one-0.005
 dad-0.005
 next-0.005
Token they
Token ID484
Feature activation+3.38e-01

Pos logit contributions
Maybe+0.004
 someone+0.004
Mom+0.004
How+0.004
What+0.003
Neg logit contributions
ocol-0.010
 rib-0.009
 microsc-0.009
 gymn-0.008
rou-0.008
Token
Token ID198
Feature activation-7.78e-01

Pos logit contributions
Maybe+0.007
 someone+0.006
 one+0.006
 next+0.006
 dad+0.006
Neg logit contributions
ocol-0.016
 rib-0.014
 microsc-0.013
 gymn-0.013
ifying-0.012
Token
Token ID198
Feature activation-7.33e-01

Pos logit contributions
Maybe+0.016
 one+0.015
 someone+0.015
 dad+0.015
How+0.015
Neg logit contributions
ocol-0.031
 rib-0.025
 microsc-0.025
 gymn-0.025
rou-0.024
TokenMom
Token ID29252
Feature activation-6.19e-01

Pos logit contributions
Maybe+0.012
 someone+0.011
 look+0.011
 somewhere+0.011
 dad+0.011
Neg logit contributions
ocol-0.032
 rib-0.027
ifying-0.026
 microsc-0.026
 gymn-0.026
Token hears
Token ID23180
Feature activation-4.11e-01

Pos logit contributions
 someone+0.014
Maybe+0.013
 one+0.012
 dad+0.012
 somewhere+0.012
Neg logit contributions
ocol-0.026
 rib-0.022
rou-0.021
 microsc-0.021
 gymn-0.021
Token the
Token ID262
Feature activation+4.23e-01

Pos logit contributions
Maybe+0.008
 someone+0.007
How+0.007
Mom+0.007
 next+0.007
Neg logit contributions
ocol-0.019
 rib-0.016
 microsc-0.015
 gymn-0.014
 topp-0.014
Token noise
Token ID7838
Feature activation-2.24e+00

Pos logit contributions
ocol+0.023
 rib+0.019
 microsc+0.018
 gymn+0.018
rou+0.018
Neg logit contributions
 someone-0.006
Maybe-0.006
 dad-0.006
 one-0.005
 next-0.005
Token.
Token ID13
Feature activation-7.42e-01

Pos logit contributions
cil+0.078
Maybe+0.073
Someone+0.072
How+0.072
Mother+0.072
Neg logit contributions
ocol-0.056
 gymn-0.046
 rib-0.045
oke-0.043
hy-0.040
Token She
Token ID1375
Feature activation-1.78e-01

Pos logit contributions
 somewhere+0.019
 someone+0.018
 one+0.018
Maybe+0.018
How+0.018
Neg logit contributions
ocol-0.024
 rib-0.021
 microsc-0.020
 gymn-0.019
pill-0.018
Token comes
Token ID2058
Feature activation-9.02e-02

Pos logit contributions
 someone+0.003
Maybe+0.003
 one+0.003
 dad+0.003
Mom+0.003
Neg logit contributions
ocol-0.008
 rib-0.007
 microsc-0.007
 gymn-0.006
rou-0.006
Token to
Token ID284
Feature activation+3.14e-01

Pos logit contributions
Maybe+0.003
 someone+0.002
How+0.002
 one+0.002
 dad+0.002
Neg logit contributions
ocol-0.003
 rib-0.003
 microsc-0.002
 gymn-0.002
rou-0.002
Token see
Token ID766
Feature activation-1.19e+00

Pos logit contributions
ocol+0.016
 rib+0.014
 microsc+0.013
rou+0.013
 gymn+0.013
Neg logit contributions
Maybe-0.005
 someone-0.005
 dad-0.004
 one-0.004
 next-0.004
Token
Token ID198
Feature activation-1.06e+00

Pos logit contributions
 one+0.006
 someone+0.006
 dad+0.006
Maybe+0.006
 next+0.006
Neg logit contributions
ocol-0.014
 rib-0.012
 microsc-0.012
 gymn-0.012
pill-0.011
Token
Token ID198
Feature activation-1.07e+00

Pos logit contributions
 one+0.025
 forth+0.025
Maybe+0.024
 looked+0.024
How+0.024
Neg logit contributions
ocol-0.037
 microsc-0.030
 rib-0.029
 gymn-0.029
rou-0.028
TokenMom
Token ID29252
Feature activation-4.45e-01

Pos logit contributions
 look+0.021
 someone+0.020
 one+0.020
 somewhere+0.020
 looked+0.020
Neg logit contributions
ocol-0.043
 microsc-0.036
 rib-0.034
 gymn-0.033
ifying-0.033
Token hears
Token ID23180
Feature activation-8.22e-01

Pos logit contributions
 someone+0.010
Maybe+0.009
 dad+0.009
 one+0.009
 somewhere+0.009
Neg logit contributions
ocol-0.019
 rib-0.016
rou-0.015
 microsc-0.015
 gymn-0.015
Token the
Token ID262
Feature activation+2.22e-01

Pos logit contributions
Maybe+0.019
How+0.017
Mom+0.016
Nothing+0.016
 somewhere+0.016
Neg logit contributions
ocol-0.035
 rib-0.029
 microsc-0.027
 gymn-0.026
pill-0.025
Token noise
Token ID7838
Feature activation-2.24e+00

Pos logit contributions
ocol+0.011
 rib+0.010
 microsc+0.009
 gymn+0.009
rou+0.009
Neg logit contributions
Maybe-0.003
 someone-0.003
 one-0.003
 dad-0.003
 next-0.003
Token and
Token ID290
Feature activation-8.10e-01

Pos logit contributions
cil+0.082
Someone+0.073
Maybe+0.072
 gentlemen+0.072
Mother+0.071
Neg logit contributions
ocol-0.055
 gymn-0.044
 rib-0.042
oke-0.042
rou-0.036
Token comes
Token ID2058
Feature activation+2.61e-01

Pos logit contributions
Mom+0.019
Maybe+0.018
 someone+0.018
 somewhere+0.017
cil+0.017
Neg logit contributions
ocol-0.033
 rib-0.030
 gymn-0.026
 microsc-0.026
pill-0.025
Token to
Token ID284
Feature activation+5.10e-01

Pos logit contributions
ocol+0.010
 rib+0.008
 microsc+0.008
 gymn+0.008
rou+0.008
Neg logit contributions
Maybe-0.007
 someone-0.007
 one-0.007
 dad-0.007
 next-0.006
Token the
Token ID262
Feature activation+7.43e-01

Pos logit contributions
ocol+0.028
 rib+0.023
 microsc+0.023
rou+0.023
pill+0.022
Neg logit contributions
 someone-0.007
Maybe-0.006
 dad-0.006
 one-0.006
 next-0.006
Token room
Token ID2119
Feature activation-2.93e-01

Pos logit contributions
ocol+0.038
 rib+0.030
 microsc+0.028
opot+0.028
 gol+0.028
Neg logit contributions
 someone-0.014
 next-0.014
Maybe-0.014
 dad-0.014
 one-0.014
Token shop
Token ID6128
Feature activation+2.81e-01

Pos logit contributions
ocol+0.026
 rib+0.022
 microsc+0.021
rou+0.020
 gymn+0.020
Neg logit contributions
 someone-0.009
Maybe-0.009
 one-0.009
 dad-0.009
 next-0.009
Token.
Token ID13
Feature activation-3.76e-01

Pos logit contributions
ocol+0.014
 rib+0.012
 microsc+0.011
 gymn+0.011
rou+0.011
Neg logit contributions
Maybe-0.004
 someone-0.004
 one-0.004
 dad-0.004
 next-0.004
Token She
Token ID1375
Feature activation-3.28e-01

Pos logit contributions
Maybe+0.008
 someone+0.007
 dad+0.007
 next+0.007
Mom+0.007
Neg logit contributions
ocol-0.016
 rib-0.014
 microsc-0.013
 gymn-0.013
pill-0.012
Token gave
Token ID2921
Feature activation-4.70e-01

Pos logit contributions
 someone+0.007
Maybe+0.007
How+0.006
What+0.006
 one+0.006
Neg logit contributions
ocol-0.014
 rib-0.012
 gymn-0.011
rou-0.011
 microsc-0.011
Token Po
Token ID7695
Feature activation-6.45e-01

Pos logit contributions
Maybe+0.012
How+0.011
 dad+0.010
Mom+0.010
 someone+0.010
Neg logit contributions
ocol-0.019
 rib-0.016
 microsc-0.015
 gymn-0.014
pill-0.014
Tokenppy
Token ID14097
Feature activation-2.24e+00

Pos logit contributions
Maybe+0.019
 someone+0.018
 dad+0.018
Mom+0.018
How+0.018
Neg logit contributions
ocol-0.023
 rib-0.019
rou-0.017
 microsc-0.017
pill-0.016
Token some
Token ID617
Feature activation+9.46e-01

Pos logit contributions
Maybe+0.048
Nothing+0.045
How+0.044
 knock+0.044
 forth+0.044
Neg logit contributions
 rib-0.084
ocol-0.082
 microsc-0.073
ifying-0.069
rou-0.068
Token money
Token ID1637
Feature activation-3.14e-01

Pos logit contributions
ocol+0.044
 microsc+0.036
rou+0.036
 rib+0.036
 gymn+0.035
Neg logit contributions
 someone-0.018
 one-0.018
 dad-0.017
 something-0.016
 next-0.016
Token and
Token ID290
Feature activation-6.06e-01

Pos logit contributions
Maybe+0.008
 someone+0.007
Mom+0.007
How+0.007
What+0.007
Neg logit contributions
ocol-0.013
 rib-0.011
 microsc-0.010
rou-0.009
 gymn-0.009
Token said
Token ID531
Feature activation+7.61e-01

Pos logit contributions
Maybe+0.011
Mom+0.010
How+0.010
What+0.010
Nothing+0.009
Neg logit contributions
ocol-0.027
 rib-0.025
 microsc-0.023
 gymn-0.022
 topp-0.021
Token "
Token ID366
Feature activation-5.19e-01

Pos logit contributions
ocol+0.041
 rib+0.034
 microsc+0.033
rou+0.033
 gol+0.032
Neg logit contributions
 someone-0.011
 one-0.010
 dad-0.010
Maybe-0.009
 everything-0.009
Token colourful
Token ID44799
Feature activation+1.56e+00

Pos logit contributions
ocol+0.032
 rib+0.026
 microsc+0.025
 gymn+0.025
rou+0.025
Neg logit contributions
 someone-0.012
 one-0.011
Maybe-0.011
 dad-0.011
 next-0.011
Token flowers
Token ID12734
Feature activation+5.27e-01

Pos logit contributions
ocol+0.082
rou+0.070
 gol+0.069
pri+0.068
ksh+0.067
Neg logit contributions
 one-0.021
 someone-0.019
 everything-0.018
 something-0.017
 water-0.017
Token,
Token ID11
Feature activation+4.67e-01

Pos logit contributions
ocol+0.027
 rib+0.023
 microsc+0.022
 gymn+0.022
rou+0.022
Neg logit contributions
 someone-0.007
Maybe-0.007
 one-0.007
 dad-0.007
 next-0.007
Token through
Token ID832
Feature activation-1.82e-01

Pos logit contributions
ocol+0.023
 rib+0.019
 microsc+0.018
 gymn+0.018
rou+0.018
Neg logit contributions
Maybe-0.008
 someone-0.008
 one-0.008
 dad-0.007
 next-0.007
Token the
Token ID262
Feature activation+6.34e-01

Pos logit contributions
Maybe+0.003
 someone+0.003
 dad+0.003
Mom+0.003
How+0.003
Neg logit contributions
ocol-0.008
 rib-0.007
 microsc-0.007
 gymn-0.007
 topp-0.006
Token lush
Token ID37408
Feature activation+2.18e+00

Pos logit contributions
ocol+0.033
 rib+0.027
 microsc+0.027
 gol+0.026
 topp+0.026
Neg logit contributions
 someone-0.009
 one-0.009
Maybe-0.009
 next-0.008
 dad-0.008
Token forests
Token ID17039
Feature activation+1.41e+00

Pos logit contributions
ocol+0.120
rou+0.106
opot+0.103
 gol+0.103
ksh+0.102
Neg logit contributions
 one-0.022
 everything-0.020
 water-0.020
 someone-0.019
 sun-0.017
Token deep
Token ID2769
Feature activation+1.03e+00

Pos logit contributions
ocol+0.082
 rib+0.068
 microsc+0.067
 gymn+0.066
 gol+0.065
Neg logit contributions
 one-0.016
 next-0.014
 someone-0.013
 there-0.012
 everything-0.012
Token in
Token ID287
Feature activation-5.42e-01

Pos logit contributions
ocol+0.054
rou+0.046
 rib+0.046
 microsc+0.045
 gymn+0.045
Neg logit contributions
 one-0.013
 someone-0.012
 next-0.012
 everything-0.011
Maybe-0.011
Token thought
Token ID1807
Feature activation+5.01e-01

Pos logit contributions
Maybe+0.013
 dad+0.012
How+0.011
 someone+0.011
Nothing+0.011
Neg logit contributions
ocol-0.023
 rib-0.019
 microsc-0.018
 gymn-0.017
ifying-0.017
Token.
Token ID13
Feature activation-1.59e-01

Pos logit contributions
ocol+0.023
 rib+0.019
 microsc+0.018
rou+0.018
 gymn+0.018
Neg logit contributions
 someone-0.010
Maybe-0.010
 one-0.010
 dad-0.009
 next-0.009
Token explore
Token ID7301
Feature activation+4.54e-01

Pos logit contributions
ocol+0.036
 rib+0.030
 microsc+0.029
rou+0.028
pill+0.028
Neg logit contributions
 someone-0.012
Maybe-0.012
 one-0.012
 dad-0.012
 next-0.011
Token the
Token ID262
Feature activation+2.56e-01

Pos logit contributions
ocol+0.023
 rib+0.019
 microsc+0.019
rou+0.018
 gymn+0.018
Neg logit contributions
 someone-0.007
Maybe-0.007
 one-0.007
 dad-0.006
 next-0.006
Token woods
Token ID16479
Feature activation+4.24e-01

Pos logit contributions
ocol+0.012
 rib+0.010
 microsc+0.010
 gymn+0.009
rou+0.009
Neg logit contributions
Maybe-0.005
 someone-0.005
 one-0.005
 dad-0.004
 next-0.004
Token and
Token ID290
Feature activation+7.01e-01

Pos logit contributions
ocol+0.021
 rib+0.018
 microsc+0.017
 gymn+0.017
rou+0.016
Neg logit contributions
Maybe-0.007
 someone-0.007
 one-0.006
 dad-0.006
 next-0.006
Token feel
Token ID1254
Feature activation+1.27e-01

Pos logit contributions
ocol+0.036
 rib+0.030
 microsc+0.028
rou+0.028
 gymn+0.028
Neg logit contributions
 someone-0.011
Maybe-0.011
 one-0.011
 dad-0.011
 next-0.010
Token close
Token ID1969
Feature activation+2.08e+00

Pos logit contributions
ocol+0.006
 rib+0.005
 microsc+0.005
 gymn+0.005
rou+0.005
Neg logit contributions
Maybe-0.002
 someone-0.002
 dad-0.002
 one-0.002
Mom-0.002
Token to
Token ID284
Feature activation+5.92e-02

Pos logit contributions
ocol+0.108
pill+0.091
 microsc+0.087
 gymn+0.087
 gol+0.087
Neg logit contributions
 one-0.041
 next-0.037
 there-0.036
 where-0.035
 dad-0.034
Token nature
Token ID3450
Feature activation+8.74e-01

Pos logit contributions
ocol+0.003
 rib+0.002
 microsc+0.002
 gymn+0.002
rou+0.002
Neg logit contributions
Maybe-0.001
 someone-0.001
Mom-0.001
 dad-0.001
 one-0.001
Token.
Token ID13
Feature activation-1.95e-01

Pos logit contributions
ocol+0.048
 rib+0.039
 microsc+0.038
rou+0.037
 gymn+0.037
Neg logit contributions
 one-0.011
 someone-0.011
 next-0.010
Maybe-0.010
 dad-0.010
Token<|endoftext|>
Token ID50256
Feature activation+2.20e-01

Pos logit contributions
Maybe+0.004
 someone+0.004
 next+0.004
 dad+0.004
 one+0.004
Neg logit contributions
ocol-0.008
 rib-0.007
 microsc-0.007
 gymn-0.006
pill-0.006
TokenOnce
Token ID7454
Feature activation+9.69e-01

Pos logit contributions
ocol+0.011
 rib+0.009
 microsc+0.009
 gymn+0.008
rou+0.008
Neg logit contributions
Maybe-0.004
 someone-0.004
 dad-0.003
 one-0.003
 next-0.003
Token Max
Token ID5436
Feature activation-2.51e-01

Pos logit contributions
Maybe+0.006
 someone+0.005
 dad+0.005
Mom+0.005
How+0.005
Neg logit contributions
ocol-0.010
 rib-0.009
 gymn-0.008
 microsc-0.008
 topp-0.008
Token saw
Token ID2497
Feature activation-5.43e-01

Pos logit contributions
Maybe+0.005
 someone+0.005
Mom+0.005
 dad+0.005
How+0.005
Neg logit contributions
ocol-0.010
 rib-0.009
 microsc-0.009
 gymn-0.008
 gol-0.008
Token a
Token ID257
Feature activation+2.92e-01

Pos logit contributions
Maybe+0.009
Mom+0.008
cil+0.008
How+0.008
 next+0.007
Neg logit contributions
ocol-0.024
 rib-0.021
 microsc-0.021
 topp-0.020
 gymn-0.020
Token big
Token ID1263
Feature activation+1.51e-01

Pos logit contributions
ocol+0.015
 rib+0.012
 microsc+0.012
 gymn+0.012
rou+0.011
Neg logit contributions
Maybe-0.005
 someone-0.005
 dad-0.004
 one-0.004
 next-0.004
Token pile
Token ID14540
Feature activation-3.43e-01

Pos logit contributions
ocol+0.007
 rib+0.006
 microsc+0.006
 gymn+0.006
rou+0.005
Neg logit contributions
Maybe-0.003
 someone-0.003
 dad-0.003
 one-0.003
Mom-0.003
Token of
Token ID286
Feature activation+1.83e+00

Pos logit contributions
Maybe+0.012
 someone+0.012
 dad+0.011
How+0.011
Mom+0.011
Neg logit contributions
ocol-0.010
 rib-0.008
 microsc-0.007
rou-0.007
 gymn-0.007
Token fruit
Token ID8234
Feature activation-3.87e-01

Pos logit contributions
ocol+0.095
spe+0.086
ifying+0.085
rou+0.083
 former+0.082
Neg logit contributions
 water-0.026
 someone-0.025
 one-0.024
 something-0.024
 everything-0.023
Token and
Token ID290
Feature activation-3.10e-01

Pos logit contributions
Maybe+0.007
 someone+0.007
 dad+0.007
Mom+0.007
How+0.006
Neg logit contributions
ocol-0.018
 rib-0.015
 microsc-0.014
 topp-0.014
 gymn-0.014
Token he
Token ID339
Feature activation-2.38e-01

Pos logit contributions
Maybe+0.006
Mom+0.006
 next+0.006
 someone+0.006
 somewhere+0.006
Neg logit contributions
ocol-0.013
 rib-0.012
 microsc-0.011
 topp-0.011
 gymn-0.011
Token wanted
Token ID2227
Feature activation-8.49e-01

Pos logit contributions
Maybe+0.005
 someone+0.005
 somewhere+0.005
How+0.005
 dad+0.005
Neg logit contributions
ocol-0.009
 rib-0.008
 microsc-0.008
 gymn-0.008
rou-0.007
Token to
Token ID284
Feature activation+4.97e-01

Pos logit contributions
Maybe+0.034
Mom+0.033
Parent+0.032
Mother+0.032
Nothing+0.032
Neg logit contributions
ocol-0.018
 rib-0.014
 microsc-0.014
 topp-0.014
 gymn-0.012
Token
Token ID129
Feature activation+8.59e-01

Pos logit contributions
 someone+0.011
Maybe+0.011
 one+0.010
 next+0.010
 dad+0.010
Neg logit contributions
ocol-0.016
 rib-0.013
rou-0.012
 microsc-0.012
 gymn-0.012
Token
Token ID241
Feature activation+7.14e-01

Pos logit contributions
ocol+0.033
 rib+0.027
 microsc+0.025
 topp+0.024
ifying+0.023
Neg logit contributions
 someone-0.024
 next-0.023
 one-0.023
Maybe-0.023
 dad-0.023
TokenYou
Token ID1639
Feature activation-4.41e-02

Pos logit contributions
ocol+0.034
 rib+0.028
 microsc+0.027
 gymn+0.026
rou+0.025
Neg logit contributions
Maybe-0.014
 someone-0.014
 one-0.013
 next-0.013
 dad-0.013
Tokenâ
Token ID22940
Feature activation+9.09e-02

Pos logit contributions
Maybe+0.001
 someone+0.001
 dad+0.001
Mom+0.001
 next+0.001
Neg logit contributions
ocol-0.002
 rib-0.001
 microsc-0.001
 gymn-0.001
rou-0.001
Token
Token ID26391
Feature activation-3.99e-01

Pos logit contributions
ocol+0.003
 rib+0.002
 microsc+0.002
 gymn+0.002
rou+0.002
Neg logit contributions
Maybe-0.003
 someone-0.003
 dad-0.003
 one-0.003
Mom-0.003
Token
Token ID8151
Feature activation+2.00e+00

Pos logit contributions
Maybe+0.009
 someone+0.009
 one+0.009
 dad+0.009
 next+0.009
Neg logit contributions
ocol-0.016
 rib-0.013
 microsc-0.013
 gymn-0.013
rou-0.012
Tokenre
Token ID260
Feature activation-3.77e-01

Pos logit contributions
ocol+0.093
 rib+0.078
 former+0.072
 microsc+0.070
 topp+0.069
Neg logit contributions
 one-0.042
 someone-0.042
 dad-0.041
Mom-0.041
 next-0.041
Token very
Token ID845
Feature activation-1.31e-01

Pos logit contributions
Maybe+0.009
 someone+0.009
Mom+0.009
 dad+0.009
How+0.009
Neg logit contributions
ocol-0.015
 rib-0.012
 gymn-0.011
 microsc-0.011
rou-0.011
Token welcome
Token ID7062
Feature activation-2.53e-01

Pos logit contributions
Maybe+0.003
 someone+0.003
 dad+0.002
Mom+0.002
 one+0.002
Neg logit contributions
ocol-0.006
 rib-0.005
 microsc-0.005
 gymn-0.005
rou-0.005
Token Clara
Token ID27443
Feature activation+1.14e-02

Pos logit contributions
Maybe+0.006
 someone+0.006
How+0.005
 dad+0.005
Mom+0.005
Neg logit contributions
ocol-0.011
 rib-0.009
 microsc-0.008
 gymn-0.008
rou-0.008
Token.
Token ID13
Feature activation-1.84e-02

Pos logit contributions
ocol+0.000
 rib+0.000
 microsc+0.000
 gymn+0.000
rou+0.000
Neg logit contributions
Maybe-0.000
 someone-0.000
 dad-0.000
 one-0.000
Mom-0.000
Token
Token ID129
Feature activation+1.78e+00

Pos logit contributions
ocol+0.055
 rib+0.046
 microsc+0.043
 topp+0.041
 gymn+0.040
Neg logit contributions
 someone-0.025
 one-0.024
 dad-0.024
Maybe-0.024
 next-0.023
Token
Token ID241
Feature activation+5.40e-02

Pos logit contributions
ocol+0.072
 rib+0.062
 microsc+0.057
 topp+0.054
ifying+0.053
Neg logit contributions
 next-0.047
 one-0.045
 someone-0.045
Mom-0.045
 dad-0.043
TokenLet
Token ID5756
Feature activation+5.91e-01

Pos logit contributions
ocol+0.003
 rib+0.002
 microsc+0.002
 gymn+0.002
rou+0.002
Neg logit contributions
Maybe-0.001
 someone-0.001
 dad-0.001
 one-0.001
 next-0.001
Tokenâ
Token ID22940
Feature activation-2.01e-01

Pos logit contributions
ocol+0.025
 rib+0.020
 microsc+0.020
rou+0.019
 gymn+0.018
Neg logit contributions
 someone-0.015
Maybe-0.014
 one-0.014
 dad-0.014
 next-0.013
Token
Token ID26391
Feature activation-2.93e-01

Pos logit contributions
Maybe+0.007
 someone+0.007
 dad+0.007
 one+0.007
 everything+0.007
Neg logit contributions
ocol-0.006
 rib-0.004
 gymn-0.004
 gol-0.004
 microsc-0.004
Token
Token ID8151
Feature activation+2.02e+00

Pos logit contributions
Maybe+0.007
 someone+0.007
 one+0.006
 dad+0.006
 next+0.006
Neg logit contributions
ocol-0.012
 rib-0.010
 gymn-0.009
 microsc-0.009
rou-0.009
Tokens
Token ID82
Feature activation-4.07e-01

Pos logit contributions
ocol+0.093
 rib+0.076
 microsc+0.071
 gymn+0.066
 topp+0.066
Neg logit contributions
Maybe-0.047
 one-0.044
 someone-0.044
 they-0.044
 everything-0.043
Token not
Token ID407
Feature activation-6.61e-01

Pos logit contributions
Maybe+0.008
 someone+0.007
How+0.007
Mom+0.007
What+0.007
Neg logit contributions
ocol-0.018
 rib-0.015
 microsc-0.015
 gymn-0.014
rou-0.014
Token think
Token ID892
Feature activation+1.56e-01

Pos logit contributions
Maybe+0.015
Mom+0.015
How+0.014
 dad+0.013
 someone+0.013
Neg logit contributions
ocol-0.027
 rib-0.023
 gymn-0.022
 gol-0.021
 microsc-0.021
Token about
Token ID546
Feature activation+1.78e-01

Pos logit contributions
ocol+0.008
 rib+0.007
 microsc+0.006
 gymn+0.006
rou+0.006
Neg logit contributions
Maybe-0.003
 someone-0.002
 dad-0.002
 one-0.002
Mom-0.002
Token the
Token ID262
Feature activation-1.63e-01

Pos logit contributions
ocol+0.009
 rib+0.008
 microsc+0.008
 gymn+0.007
rou+0.007
Neg logit contributions
Maybe-0.002
 someone-0.002
 dad-0.002
 one-0.002
Mom-0.002
Token and
Token ID290
Feature activation-2.55e-01

Pos logit contributions
Maybe+0.009
 someone+0.008
How+0.008
What+0.008
Mom+0.008
Neg logit contributions
ocol-0.018
 rib-0.015
 microsc-0.014
rou-0.014
 gymn-0.014
Token Ben
Token ID3932
Feature activation-3.94e-01

Pos logit contributions
Maybe+0.005
Mom+0.005
 someone+0.004
 somewhere+0.004
How+0.004
Neg logit contributions
ocol-0.012
 rib-0.010
 microsc-0.009
 gymn-0.009
 topp-0.009
Token are
Token ID389
Feature activation-2.35e-01

Pos logit contributions
Maybe+0.012
Mom+0.012
How+0.012
 someone+0.012
What+0.012
Neg logit contributions
ocol-0.013
 rib-0.010
 microsc-0.009
 gymn-0.009
rou-0.009
Token bad
Token ID2089
Feature activation-7.87e-02

Pos logit contributions
Maybe+0.004
Mom+0.004
How+0.004
 someone+0.004
What+0.003
Neg logit contributions
ocol-0.011
 rib-0.009
 microsc-0.009
 gymn-0.009
rou-0.009
Token.
Token ID13
Feature activation+5.72e-01

Pos logit contributions
Maybe+0.001
 someone+0.001
Mom+0.001
How+0.001
 dad+0.001
Neg logit contributions
ocol-0.004
 rib-0.003
 microsc-0.003
 gymn-0.003
rou-0.003
Token He
Token ID679
Feature activation+6.19e-01

Pos logit contributions
ocol+0.029
 rib+0.024
 microsc+0.023
rou+0.023
 gymn+0.022
Neg logit contributions
 someone-0.011
Maybe-0.010
 dad-0.010
 one-0.010
 next-0.009
Token takes
Token ID2753
Feature activation-5.50e-01

Pos logit contributions
ocol+0.029
 rib+0.023
 microsc+0.022
 gymn+0.021
rou+0.021
Neg logit contributions
Maybe-0.013
 someone-0.013
 one-0.012
 dad-0.012
 next-0.012
Token the
Token ID262
Feature activation+6.36e-02

Pos logit contributions
Maybe+0.011
How+0.010
Nothing+0.010
Mom+0.010
What+0.009
Neg logit contributions
ocol-0.025
 rib-0.021
 gymn-0.019
 topp-0.019
 microsc-0.019
Token phone
Token ID3072
Feature activation-3.25e-01

Pos logit contributions
ocol+0.003
 rib+0.002
 microsc+0.002
 gymn+0.002
 topp+0.002
Neg logit contributions
Maybe-0.001
 someone-0.001
 dad-0.001
 one-0.001
Mom-0.001
Token from
Token ID422
Feature activation-3.96e-01

Pos logit contributions
Maybe+0.007
 someone+0.006
How+0.006
Mom+0.006
What+0.006
Neg logit contributions
ocol-0.014
 rib-0.012
 microsc-0.011
 gymn-0.011
rou-0.011
Token them
Token ID606
Feature activation+1.37e-01

Pos logit contributions
Maybe+0.009
Nothing+0.008
How+0.008
Mom+0.007
 someone+0.007
Neg logit contributions
ocol-0.017
 rib-0.014
 gymn-0.014
 microsc-0.013
 topp-0.013
Token
Token ID198
Feature activation-6.44e-01

Pos logit contributions
 one+0.013
Maybe+0.013
How+0.013
 someone+0.013
 forth+0.013
Neg logit contributions
ocol-0.025
 rib-0.023
 gymn-0.021
 microsc-0.021
 gol-0.020
Token
Token ID198
Feature activation-5.82e-01

Pos logit contributions
Maybe+0.013
 one+0.013
How+0.012
 looked+0.012
 someone+0.012
Neg logit contributions
ocol-0.026
 rib-0.022
 microsc-0.021
 gymn-0.020
 gol-0.020
TokenThe
Token ID464
Feature activation-5.52e-01

Pos logit contributions
 looked+0.010
 one+0.010
 someone+0.010
 everything+0.010
Maybe+0.010
Neg logit contributions
ocol-0.025
 rib-0.022
 gol-0.020
 microsc-0.020
 gymn-0.020
Token lady
Token ID10846
Feature activation-1.51e-01

Pos logit contributions
Maybe+0.013
How+0.012
 someone+0.012
cil+0.011
Nothing+0.011
Neg logit contributions
ocol-0.022
 rib-0.020
 gymn-0.017
 topp-0.017
 microsc-0.017
Token opened
Token ID4721
Feature activation+1.24e+00

Pos logit contributions
Maybe+0.003
 someone+0.003
 dad+0.003
 one+0.003
How+0.003
Neg logit contributions
ocol-0.007
 rib-0.006
 microsc-0.005
 gymn-0.005
rou-0.005
Token the
Token ID262
Feature activation+8.70e-01

Pos logit contributions
ocol+0.071
pill+0.060
rou+0.060
 former+0.059
 gol+0.059
Neg logit contributions
 one-0.017
 someone-0.013
 dad-0.013
 everything-0.013
 something-0.012
Token box
Token ID3091
Feature activation+7.29e-01

Pos logit contributions
ocol+0.049
rou+0.042
 rib+0.041
pill+0.041
 gol+0.041
Neg logit contributions
 one-0.008
 someone-0.008
 dad-0.008
 next-0.007
Maybe-0.007
Token.
Token ID13
Feature activation-5.38e-01

Pos logit contributions
ocol+0.041
 rib+0.034
 microsc+0.033
 gymn+0.032
rou+0.032
Neg logit contributions
 one-0.009
 someone-0.009
Maybe-0.009
 next-0.008
 dad-0.008
Token She
Token ID1375
Feature activation-3.29e-02

Pos logit contributions
 next+0.013
Maybe+0.013
 one+0.013
Mom+0.013
 dad+0.013
Neg logit contributions
ocol-0.020
 rib-0.017
 microsc-0.017
 gymn-0.015
pill-0.015
Token was
Token ID373
Feature activation+7.65e-02

Pos logit contributions
 someone+0.001
Maybe+0.001
 one+0.001
 dad+0.001
Mom+0.001
Neg logit contributions
ocol-0.002
 rib-0.001
 microsc-0.001
 gymn-0.001
rou-0.001
Token shocked
Token ID11472
Feature activation-2.63e-01

Pos logit contributions
ocol+0.004
 rib+0.003
 microsc+0.003
 gymn+0.003
rou+0.003
Neg logit contributions
Maybe-0.001
 someone-0.001
 one-0.001
 dad-0.001
Mom-0.001
Token living
Token ID2877
Feature activation+2.45e-01

Pos logit contributions
ocol+0.011
 rib+0.009
 microsc+0.008
 gymn+0.008
rou+0.008
Neg logit contributions
Maybe-0.004
 someone-0.004
 one-0.004
 dad-0.004
Mom-0.004
Token room
Token ID2119
Feature activation-2.63e-01

Pos logit contributions
ocol+0.013
 rib+0.011
 microsc+0.011
 gymn+0.011
rou+0.011
Neg logit contributions
Maybe-0.003
 someone-0.003
 one-0.003
 dad-0.003
 next-0.003
Token and
Token ID290
Feature activation+7.17e-01

Pos logit contributions
Maybe+0.005
 someone+0.005
Mom+0.004
How+0.004
What+0.004
Neg logit contributions
ocol-0.012
 rib-0.010
 microsc-0.010
 gymn-0.010
rou-0.010
Token sit
Token ID1650
Feature activation+2.42e-01

Pos logit contributions
ocol+0.039
 rib+0.033
rou+0.031
 microsc+0.031
pill+0.031
Neg logit contributions
 someone-0.010
 dad-0.009
Maybe-0.009
 one-0.009
 everything-0.008
Token on
Token ID319
Feature activation-2.96e-02

Pos logit contributions
ocol+0.012
 rib+0.010
 microsc+0.010
 gymn+0.010
rou+0.010
Neg logit contributions
Maybe-0.004
 someone-0.004
 one-0.004
 dad-0.004
 next-0.003
Token the
Token ID262
Feature activation+1.00e+00

Pos logit contributions
Maybe+0.001
 someone+0.001
How+0.001
Mom+0.000
 dad+0.000
Neg logit contributions
ocol-0.001
 rib-0.001
 microsc-0.001
 gymn-0.001
 topp-0.001
Token couch
Token ID18507
Feature activation+6.12e-01

Pos logit contributions
ocol+0.050
 gol+0.039
rou+0.038
pill+0.038
 rib+0.038
Neg logit contributions
 next-0.020
 someone-0.019
 one-0.019
 dad-0.019
 fast-0.018
Token.
Token ID13
Feature activation-1.74e-01

Pos logit contributions
ocol+0.033
 rib+0.028
 microsc+0.027
 gymn+0.026
rou+0.026
Neg logit contributions
 someone-0.008
 one-0.008
Maybe-0.007
 dad-0.007
 next-0.007
Token Sam
Token ID3409
Feature activation+3.17e-01

Pos logit contributions
Maybe+0.003
 someone+0.003
 next+0.003
Mom+0.003
 one+0.003
Neg logit contributions
ocol-0.008
 rib-0.007
 microsc-0.006
 gymn-0.006
 gol-0.006
Token opens
Token ID9808
Feature activation+2.68e-01

Pos logit contributions
ocol+0.014
 rib+0.012
 microsc+0.011
 gymn+0.011
rou+0.011
Neg logit contributions
Maybe-0.006
 someone-0.006
 one-0.006
 dad-0.006
 next-0.006
Token the
Token ID262
Feature activation+1.35e+00

Pos logit contributions
ocol+0.013
 rib+0.011
 microsc+0.011
 gymn+0.010
rou+0.010
Neg logit contributions
Maybe-0.005
 someone-0.004
 one-0.004
 dad-0.004
 next-0.004
Token seen
Token ID1775
Feature activation-3.33e-01

Pos logit contributions
ocol+0.031
 rib+0.026
 microsc+0.026
rou+0.025
 gymn+0.025
Neg logit contributions
 someone-0.005
Maybe-0.005
 one-0.005
 dad-0.004
 everything-0.004
Token the
Token ID262
Feature activation+3.95e-01

Pos logit contributions
Maybe+0.005
Mom+0.004
How+0.004
 next+0.004
 dad+0.004
Neg logit contributions
ocol-0.017
 rib-0.015
 microsc-0.014
 gymn-0.014
 topp-0.013
Token little
Token ID1310
Feature activation+2.45e-01

Pos logit contributions
ocol+0.016
 rib+0.013
 microsc+0.012
 gymn+0.012
rou+0.012
Neg logit contributions
Maybe-0.010
 someone-0.010
 one-0.010
 dad-0.010
 next-0.010
Token girl
Token ID2576
Feature activation-6.12e-02

Pos logit contributions
ocol+0.013
 rib+0.011
 microsc+0.011
 gymn+0.010
rou+0.010
Neg logit contributions
Maybe-0.003
 someone-0.003
 one-0.003
 dad-0.003
 next-0.003
Token playing
Token ID2712
Feature activation+5.36e-01

Pos logit contributions
Maybe+0.001
 someone+0.001
Mom+0.001
 dad+0.001
 one+0.001
Neg logit contributions
ocol-0.003
 rib-0.002
 microsc-0.002
 gymn-0.002
rou-0.002
Token the
Token ID262
Feature activation+5.53e-01

Pos logit contributions
ocol+0.028
 rib+0.023
 microsc+0.023
rou+0.022
 topp+0.022
Neg logit contributions
 someone-0.008
Maybe-0.008
 one-0.008
 dad-0.007
 next-0.007
Token piano
Token ID19132
Feature activation-1.27e-01

Pos logit contributions
ocol+0.023
 rib+0.018
 microsc+0.017
rou+0.017
 topp+0.016
Neg logit contributions
 someone-0.014
Maybe-0.014
 one-0.014
 next-0.014
 dad-0.014
Token came
Token ID1625
Feature activation+1.79e-01

Pos logit contributions
Maybe+0.002
 someone+0.002
 dad+0.002
Mom+0.002
How+0.002
Neg logit contributions
ocol-0.006
 rib-0.005
 microsc-0.005
 gymn-0.005
rou-0.005
Token up
Token ID510
Feature activation+6.72e-01

Pos logit contributions
ocol+0.008
 rib+0.007
 microsc+0.006
 gymn+0.006
rou+0.006
Neg logit contributions
Maybe-0.004
 someone-0.004
 dad-0.003
 one-0.003
Mom-0.003
Token to
Token ID284
Feature activation+3.63e-01

Pos logit contributions
ocol+0.035
 rib+0.029
 microsc+0.028
pill+0.027
 gymn+0.027
Neg logit contributions
 one-0.011
 someone-0.010
 next-0.010
Maybe-0.010
 dad-0.010
Token her
Token ID607
Feature activation+7.42e-01

Pos logit contributions
ocol+0.018
 rib+0.015
 microsc+0.015
rou+0.014
 gymn+0.014
Neg logit contributions
 someone-0.006
Maybe-0.006
 one-0.006
 dad-0.005
 next-0.005
Token They
Token ID1119
Feature activation+5.15e-01

Pos logit contributions
ocol+0.014
 rib+0.011
 microsc+0.011
 gymn+0.011
rou+0.011
Neg logit contributions
Maybe-0.006
 someone-0.006
 one-0.006
 dad-0.006
 next-0.005
Token can
Token ID460
Feature activation+2.47e-01

Pos logit contributions
ocol+0.021
 rib+0.016
 microsc+0.015
 gymn+0.015
rou+0.015
Neg logit contributions
Maybe-0.014
 someone-0.013
 one-0.013
 dad-0.013
 next-0.013
Token hurt
Token ID5938
Feature activation-4.66e-01

Pos logit contributions
ocol+0.012
 rib+0.010
 microsc+0.010
 gymn+0.009
rou+0.009
Neg logit contributions
Maybe-0.004
 someone-0.004
 one-0.004
 dad-0.004
 next-0.004
Token someone
Token ID2130
Feature activation-1.49e-01

Pos logit contributions
Maybe+0.010
Mom+0.009
 dad+0.009
How+0.009
 someone+0.009
Neg logit contributions
ocol-0.020
 rib-0.017
 microsc-0.016
 gymn-0.015
rou-0.015
Token.
Token ID13
Feature activation+3.69e-01

Pos logit contributions
Maybe+0.003
 someone+0.003
Mom+0.003
 dad+0.003
 one+0.003
Neg logit contributions
ocol-0.007
 rib-0.006
 microsc-0.005
 gymn-0.005
rou-0.005
Token We
Token ID775
Feature activation+1.53e+00

Pos logit contributions
ocol+0.018
 rib+0.015
 microsc+0.014
 gymn+0.014
rou+0.014
Neg logit contributions
Maybe-0.007
 someone-0.007
 one-0.006
 dad-0.006
Mom-0.006
Token should
Token ID815
Feature activation-4.44e-02

Pos logit contributions
ocol+0.074
 microsc+0.056
 rib+0.056
rou+0.054
 gymn+0.054
Neg logit contributions
Maybe-0.036
 one-0.035
 looked-0.032
 someone-0.032
 not-0.032
Token only
Token ID691
Feature activation+4.81e-01

Pos logit contributions
Maybe+0.001
 someone+0.001
 dad+0.001
 one+0.001
How+0.001
Neg logit contributions
ocol-0.002
 rib-0.002
 microsc-0.002
 gymn-0.002
rou-0.002
Token throw
Token ID3714
Feature activation-4.69e-01

Pos logit contributions
ocol+0.023
 rib+0.019
 microsc+0.018
rou+0.018
 gymn+0.018
Neg logit contributions
 someone-0.009
Maybe-0.009
 one-0.008
 dad-0.008
 next-0.008
Token round
Token ID2835
Feature activation+5.22e-01

Pos logit contributions
Maybe+0.008
Mom+0.008
 dad+0.008
 someone+0.008
How+0.007
Neg logit contributions
ocol-0.023
 rib-0.020
 microsc-0.018
 gymn-0.017
 topp-0.017
Token and
Token ID290
Feature activation+3.48e-01

Pos logit contributions
ocol+0.026
 rib+0.022
 microsc+0.021
 gymn+0.021
rou+0.021
Neg logit contributions
 someone-0.008
Maybe-0.008
 one-0.007
 dad-0.007
 next-0.007
Token and
Token ID290
Feature activation+2.40e-01

Pos logit contributions
ocol+0.024
 rib+0.020
 microsc+0.020
 gymn+0.019
rou+0.019
Neg logit contributions
 someone-0.006
Maybe-0.006
 one-0.006
 dad-0.006
 next-0.005
Token they
Token ID484
Feature activation+3.53e-01

Pos logit contributions
ocol+0.013
 rib+0.011
 microsc+0.011
 gymn+0.011
rou+0.011
Neg logit contributions
Maybe-0.002
 someone-0.002
 one-0.002
 dad-0.002
 next-0.002
Token brought
Token ID3181
Feature activation-9.76e-02

Pos logit contributions
ocol+0.020
 rib+0.017
 microsc+0.017
 gymn+0.016
rou+0.016
Neg logit contributions
Maybe-0.003
 someone-0.003
 one-0.003
 dad-0.003
Mom-0.003
Token the
Token ID262
Feature activation+4.06e-01

Pos logit contributions
Maybe+0.002
 someone+0.001
How+0.001
Nothing+0.001
Mom+0.001
Neg logit contributions
ocol-0.005
 rib-0.004
 microsc-0.004
 topp-0.004
 gymn-0.004
Token arm
Token ID3211
Feature activation-3.07e-01

Pos logit contributions
ocol+0.020
 rib+0.016
 microsc+0.016
 gymn+0.015
rou+0.015
Neg logit contributions
Maybe-0.007
 someone-0.007
 one-0.007
 dad-0.007
 next-0.007
Tokenchair
Token ID16337
Feature activation-9.34e-04

Pos logit contributions
Maybe+0.009
 someone+0.009
How+0.009
 dad+0.009
What+0.009
Neg logit contributions
ocol-0.010
 rib-0.008
 microsc-0.007
hy-0.007
rou-0.007
Token home
Token ID1363
Feature activation+1.58e-01

Pos logit contributions
Maybe+0.000
 someone+0.000
Mom+0.000
 dad+0.000
How+0.000
Neg logit contributions
ocol-0.000
 rib-0.000
 microsc-0.000
rou-0.000
 gymn-0.000
Token together
Token ID1978
Feature activation+2.47e-01

Pos logit contributions
ocol+0.008
 rib+0.007
 microsc+0.006
 gymn+0.006
rou+0.006
Neg logit contributions
Maybe-0.003
 someone-0.002
 dad-0.002
 one-0.002
Mom-0.002
Token.
Token ID13
Feature activation+2.05e-01

Pos logit contributions
ocol+0.013
 rib+0.011
 microsc+0.010
 gymn+0.010
rou+0.010
Neg logit contributions
Maybe-0.004
 someone-0.004
 dad-0.003
 one-0.003
 next-0.003
Token
Token ID198
Feature activation+1.58e-02

Pos logit contributions
ocol+0.010
 rib+0.009
 microsc+0.008
 gymn+0.008
rou+0.008
Neg logit contributions
Maybe-0.003
 someone-0.003
 dad-0.003
 one-0.003
 next-0.003
Token
Token ID198
Feature activation-3.86e-02

Pos logit contributions
ocol+0.001
 rib+0.001
 microsc+0.001
 gymn+0.001
rou+0.001
Neg logit contributions
Maybe-0.000
 someone-0.000
 dad-0.000
 one-0.000
 next-0.000
Token block
Token ID2512
Feature activation-6.31e-01

Pos logit contributions
ocol+0.013
 rib+0.011
 microsc+0.011
 gymn+0.010
rou+0.010
Neg logit contributions
Maybe-0.005
 someone-0.005
 one-0.004
 dad-0.004
 next-0.004
Token that
Token ID326
Feature activation-1.04e+00

Pos logit contributions
Maybe+0.013
 someone+0.012
What+0.012
Mom+0.011
How+0.011
Neg logit contributions
ocol-0.027
 rib-0.024
 gymn-0.022
hy-0.022
rou-0.022
Token can
Token ID460
Feature activation-3.18e-01

Pos logit contributions
 somewhere+0.017
What+0.017
Maybe+0.016
Where+0.016
 someone+0.016
Neg logit contributions
ocol-0.045
 rib-0.043
 microsc-0.039
 gymn-0.039
pill-0.037
Token snap
Token ID11495
Feature activation-3.87e-02

Pos logit contributions
Maybe+0.006
 someone+0.006
 dad+0.006
How+0.006
Mom+0.006
Neg logit contributions
ocol-0.014
 rib-0.012
 gymn-0.011
 microsc-0.011
 topp-0.011
Token to
Token ID284
Feature activation-7.62e-01

Pos logit contributions
Maybe+0.001
 someone+0.001
 dad+0.001
Mom+0.001
 one+0.001
Neg logit contributions
ocol-0.002
 rib-0.002
 microsc-0.002
 gymn-0.001
rou-0.001
Token other
Token ID584
Feature activation-2.56e-01

Pos logit contributions
Maybe+0.016
Nothing+0.014
Someone+0.014
How+0.013
 next+0.013
Neg logit contributions
ocol-0.034
 rib-0.029
 gymn-0.027
 microsc-0.026
 topp-0.025
Token blocks
Token ID7021
Feature activation-8.32e-02

Pos logit contributions
Maybe+0.006
 someone+0.005
Mom+0.005
 dad+0.005
How+0.005
Neg logit contributions
ocol-0.011
 rib-0.009
 microsc-0.009
 gymn-0.009
 topp-0.008
Token.
Token ID13
Feature activation+2.48e-03

Pos logit contributions
Maybe+0.001
 someone+0.001
 dad+0.001
Mom+0.001
 one+0.001
Neg logit contributions
ocol-0.004
 rib-0.003
 microsc-0.003
 gymn-0.003
rou-0.003
Token She
Token ID1375
Feature activation+2.96e-01

Pos logit contributions
ocol+0.000
 rib+0.000
 microsc+0.000
 gymn+0.000
rou+0.000
Neg logit contributions
Maybe-0.000
 someone-0.000
 one-0.000
 dad-0.000
 next-0.000
Token thinks
Token ID6834
Feature activation-6.38e-01

Pos logit contributions
ocol+0.013
 rib+0.011
 microsc+0.010
 gymn+0.010
rou+0.010
Neg logit contributions
Maybe-0.007
 someone-0.007
 one-0.006
 dad-0.006
 next-0.006
Token it
Token ID340
Feature activation-8.65e-01

Pos logit contributions
Maybe+0.011
How+0.010
Mom+0.009
What+0.009
Where+0.008
Neg logit contributions
ocol-0.032
 rib-0.028
 microsc-0.026
 gymn-0.026
 topp-0.025
Token they
Token ID484
Feature activation+3.39e-01

Pos logit contributions
Maybe+0.004
How+0.004
 someone+0.003
Mom+0.003
What+0.003
Neg logit contributions
ocol-0.016
 rib-0.015
 gymn-0.014
 microsc-0.014
 gol-0.013
Token reached
Token ID4251
Feature activation-4.12e-01

Pos logit contributions
ocol+0.018
 rib+0.015
 microsc+0.014
 gymn+0.014
rou+0.014
Neg logit contributions
Maybe-0.005
 someone-0.005
 one-0.004
 dad-0.004
 next-0.004
Token the
Token ID262
Feature activation+1.02e-01

Pos logit contributions
Maybe+0.010
Mom+0.009
How+0.009
What+0.009
Nothing+0.009
Neg logit contributions
ocol-0.017
 rib-0.014
 gymn-0.013
 microsc-0.013
rou-0.012
Token other
Token ID584
Feature activation-3.65e-01

Pos logit contributions
ocol+0.005
 rib+0.004
 microsc+0.004
 gymn+0.004
rou+0.004
Neg logit contributions
Maybe-0.002
 someone-0.001
 dad-0.001
 one-0.001
Mom-0.001
Token side
Token ID1735
Feature activation+6.63e-02

Pos logit contributions
Maybe+0.005
 someone+0.004
How+0.004
 dad+0.004
Mom+0.004
Neg logit contributions
ocol-0.018
 rib-0.016
 gymn-0.015
 microsc-0.015
 gol-0.014
Token,
Token ID11
Feature activation+1.43e-01

Pos logit contributions
ocol+0.003
 rib+0.002
 microsc+0.002
 gymn+0.002
rou+0.002
Neg logit contributions
Maybe-0.001
 someone-0.001
 dad-0.001
How-0.001
Mom-0.001
Token the
Token ID262
Feature activation-3.90e-01

Pos logit contributions
ocol+0.008
 rib+0.007
 microsc+0.007
 gymn+0.006
rou+0.006
Neg logit contributions
Maybe-0.001
 someone-0.001
 dad-0.001
 one-0.001
Mom-0.001
Token dog
Token ID3290
Feature activation+1.68e-01

Pos logit contributions
Maybe+0.008
How+0.008
 someone+0.008
What+0.007
Nothing+0.007
Neg logit contributions
ocol-0.016
 rib-0.014
 gymn-0.013
 gol-0.013
 microsc-0.013
Token ran
Token ID4966
Feature activation-7.55e-02

Pos logit contributions
ocol+0.008
 rib+0.007
 microsc+0.006
 gymn+0.006
rou+0.006
Neg logit contributions
Maybe-0.003
 someone-0.003
 dad-0.003
 one-0.003
Mom-0.003
Token towards
Token ID3371
Feature activation-9.98e-01

Pos logit contributions
Maybe+0.001
 someone+0.001
 dad+0.001
Mom+0.001
How+0.001
Neg logit contributions
ocol-0.003
 rib-0.003
 microsc-0.003
 gymn-0.003
rou-0.003
Token them
Token ID606
Feature activation-4.57e-01

Pos logit contributions
Maybe+0.024
Nothing+0.023
How+0.022
Someone+0.021
Mom+0.021
Neg logit contributions
ocol-0.038
 rib-0.033
 microsc-0.030
 gymn-0.030
 topp-0.028
Token came
Token ID1625
Feature activation+8.08e-02

Pos logit contributions
ocol+0.054
 microsc+0.045
 rib+0.044
 gymn+0.043
rou+0.042
Neg logit contributions
Maybe-0.014
 someone-0.014
 one-0.014
 next-0.013
 turned-0.013
Token and
Token ID290
Feature activation-1.88e-01

Pos logit contributions
ocol+0.004
 rib+0.003
 microsc+0.003
 gymn+0.003
rou+0.003
Neg logit contributions
Maybe-0.001
 someone-0.001
 dad-0.001
 one-0.001
Mom-0.001
Token washed
Token ID18989
Feature activation+4.11e-01

Pos logit contributions
Maybe+0.003
 someone+0.003
Mom+0.003
 dad+0.003
How+0.003
Neg logit contributions
ocol-0.009
 rib-0.008
 microsc-0.007
 gymn-0.007
rou-0.007
Token away
Token ID1497
Feature activation+2.80e-01

Pos logit contributions
ocol+0.020
 rib+0.017
 microsc+0.016
 gymn+0.016
rou+0.016
Neg logit contributions
 someone-0.007
Maybe-0.007
 one-0.007
 dad-0.007
 next-0.006
Token the
Token ID262
Feature activation-1.47e-01

Pos logit contributions
ocol+0.014
 rib+0.011
 microsc+0.011
 gymn+0.011
rou+0.011
Neg logit contributions
Maybe-0.005
 someone-0.005
 one-0.004
 dad-0.004
 next-0.004
Token whale
Token ID22206
Feature activation-5.54e-01

Pos logit contributions
Maybe+0.003
 someone+0.003
How+0.003
Mom+0.003
 one+0.003
Neg logit contributions
ocol-0.007
 rib-0.006
 microsc-0.005
 gymn-0.005
 topp-0.005
Token's
Token ID338
Feature activation+6.09e-01

Pos logit contributions
Maybe+0.013
Mom+0.013
 someone+0.013
How+0.012
What+0.012
Neg logit contributions
ocol-0.022
 rib-0.019
 microsc-0.018
rou-0.017
 gymn-0.017
Token painting
Token ID12036
Feature activation-1.05e-01

Pos logit contributions
ocol+0.027
 rib+0.022
 microsc+0.021
 gymn+0.021
rou+0.021
Neg logit contributions
 someone-0.013
 dad-0.013
 one-0.012
Maybe-0.012
 next-0.012
Token.
Token ID13
Feature activation+2.75e-01

Pos logit contributions
Maybe+0.002
 someone+0.002
 dad+0.002
Mom+0.002
How+0.002
Neg logit contributions
ocol-0.005
 rib-0.004
 microsc-0.004
 gymn-0.004
rou-0.004
Token The
Token ID383
Feature activation-1.09e+00

Pos logit contributions
ocol+0.011
 rib+0.009
 microsc+0.009
 gymn+0.008
rou+0.008
Neg logit contributions
Maybe-0.007
 someone-0.007
 one-0.007
 dad-0.006
 next-0.006
Token whale
Token ID22206
Feature activation+3.22e-01

Pos logit contributions
Maybe+0.029
How+0.027
iffs+0.026
They+0.026
Nothing+0.026
Neg logit contributions
ocol-0.042
 rib-0.035
 topp-0.033
 microsc-0.032
 gol-0.032
Token a
Token ID257
Feature activation+3.65e-01

Pos logit contributions
ocol+0.019
 rib+0.017
 microsc+0.016
 gymn+0.016
rou+0.016
Neg logit contributions
Maybe-0.002
 someone-0.002
 one-0.002
 dad-0.002
 next-0.002
Token time
Token ID640
Feature activation+5.08e-01

Pos logit contributions
ocol+0.016
 rib+0.013
 microsc+0.012
 gymn+0.012
rou+0.012
Neg logit contributions
Maybe-0.008
 someone-0.008
 one-0.008
 dad-0.008
 next-0.008
Token,
Token ID11
Feature activation+1.07e-01

Pos logit contributions
ocol+0.026
 rib+0.022
 microsc+0.021
rou+0.021
 gymn+0.021
Neg logit contributions
 someone-0.007
Maybe-0.007
 one-0.007
 dad-0.007
 next-0.007
Token there
Token ID612
Feature activation+9.10e-01

Pos logit contributions
ocol+0.007
 rib+0.006
 microsc+0.005
 gymn+0.005
rou+0.005
Neg logit contributions
Maybe-0.001
 someone-0.000
 dad-0.000
 one-0.000
Mom-0.000
Token was
Token ID373
Feature activation+8.40e-01

Pos logit contributions
ocol+0.049
 rib+0.041
 microsc+0.039
rou+0.038
 former+0.038
Neg logit contributions
 someone-0.015
Maybe-0.014
 one-0.014
 everything-0.013
 dad-0.013
Token a
Token ID257
Feature activation-2.22e-01

Pos logit contributions
ocol+0.055
 rib+0.047
 microsc+0.046
rou+0.045
 gymn+0.045
Neg logit contributions
 one-0.003
 someone-0.003
Maybe-0.002
 dad-0.002
 something-0.002
Token little
Token ID1310
Feature activation-6.63e-01

Pos logit contributions
Maybe+0.005
 someone+0.005
 one+0.004
How+0.004
 somewhere+0.004
Neg logit contributions
ocol-0.010
 rib-0.008
 gymn-0.008
 microsc-0.008
rou-0.007
Token boy
Token ID2933
Feature activation-4.49e-01

Pos logit contributions
Maybe+0.014
 someone+0.013
How+0.013
What+0.012
 somewhere+0.012
Neg logit contributions
ocol-0.030
 rib-0.026
 gol-0.024
 microsc-0.023
 gymn-0.023
Token named
Token ID3706
Feature activation+8.83e-01

Pos logit contributions
Maybe+0.011
 someone+0.011
How+0.010
What+0.010
 dad+0.010
Neg logit contributions
ocol-0.018
 rib-0.014
 microsc-0.014
opot-0.014
rou-0.014
Token Tim
Token ID5045
Feature activation-4.02e-01

Pos logit contributions
ocol+0.043
 rib+0.036
pill+0.034
rou+0.033
 former+0.033
Neg logit contributions
 someone-0.019
 dad-0.019
 one-0.019
 next-0.018
 mom-0.018
Tokenmy
Token ID1820
Feature activation-2.23e-01

Pos logit contributions
Maybe+0.007
 someone+0.007
 dad+0.007
How+0.007
Nothing+0.006
Neg logit contributions
ocol-0.018
 rib-0.015
rou-0.015
 microsc-0.014
 gymn-0.014
Token the
Token ID262
Feature activation+6.15e-01

Pos logit contributions
Maybe+0.019
Nothing+0.018
 dad+0.018
Someone+0.017
Mom+0.017
Neg logit contributions
ocol-0.031
 rib-0.026
 microsc-0.024
 gymn-0.024
 gol-0.022
Token toy
Token ID13373
Feature activation-1.27e-01

Pos logit contributions
ocol+0.032
 rib+0.026
 microsc+0.025
pill+0.025
rou+0.025
Neg logit contributions
 one-0.010
 someone-0.010
 next-0.010
Maybe-0.010
 dad-0.010
Token car
Token ID1097
Feature activation-5.62e-01

Pos logit contributions
Maybe+0.002
 someone+0.002
Mom+0.002
 dad+0.002
How+0.002
Neg logit contributions
ocol-0.006
 rib-0.005
 gymn-0.005
 microsc-0.005
 gol-0.005
Token and
Token ID290
Feature activation-3.74e-01

Pos logit contributions
Maybe+0.012
Mom+0.011
 someone+0.011
How+0.010
What+0.010
Neg logit contributions
ocol-0.024
 rib-0.021
 gymn-0.020
 microsc-0.020
rou-0.019
Token thought
Token ID1807
Feature activation+4.72e-01

Pos logit contributions
Maybe+0.007
Mom+0.007
 someone+0.007
 somewhere+0.007
 next+0.007
Neg logit contributions
ocol-0.015
 rib-0.014
 microsc-0.013
 gymn-0.013
 topp-0.012
Token it
Token ID340
Feature activation-1.13e+00

Pos logit contributions
ocol+0.025
 rib+0.021
 microsc+0.021
rou+0.020
 gymn+0.020
Neg logit contributions
 someone-0.006
 one-0.006
Maybe-0.006
 dad-0.006
 next-0.005
Token was
Token ID373
Feature activation-6.70e-01

Pos logit contributions
 dad+0.037
Mom+0.037
 someone+0.036
Mother+0.036
Dad+0.036
Neg logit contributions
 rib-0.029
 gymn-0.027
ocol-0.025
 microsc-0.024
 topp-0.023
Token fun
Token ID1257
Feature activation-1.73e-01

Pos logit contributions
Maybe+0.014
Mom+0.013
Parent+0.012
Nothing+0.012
Someone+0.012
Neg logit contributions
ocol-0.028
 rib-0.025
 microsc-0.024
 gymn-0.024
 topp-0.022
Token.
Token ID13
Feature activation-3.60e-01

Pos logit contributions
Maybe+0.004
Mom+0.004
 someone+0.004
 dad+0.004
How+0.004
Neg logit contributions
ocol-0.007
 rib-0.006
 microsc-0.006
 gymn-0.006
rou-0.006
Token
Token ID198
Feature activation-5.50e-01

Pos logit contributions
Maybe+0.007
 someone+0.007
 dad+0.007
Mom+0.007
 one+0.007
Neg logit contributions
ocol-0.015
 rib-0.013
 microsc-0.012
 gymn-0.012
pill-0.012
Token
Token ID198
Feature activation-3.47e-01

Pos logit contributions
Maybe+0.010
 someone+0.010
 dad+0.010
 one+0.010
How+0.010
Neg logit contributions
ocol-0.024
 rib-0.020
 microsc-0.019
 gymn-0.019
rou-0.019
Token says
Token ID1139
Feature activation+3.42e-01

Pos logit contributions
ocol+0.007
 rib+0.006
 microsc+0.005
 gymn+0.005
rou+0.005
Neg logit contributions
Maybe-0.004
 someone-0.004
 one-0.004
 dad-0.004
Mom-0.004
Token.
Token ID13
Feature activation-3.22e-01

Pos logit contributions
ocol+0.018
 rib+0.015
 microsc+0.015
 gymn+0.014
rou+0.014
Neg logit contributions
 someone-0.005
Maybe-0.005
 one-0.004
 dad-0.004
 next-0.004
Token
Token ID198
Feature activation-1.47e-01

Pos logit contributions
Maybe+0.006
 someone+0.006
 one+0.006
 next+0.005
 somewhere+0.005
Neg logit contributions
ocol-0.014
 rib-0.012
 microsc-0.012
 gymn-0.011
pill-0.011
Token
Token ID198
Feature activation+2.79e-01

Pos logit contributions
 someone+0.003
Maybe+0.002
 one+0.002
 dad+0.002
How+0.002
Neg logit contributions
ocol-0.007
 rib-0.006
 microsc-0.006
 gymn-0.005
rou-0.005
Token"
Token ID1
Feature activation-1.51e-01

Pos logit contributions
ocol+0.015
 rib+0.013
 microsc+0.012
 gymn+0.012
rou+0.012
Neg logit contributions
Maybe-0.004
 someone-0.003
 dad-0.003
 one-0.003
Mom-0.003
TokenWow
Token ID22017
Feature activation-1.14e+00

Pos logit contributions
 someone+0.003
 one+0.003
 dad+0.003
 everything+0.003
 somewhere+0.003
Neg logit contributions
ocol-0.007
 rib-0.006
 gymn-0.005
 microsc-0.005
 gol-0.005
Token,
Token ID11
Feature activation+3.78e-01

Pos logit contributions
Maybe+0.034
How+0.032
Where+0.031
What+0.031
Mom+0.030
Neg logit contributions
ocol-0.041
 rib-0.035
 microsc-0.030
 gymn-0.030
rou-0.029
Token Mom
Token ID11254
Feature activation-8.41e-03

Pos logit contributions
ocol+0.020
 rib+0.017
 microsc+0.016
 gymn+0.016
rou+0.016
Neg logit contributions
 someone-0.005
Maybe-0.005
 dad-0.005
 one-0.005
 next-0.004
Token,
Token ID11
Feature activation+3.70e-01

Pos logit contributions
Maybe+0.000
 someone+0.000
 dad+0.000
How+0.000
Mom+0.000
Neg logit contributions
ocol-0.000
 rib-0.000
 microsc-0.000
rou-0.000
 gymn-0.000
Token that
Token ID326
Feature activation-8.67e-05

Pos logit contributions
ocol+0.020
 rib+0.017
 microsc+0.016
 gymn+0.016
rou+0.016
Neg logit contributions
 someone-0.005
Maybe-0.005
 dad-0.004
 one-0.004
 next-0.004
Token's
Token ID338
Feature activation-2.95e-01

Pos logit contributions
Maybe+0.000
 someone+0.000
 dad+0.000
Mom+0.000
How+0.000
Neg logit contributions
ocol-0.000
 rib-0.000
 microsc-0.000
 gymn-0.000
 topp-0.000
Token
Token ID198
Feature activation-6.01e-01

Pos logit contributions
Maybe+0.013
 dad+0.013
 one+0.013
 someone+0.013
How+0.013
Neg logit contributions
ocol-0.027
 rib-0.022
 microsc-0.022
 gymn-0.022
pri-0.021
TokenSp
Token ID4561
Feature activation+4.94e-02

Pos logit contributions
 dad+0.015
 someone+0.014
 one+0.014
 somewhere+0.013
Maybe+0.013
Neg logit contributions
ocol-0.023
 microsc-0.020
 rib-0.019
 gymn-0.019
rou-0.018
Tokenide
Token ID485
Feature activation-2.08e-01

Pos logit contributions
ocol+0.002
 rib+0.001
 microsc+0.001
rou+0.001
 gymn+0.001
Neg logit contributions
 someone-0.002
Maybe-0.002
 one-0.002
 dad-0.002
 next-0.002
Tokeny
Token ID88
Feature activation-7.71e-01

Pos logit contributions
Maybe+0.006
 someone+0.006
 dad+0.006
How+0.006
Mom+0.006
Neg logit contributions
ocol-0.007
 rib-0.006
 microsc-0.006
rou-0.005
 gymn-0.005
Token thought
Token ID1807
Feature activation+7.39e-01

Pos logit contributions
Maybe+0.017
What+0.016
How+0.016
 someone+0.016
Mom+0.016
Neg logit contributions
ocol-0.029
 rib-0.026
 microsc-0.024
 gymn-0.024
rou-0.024
Token,
Token ID11
Feature activation-1.05e+00

Pos logit contributions
ocol+0.032
 rib+0.025
 gol+0.024
ifying+0.024
 microsc+0.024
Neg logit contributions
 someone-0.018
 one-0.017
 dad-0.017
 everything-0.017
 next-0.016
Token "
Token ID366
Feature activation-3.48e-01

Pos logit contributions
Maybe+0.024
Mom+0.024
 dad+0.023
cil+0.022
Nothing+0.022
Neg logit contributions
ocol-0.037
 rib-0.036
 gymn-0.032
 microsc-0.031
pill-0.029
TokenI
Token ID40
Feature activation+9.84e-01

Pos logit contributions
 dad+0.006
 someone+0.005
 one+0.005
 what+0.005
 somewhere+0.005
Neg logit contributions
ocol-0.016
 rib-0.014
 gymn-0.014
 microsc-0.013
pill-0.013
Token need
Token ID761
Feature activation+9.04e-02

Pos logit contributions
ocol+0.048
 rib+0.040
 microsc+0.037
 topp+0.036
pill+0.036
Neg logit contributions
Maybe-0.020
 one-0.020
 someone-0.019
 looked-0.018
 there-0.018
Token to
Token ID284
Feature activation-3.13e-01

Pos logit contributions
ocol+0.004
 rib+0.003
 microsc+0.003
 gymn+0.003
rou+0.003
Neg logit contributions
Maybe-0.002
 someone-0.002
 dad-0.002
Mom-0.002
 one-0.002
Token prevent
Token ID2948
Feature activation-7.25e-01

Pos logit contributions
Maybe+0.006
 someone+0.006
How+0.006
 dad+0.006
What+0.006
Neg logit contributions
ocol-0.014
 rib-0.012
 gymn-0.011
 microsc-0.011
 topp-0.011
Token She
Token ID1375
Feature activation+6.45e-02

Pos logit contributions
ocol+0.004
 rib+0.004
 microsc+0.003
 gymn+0.003
rou+0.003
Neg logit contributions
Maybe-0.002
 someone-0.002
 dad-0.002
 one-0.002
 next-0.002
Token felt
Token ID2936
Feature activation-6.52e-01

Pos logit contributions
ocol+0.003
 rib+0.003
 microsc+0.002
 gymn+0.002
rou+0.002
Neg logit contributions
Maybe-0.001
 someone-0.001
 dad-0.001
 one-0.001
 next-0.001
Token helpless
Token ID21144
Feature activation-3.07e-01

Pos logit contributions
Maybe+0.013
Mom+0.012
How+0.012
 dad+0.012
cil+0.011
Neg logit contributions
ocol-0.030
 rib-0.024
 microsc-0.023
 gymn-0.022
ifying-0.022
Token and
Token ID290
Feature activation+3.98e-02

Pos logit contributions
Maybe+0.006
Mom+0.006
How+0.005
 someone+0.005
 dad+0.005
Neg logit contributions
ocol-0.014
 rib-0.012
rou-0.011
 gymn-0.011
 microsc-0.011
Token didn
Token ID1422
Feature activation+1.31e-01

Pos logit contributions
ocol+0.002
 rib+0.001
 microsc+0.001
 gymn+0.001
rou+0.001
Neg logit contributions
Maybe-0.001
 someone-0.001
 dad-0.001
 one-0.001
Mom-0.001
Token't
Token ID470
Feature activation-1.02e+00

Pos logit contributions
ocol+0.006
 rib+0.005
 microsc+0.004
 gymn+0.004
rou+0.004
Neg logit contributions
Maybe-0.003
 someone-0.003
 dad-0.003
 one-0.003
 next-0.003
Token know
Token ID760
Feature activation-5.73e-01

Pos logit contributions
Mom+0.028
Maybe+0.027
 dad+0.027
How+0.027
ths+0.026
Neg logit contributions
ocol-0.035
 rib-0.029
 gymn-0.029
 microsc-0.027
free-0.027
Token what
Token ID644
Feature activation+3.12e-01

Pos logit contributions
Maybe+0.007
Mom+0.006
How+0.005
 dad+0.005
Dad+0.005
Neg logit contributions
ocol-0.031
 rib-0.027
 microsc-0.026
rou-0.025
 gymn-0.025
Token to
Token ID284
Feature activation+5.58e-01

Pos logit contributions
ocol+0.012
 rib+0.009
 microsc+0.009
rou+0.008
 gymn+0.008
Neg logit contributions
 someone-0.009
Maybe-0.009
 one-0.009
 dad-0.008
 next-0.008
Token do
Token ID466
Feature activation+2.62e-02

Pos logit contributions
ocol+0.022
 rib+0.017
 microsc+0.016
pill+0.016
rou+0.016
Neg logit contributions
 someone-0.016
Maybe-0.016
 dad-0.016
 one-0.015
 next-0.015
Token.
Token ID13
Feature activation-2.52e-01

Pos logit contributions
ocol+0.001
 rib+0.001
 microsc+0.001
 gymn+0.001
rou+0.001
Neg logit contributions
Maybe-0.000
 someone-0.000
 dad-0.000
 one-0.000
Mom-0.000
Token you
Token ID345
Feature activation-3.04e-01

Pos logit contributions
ocol+0.010
 rib+0.008
 microsc+0.008
 gymn+0.007
rou+0.007
Neg logit contributions
Maybe-0.004
 someone-0.004
 dad-0.004
 one-0.004
Mom-0.004
Token."
Token ID526
Feature activation-5.52e-01

Pos logit contributions
Maybe+0.007
 someone+0.006
 dad+0.006
How+0.006
What+0.006
Neg logit contributions
ocol-0.013
 rib-0.011
 gymn-0.010
 microsc-0.010
rou-0.010
Token<|endoftext|>
Token ID50256
Feature activation+2.24e-01

Pos logit contributions
Maybe+0.012
 someone+0.011
 one+0.011
ble+0.011
 somewhere+0.011
Neg logit contributions
ocol-0.021
 rib-0.018
 microsc-0.017
 gymn-0.017
pill-0.017
TokenAnna
Token ID31160
Feature activation-8.46e-02

Pos logit contributions
ocol+0.011
 rib+0.009
 microsc+0.009
 gymn+0.009
rou+0.009
Neg logit contributions
Maybe-0.004
 someone-0.004
 one-0.003
 dad-0.003
 next-0.003
Token and
Token ID290
Feature activation-8.44e-02

Pos logit contributions
 someone+0.001
Maybe+0.001
 dad+0.001
 one+0.001
How+0.001
Neg logit contributions
ocol-0.004
 rib-0.003
 microsc-0.003
 gymn-0.003
rou-0.003
Token Ben
Token ID3932
Feature activation-1.03e+00

Pos logit contributions
Maybe+0.002
 someone+0.002
How+0.002
 next+0.002
 one+0.002
Neg logit contributions
ocol-0.004
 rib-0.003
 microsc-0.003
 gymn-0.003
 gol-0.003
Token are
Token ID389
Feature activation+4.12e-01

Pos logit contributions
 someone+0.022
 somewhere+0.020
How+0.020
Nothing+0.019
 next+0.019
Neg logit contributions
ocol-0.043
 rib-0.038
 microsc-0.034
opot-0.034
 topp-0.034
Token best
Token ID1266
Feature activation+1.35e+00

Pos logit contributions
ocol+0.019
 rib+0.016
 microsc+0.015
 gymn+0.015
 gol+0.014
Neg logit contributions
 dad-0.008
 someone-0.008
 one-0.008
Maybe-0.008
 next-0.008
Token friends
Token ID2460
Feature activation-5.26e-01

Pos logit contributions
ocol+0.080
ifying+0.069
spe+0.068
 rib+0.068
 microsc+0.068
Neg logit contributions
 dad-0.014
 mom-0.012
 one-0.010
 next-0.009
 someone-0.008
Token.
Token ID13
Feature activation+2.06e-02

Pos logit contributions
Maybe+0.011
 someone+0.010
How+0.010
What+0.010
Nothing+0.010
Neg logit contributions
ocol-0.022
 rib-0.019
rou-0.018
 microsc-0.017
 gymn-0.017
Token They
Token ID1119
Feature activation-3.71e-02

Pos logit contributions
ocol+0.001
 rib+0.001
 microsc+0.001
 gymn+0.001
 topp+0.001
Neg logit contributions
Maybe-0.000
 someone-0.000
 dad-0.000
 one-0.000
 next-0.000