TOP ACTIVATIONS
MAX = 5.445

They pick up the cr ay ons . They draw a
he would share his cr ay ons with Lily . He
. He drops the ash tr ay . It breaks .
Can we share the cr ay on ?" They
Lily accidentally dropped the ash tr ay and it broke into
picked up her pink cr ay on , and started to
cleaned up the broken ash tr ay and Lily learned to
apologized for breaking the cr ay on and promised to be
throw away the broken ash tr ay and soon forgotten about
. She writes with cr ay ons , pens , pencil
Tim used his cr ay ons to draw on the
. She grabbed her cr ay ons and started to fill
would take out a j igsaw puzzle and try to solve
, it caused the ash tr ay to scatter all over
. She brought her cr ay ons and a big paper
. They have many cr ay ons and papers . They
draw and shared the cr ay ons . They were very
It was a big j igsaw puzzle and she wanted to
. He wants his cr ay ons back .
to fight over the cr ay ons . They yelled and

MIN ACTIVATIONS
MIN = -5.017

the little girl and the healthy goose spun together all around
The little girl and the healthy goose were both very tired
after that . Whenever the chip m unk needed help ,
and ran around as the salty water tick led their toes
because the frog and the chip m unk had promised not
their faces happy and their tongues glad . "
, like mint and rose mary . They liked to see
like carrots and peas and lettuce . One day
ish to surrender until the rad ish felt exhausted .
make the fish and the cor als sad ."
leave !" Suddenly , the chip mun ks were free again
didn 't stop until their stomach s were full .
for his knee and his elbow . He let him ride
make the fish and the cor als sad . They wanted
and lady bugs and cater pill ars ." Tim
has cheese and ham and lettuce . It looks y ummy
you ". The bad ger was very surprised .
she asked herself if these berries were safe to eat .
reached her hand out and tasted one . It was so
sweets with his fingers and tasted each one until he finally

INTERVAL 2.829 - 5.445
CONTAINS 0.046%

sad and disappointed . No matter how hard he worked ,
her backpack but it was nowhere to be found .
says this is a fun way to play with the red
, Lily and Emma shared their toys and candy together .
to replace it with a new one ."

INTERVAL 0.214 - 2.829
CONTAINS 39.862%

car practiced going fast together . They raced and raced until
soon . Sara had a beautiful white veil that she would
a disappointed look , she said , " I told you
ill heard Tom 's cry and ran to help him .
He told Jill it was a special dinosaur . He said

INTERVAL -2.402 - 0.214
CONTAINS 59.758%

idea what it was . The machine suddenly replaced the blue
animal trying to get attention . From then on , the
smile . It said , " Thank you , thank you
bed . The princess was so happy . She
losing her glove , so she asked her mom for help

INTERVAL -5.017 - -2.402
CONTAINS 0.335%

of soup with noodles and vegetables . Tom looks at Lily
The girl felt happy and amazed . This
bring him two delicious w affles . John loved
She wanted to know why the river had flooded .
It opened its mouth and swallowed the trash . Lily and
Token They
Token ID1119
Feature activation+4.19e-01

Pos logit contributions
oys+0.000
 globe+0.000
 handshake+0.000
 compartment+0.000
fs+0.000
Neg logit contributions
please-0.000
 gor-0.000
 chickens-0.000
 ge-0.000
 bored-0.000
Token pick
Token ID2298
Feature activation-7.24e-02

Pos logit contributions
oys+0.008
 handshake+0.008
 box+0.008
 globe+0.007
fs+0.007
Neg logit contributions
please-0.016
 gor-0.015
 chickens-0.015
 pigs-0.014
 ge-0.014
Token up
Token ID510
Feature activation+6.05e-01

Pos logit contributions
please+0.002
 chickens+0.002
 gor+0.002
 ge+0.002
 pigs+0.002
Neg logit contributions
oys-0.002
fs-0.002
 compartment-0.002
kin-0.002
 handshake-0.002
Token the
Token ID262
Feature activation+1.02e+00

Pos logit contributions
oys+0.019
fs+0.018
 compartment+0.017
kin+0.017
 handshake+0.017
Neg logit contributions
please-0.017
 chickens-0.016
 gor-0.016
 ge-0.015
 pigs-0.015
Token cr
Token ID1067
Feature activation+2.95e+00

Pos logit contributions
oys+0.022
fs+0.020
 compartment+0.019
kin+0.018
 handshake+0.018
Neg logit contributions
please-0.038
 chickens-0.036
 gor-0.036
 ge-0.035
 pigs-0.034
Tokenay
Token ID323
Feature activation+5.44e+00

Pos logit contributions
 seek+0.062
 handshake+0.057
fs+0.056
oys+0.055
chery+0.054
Neg logit contributions
please-0.100
 North-0.091
 chickens-0.089
der-0.089
 gor-0.088
Tokenons
Token ID684
Feature activation+5.18e-02

Pos logit contributions
 seek+0.193
acle+0.184
suit+0.179
belt+0.176
 locker+0.175
Neg logit contributions
please-0.107
 tur-0.099
 chickens-0.099
opl-0.096
 gor-0.094
Token.
Token ID13
Feature activation-3.66e-02

Pos logit contributions
oys+0.002
fs+0.002
 compartment+0.001
 handshake+0.001
 globe+0.001
Neg logit contributions
please-0.001
 chickens-0.001
 gor-0.001
 ge-0.001
 pigs-0.001
Token They
Token ID1119
Feature activation+4.69e-01

Pos logit contributions
please+0.001
 gor+0.001
 chickens+0.001
 ge+0.001
 bored+0.001
Neg logit contributions
oys-0.001
 handshake-0.001
 globe-0.001
 compartment-0.001
fs-0.001
Token draw
Token ID3197
Feature activation+8.17e-01

Pos logit contributions
oys+0.009
 handshake+0.009
 globe+0.009
 box+0.009
fs+0.009
Neg logit contributions
please-0.018
 gor-0.016
 chickens-0.016
 ge-0.016
 North-0.016
Token a
Token ID257
Feature activation+1.64e+00

Pos logit contributions
oys+0.022
fs+0.021
 compartment+0.020
kin+0.020
 handshake+0.020
Neg logit contributions
please-0.026
 chickens-0.024
 gor-0.024
 ge-0.023
 pigs-0.022
Token he
Token ID339
Feature activation-1.35e+00

Pos logit contributions
oys+0.003
fs+0.003
kin+0.003
 globe+0.003
 handshake+0.003
Neg logit contributions
please-0.008
 gor-0.007
 chickens-0.007
 pasture-0.007
 North-0.007
Token would
Token ID561
Feature activation-4.63e-01

Pos logit contributions
please+0.046
 gor+0.043
 chickens+0.043
 ge+0.042
 pigs+0.040
Neg logit contributions
oys-0.033
fs-0.031
 compartment-0.030
 globe-0.029
 handshake-0.029
Token share
Token ID2648
Feature activation-4.81e-01

Pos logit contributions
please+0.016
 gor+0.015
 chickens+0.015
 ge+0.015
 pigs+0.014
Neg logit contributions
oys-0.011
fs-0.010
 handshake-0.010
 compartment-0.010
 globe-0.010
Token his
Token ID465
Feature activation+2.57e+00

Pos logit contributions
please+0.014
 gor+0.013
 chickens+0.013
 ge+0.013
 pigs+0.012
Neg logit contributions
oys-0.014
fs-0.013
 compartment-0.013
kin-0.013
 handshake-0.012
Token cr
Token ID1067
Feature activation+3.17e+00

Pos logit contributions
oys+0.065
fs+0.061
 compartment+0.058
lock+0.052
 globe+0.052
Neg logit contributions
please-0.084
 chickens-0.084
 ge-0.083
 gor-0.082
 pigs-0.078
Tokenay
Token ID323
Feature activation+5.14e+00

Pos logit contributions
 compartment+0.069
 globe+0.067
oys+0.067
 torch+0.066
 seek+0.066
Neg logit contributions
please-0.107
 North-0.085
 chickens-0.084
 gor-0.083
opl-0.082
Tokenons
Token ID684
Feature activation+3.66e-01

Pos logit contributions
 compartment+0.166
suit+0.162
belt+0.156
acle+0.152
 locker+0.150
Neg logit contributions
please-0.142
 chickens-0.115
 tur-0.114
 ge-0.105
opl-0.104
Token with
Token ID351
Feature activation+2.62e-01

Pos logit contributions
oys+0.010
fs+0.010
 compartment+0.010
 handshake+0.009
 globe+0.009
Neg logit contributions
please-0.011
 chickens-0.010
 gor-0.010
 ge-0.010
 bored-0.010
Token Lily
Token ID20037
Feature activation-9.91e-02

Pos logit contributions
oys+0.007
fs+0.007
kin+0.006
 globe+0.006
 handshake+0.006
Neg logit contributions
please-0.008
 gor-0.008
 chickens-0.008
 ge-0.007
 North-0.007
Token.
Token ID13
Feature activation-2.31e-01

Pos logit contributions
please+0.004
 gor+0.004
 ge+0.003
 North+0.003
 chickens+0.003
Neg logit contributions
oys-0.002
 box-0.002
kin-0.002
fs-0.002
 globe-0.002
Token He
Token ID679
Feature activation+7.89e-01

Pos logit contributions
please+0.007
 gor+0.006
 chickens+0.006
 ge+0.006
 pigs+0.006
Neg logit contributions
oys-0.007
fs-0.006
 compartment-0.006
 globe-0.006
kin-0.006
Token.
Token ID13
Feature activation-8.36e-01

Pos logit contributions
please+0.052
 gor+0.047
 North+0.046
 ge+0.045
 chickens+0.045
Neg logit contributions
oys-0.027
fs-0.024
kin-0.023
 handshake-0.022
 box-0.022
Token He
Token ID679
Feature activation-2.51e-01

Pos logit contributions
please+0.029
 chickens+0.027
 gor+0.026
 ge+0.025
 pigs+0.024
Neg logit contributions
oys-0.021
fs-0.019
 compartment-0.019
 globe-0.018
 handshake-0.018
Token drops
Token ID10532
Feature activation+7.95e-02

Pos logit contributions
please+0.009
 chickens+0.008
 gor+0.008
 pigs+0.007
 ge+0.007
Neg logit contributions
oys-0.006
fs-0.006
 compartment-0.006
 handshake-0.006
 box-0.006
Token the
Token ID262
Feature activation+1.15e+00

Pos logit contributions
oys+0.002
fs+0.002
 compartment+0.002
 handshake+0.002
 globe+0.002
Neg logit contributions
please-0.002
 chickens-0.002
 gor-0.002
 ge-0.002
 pigs-0.002
Token ash
Token ID12530
Feature activation+8.25e-01

Pos logit contributions
oys+0.027
fs+0.025
 globe+0.025
kin+0.025
 compartment+0.025
Neg logit contributions
please-0.042
 gor-0.037
 chickens-0.036
 ge-0.035
 bored-0.035
Tokentr
Token ID2213
Feature activation+4.99e+00

Pos logit contributions
oys+0.020
fs+0.018
 globe+0.018
 compartment+0.018
kin+0.017
Neg logit contributions
please-0.030
 gor-0.027
 chickens-0.027
 ge-0.026
 pigs-0.025
Tokenay
Token ID323
Feature activation+4.57e+00

Pos logit contributions
oys+0.090
fs+0.086
chery+0.086
 handshake+0.086
 seek+0.083
Neg logit contributions
please-0.185
 gor-0.175
 bored-0.169
 chickens-0.167
 North-0.167
Token.
Token ID13
Feature activation-6.64e-01

Pos logit contributions
oys+0.126
 handshake+0.121
fs+0.120
 compartment+0.118
kin+0.116
Neg logit contributions
please-0.135
 gor-0.127
 chickens-0.125
 ge-0.118
 pigs-0.117
Token It
Token ID632
Feature activation+1.35e+00

Pos logit contributions
please+0.022
 chickens+0.020
 gor+0.020
 ge+0.019
 pigs+0.018
Neg logit contributions
oys-0.018
fs-0.017
 compartment-0.016
 globe-0.016
kin-0.016
Token breaks
Token ID9457
Feature activation+1.86e-01

Pos logit contributions
oys+0.027
fs+0.027
 compartment+0.026
 box+0.026
 globe+0.026
Neg logit contributions
please-0.052
 chickens-0.047
 gor-0.046
 ge-0.045
 pigs-0.044
Token.
Token ID13
Feature activation-3.38e-01

Pos logit contributions
oys+0.005
 compartment+0.004
fs+0.004
 globe+0.004
kin+0.004
Neg logit contributions
please-0.007
 gor-0.006
 chickens-0.006
 ge-0.006
 pasture-0.005
Token Can
Token ID1680
Feature activation+7.85e-01

Pos logit contributions
fs+0.003
lf+0.003
 compartment+0.003
kin+0.003
oys+0.003
Neg logit contributions
please-0.003
 chickens-0.003
 ge-0.003
 gor-0.003
 bored-0.002
Token we
Token ID356
Feature activation-5.87e-01

Pos logit contributions
 compartment+0.027
suit+0.026
fs+0.025
oys+0.025
 torch+0.025
Neg logit contributions
please-0.018
 bored-0.018
 asks-0.017
 decides-0.016
 going-0.016
Token share
Token ID2648
Feature activation-4.06e-01

Pos logit contributions
please+0.020
 gor+0.019
 chickens+0.019
 ge+0.018
 pigs+0.018
Neg logit contributions
oys-0.014
fs-0.014
 compartment-0.013
 handshake-0.013
 globe-0.013
Token the
Token ID262
Feature activation+2.56e+00

Pos logit contributions
please+0.011
 chickens+0.010
 ge+0.010
 gor+0.010
 bored+0.009
Neg logit contributions
oys-0.013
fs-0.013
kin-0.012
 compartment-0.012
 handshake-0.012
Token cr
Token ID1067
Feature activation+2.96e+00

Pos logit contributions
oys+0.062
fs+0.060
belt+0.056
han+0.054
 handshake+0.053
Neg logit contributions
please-0.087
 chickens-0.081
 ge-0.080
 bored-0.079
 gor-0.076
Tokenay
Token ID323
Feature activation+4.98e+00

Pos logit contributions
oys+0.053
fs+0.052
 compartment+0.051
 handshake+0.051
 torch+0.048
Neg logit contributions
please-0.114
 North-0.097
 gor-0.096
 chickens-0.095
 bored-0.091
Tokenon
Token ID261
Feature activation+1.70e+00

Pos logit contributions
belt+0.138
 compartment+0.137
 torch+0.135
suit+0.134
acle+0.128
Neg logit contributions
please-0.150
 tur-0.125
 ge-0.117
 chickens-0.113
opl-0.111
Token?"
Token ID1701
Feature activation-6.66e-01

Pos logit contributions
fs+0.051
oys+0.051
 compartment+0.049
 torch+0.046
chery+0.046
Neg logit contributions
please-0.048
 bored-0.040
 chickens-0.040
 ge-0.039
 gor-0.039
Token
Token ID198
Feature activation+1.55e-01

Pos logit contributions
please+0.023
 gor+0.022
 chickens+0.022
 ge+0.021
 pigs+0.020
Neg logit contributions
oys-0.016
fs-0.015
 handshake-0.014
 compartment-0.014
 globe-0.014
Token
Token ID198
Feature activation+2.72e-01

Pos logit contributions
oys+0.005
fs+0.005
 compartment+0.004
kin+0.004
 globe+0.004
Neg logit contributions
please-0.004
 chickens-0.004
 gor-0.004
 ge-0.004
 pigs-0.004
TokenThey
Token ID2990
Feature activation+7.02e-01

Pos logit contributions
fs+0.007
oys+0.007
 compartment+0.007
 globe+0.007
 handshake+0.007
Neg logit contributions
please-0.009
 ge-0.008
 chickens-0.008
 gor-0.007
 pigs-0.007
Token Lily
Token ID20037
Feature activation+1.97e-01

Pos logit contributions
oys+0.005
fs+0.005
 compartment+0.005
 handshake+0.004
 globe+0.004
Neg logit contributions
please-0.006
 chickens-0.006
 gor-0.006
 ge-0.006
 bored-0.006
Token accidentally
Token ID14716
Feature activation-1.37e+00

Pos logit contributions
oys+0.003
 box+0.003
 belt+0.003
lf+0.003
 compartment+0.003
Neg logit contributions
 gor-0.008
please-0.008
 chickens-0.007
 ge-0.007
 pigs-0.007
Token dropped
Token ID5710
Feature activation-1.88e-01

Pos logit contributions
please+0.043
 gor+0.041
 chickens+0.039
 ge+0.038
 North+0.037
Neg logit contributions
oys-0.037
fs-0.035
 compartment-0.034
kin-0.034
 handshake-0.033
Token the
Token ID262
Feature activation+2.11e+00

Pos logit contributions
please+0.007
 gor+0.006
 chickens+0.006
 ge+0.006
 North+0.005
Neg logit contributions
oys-0.005
 box-0.004
 compartment-0.004
fs-0.004
 globe-0.004
Token ash
Token ID12530
Feature activation+2.53e+00

Pos logit contributions
oys+0.067
fs+0.066
 handshake+0.062
kin+0.062
 compartment+0.060
Neg logit contributions
please-0.055
 gor-0.053
 chickens-0.052
 ge-0.050
 pigs-0.048
Tokentr
Token ID2213
Feature activation+4.97e+00

Pos logit contributions
oys+0.049
fs+0.047
 handshake+0.043
kin+0.043
 compartment+0.043
Neg logit contributions
please-0.097
 gor-0.092
 chickens-0.091
 ge-0.088
 pigs-0.086
Tokenay
Token ID323
Feature activation+3.29e+00

Pos logit contributions
oys+0.099
chery+0.098
 seek+0.095
 handshake+0.094
fs+0.094
Neg logit contributions
please-0.171
 gor-0.159
 bored-0.158
 North-0.156
der-0.151
Token and
Token ID290
Feature activation+4.77e-01

Pos logit contributions
oys+0.091
fs+0.086
 handshake+0.085
 compartment+0.084
kin+0.083
Neg logit contributions
please-0.099
 chickens-0.091
 gor-0.090
 ge-0.086
 pigs-0.084
Token it
Token ID340
Feature activation+9.27e-02

Pos logit contributions
oys+0.010
fs+0.010
 compartment+0.010
kin+0.009
 globe+0.009
Neg logit contributions
please-0.018
 gor-0.017
 chickens-0.016
 ge-0.016
 pigs-0.015
Token broke
Token ID6265
Feature activation-1.27e-01

Pos logit contributions
fs+0.002
oys+0.002
 compartment+0.002
kin+0.002
 box+0.002
Neg logit contributions
please-0.003
 gor-0.003
 chickens-0.003
 ge-0.003
 North-0.003
Token into
Token ID656
Feature activation+3.81e-02

Pos logit contributions
please+0.005
 gor+0.004
 chickens+0.004
 ge+0.004
 North+0.004
Neg logit contributions
oys-0.003
 compartment-0.003
fs-0.003
 globe-0.003
kin-0.003
Token picked
Token ID6497
Feature activation+2.42e-02

Pos logit contributions
oys+0.012
fs+0.011
 compartment+0.011
 handshake+0.011
lf+0.011
Neg logit contributions
please-0.022
 gor-0.021
 chickens-0.020
 ge-0.020
 pigs-0.020
Token up
Token ID510
Feature activation+4.03e-01

Pos logit contributions
oys+0.001
fs+0.001
 compartment+0.001
kin+0.001
 handshake+0.001
Neg logit contributions
please-0.001
 chickens-0.001
 gor-0.001
 ge-0.001
 pigs-0.001
Token her
Token ID607
Feature activation+8.20e-01

Pos logit contributions
oys+0.014
fs+0.013
 globe+0.013
 compartment+0.013
kin+0.012
Neg logit contributions
please-0.010
 chickens-0.010
 gor-0.009
 ge-0.009
 bored-0.009
Token pink
Token ID11398
Feature activation+2.05e+00

Pos logit contributions
oys+0.022
fs+0.020
 handshake+0.018
 compartment+0.018
kin+0.018
Neg logit contributions
please-0.027
 chickens-0.026
 gor-0.025
 ge-0.025
 pigs-0.024
Token cr
Token ID1067
Feature activation+2.49e+00

Pos logit contributions
oys+0.052
fs+0.048
 compartment+0.044
kin+0.043
 handshake+0.043
Neg logit contributions
please-0.067
 chickens-0.066
 gor-0.063
 pigs-0.062
 ge-0.062
Tokenay
Token ID323
Feature activation+4.80e+00

Pos logit contributions
 seek+0.051
oys+0.051
 handshake+0.051
fs+0.048
 locker+0.047
Neg logit contributions
please-0.084
 North-0.073
 chickens-0.072
der-0.072
 gor-0.071
Tokenon
Token ID261
Feature activation+1.07e+00

Pos logit contributions
 locker+0.129
 seek+0.127
 handshake+0.126
 compartment+0.125
oys+0.122
Neg logit contributions
please-0.132
 chickens-0.116
 tur-0.110
 gor-0.109
 onions-0.108
Token,
Token ID11
Feature activation-2.16e-01

Pos logit contributions
oys+0.036
fs+0.033
 handshake+0.033
chery+0.031
han+0.031
Neg logit contributions
please-0.024
 chickens-0.023
 gor-0.021
 pigs-0.021
 bored-0.021
Token and
Token ID290
Feature activation+3.75e-01

Pos logit contributions
please+0.007
 gor+0.007
 chickens+0.007
 ge+0.007
 North+0.006
Neg logit contributions
oys-0.005
fs-0.005
 compartment-0.005
 globe-0.005
kin-0.005
Token started
Token ID2067
Feature activation-4.40e-01

Pos logit contributions
oys+0.010
fs+0.010
 compartment+0.010
kin+0.009
 globe+0.009
Neg logit contributions
please-0.012
 gor-0.011
 chickens-0.011
 ge-0.010
 pigs-0.010
Token to
Token ID284
Feature activation-1.11e-01

Pos logit contributions
please+0.015
 gor+0.015
 ge+0.014
 chickens+0.014
 North+0.014
Neg logit contributions
oys-0.009
box-0.009
 box-0.009
 compartment-0.009
lf-0.009
Token cleaned
Token ID20750
Feature activation+4.36e-02

Pos logit contributions
oys+0.039
fs+0.038
 globe+0.036
 handshake+0.036
 compartment+0.036
Neg logit contributions
please-0.036
 chickens-0.033
 ge-0.031
 gor-0.031
 bored-0.030
Token up
Token ID510
Feature activation+5.86e-01

Pos logit contributions
fs+0.001
oys+0.001
 handshake+0.001
chery+0.001
kin+0.001
Neg logit contributions
please-0.001
 chickens-0.001
 gor-0.001
 bored-0.001
 ge-0.001
Token the
Token ID262
Feature activation+1.84e+00

Pos logit contributions
oys+0.020
fs+0.020
 handshake+0.019
 compartment+0.018
 globe+0.018
Neg logit contributions
please-0.014
 chickens-0.013
 gor-0.013
 ge-0.013
 bored-0.012
Token broken
Token ID5445
Feature activation+1.73e+00

Pos logit contributions
oys+0.059
fs+0.058
kin+0.055
 handshake+0.055
 compartment+0.054
Neg logit contributions
please-0.048
 gor-0.044
 chickens-0.044
 ge-0.042
 pigs-0.040
Token ash
Token ID12530
Feature activation+2.85e+00

Pos logit contributions
oys+0.050
fs+0.048
kin+0.046
 handshake+0.045
 compartment+0.045
Neg logit contributions
please-0.051
 gor-0.047
 chickens-0.047
 ge-0.046
 pigs-0.044
Tokentr
Token ID2213
Feature activation+4.76e+00

Pos logit contributions
fs+0.062
oys+0.062
chery+0.059
kin+0.058
 handshake+0.058
Neg logit contributions
please-0.097
 gor-0.093
 chickens-0.090
 North-0.086
 pigs-0.086
Tokenay
Token ID323
Feature activation+3.66e+00

Pos logit contributions
chery+0.095
 seek+0.093
 handshake+0.089
oys+0.089
fs+0.087
Neg logit contributions
please-0.162
 gor-0.153
 bored-0.150
der-0.149
 North-0.148
Token and
Token ID290
Feature activation+3.16e-01

Pos logit contributions
 handshake+0.114
oys+0.113
 locker+0.112
 seek+0.110
 compartment+0.110
Neg logit contributions
please-0.087
 chickens-0.081
 gor-0.079
 bored-0.075
 pigs-0.073
Token Lily
Token ID20037
Feature activation+1.10e-01

Pos logit contributions
oys+0.010
fs+0.010
 globe+0.009
 handshake+0.009
kin+0.009
Neg logit contributions
please-0.009
 chickens-0.008
 ge-0.008
 bored-0.007
 gor-0.007
Token learned
Token ID4499
Feature activation+6.43e-02

Pos logit contributions
oys+0.003
fs+0.003
 compartment+0.002
kin+0.002
 globe+0.002
Neg logit contributions
please-0.004
 gor-0.003
 chickens-0.003
 ge-0.003
 pigs-0.003
Token to
Token ID284
Feature activation+6.82e-01

Pos logit contributions
oys+0.002
fs+0.002
kin+0.002
 compartment+0.002
 globe+0.002
Neg logit contributions
please-0.002
 chickens-0.002
 gor-0.002
 ge-0.002
 bored-0.002
Token apologized
Token ID22893
Feature activation-4.56e-01

Pos logit contributions
oys+0.018
 box+0.016
 globe+0.016
lf+0.016
 compartment+0.016
Neg logit contributions
please-0.041
 gor-0.040
 ge-0.038
 chickens-0.038
 North-0.037
Token for
Token ID329
Feature activation-7.40e-01

Pos logit contributions
please+0.014
 gor+0.013
 chickens+0.013
 ge+0.012
 pigs+0.012
Neg logit contributions
oys-0.013
fs-0.012
 compartment-0.012
 globe-0.012
kin-0.012
Token breaking
Token ID7163
Feature activation+2.10e-01

Pos logit contributions
please+0.019
 gor+0.017
 chickens+0.017
 ge+0.016
 North+0.016
Neg logit contributions
oys-0.025
fs-0.023
kin-0.023
 compartment-0.022
 handshake-0.022
Token the
Token ID262
Feature activation+8.17e-01

Pos logit contributions
oys+0.006
fs+0.006
 compartment+0.006
 handshake+0.006
kin+0.006
Neg logit contributions
please-0.006
 chickens-0.006
 gor-0.006
 ge-0.006
 pigs-0.005
Token cr
Token ID1067
Feature activation+2.82e+00

Pos logit contributions
oys+0.018
fs+0.017
 globe+0.016
 compartment+0.016
 handshake+0.016
Neg logit contributions
please-0.030
 gor-0.028
 chickens-0.027
 ge-0.027
 bored-0.026
Tokenay
Token ID323
Feature activation+4.73e+00

Pos logit contributions
oys+0.048
 handshake+0.046
fs+0.046
 seek+0.046
 globe+0.045
Neg logit contributions
please-0.108
 chickens-0.097
 North-0.095
 gor-0.095
der-0.093
Tokenon
Token ID261
Feature activation+1.39e+00

Pos logit contributions
suit+0.127
 locker+0.124
 seek+0.124
 compartment+0.119
 handshake+0.119
Neg logit contributions
please-0.134
opl-0.117
 chickens-0.112
 North-0.110
 tur-0.110
Token and
Token ID290
Feature activation+2.53e-01

Pos logit contributions
oys+0.040
fs+0.038
 handshake+0.037
 compartment+0.036
 globe+0.035
Neg logit contributions
please-0.039
 chickens-0.036
 gor-0.036
 bored-0.035
 ge-0.034
Token promised
Token ID8072
Feature activation+9.30e-01

Pos logit contributions
oys+0.007
fs+0.007
 compartment+0.006
 handshake+0.006
kin+0.006
Neg logit contributions
please-0.008
 gor-0.007
 chickens-0.007
 ge-0.007
 pigs-0.007
Token to
Token ID284
Feature activation-2.84e-01

Pos logit contributions
fs+0.030
oys+0.029
 globe+0.029
 compartment+0.029
kin+0.029
Neg logit contributions
please-0.024
 gor-0.021
 chickens-0.021
 ge-0.021
 bored-0.019
Token be
Token ID307
Feature activation+5.43e-01

Pos logit contributions
please+0.009
 gor+0.009
 chickens+0.009
 ge+0.008
 pigs+0.008
Neg logit contributions
oys-0.008
fs-0.007
 handshake-0.007
 compartment-0.007
kin-0.007
Token throw
Token ID3714
Feature activation-1.39e-01

Pos logit contributions
oys+0.046
fs+0.045
 compartment+0.042
 globe+0.042
kin+0.041
Neg logit contributions
please-0.045
 gor-0.041
 chickens-0.041
 ge-0.040
 pigs-0.038
Token away
Token ID1497
Feature activation+4.49e-01

Pos logit contributions
please+0.004
 chickens+0.004
 gor+0.004
 ge+0.004
 pigs+0.003
Neg logit contributions
oys-0.004
fs-0.004
kin-0.004
 handshake-0.004
lf-0.004
Token the
Token ID262
Feature activation+2.39e+00

Pos logit contributions
oys+0.014
fs+0.013
kin+0.013
 compartment+0.013
 handshake+0.012
Neg logit contributions
please-0.012
 gor-0.012
 chickens-0.012
 ge-0.011
 bored-0.011
Token broken
Token ID5445
Feature activation+1.48e+00

Pos logit contributions
oys+0.079
fs+0.077
kin+0.074
 handshake+0.073
 compartment+0.072
Neg logit contributions
please-0.061
 gor-0.057
 chickens-0.056
 ge-0.054
 pigs-0.052
Token ash
Token ID12530
Feature activation+2.71e+00

Pos logit contributions
oys+0.040
fs+0.037
 compartment+0.037
 globe+0.036
 handshake+0.036
Neg logit contributions
please-0.048
 chickens-0.044
 gor-0.043
 ge-0.042
 bored-0.040
Tokentr
Token ID2213
Feature activation+4.71e+00

Pos logit contributions
oys+0.053
fs+0.050
kin+0.046
 handshake+0.046
 compartment+0.046
Neg logit contributions
please-0.105
 gor-0.099
 chickens-0.098
 ge-0.094
 pigs-0.093
Tokenay
Token ID323
Feature activation+3.15e+00

Pos logit contributions
chery+0.092
oys+0.091
 seek+0.088
 handshake+0.088
fs+0.087
Neg logit contributions
please-0.162
 gor-0.152
 North-0.147
 bored-0.147
der-0.143
Token and
Token ID290
Feature activation-6.62e-02

Pos logit contributions
oys+0.091
 handshake+0.085
 compartment+0.085
fs+0.084
 locker+0.082
Neg logit contributions
please-0.086
 gor-0.081
 chickens-0.081
 ge-0.076
 pigs-0.074
Token soon
Token ID2582
Feature activation-1.39e+00

Pos logit contributions
please+0.002
 gor+0.002
 chickens+0.002
 ge+0.002
 pigs+0.002
Neg logit contributions
oys-0.002
fs-0.002
 compartment-0.002
 globe-0.002
 handshake-0.002
Token forgotten
Token ID11564
Feature activation-1.91e+00

Pos logit contributions
please+0.050
 gor+0.044
 chickens+0.043
 North+0.041
 pigs+0.041
Neg logit contributions
oys-0.034
fs-0.031
 handshake-0.031
 compartment-0.031
kin-0.030
Token about
Token ID546
Feature activation+6.71e-02

Pos logit contributions
please+0.070
 gor+0.063
 chickens+0.061
 ge+0.059
 North+0.059
Neg logit contributions
oys-0.047
fs-0.043
 compartment-0.042
 handshake-0.042
kin-0.041
Token.
Token ID13
Feature activation-1.82e-01

Pos logit contributions
please+0.015
 ge+0.014
 chickens+0.013
 gor+0.013
 pigs+0.013
Neg logit contributions
oys-0.009
fs-0.009
 handshake-0.008
 compartment-0.008
 box-0.008
Token She
Token ID1375
Feature activation+5.37e-01

Pos logit contributions
please+0.005
 gor+0.005
 chickens+0.005
 ge+0.005
 pigs+0.004
Neg logit contributions
oys-0.006
fs-0.005
 compartment-0.005
 handshake-0.005
 globe-0.005
Token writes
Token ID6797
Feature activation+1.37e+00

Pos logit contributions
 handshake+0.011
 compartment+0.011
oys+0.011
 box+0.011
 globe+0.010
Neg logit contributions
please-0.020
 chickens-0.018
 ge-0.017
 gor-0.017
 pigs-0.017
Token with
Token ID351
Feature activation+1.20e+00

Pos logit contributions
oys+0.037
fs+0.036
 compartment+0.034
 globe+0.034
 handshake+0.034
Neg logit contributions
please-0.044
 gor-0.040
 chickens-0.040
 ge-0.039
 pigs-0.037
Token cr
Token ID1067
Feature activation+2.61e+00

Pos logit contributions
oys+0.034
fs+0.032
 compartment+0.031
 globe+0.030
kin+0.030
Neg logit contributions
please-0.037
 chickens-0.034
 gor-0.034
 ge-0.033
 pigs-0.032
Tokenay
Token ID323
Feature activation+4.68e+00

Pos logit contributions
oys+0.039
fs+0.037
 globe+0.036
 compartment+0.035
kin+0.035
Neg logit contributions
please-0.110
 chickens-0.103
 gor-0.102
 ge-0.098
 North-0.098
Tokenons
Token ID684
Feature activation+6.71e-01

Pos logit contributions
oys+0.092
 compartment+0.091
kin+0.086
 globe+0.086
 seek+0.083
Neg logit contributions
please-0.176
 chickens-0.168
 gor-0.162
 ge-0.159
 pigs-0.155
Token,
Token ID11
Feature activation+3.53e-01

Pos logit contributions
oys+0.018
fs+0.017
 compartment+0.016
 handshake+0.016
 globe+0.016
Neg logit contributions
please-0.021
 chickens-0.020
 gor-0.019
 ge-0.019
 bored-0.018
Token pens
Token ID29707
Feature activation+3.08e-01

Pos logit contributions
oys+0.008
fs+0.008
 compartment+0.008
 globe+0.007
kin+0.007
Neg logit contributions
please-0.012
 chickens-0.012
 gor-0.011
 ge-0.011
 pigs-0.011
Token,
Token ID11
Feature activation+8.78e-01

Pos logit contributions
oys+0.007
fs+0.007
 compartment+0.007
 globe+0.007
 handshake+0.007
Neg logit contributions
please-0.011
 chickens-0.010
 gor-0.010
 ge-0.009
 pigs-0.009
Token pencil
Token ID21613
Feature activation+1.66e-01

Pos logit contributions
oys+0.022
fs+0.020
 compartment+0.020
 handshake+0.019
 globe+0.019
Neg logit contributions
please-0.030
 chickens-0.029
 gor-0.028
 ge-0.027
 pigs-0.027
Token
Token ID198
Feature activation+4.13e-01

Pos logit contributions
oys+0.006
fs+0.005
 compartment+0.005
kin+0.005
 globe+0.005
Neg logit contributions
please-0.005
 gor-0.005
 chickens-0.005
 ge-0.005
 North-0.004
TokenTim
Token ID14967
Feature activation+1.51e+00

Pos logit contributions
 compartment+0.013
oys+0.012
fs+0.012
 globe+0.012
 handshake+0.012
Neg logit contributions
please-0.012
 gor-0.010
 ge-0.010
 chickens-0.010
 pasture-0.009
Token used
Token ID973
Feature activation+1.08e-01

Pos logit contributions
oys+0.031
 box+0.028
 belt+0.027
lf+0.026
 compartment+0.026
Neg logit contributions
please-0.054
 gor-0.054
 chickens-0.051
 North-0.050
 ge-0.050
Token his
Token ID465
Feature activation+2.10e+00

Pos logit contributions
oys+0.003
fs+0.003
 compartment+0.003
 globe+0.003
kin+0.003
Neg logit contributions
please-0.003
 gor-0.003
 chickens-0.003
 ge-0.003
 pigs-0.003
Token cr
Token ID1067
Feature activation+2.77e+00

Pos logit contributions
oys+0.061
fs+0.054
 compartment+0.052
 handshake+0.050
 globe+0.049
Neg logit contributions
please-0.063
 chickens-0.062
 gor-0.060
 ge-0.059
 bored-0.058
Tokenay
Token ID323
Feature activation+4.68e+00

Pos logit contributions
oys+0.050
 globe+0.048
fs+0.047
 seek+0.047
 compartment+0.046
Neg logit contributions
please-0.108
 chickens-0.096
 gor-0.094
 North-0.092
 pigs-0.091
Tokenons
Token ID684
Feature activation+3.95e-01

Pos logit contributions
 compartment+0.107
suit+0.105
oys+0.103
 locker+0.102
belt+0.101
Neg logit contributions
please-0.165
 chickens-0.146
 gor-0.137
 ge-0.137
 tur-0.134
Token to
Token ID284
Feature activation+8.22e-01

Pos logit contributions
oys+0.013
fs+0.012
 compartment+0.012
 globe+0.012
 handshake+0.011
Neg logit contributions
please-0.011
 chickens-0.010
 gor-0.009
 ge-0.009
 bored-0.009
Token draw
Token ID3197
Feature activation+4.30e-01

Pos logit contributions
oys+0.020
fs+0.019
 compartment+0.018
kin+0.018
 handshake+0.018
Neg logit contributions
please-0.029
 gor-0.027
 chickens-0.027
 ge-0.026
 pigs-0.025
Token on
Token ID319
Feature activation+9.28e-01

Pos logit contributions
oys+0.011
fs+0.010
 compartment+0.010
 globe+0.010
 handshake+0.010
Neg logit contributions
please-0.014
 gor-0.013
 chickens-0.013
 ge-0.013
 pigs-0.012
Token the
Token ID262
Feature activation+7.83e-01

Pos logit contributions
oys+0.022
 globe+0.022
 compartment+0.021
 box+0.021
fs+0.021
Neg logit contributions
please-0.033
 chickens-0.029
 ge-0.028
 gor-0.028
 bored-0.027
Token.
Token ID13
Feature activation+2.66e-01

Pos logit contributions
oys+0.031
fs+0.029
 handshake+0.028
kin+0.028
 globe+0.028
Neg logit contributions
please-0.024
 gor-0.022
 chickens-0.022
 bored-0.021
 North-0.020
Token She
Token ID1375
Feature activation+8.51e-01

Pos logit contributions
oys+0.007
fs+0.007
kin+0.006
 compartment+0.006
 globe+0.006
Neg logit contributions
please-0.009
 gor-0.008
 chickens-0.008
 ge-0.008
 bored-0.007
Token grabbed
Token ID13176
Feature activation+2.68e-01

Pos logit contributions
oys+0.022
fs+0.020
 compartment+0.020
kin+0.019
 handshake+0.019
Neg logit contributions
please-0.028
 gor-0.026
 chickens-0.026
 ge-0.025
 pigs-0.024
Token her
Token ID607
Feature activation+2.18e+00

Pos logit contributions
oys+0.010
 handshake+0.009
kin+0.009
fs+0.009
belt+0.008
Neg logit contributions
 chickens-0.006
please-0.006
 gor-0.006
 bored-0.006
 pigs-0.005
Token cr
Token ID1067
Feature activation+2.59e+00

Pos logit contributions
oys+0.066
fs+0.057
 handshake+0.056
chery+0.053
eway+0.053
Neg logit contributions
 chickens-0.060
please-0.060
 bored-0.059
 gor-0.056
 pigs-0.054
Tokenay
Token ID323
Feature activation+4.67e+00

Pos logit contributions
 globe+0.053
 handshake+0.053
oys+0.053
 seek+0.053
kin+0.050
Neg logit contributions
please-0.090
 chickens-0.076
 gor-0.076
 North-0.076
der-0.073
Tokenons
Token ID684
Feature activation+5.84e-01

Pos logit contributions
 seek+0.148
 compartment+0.142
suit+0.141
 handshake+0.141
acle+0.138
Neg logit contributions
please-0.111
 tur-0.092
 chickens-0.092
 going-0.087
opl-0.087
Token and
Token ID290
Feature activation+9.65e-01

Pos logit contributions
oys+0.020
fs+0.020
 handshake+0.019
 globe+0.018
kin+0.018
Neg logit contributions
please-0.012
 chickens-0.012
 bored-0.012
 pigs-0.011
 ge-0.010
Token started
Token ID2067
Feature activation-4.80e-01

Pos logit contributions
oys+0.030
fs+0.029
 globe+0.028
 handshake+0.028
 compartment+0.027
Neg logit contributions
please-0.027
 chickens-0.025
 gor-0.024
 ge-0.023
 pigs-0.022
Token to
Token ID284
Feature activation+2.14e-01

Pos logit contributions
please+0.016
 gor+0.015
 chickens+0.015
 ge+0.015
 North+0.014
Neg logit contributions
oys-0.011
fs-0.011
 compartment-0.010
box-0.010
 handshake-0.010
Token fill
Token ID6070
Feature activation-1.45e-02

Pos logit contributions
oys+0.004
fs+0.004
 compartment+0.004
 globe+0.004
 handshake+0.004
Neg logit contributions
please-0.008
 chickens-0.008
 gor-0.008
 ge-0.008
 pigs-0.007
Token would
Token ID561
Feature activation-4.54e-01

Pos logit contributions
please+0.025
 ge+0.024
 gor+0.024
 chickens+0.024
 pigs+0.023
Neg logit contributions
 box-0.019
lf-0.019
oys-0.019
 belt-0.018
box-0.018
Token take
Token ID1011
Feature activation-1.02e-02

Pos logit contributions
please+0.017
 ge+0.016
 chickens+0.016
 gor+0.016
 pigs+0.015
Neg logit contributions
oys-0.009
lf-0.008
 box-0.008
 globe-0.008
fs-0.008
Token out
Token ID503
Feature activation+6.40e-02

Pos logit contributions
please+0.000
 gor+0.000
 chickens+0.000
 ge+0.000
 pigs+0.000
Neg logit contributions
oys-0.000
fs-0.000
 compartment-0.000
 globe-0.000
 handshake-0.000
Token a
Token ID257
Feature activation+5.92e-01

Pos logit contributions
oys+0.002
fs+0.002
 compartment+0.002
 globe+0.002
kin+0.002
Neg logit contributions
please-0.002
 chickens-0.002
 gor-0.002
 ge-0.002
 pigs-0.002
Token j
Token ID474
Feature activation+3.34e-02

Pos logit contributions
oys+0.016
fs+0.014
kin+0.013
 handshake+0.013
 compartment+0.013
Neg logit contributions
please-0.019
 gor-0.018
 chickens-0.018
 bored-0.018
 pigs-0.017
Tokenigsaw
Token ID45636
Feature activation+4.66e+00

Pos logit contributions
oys+0.001
 globe+0.001
fs+0.001
 compartment+0.001
cher+0.001
Neg logit contributions
please-0.001
 colony-0.001
 bored-0.001
 gor-0.001
 chickens-0.001
Token puzzle
Token ID15027
Feature activation+6.85e-01

Pos logit contributions
oys+0.132
 handshake+0.116
fs+0.115
kin+0.112
belt+0.111
Neg logit contributions
 gor-0.129
 chickens-0.124
please-0.124
 bored-0.114
 tur-0.113
Token and
Token ID290
Feature activation-2.85e-01

Pos logit contributions
oys+0.021
fs+0.020
 globe+0.018
 handshake+0.018
kin+0.018
Neg logit contributions
please-0.019
 chickens-0.018
 gor-0.018
 bored-0.016
 pigs-0.016
Token try
Token ID1949
Feature activation-3.96e-01

Pos logit contributions
please+0.010
 chickens+0.010
 gor+0.009
 ge+0.009
 pigs+0.009
Neg logit contributions
oys-0.006
fs-0.006
 compartment-0.006
 handshake-0.006
 globe-0.006
Token to
Token ID284
Feature activation+1.50e-01

Pos logit contributions
please+0.015
 ge+0.015
 gor+0.014
 chickens+0.014
 pigs+0.013
Neg logit contributions
fs-0.008
 compartment-0.007
 globe-0.007
 box-0.007
oys-0.007
Token solve
Token ID8494
Feature activation-1.40e+00

Pos logit contributions
oys+0.003
fs+0.003
 compartment+0.003
 globe+0.003
 handshake+0.003
Neg logit contributions
please-0.006
 gor-0.005
 chickens-0.005
 ge-0.005
 pigs-0.005
Token,
Token ID11
Feature activation-6.80e-01

Pos logit contributions
please+0.015
 gor+0.014
 chickens+0.014
 ge+0.013
 North+0.013
Neg logit contributions
oys-0.010
fs-0.009
 globe-0.009
 compartment-0.008
kin-0.008
Token it
Token ID340
Feature activation+2.75e-01

Pos logit contributions
please+0.023
 gor+0.020
 chickens+0.020
 ge+0.020
 pigs+0.019
Neg logit contributions
oys-0.017
fs-0.017
 compartment-0.016
 globe-0.016
kin-0.016
Token caused
Token ID4073
Feature activation+3.65e-01

Pos logit contributions
fs+0.006
 compartment+0.006
oys+0.006
kin+0.006
 box+0.006
Neg logit contributions
please-0.010
 gor-0.009
 chickens-0.009
 ge-0.008
 pigs-0.008
Token the
Token ID262
Feature activation+3.07e-01

Pos logit contributions
oys+0.009
fs+0.008
 compartment+0.007
 handshake+0.007
 globe+0.007
Neg logit contributions
please-0.013
 chickens-0.012
 gor-0.012
 ge-0.011
 bored-0.011
Token ash
Token ID12530
Feature activation+5.58e-01

Pos logit contributions
oys+0.009
fs+0.008
 globe+0.008
 handshake+0.008
kin+0.008
Neg logit contributions
please-0.009
 gor-0.008
 chickens-0.008
 ge-0.008
 North-0.008
Tokentr
Token ID2213
Feature activation+4.66e+00

Pos logit contributions
oys+0.013
fs+0.012
 globe+0.012
 compartment+0.012
 box+0.012
Neg logit contributions
please-0.020
 gor-0.018
 chickens-0.018
 ge-0.018
 pigs-0.017
Tokenay
Token ID323
Feature activation+3.42e+00

Pos logit contributions
oys+0.077
fs+0.071
 handshake+0.068
 globe+0.065
kin+0.065
Neg logit contributions
please-0.188
 gor-0.177
 chickens-0.171
 bored-0.171
 North-0.169
Token to
Token ID284
Feature activation+6.56e-01

Pos logit contributions
oys+0.086
fs+0.081
 compartment+0.076
 globe+0.076
 handshake+0.075
Neg logit contributions
please-0.116
 gor-0.107
 chickens-0.107
 ge-0.104
 pigs-0.100
Token scatter
Token ID41058
Feature activation-5.54e-01

Pos logit contributions
oys+0.017
kin+0.016
fs+0.016
lf+0.016
 globe+0.016
Neg logit contributions
please-0.022
 chickens-0.020
 gor-0.019
 bored-0.019
 ge-0.019
Token all
Token ID477
Feature activation+1.04e+00

Pos logit contributions
please+0.020
 gor+0.018
 chickens+0.017
 ge+0.017
 pigs+0.016
Neg logit contributions
oys-0.014
fs-0.013
 compartment-0.013
 globe-0.012
 box-0.012
Token over
Token ID625
Feature activation+2.60e-01

Pos logit contributions
oys+0.027
fs+0.025
 compartment+0.025
 globe+0.025
 handshake+0.024
Neg logit contributions
please-0.035
 chickens-0.033
 gor-0.032
 ge-0.031
 pigs-0.030
Token.
Token ID13
Feature activation-4.98e-01

Pos logit contributions
please+0.028
 gor+0.026
 chickens+0.025
 ge+0.025
 North+0.024
Neg logit contributions
oys-0.020
fs-0.019
 compartment-0.018
 globe-0.018
 handshake-0.018
Token She
Token ID1375
Feature activation+1.47e-01

Pos logit contributions
please+0.015
 gor+0.013
 chickens+0.013
 ge+0.013
 North+0.012
Neg logit contributions
oys-0.015
 compartment-0.014
 handshake-0.013
fs-0.013
 globe-0.013
Token brought
Token ID3181
Feature activation+8.49e-01

Pos logit contributions
oys+0.004
fs+0.004
 compartment+0.004
 handshake+0.004
kin+0.004
Neg logit contributions
please-0.004
 gor-0.004
 chickens-0.004
 ge-0.004
 pigs-0.004
Token her
Token ID607
Feature activation+9.27e-01

Pos logit contributions
oys+0.026
fs+0.025
 compartment+0.024
kin+0.024
 globe+0.024
Neg logit contributions
please-0.024
 chickens-0.022
 gor-0.022
 ge-0.021
 pigs-0.021
Token cr
Token ID1067
Feature activation+2.50e+00

Pos logit contributions
oys+0.023
fs+0.022
 compartment+0.021
 globe+0.021
 handshake+0.021
Neg logit contributions
please-0.031
 gor-0.029
 chickens-0.029
 ge-0.028
 North-0.027
Tokenay
Token ID323
Feature activation+4.66e+00

Pos logit contributions
oys+0.048
 compartment+0.047
fs+0.046
 globe+0.046
 seek+0.046
Neg logit contributions
please-0.092
 chickens-0.081
 gor-0.080
 North-0.078
 pigs-0.077
Tokenons
Token ID684
Feature activation+2.93e-01

Pos logit contributions
 compartment+0.132
belt+0.125
 seek+0.123
suit+0.122
 locker+0.120
Neg logit contributions
please-0.142
 chickens-0.127
 ge-0.121
 tur-0.118
 gor-0.115
Token and
Token ID290
Feature activation+8.01e-01

Pos logit contributions
fs+0.010
oys+0.009
 compartment+0.009
lf+0.009
 handshake+0.009
Neg logit contributions
please-0.007
 chickens-0.007
 ge-0.006
 gor-0.006
 pigs-0.006
Token a
Token ID257
Feature activation+9.36e-01

Pos logit contributions
oys+0.019
fs+0.018
 compartment+0.017
 handshake+0.017
kin+0.017
Neg logit contributions
please-0.028
 gor-0.026
 chickens-0.026
 ge-0.025
 North-0.024
Token big
Token ID1263
Feature activation+1.02e+00

Pos logit contributions
oys+0.022
fs+0.021
 compartment+0.020
 globe+0.019
 handshake+0.019
Neg logit contributions
please-0.033
 chickens-0.031
 gor-0.031
 ge-0.030
 pigs-0.029
Token paper
Token ID3348
Feature activation+9.12e-01

Pos logit contributions
oys+0.024
fs+0.023
 compartment+0.021
 handshake+0.020
kin+0.020
Neg logit contributions
please-0.036
 chickens-0.034
 gor-0.034
 ge-0.033
 pigs-0.032
Token.
Token ID13
Feature activation-2.53e-01

Pos logit contributions
please+0.001
 chickens+0.001
 gor+0.001
 ge+0.001
 North+0.001
Neg logit contributions
oys-0.001
 compartment-0.001
 handshake-0.001
kin-0.001
 globe-0.001
Token They
Token ID1119
Feature activation-2.25e-01

Pos logit contributions
please+0.009
 chickens+0.008
 gor+0.008
 ge+0.008
 North+0.007
Neg logit contributions
oys-0.006
fs-0.006
 globe-0.005
 handshake-0.005
 compartment-0.005
Token have
Token ID423
Feature activation+5.41e-01

Pos logit contributions
please+0.008
 gor+0.007
 chickens+0.007
 North+0.007
 pigs+0.007
Neg logit contributions
 box-0.005
 handshake-0.005
oys-0.004
kin-0.004
fs-0.004
Token many
Token ID867
Feature activation+6.50e-01

Pos logit contributions
oys+0.010
box+0.009
 box+0.009
 handshake+0.009
 card+0.009
Neg logit contributions
please-0.021
 gor-0.020
 North-0.019
 pasture-0.019
 chickens-0.018
Token cr
Token ID1067
Feature activation+2.97e+00

Pos logit contributions
oys+0.013
fs+0.013
 compartment+0.012
 handshake+0.012
 globe+0.011
Neg logit contributions
please-0.025
 gor-0.023
 chickens-0.023
 ge-0.022
 North-0.022
Tokenay
Token ID323
Feature activation+4.63e+00

Pos logit contributions
oys+0.048
 globe+0.047
 compartment+0.047
fs+0.046
 handshake+0.045
Neg logit contributions
please-0.120
 chickens-0.109
 gor-0.107
 pigs-0.103
 North-0.103
Tokenons
Token ID684
Feature activation+9.28e-01

Pos logit contributions
 compartment+0.092
oys+0.091
kin+0.086
 globe+0.086
fs+0.084
Neg logit contributions
please-0.176
 chickens-0.162
 gor-0.157
 ge-0.155
 pigs-0.151
Token and
Token ID290
Feature activation+6.01e-01

Pos logit contributions
oys+0.025
fs+0.024
 compartment+0.023
 handshake+0.022
 globe+0.022
Neg logit contributions
please-0.030
 chickens-0.027
 gor-0.027
 ge-0.026
 pigs-0.025
Token papers
Token ID9473
Feature activation+3.63e-02

Pos logit contributions
oys+0.012
fs+0.011
 handshake+0.011
 compartment+0.011
kin+0.011
Neg logit contributions
please-0.023
 gor-0.022
 chickens-0.021
 ge-0.021
 North-0.020
Token.
Token ID13
Feature activation-3.63e-01

Pos logit contributions
oys+0.001
fs+0.001
 compartment+0.001
 handshake+0.001
 globe+0.001
Neg logit contributions
please-0.001
 gor-0.001
 chickens-0.001
 ge-0.001
 pigs-0.001
Token They
Token ID1119
Feature activation+5.69e-01

Pos logit contributions
please+0.012
 gor+0.010
 chickens+0.010
 ge+0.010
 bored+0.010
Neg logit contributions
oys-0.010
fs-0.009
 compartment-0.009
 handshake-0.009
 globe-0.009
Token draw
Token ID3197
Feature activation+2.95e-01

Pos logit contributions
please+0.034
 chickens+0.033
 gor+0.033
 ge+0.031
 pigs+0.031
Neg logit contributions
 globe-0.016
 handshake-0.016
oys-0.015
 box-0.015
 seek-0.015
Token and
Token ID290
Feature activation-4.05e-01

Pos logit contributions
oys+0.008
fs+0.008
 compartment+0.007
kin+0.007
 handshake+0.007
Neg logit contributions
please-0.009
 chickens-0.009
 gor-0.009
 ge-0.008
 pigs-0.008
Token shared
Token ID4888
Feature activation-1.44e+00

Pos logit contributions
please+0.013
 gor+0.012
 chickens+0.012
 ge+0.012
 pigs+0.011
Neg logit contributions
oys-0.011
fs-0.010
 compartment-0.010
kin-0.009
 globe-0.009
Token the
Token ID262
Feature activation+2.08e+00

Pos logit contributions
please+0.041
 gor+0.038
 chickens+0.038
 ge+0.037
 pigs+0.035
Neg logit contributions
oys-0.044
fs-0.042
 compartment-0.040
kin-0.040
 globe-0.039
Token cr
Token ID1067
Feature activation+2.07e+00

Pos logit contributions
fs+0.063
oys+0.061
belt+0.060
 compartment+0.057
kin+0.056
Neg logit contributions
please-0.059
 chickens-0.057
 ge-0.055
 bored-0.053
 pigs-0.052
Tokenay
Token ID323
Feature activation+4.62e+00

Pos logit contributions
 compartment+0.044
 locker+0.043
oys+0.042
 handshake+0.042
fs+0.042
Neg logit contributions
please-0.071
 chickens-0.062
 gor-0.062
 North-0.060
 pigs-0.059
Tokenons
Token ID684
Feature activation+1.85e-01

Pos logit contributions
belt+0.147
 compartment+0.141
suit+0.140
 locker+0.137
 torch+0.131
Neg logit contributions
please-0.119
 tur-0.108
 chickens-0.105
 ge-0.104
 bees-0.101
Token.
Token ID13
Feature activation+1.69e-01

Pos logit contributions
oys+0.006
fs+0.005
 compartment+0.005
 handshake+0.005
belt+0.005
Neg logit contributions
 chickens-0.005
please-0.005
 gor-0.005
 bored-0.004
 ge-0.004
Token They
Token ID1119
Feature activation+6.13e-01

Pos logit contributions
oys+0.005
 globe+0.004
 compartment+0.004
fs+0.004
 handshake+0.004
Neg logit contributions
please-0.006
 chickens-0.005
 gor-0.005
 ge-0.005
 pigs-0.005
Token were
Token ID547
Feature activation-3.95e-01

Pos logit contributions
 globe+0.013
kin+0.013
 box+0.013
fs+0.013
 handshake+0.012
Neg logit contributions
please-0.022
 gor-0.020
 chickens-0.020
 pigs-0.019
 North-0.019
Token very
Token ID845
Feature activation-3.78e-01

Pos logit contributions
please+0.013
 gor+0.012
 chickens+0.012
 North+0.011
 ge+0.011
Neg logit contributions
kin-0.009
oys-0.009
 globe-0.009
fs-0.009
 handshake-0.008
Token It
Token ID632
Feature activation+2.03e+00

Pos logit contributions
please+0.039
 gor+0.037
 ge+0.035
 chickens+0.034
 North+0.034
Neg logit contributions
oys-0.028
 Kate-0.028
 compartment-0.027
igsaw-0.026
kin-0.026
Token was
Token ID373
Feature activation+1.30e+00

Pos logit contributions
oys+0.044
fs+0.041
 compartment+0.039
 handshake+0.038
 globe+0.038
Neg logit contributions
please-0.076
 chickens-0.071
 gor-0.071
 ge-0.068
 pigs-0.066
Token a
Token ID257
Feature activation+1.48e+00

Pos logit contributions
oys+0.031
fs+0.030
kin+0.029
 compartment+0.029
 handshake+0.028
Neg logit contributions
please-0.045
 gor-0.041
 chickens-0.040
 ge-0.040
 pigs-0.037
Token big
Token ID1263
Feature activation+1.46e+00

Pos logit contributions
oys+0.042
fs+0.040
 compartment+0.038
 handshake+0.038
 globe+0.038
Neg logit contributions
please-0.045
 gor-0.041
 chickens-0.041
 ge-0.040
 pigs-0.038
Token j
Token ID474
Feature activation+4.68e-01

Pos logit contributions
oys+0.039
fs+0.038
 compartment+0.034
 handshake+0.034
kin+0.034
Neg logit contributions
please-0.047
 chickens-0.044
 gor-0.044
 ge-0.042
 pigs-0.041
Tokenigsaw
Token ID45636
Feature activation+4.62e+00

Pos logit contributions
oys+0.013
fs+0.013
 globe+0.012
 compartment+0.012
 handshake+0.012
Neg logit contributions
please-0.013
 gor-0.012
 chickens-0.012
 bored-0.012
 colony-0.012
Token puzzle
Token ID15027
Feature activation+1.69e+00

Pos logit contributions
oys+0.126
fs+0.108
 handshake+0.108
chery+0.100
lock+0.100
Neg logit contributions
 gor-0.136
please-0.135
 chickens-0.132
 pigs-0.121
 particularly-0.121
Token and
Token ID290
Feature activation+2.42e-01

Pos logit contributions
oys+0.048
fs+0.045
 handshake+0.043
 globe+0.042
 compartment+0.041
Neg logit contributions
please-0.048
 chickens-0.047
 gor-0.046
 bored-0.042
 pigs-0.042
Token she
Token ID673
Feature activation-2.95e-01

Pos logit contributions
oys+0.006
fs+0.006
 compartment+0.005
kin+0.005
 handshake+0.005
Neg logit contributions
please-0.008
 gor-0.008
 chickens-0.008
 ge-0.007
 pigs-0.007
Token wanted
Token ID2227
Feature activation+1.22e+00

Pos logit contributions
please+0.009
 gor+0.009
 chickens+0.009
 ge+0.008
 pigs+0.008
Neg logit contributions
oys-0.008
fs-0.007
 compartment-0.007
kin-0.007
 globe-0.007
Token to
Token ID284
Feature activation+7.74e-01

Pos logit contributions
oys+0.030
fs+0.029
 compartment+0.028
 handshake+0.028
 globe+0.027
Neg logit contributions
please-0.041
 gor-0.038
 chickens-0.038
 ge-0.037
 pigs-0.035
Token.
Token ID13
Feature activation-3.64e-02

Pos logit contributions
oys+0.000
fs+0.000
 compartment+0.000
kin+0.000
 globe+0.000
Neg logit contributions
please-0.000
 gor-0.000
 chickens-0.000
 ge-0.000
 bored-0.000
Token He
Token ID679
Feature activation+4.23e-01

Pos logit contributions
please+0.001
 gor+0.001
 chickens+0.001
 ge+0.001
 pigs+0.001
Neg logit contributions
oys-0.001
fs-0.001
 compartment-0.001
 globe-0.001
 handshake-0.001
Token wants
Token ID3382
Feature activation+1.31e+00

Pos logit contributions
oys+0.010
fs+0.009
 box+0.009
 compartment+0.009
 handshake+0.009
Neg logit contributions
please-0.015
 chickens-0.013
 gor-0.013
 pigs-0.013
 ge-0.013
Token his
Token ID465
Feature activation+1.85e+00

Pos logit contributions
oys+0.035
fs+0.031
 handshake+0.031
 compartment+0.030
 globe+0.030
Neg logit contributions
please-0.042
 gor-0.039
 chickens-0.039
 ge-0.037
 North-0.036
Token cr
Token ID1067
Feature activation+2.89e+00

Pos logit contributions
oys+0.032
fs+0.029
 compartment+0.027
 globe+0.026
 handshake+0.026
Neg logit contributions
please-0.077
 gor-0.073
 chickens-0.073
 ge-0.071
 pigs-0.069
Tokenay
Token ID323
Feature activation+4.61e+00

Pos logit contributions
oys+0.054
 compartment+0.052
fs+0.052
 handshake+0.051
 globe+0.050
Neg logit contributions
please-0.110
 gor-0.094
 chickens-0.094
 North-0.093
 pigs-0.090
Tokenons
Token ID684
Feature activation+9.08e-02

Pos logit contributions
 compartment+0.109
oys+0.102
 locker+0.099
suit+0.098
belt+0.097
Neg logit contributions
please-0.166
 chickens-0.138
 tur-0.135
 gor-0.134
 ge-0.131
Token back
Token ID736
Feature activation+4.93e-01

Pos logit contributions
oys+0.003
fs+0.003
 compartment+0.002
kin+0.002
 handshake+0.002
Neg logit contributions
please-0.003
 gor-0.003
 chickens-0.002
 ge-0.002
 pigs-0.002
Token.
Token ID13
Feature activation-2.23e-01

Pos logit contributions
oys+0.014
fs+0.014
 compartment+0.013
kin+0.013
 globe+0.013
Neg logit contributions
please-0.015
 gor-0.014
 chickens-0.013
 ge-0.013
 pigs-0.012
Token
Token ID198
Feature activation-5.78e-01

Pos logit contributions
please+0.006
 gor+0.005
 chickens+0.005
 ge+0.005
 bored+0.005
Neg logit contributions
fs-0.007
oys-0.007
kin-0.007
 globe-0.007
 compartment-0.007
Token
Token ID198
Feature activation-2.84e-01

Pos logit contributions
please+0.013
 gor+0.011
 ge+0.011
 chickens+0.011
 North+0.011
Neg logit contributions
oys-0.020
fs-0.020
kin-0.020
 globe-0.019
 compartment-0.019
Token to
Token ID284
Feature activation-3.32e-03

Pos logit contributions
please+0.028
 gor+0.026
 chickens+0.026
 ge+0.025
 North+0.024
Neg logit contributions
oys-0.024
fs-0.022
 handshake-0.022
 compartment-0.022
 globe-0.021
Token fight
Token ID1907
Feature activation+1.03e+00

Pos logit contributions
please+0.000
 gor+0.000
 chickens+0.000
 ge+0.000
 North+0.000
Neg logit contributions
oys-0.000
 handshake-0.000
fs-0.000
 compartment-0.000
 globe-0.000
Token over
Token ID625
Feature activation+4.82e-01

Pos logit contributions
fs+0.035
oys+0.034
kin+0.034
 globe+0.033
 compartment+0.033
Neg logit contributions
please-0.024
 bored-0.022
 gor-0.021
 chickens-0.021
 ge-0.020
Token the
Token ID262
Feature activation+7.11e-01

Pos logit contributions
oys+0.015
fs+0.015
 compartment+0.014
kin+0.014
 globe+0.014
Neg logit contributions
please-0.013
 chickens-0.012
 gor-0.012
 ge-0.012
 pigs-0.011
Token cr
Token ID1067
Feature activation+2.53e+00

Pos logit contributions
oys+0.016
fs+0.015
 compartment+0.014
 globe+0.014
 handshake+0.014
Neg logit contributions
please-0.026
 gor-0.024
 chickens-0.024
 ge-0.023
 pigs-0.022
Tokenay
Token ID323
Feature activation+4.61e+00

Pos logit contributions
 seek+0.048
fs+0.047
 globe+0.046
 handshake+0.045
oys+0.045
Neg logit contributions
please-0.088
der-0.079
 chickens-0.079
 gor-0.078
 North-0.078
Tokenons
Token ID684
Feature activation-2.21e-01

Pos logit contributions
suit+0.139
belt+0.138
 seek+0.133
 locker+0.132
 compartment+0.131
Neg logit contributions
please-0.111
opl-0.105
 tur-0.100
vious-0.097
 going-0.096
Token.
Token ID13
Feature activation-1.61e-01

Pos logit contributions
please+0.007
 gor+0.006
 chickens+0.006
 ge+0.006
 North+0.006
Neg logit contributions
oys-0.006
fs-0.006
 compartment-0.006
kin-0.006
 globe-0.005
Token They
Token ID1119
Feature activation+9.11e-01

Pos logit contributions
please+0.006
 chickens+0.005
 gor+0.005
 ge+0.005
 pigs+0.005
Neg logit contributions
oys-0.004
 handshake-0.004
 compartment-0.004
 globe-0.004
fs-0.004
Token yelled
Token ID22760
Feature activation+1.90e-01

Pos logit contributions
oys+0.022
fs+0.021
 compartment+0.020
 handshake+0.019
 globe+0.019
Neg logit contributions
please-0.031
 gor-0.029
 chickens-0.029
 North-0.028
 ge-0.027
Token and
Token ID290
Feature activation+6.88e-01

Pos logit contributions
oys+0.006
fs+0.006
 handshake+0.006
 compartment+0.006
kin+0.006
Neg logit contributions
please-0.005
 gor-0.005
 chickens-0.005
 ge-0.004
 North-0.004
Token the
Token ID262
Feature activation-2.66e+00

Pos logit contributions
oys+0.014
fs+0.013
kin+0.013
 globe+0.013
 box+0.012
Neg logit contributions
please-0.023
 gor-0.020
 chickens-0.020
 North-0.019
 ge-0.019
Token little
Token ID1310
Feature activation-2.02e+00

Pos logit contributions
please+0.061
 gor+0.051
 North+0.048
 bored+0.047
 chickens+0.046
Neg logit contributions
oys-0.092
kin-0.088
 box-0.087
fs-0.087
 note-0.087
Token girl
Token ID2576
Feature activation+3.37e-02

Pos logit contributions
please+0.048
 gor+0.042
 chickens+0.040
 North+0.039
 bored+0.038
Neg logit contributions
oys-0.071
kin-0.065
fs-0.065
 globe-0.065
 handshake-0.064
Token and
Token ID290
Feature activation+2.02e-01

Pos logit contributions
oys+0.001
fs+0.001
 compartment+0.001
 handshake+0.001
kin+0.001
Neg logit contributions
please-0.001
 gor-0.001
 chickens-0.001
 ge-0.001
 pigs-0.001
Token the
Token ID262
Feature activation-2.47e+00

Pos logit contributions
kin+0.004
oys+0.004
lf+0.004
 box+0.004
 globe+0.004
Neg logit contributions
please-0.007
 North-0.006
 gor-0.006
 bored-0.006
act-0.006
Token healthy
Token ID5448
Feature activation-5.02e+00

Pos logit contributions
please+0.057
 gor+0.050
 North+0.046
 bored+0.044
 chickens+0.043
Neg logit contributions
oys-0.083
 box-0.082
fs-0.081
 globe-0.080
 note-0.080
Token goose
Token ID37246
Feature activation-1.42e+00

Pos logit contributions
please+0.104
 gor+0.088
 bored+0.084
 North+0.082
 chickens+0.082
Neg logit contributions
oys-0.188
fs-0.179
kin-0.175
 handshake-0.174
 globe-0.173
Token spun
Token ID26843
Feature activation-2.63e-01

Pos logit contributions
please+0.045
 gor+0.042
 chickens+0.041
 North+0.039
 ge+0.039
Neg logit contributions
fs-0.036
oys-0.036
 handshake-0.036
 box-0.034
 compartment-0.034
Token together
Token ID1978
Feature activation+1.95e-01

Pos logit contributions
please+0.008
 chickens+0.007
 gor+0.007
 ge+0.007
 bored+0.007
Neg logit contributions
oys-0.008
fs-0.007
 compartment-0.007
kin-0.007
 handshake-0.007
Token all
Token ID477
Feature activation+9.60e-01

Pos logit contributions
oys+0.005
fs+0.005
 compartment+0.005
 globe+0.005
 handshake+0.005
Neg logit contributions
please-0.006
 gor-0.006
 chickens-0.006
 ge-0.006
 pigs-0.005
Token around
Token ID1088
Feature activation+5.34e-01

Pos logit contributions
oys+0.026
fs+0.025
 compartment+0.024
 globe+0.024
kin+0.024
Neg logit contributions
please-0.031
 chickens-0.028
 gor-0.028
 ge-0.027
 pigs-0.026
TokenThe
Token ID464
Feature activation-2.18e+00

Pos logit contributions
oys+0.005
fs+0.005
kin+0.004
 globe+0.004
 handshake+0.004
Neg logit contributions
please-0.008
 gor-0.007
 chickens-0.007
 bored-0.007
 ge-0.007
Token little
Token ID1310
Feature activation-1.25e+00

Pos logit contributions
please+0.057
 gor+0.049
 North+0.045
 chickens+0.044
 bored+0.043
Neg logit contributions
oys-0.068
kin-0.067
fs-0.067
 globe-0.067
 box-0.067
Token girl
Token ID2576
Feature activation+7.61e-01

Pos logit contributions
please+0.038
 gor+0.034
 chickens+0.033
 North+0.032
 ge+0.031
Neg logit contributions
oys-0.036
fs-0.033
 globe-0.033
kin-0.033
 compartment-0.032
Token and
Token ID290
Feature activation+1.46e-01

Pos logit contributions
oys+0.015
fs+0.014
 compartment+0.014
 handshake+0.014
kin+0.013
Neg logit contributions
please-0.029
 gor-0.028
 chickens-0.027
 ge-0.026
 pigs-0.026
Token the
Token ID262
Feature activation-2.34e+00

Pos logit contributions
kin+0.003
 globe+0.003
 box+0.003
oys+0.003
fs+0.003
Neg logit contributions
please-0.005
 North-0.005
 bored-0.004
 gor-0.004
der-0.004
Token healthy
Token ID5448
Feature activation-4.81e+00

Pos logit contributions
please+0.055
 gor+0.049
 North+0.045
 chickens+0.043
 harming+0.041
Neg logit contributions
 box-0.077
oys-0.077
fs-0.077
 globe-0.077
kin-0.076
Token goose
Token ID37246
Feature activation-1.01e+00

Pos logit contributions
please+0.102
 gor+0.085
 bored+0.080
 North+0.080
 chickens+0.078
Neg logit contributions
oys-0.179
fs-0.170
 globe-0.165
kin-0.165
 handshake-0.165
Token were
Token ID547
Feature activation-3.39e-01

Pos logit contributions
please+0.034
 gor+0.032
 chickens+0.031
 pigs+0.029
 North+0.029
Neg logit contributions
fs-0.024
oys-0.024
 handshake-0.024
 box-0.023
 globe-0.023
Token both
Token ID1111
Feature activation+1.03e+00

Pos logit contributions
please+0.011
 gor+0.010
 chickens+0.010
 pigs+0.009
 ge+0.009
Neg logit contributions
oys-0.009
kin-0.009
 handshake-0.008
fs-0.008
 compartment-0.008
Token very
Token ID845
Feature activation-2.60e-01

Pos logit contributions
oys+0.029
kin+0.028
 compartment+0.028
 handshake+0.027
 globe+0.027
Neg logit contributions
please-0.032
 gor-0.029
 chickens-0.028
 ge-0.027
 pigs-0.026
Token tired
Token ID10032
Feature activation-9.26e-01

Pos logit contributions
please+0.008
 gor+0.007
 chickens+0.007
 pigs+0.007
 ge+0.007
Neg logit contributions
oys-0.007
fs-0.007
kin-0.007
 handshake-0.007
 compartment-0.007
Token after
Token ID706
Feature activation-3.55e-01

Pos logit contributions
oys+0.003
fs+0.003
 compartment+0.003
 handshake+0.003
kin+0.003
Neg logit contributions
please-0.003
 gor-0.003
 chickens-0.003
 ge-0.003
 bored-0.003
Token that
Token ID326
Feature activation-7.59e-02

Pos logit contributions
please+0.012
 gor+0.010
 chickens+0.010
 ge+0.010
 North+0.010
Neg logit contributions
fs-0.009
oys-0.009
lf-0.009
kin-0.009
 globe-0.008
Token.
Token ID13
Feature activation+2.63e-01

Pos logit contributions
please+0.003
 gor+0.003
 chickens+0.003
 ge+0.003
 North+0.002
Neg logit contributions
oys-0.002
 box-0.002
 globe-0.001
fs-0.001
 compartment-0.001
Token Whenever
Token ID21326
Feature activation-4.41e-02

Pos logit contributions
oys+0.008
fs+0.007
 compartment+0.007
 handshake+0.007
 globe+0.007
Neg logit contributions
please-0.008
 chickens-0.007
 gor-0.007
 ge-0.007
 bored-0.007
Token the
Token ID262
Feature activation-2.18e+00

Pos logit contributions
please+0.002
 gor+0.002
 chickens+0.002
 North+0.001
 ge+0.001
Neg logit contributions
oys-0.001
fs-0.001
kin-0.001
 handshake-0.001
 compartment-0.001
Token chip
Token ID11594
Feature activation-4.68e+00

Pos logit contributions
please+0.068
 gor+0.063
 North+0.062
 harming+0.057
act+0.057
Neg logit contributions
oys-0.056
 box-0.054
 note-0.051
 card-0.049
kin-0.049
Tokenm
Token ID76
Feature activation-1.87e+00

Pos logit contributions
 North+0.133
please+0.130
 gor+0.123
 harming+0.120
 chickens+0.111
Neg logit contributions
oys-0.135
fs-0.127
kin-0.126
 box-0.123
 globe-0.119
Tokenunk
Token ID2954
Feature activation-5.92e-01

Pos logit contributions
please+0.064
 gor+0.061
 North+0.061
 chickens+0.056
 bored+0.055
Neg logit contributions
oys-0.048
kin-0.042
fs-0.038
box-0.038
 box-0.037
Token needed
Token ID2622
Feature activation+6.61e-01

Pos logit contributions
please+0.021
 gor+0.019
 chickens+0.019
 ge+0.018
 North+0.018
Neg logit contributions
oys-0.012
 box-0.012
fs-0.012
 handshake-0.012
 globe-0.012
Token help
Token ID1037
Feature activation-1.28e-01

Pos logit contributions
oys+0.018
fs+0.017
 globe+0.016
kin+0.016
 compartment+0.016
Neg logit contributions
please-0.021
 chickens-0.020
 gor-0.020
 ge-0.019
 pigs-0.018
Token,
Token ID11
Feature activation-7.10e-01

Pos logit contributions
please+0.005
 gor+0.004
 chickens+0.004
 ge+0.004
 pigs+0.004
Neg logit contributions
oys-0.003
fs-0.003
 compartment-0.003
 globe-0.003
 handshake-0.003
Token and
Token ID290
Feature activation+1.48e-02

Pos logit contributions
please+0.036
 gor+0.033
 chickens+0.033
 ge+0.032
 pigs+0.031
Neg logit contributions
oys-0.027
fs-0.025
 compartment-0.024
 globe-0.024
 handshake-0.024
Token ran
Token ID4966
Feature activation-8.91e-01

Pos logit contributions
oys+0.000
fs+0.000
 compartment+0.000
 globe+0.000
 handshake+0.000
Neg logit contributions
please-0.000
 chickens-0.000
 gor-0.000
 ge-0.000
 pigs-0.000
Token around
Token ID1088
Feature activation+4.88e-01

Pos logit contributions
please+0.024
 gor+0.023
 chickens+0.023
 ge+0.022
 bored+0.021
Neg logit contributions
oys-0.027
fs-0.027
kin-0.026
 compartment-0.026
 globe-0.025
Token as
Token ID355
Feature activation-2.57e-01

Pos logit contributions
oys+0.011
 globe+0.011
 compartment+0.010
fs+0.010
 handshake+0.010
Neg logit contributions
please-0.017
 gor-0.016
 chickens-0.015
 ge-0.015
 North-0.015
Token the
Token ID262
Feature activation-2.04e+00

Pos logit contributions
please+0.009
 gor+0.009
 chickens+0.009
 ge+0.008
 pigs+0.008
Neg logit contributions
oys-0.006
fs-0.006
 compartment-0.005
 globe-0.005
kin-0.005
Token salty
Token ID36021
Feature activation-4.56e+00

Pos logit contributions
please+0.045
 chickens+0.040
 gor+0.040
 ge+0.039
 pigs+0.036
Neg logit contributions
oys-0.075
fs-0.073
 compartment-0.070
 globe-0.069
 handshake-0.069
Token water
Token ID1660
Feature activation-2.26e+00

Pos logit contributions
please+0.131
 gor+0.117
 chickens+0.115
 ge+0.107
 pigs+0.106
Neg logit contributions
oys-0.140
fs-0.133
kin-0.132
 globe-0.128
 compartment-0.127
Token tick
Token ID4378
Feature activation-2.42e+00

Pos logit contributions
please+0.068
 gor+0.062
 chickens+0.060
 North+0.057
 pigs+0.055
Neg logit contributions
oys-0.065
fs-0.064
kin-0.063
 handshake-0.061
 compartment-0.061
Tokenled
Token ID992
Feature activation-7.17e-01

Pos logit contributions
please+0.100
 gor+0.093
 North+0.090
 chickens+0.090
 bored+0.087
Neg logit contributions
oys-0.043
fs-0.043
kin-0.041
suit-0.035
 handshake-0.035
Token their
Token ID511
Feature activation-7.57e-01

Pos logit contributions
please+0.022
 gor+0.020
 chickens+0.020
 ge+0.019
 pigs+0.018
Neg logit contributions
oys-0.021
fs-0.020
 compartment-0.019
 globe-0.019
 handshake-0.019
Token toes
Token ID23932
Feature activation-1.16e+00

Pos logit contributions
please+0.024
 chickens+0.023
 gor+0.023
 ge+0.022
 pigs+0.021
Neg logit contributions
oys-0.020
fs-0.019
 compartment-0.018
 globe-0.018
 handshake-0.018
Token because
Token ID780
Feature activation+5.12e-01

Pos logit contributions
oys+0.011
fs+0.010
 globe+0.010
 compartment+0.010
 handshake+0.010
Neg logit contributions
please-0.017
 gor-0.016
 chickens-0.016
 ge-0.015
 pigs-0.015
Token the
Token ID262
Feature activation-9.92e-01

Pos logit contributions
oys+0.011
fs+0.010
 globe+0.010
kin+0.010
 compartment+0.010
Neg logit contributions
please-0.019
 chickens-0.018
 gor-0.017
 ge-0.017
 bored-0.017
Token frog
Token ID21264
Feature activation-1.57e+00

Pos logit contributions
please+0.033
 gor+0.031
 North+0.030
der+0.028
 pasture+0.028
Neg logit contributions
 box-0.024
 note-0.023
 card-0.022
 label-0.022
 belt-0.021
Token and
Token ID290
Feature activation-4.42e-02

Pos logit contributions
please+0.058
 gor+0.055
 chickens+0.053
 North+0.053
 ge+0.052
Neg logit contributions
oys-0.030
 globe-0.030
fs-0.029
kin-0.029
 handshake-0.028
Token the
Token ID262
Feature activation-1.28e+00

Pos logit contributions
please+0.002
 North+0.001
 pasture+0.001
act+0.001
 chickens+0.001
Neg logit contributions
 globe-0.001
 box-0.001
oys-0.001
kin-0.001
 handshake-0.001
Token chip
Token ID11594
Feature activation-4.53e+00

Pos logit contributions
please+0.041
 gor+0.039
 North+0.037
der+0.036
 pasture+0.035
Neg logit contributions
 box-0.032
 note-0.030
 card-0.028
 label-0.027
 present-0.027
Tokenm
Token ID76
Feature activation-1.63e+00

Pos logit contributions
please+0.128
 North+0.128
 gor+0.120
 harming+0.110
 pasture+0.109
Neg logit contributions
oys-0.127
fs-0.124
 box-0.120
 globe-0.119
kin-0.118
Tokenunk
Token ID2954
Feature activation-1.09e+00

Pos logit contributions
please+0.056
 gor+0.054
 North+0.052
 chickens+0.049
 bored+0.048
Neg logit contributions
oys-0.040
kin-0.037
fs-0.035
box-0.033
 handshake-0.033
Token had
Token ID550
Feature activation+2.24e-01

Pos logit contributions
please+0.039
 gor+0.036
 chickens+0.034
 North+0.034
 ge+0.033
Neg logit contributions
oys-0.024
fs-0.023
 handshake-0.023
 globe-0.023
 box-0.022
Token promised
Token ID8072
Feature activation+1.37e+00

Pos logit contributions
 handshake+0.005
oys+0.004
 box+0.004
 seek+0.004
fs+0.004
Neg logit contributions
please-0.008
 gor-0.008
 North-0.007
 chickens-0.007
 pigs-0.007
Token not
Token ID407
Feature activation-4.24e-02

Pos logit contributions
oys+0.040
fs+0.038
 compartment+0.036
 handshake+0.036
 globe+0.036
Neg logit contributions
please-0.041
 gor-0.038
 chickens-0.038
 ge-0.036
 pigs-0.035
Token their
Token ID511
Feature activation+8.54e-01

Pos logit contributions
oys+0.012
fs+0.012
 handshake+0.011
 compartment+0.011
kin+0.011
Neg logit contributions
please-0.014
 gor-0.013
 chickens-0.013
 ge-0.012
 North-0.012
Token faces
Token ID6698
Feature activation-8.85e-01

Pos logit contributions
oys+0.023
 globe+0.022
fs+0.022
kin+0.021
 handshake+0.021
Neg logit contributions
please-0.027
 gor-0.024
 chickens-0.024
 ge-0.023
 pigs-0.022
Token happy
Token ID3772
Feature activation+5.91e-01

Pos logit contributions
please+0.033
 gor+0.030
 ge+0.029
 chickens+0.028
 colony+0.028
Neg logit contributions
oys-0.018
 globe-0.017
kin-0.017
 handshake-0.017
 note-0.017
Token and
Token ID290
Feature activation-2.21e-01

Pos logit contributions
oys+0.014
 globe+0.013
 handshake+0.012
kin+0.012
 compartment+0.012
Neg logit contributions
please-0.022
 gor-0.020
 chickens-0.019
 ge-0.019
 pasture-0.019
Token their
Token ID511
Feature activation-3.37e-01

Pos logit contributions
please+0.006
 chickens+0.006
 gor+0.006
 ge+0.006
 bored+0.005
Neg logit contributions
oys-0.007
fs-0.006
 compartment-0.006
 globe-0.006
 handshake-0.006
Token tongues
Token ID39413
Feature activation-4.52e+00

Pos logit contributions
please+0.012
 gor+0.011
 chickens+0.011
 ge+0.010
 pigs+0.010
Neg logit contributions
oys-0.008
 globe-0.007
fs-0.007
kin-0.007
 handshake-0.007
Token glad
Token ID9675
Feature activation-1.11e+00

Pos logit contributions
please+0.162
 gor+0.159
 colony+0.149
 chickens+0.148
Fre+0.147
Neg logit contributions
kin-0.095
 note-0.090
 globe-0.088
 card-0.086
 with-0.085
Token.
Token ID13
Feature activation+3.98e-01

Pos logit contributions
please+0.048
 gor+0.045
 pasture+0.044
 colony+0.043
 chickens+0.043
Neg logit contributions
oys-0.018
 box-0.016
 present-0.016
 note-0.016
 globe-0.015
Token
Token ID198
Feature activation-3.54e-01

Pos logit contributions
oys+0.011
fs+0.010
 handshake+0.009
 compartment+0.009
 globe+0.009
Neg logit contributions
please-0.013
 gor-0.012
 chickens-0.012
 ge-0.012
 bored-0.011
Token
Token ID198
Feature activation-2.42e-01

Pos logit contributions
please+0.011
 gor+0.010
 chickens+0.010
 ge+0.010
 pigs+0.009
Neg logit contributions
oys-0.010
fs-0.009
 compartment-0.009
kin-0.009
 globe-0.009
Token"
Token ID1
Feature activation-5.56e-01

Pos logit contributions
please+0.007
 ge+0.007
 chickens+0.007
 gor+0.006
 pigs+0.006
Neg logit contributions
 compartment-0.007
oys-0.007
fs-0.006
 handshake-0.006
 globe-0.006
Token,
Token ID11
Feature activation-1.55e+00

Pos logit contributions
fs+0.009
oys+0.009
kin+0.008
 box+0.008
 globe+0.008
Neg logit contributions
please-0.016
 gor-0.015
 chickens-0.014
 ge-0.014
 North-0.014
Token like
Token ID588
Feature activation-2.64e-01

Pos logit contributions
please+0.056
 gor+0.051
 North+0.049
 chickens+0.046
 ge+0.046
Neg logit contributions
oys-0.032
fs-0.030
 globe-0.030
 compartment-0.030
kin-0.029
Token mint
Token ID21043
Feature activation-2.59e+00

Pos logit contributions
please+0.009
 North+0.007
 gor+0.007
vious+0.007
 regretted+0.007
Neg logit contributions
oys-0.006
 box-0.006
fs-0.006
 card-0.006
 label-0.006
Token and
Token ID290
Feature activation-1.42e+00

Pos logit contributions
please+0.090
 gor+0.085
 North+0.085
 known+0.084
 ge+0.078
Neg logit contributions
fs-0.060
oys-0.059
kin-0.057
 box-0.056
 globe-0.054
Token rose
Token ID8278
Feature activation-2.93e+00

Pos logit contributions
please+0.046
 gor+0.039
vious+0.039
 North+0.038
 bored+0.037
Neg logit contributions
oys-0.032
 label-0.031
 box-0.031
 globe-0.030
 contents-0.029
Tokenmary
Token ID6874
Feature activation-4.48e+00

Pos logit contributions
please+0.093
 North+0.085
 regretted+0.085
 gor+0.083
 decides+0.083
Neg logit contributions
fs-0.070
oys-0.069
 box-0.066
kin-0.066
bes-0.065
Token.
Token ID13
Feature activation-1.06e+00

Pos logit contributions
please+0.184
 known+0.174
 North+0.172
eling+0.166
 harming+0.165
Neg logit contributions
 box-0.068
fs-0.068
 globe-0.067
suit-0.067
kin-0.066
Token They
Token ID1119
Feature activation-3.33e-01

Pos logit contributions
please+0.038
 chickens+0.035
 gor+0.034
 harming+0.033
 ge+0.032
Neg logit contributions
oys-0.025
fs-0.022
 compartment-0.021
 handshake-0.021
 globe-0.021
Token liked
Token ID8288
Feature activation-8.53e-01

Pos logit contributions
 gor+0.011
 North+0.011
please+0.011
opl+0.011
 asleep+0.010
Neg logit contributions
oys-0.007
 box-0.007
kin-0.007
 believe-0.006
f-0.006
Token to
Token ID284
Feature activation-8.80e-01

Pos logit contributions
vious+0.036
ivery+0.036
please+0.036
opl+0.036
eling+0.035
Neg logit contributions
 with-0.007
 box-0.007
 present-0.006
 card-0.006
oys-0.005
Token see
Token ID766
Feature activation-8.09e-01

Pos logit contributions
please+0.032
 gor+0.032
 chickens+0.030
 North+0.030
 ge+0.029
Neg logit contributions
oys-0.017
 handshake-0.016
kin-0.016
 seek-0.016
fs-0.015
Token like
Token ID588
Feature activation-8.10e-02

Pos logit contributions
please+0.027
 gor+0.025
 North+0.023
 chickens+0.023
 ge+0.023
Neg logit contributions
oys-0.018
fs-0.017
 handshake-0.017
 globe-0.016
kin-0.016
Token carrots
Token ID34397
Feature activation-2.75e+00

Pos logit contributions
please+0.003
vious+0.003
arre+0.003
 regretted+0.002
ained+0.002
Neg logit contributions
 box-0.002
 card-0.002
 necklace-0.002
 globe-0.002
 present-0.002
Token and
Token ID290
Feature activation-1.58e+00

Pos logit contributions
please+0.123
 gor+0.113
 ge+0.108
 North+0.108
 known+0.105
Neg logit contributions
 box-0.042
kin-0.039
oys-0.039
 globe-0.038
fs-0.037
Token peas
Token ID22589
Feature activation-3.73e+00

Pos logit contributions
please+0.047
ained+0.043
 bored+0.041
vious+0.040
 gor+0.039
Neg logit contributions
oys-0.037
 seek-0.036
 box-0.036
 card-0.035
 necklace-0.035
Token and
Token ID290
Feature activation-1.61e+00

Pos logit contributions
please+0.158
 North+0.146
 gor+0.145
 Australian+0.140
 ge+0.140
Neg logit contributions
kin-0.061
 box-0.058
oys-0.056
 globe-0.053
fs-0.052
Token lettuce
Token ID39406
Feature activation-4.44e+00

Pos logit contributions
please+0.049
ained+0.045
 North+0.041
 gor+0.041
 Australian+0.041
Neg logit contributions
 box-0.038
 shovel-0.037
 card-0.036
 seek-0.036
 necklace-0.036
Token.
Token ID13
Feature activation-6.11e-01

Pos logit contributions
please+0.189
 North+0.170
 gor+0.169
 known+0.167
eling+0.164
Neg logit contributions
 box-0.076
kin-0.074
oys-0.069
fs-0.068
 globe-0.067
Token
Token ID198
Feature activation-5.82e-01

Pos logit contributions
please+0.023
 gor+0.021
 chickens+0.021
 ge+0.021
 North+0.020
Neg logit contributions
oys-0.013
fs-0.011
 globe-0.011
 compartment-0.011
 box-0.011
Token
Token ID198
Feature activation-2.04e-01

Pos logit contributions
please+0.017
 gor+0.016
 chickens+0.016
 ge+0.015
 pigs+0.015
Neg logit contributions
oys-0.017
fs-0.016
 compartment-0.015
kin-0.015
 globe-0.015
TokenOne
Token ID3198
Feature activation-6.16e-01

Pos logit contributions
please+0.005
 gor+0.004
 chickens+0.004
 ge+0.004
 pigs+0.004
Neg logit contributions
oys-0.007
fs-0.007
 compartment-0.007
 globe-0.007
 handshake-0.007
Token day
Token ID1110
Feature activation+9.88e-01

Pos logit contributions
please+0.020
 gor+0.018
 chickens+0.018
 ge+0.017
 pigs+0.017
Neg logit contributions
oys-0.017
fs-0.015
kin-0.015
 globe-0.015
 handshake-0.015
Tokenish
Token ID680
Feature activation-2.16e+00

Pos logit contributions
 gor+0.119
 North+0.116
please+0.113
 chickens+0.110
 bored+0.109
Neg logit contributions
oys-0.081
 box-0.079
lock-0.074
 globe-0.074
box-0.071
Token to
Token ID284
Feature activation-8.32e-01

Pos logit contributions
please+0.076
 gor+0.068
 chickens+0.066
 ge+0.065
 North+0.065
Neg logit contributions
oys-0.052
fs-0.049
 handshake-0.048
 box-0.046
kin-0.046
Token surrender
Token ID16908
Feature activation-1.23e+00

Pos logit contributions
please+0.030
 gor+0.027
 chickens+0.027
 ge+0.026
 pigs+0.026
Neg logit contributions
oys-0.019
fs-0.018
 handshake-0.017
kin-0.017
 compartment-0.017
Token until
Token ID1566
Feature activation-2.42e+00

Pos logit contributions
please+0.043
 gor+0.040
 chickens+0.038
 North+0.037
 ge+0.036
Neg logit contributions
oys-0.031
fs-0.028
kin-0.028
 compartment-0.027
 handshake-0.027
Token the
Token ID262
Feature activation-2.54e+00

Pos logit contributions
please+0.099
 gor+0.088
 chickens+0.088
 pigs+0.083
 pasture+0.083
Neg logit contributions
kin-0.042
 box-0.040
oys-0.040
 globe-0.040
fs-0.039
Token rad
Token ID2511
Feature activation-4.44e+00

Pos logit contributions
please+0.068
 gor+0.063
 pasture+0.060
eling+0.059
ained+0.058
Neg logit contributions
 box-0.075
 note-0.070
 present-0.068
 message-0.065
 globe-0.065
Tokenish
Token ID680
Feature activation-2.94e+00

Pos logit contributions
 gor+0.158
 North+0.152
please+0.150
 chickens+0.145
 bored+0.143
Neg logit contributions
 box-0.089
oys-0.079
box-0.077
lock-0.077
 globe-0.075
Token felt
Token ID2936
Feature activation-3.14e+00

Pos logit contributions
please+0.101
 gor+0.094
 chickens+0.092
 North+0.089
 pigs+0.088
Neg logit contributions
kin-0.067
fs-0.067
oys-0.066
 handshake-0.064
 compartment-0.064
Token exhausted
Token ID19064
Feature activation-1.71e+00

Pos logit contributions
please+0.116
 gor+0.107
 chickens+0.102
 North+0.099
 pigs+0.098
Neg logit contributions
kin-0.065
 compartment-0.063
 handshake-0.059
 note-0.058
 box-0.058
Token.
Token ID13
Feature activation+2.06e-01

Pos logit contributions
please+0.066
 gor+0.062
 ge+0.059
 chickens+0.059
 pasture+0.056
Neg logit contributions
oys-0.038
fs-0.032
 box-0.032
 compartment-0.031
kin-0.031
Token
Token ID220
Feature activation-1.54e+00

Pos logit contributions
oys+0.005
 compartment+0.004
fs+0.004
 globe+0.004
 handshake+0.004
Neg logit contributions
please-0.007
 chickens-0.007
 gor-0.007
 ge-0.007
 North-0.006
Token make
Token ID787
Feature activation-1.48e+00

Pos logit contributions
please+0.044
 gor+0.039
 chickens+0.037
 North+0.036
 ge+0.036
Neg logit contributions
oys-0.027
 handshake-0.026
 box-0.025
 compartment-0.025
 globe-0.025
Token the
Token ID262
Feature activation-1.73e+00

Pos logit contributions
please+0.056
 gor+0.049
 pasture+0.047
 North+0.046
 chickens+0.046
Neg logit contributions
 box-0.028
fs-0.027
 handshake-0.027
oys-0.027
 note-0.027
Token fish
Token ID5916
Feature activation-1.90e+00

Pos logit contributions
please+0.059
 gor+0.049
 chickens+0.047
 pasture+0.047
 ge+0.046
Neg logit contributions
 globe-0.045
 box-0.044
kin-0.042
 handshake-0.042
 note-0.041
Token and
Token ID290
Feature activation-1.68e+00

Pos logit contributions
please+0.064
 gor+0.057
 chickens+0.054
 North+0.053
 ge+0.052
Neg logit contributions
kin-0.047
oys-0.047
 globe-0.046
 note-0.045
 box-0.045
Token the
Token ID262
Feature activation-1.64e+00

Pos logit contributions
please+0.058
 gor+0.049
 North+0.046
 pasture+0.046
 chickens+0.046
Neg logit contributions
 handshake-0.040
oys-0.039
 box-0.039
 globe-0.039
 compartment-0.038
Token cor
Token ID1162
Feature activation-4.43e+00

Pos logit contributions
please+0.052
 gor+0.043
 chickens+0.041
 pasture+0.041
 ge+0.040
Neg logit contributions
 globe-0.045
 box-0.045
kin-0.044
 handshake-0.044
oys-0.044
Tokenals
Token ID874
Feature activation-3.17e+00

Pos logit contributions
please+0.155
 known+0.137
 North+0.133
ained+0.133
 gor+0.131
Neg logit contributions
cher-0.093
suit-0.092
 box-0.091
 globe-0.091
acle-0.082
Token sad
Token ID6507
Feature activation-2.18e+00

Pos logit contributions
please+0.122
 gor+0.111
 chickens+0.104
 ge+0.103
 anth+0.103
Neg logit contributions
kin-0.065
 box-0.064
fs-0.063
 globe-0.062
 handshake-0.061
Token."
Token ID526
Feature activation-1.22e-01

Pos logit contributions
please+0.082
 pasture+0.070
 gor+0.069
 known+0.068
 Australian+0.066
Neg logit contributions
oys-0.048
kin-0.046
 handshake-0.046
 box-0.045
 globe-0.044
Token
Token ID198
Feature activation+2.61e-01

Pos logit contributions
please+0.004
 gor+0.003
 chickens+0.003
 ge+0.003
 pigs+0.003
Neg logit contributions
oys-0.003
fs-0.003
 compartment-0.003
kin-0.003
 globe-0.003
Token
Token ID198
Feature activation+4.69e-01

Pos logit contributions
oys+0.008
fs+0.008
kin+0.007
 compartment+0.007
 globe+0.007
Neg logit contributions
please-0.007
 gor-0.007
 chickens-0.007
 ge-0.007
 pigs-0.006
Token leave
Token ID2666
Feature activation+1.31e-01

Pos logit contributions
please+0.028
 gor+0.026
 bored+0.025
 chickens+0.024
 ge+0.024
Neg logit contributions
oys-0.016
 handshake-0.015
fs-0.015
kin-0.015
 box-0.015
Token!"
Token ID2474
Feature activation-4.17e-01

Pos logit contributions
oys+0.003
fs+0.003
 compartment+0.003
 globe+0.003
 box+0.003
Neg logit contributions
please-0.004
 gor-0.004
 chickens-0.004
 ge-0.004
 bored-0.004
Token Suddenly
Token ID24975
Feature activation+1.83e-01

Pos logit contributions
please+0.014
 gor+0.013
 chickens+0.013
 ge+0.013
 bored+0.012
Neg logit contributions
oys-0.010
fs-0.010
 compartment-0.009
 handshake-0.009
 globe-0.009
Token,
Token ID11
Feature activation-3.86e-01

Pos logit contributions
oys+0.004
fs+0.004
 compartment+0.004
 handshake+0.004
 globe+0.004
Neg logit contributions
please-0.007
 gor-0.006
 chickens-0.006
 ge-0.006
 pigs-0.006
Token the
Token ID262
Feature activation-1.27e+00

Pos logit contributions
please+0.013
 gor+0.012
 chickens+0.011
 ge+0.011
 North+0.011
Neg logit contributions
oys-0.010
fs-0.009
 globe-0.009
 compartment-0.009
kin-0.009
Token chip
Token ID11594
Feature activation-4.39e+00

Pos logit contributions
please+0.045
 gor+0.042
 North+0.039
 bored+0.038
 ge+0.038
Neg logit contributions
 box-0.029
kin-0.027
 note-0.026
oys-0.026
 card-0.026
Tokenmun
Token ID6199
Feature activation-2.62e+00

Pos logit contributions
please+0.126
 North+0.119
 gor+0.116
 bored+0.108
 harming+0.107
Neg logit contributions
oys-0.126
fs-0.123
kin-0.121
 globe-0.116
 box-0.116
Tokenks
Token ID591
Feature activation-7.33e-01

Pos logit contributions
please+0.109
 gor+0.101
 North+0.101
 chickens+0.096
 ge+0.094
Neg logit contributions
oys-0.044
kin-0.042
fs-0.040
 compartment-0.039
 handshake-0.036
Token were
Token ID547
Feature activation-1.20e+00

Pos logit contributions
please+0.024
 gor+0.023
 North+0.021
 chickens+0.021
 ge+0.021
Neg logit contributions
oys-0.019
fs-0.018
kin-0.017
lf-0.017
 compartment-0.016
Token free
Token ID1479
Feature activation+4.66e-01

Pos logit contributions
please+0.037
 gor+0.034
 decides+0.034
 North+0.032
act+0.031
Neg logit contributions
kin-0.031
fs-0.029
oys-0.029
 compartment-0.028
box-0.028
Token again
Token ID757
Feature activation-1.15e+00

Pos logit contributions
oys+0.009
 box+0.008
 compartment+0.008
box+0.008
 globe+0.008
Neg logit contributions
please-0.019
 gor-0.017
 ge-0.016
 chickens-0.016
 North-0.016
Token didn
Token ID1422
Feature activation+1.93e-01

Pos logit contributions
please+0.029
 gor+0.029
 North+0.028
 chickens+0.027
 pasture+0.027
Neg logit contributions
kin-0.024
fs-0.023
lf-0.022
oys-0.022
 seek-0.022
Token't
Token ID470
Feature activation-5.72e-01

Pos logit contributions
oys+0.004
fs+0.004
 globe+0.004
kin+0.004
 compartment+0.004
Neg logit contributions
please-0.007
 chickens-0.007
 gor-0.007
 ge-0.007
 pigs-0.006
Token stop
Token ID2245
Feature activation-1.95e-01

Pos logit contributions
please+0.017
 chickens+0.017
 gor+0.017
 North+0.017
 pasture+0.016
Neg logit contributions
oys-0.015
 seek-0.015
 believe-0.014
kin-0.014
 handshake-0.013
Token until
Token ID1566
Feature activation-2.37e+00

Pos logit contributions
please+0.007
 gor+0.006
vious+0.006
 chickens+0.006
 pasture+0.006
Neg logit contributions
oys-0.004
 with-0.004
 note-0.004
 box-0.003
 globe-0.003
Token their
Token ID511
Feature activation-1.33e+00

Pos logit contributions
please+0.094
 gor+0.086
 chickens+0.085
 ge+0.080
 pigs+0.080
Neg logit contributions
oys-0.049
kin-0.045
fs-0.045
 compartment-0.044
 handshake-0.044
Token stomach
Token ID11384
Feature activation-4.32e+00

Pos logit contributions
please+0.046
 gor+0.042
 chickens+0.042
 ge+0.041
 pigs+0.039
Neg logit contributions
oys-0.033
fs-0.030
 handshake-0.029
 compartment-0.029
 globe-0.029
Tokens
Token ID82
Feature activation-1.63e+00

Pos logit contributions
please+0.151
 gor+0.141
 chickens+0.134
 ge+0.132
 North+0.131
Neg logit contributions
oys-0.103
fs-0.099
 compartment-0.096
 box-0.095
kin-0.095
Token were
Token ID547
Feature activation-1.61e+00

Pos logit contributions
please+0.054
 gor+0.048
 chickens+0.047
 ge+0.047
 North+0.045
Neg logit contributions
oys-0.043
kin-0.041
fs-0.040
 compartment-0.040
 globe-0.040
Token full
Token ID1336
Feature activation-5.26e-01

Pos logit contributions
please+0.057
 gor+0.052
 chickens+0.051
 ge+0.050
 decides+0.050
Neg logit contributions
kin-0.040
 compartment-0.035
 box-0.035
acle-0.034
box-0.033
Token.
Token ID13
Feature activation+2.45e-01

Pos logit contributions
please+0.020
 gor+0.018
 chickens+0.018
 ge+0.018
 North+0.017
Neg logit contributions
oys-0.011
 compartment-0.010
fs-0.010
 box-0.010
kin-0.010
Token
Token ID198
Feature activation-2.03e-01

Pos logit contributions
oys+0.007
fs+0.007
kin+0.007
 globe+0.007
 compartment+0.006
Neg logit contributions
please-0.007
 chickens-0.007
 gor-0.006
 ge-0.006
 bored-0.006
Token for
Token ID329
Feature activation-3.84e-01

Pos logit contributions
please+0.020
 chickens+0.019
 gor+0.019
 ge+0.018
 pigs+0.017
Neg logit contributions
oys-0.025
fs-0.024
 compartment-0.024
 handshake-0.023
lf-0.023
Token his
Token ID465
Feature activation+9.71e-01

Pos logit contributions
please+0.011
 gor+0.010
 chickens+0.010
 ge+0.010
 pigs+0.009
Neg logit contributions
oys-0.011
fs-0.011
 compartment-0.010
kin-0.010
 handshake-0.010
Token knee
Token ID10329
Feature activation-2.25e+00

Pos logit contributions
oys+0.016
fs+0.015
 compartment+0.014
 globe+0.014
 handshake+0.014
Neg logit contributions
please-0.041
 gor-0.038
 chickens-0.038
 ge-0.037
 pigs-0.036
Token and
Token ID290
Feature activation+1.06e-01

Pos logit contributions
please+0.077
 gor+0.072
 chickens+0.071
 ge+0.069
 pigs+0.066
Neg logit contributions
oys-0.057
fs-0.052
 handshake-0.050
 compartment-0.050
kin-0.049
Token his
Token ID465
Feature activation+5.06e-01

Pos logit contributions
oys+0.003
fs+0.003
 compartment+0.003
 globe+0.002
 handshake+0.002
Neg logit contributions
please-0.003
 chickens-0.003
 gor-0.003
 ge-0.003
 pigs-0.003
Token elbow
Token ID22662
Feature activation-4.30e+00

Pos logit contributions
oys+0.009
fs+0.008
 globe+0.008
 compartment+0.008
 handshake+0.007
Neg logit contributions
please-0.021
 gor-0.020
 chickens-0.020
 ge-0.019
 pigs-0.019
Token.
Token ID13
Feature activation+3.90e-01

Pos logit contributions
 gor+0.156
please+0.155
 chickens+0.151
 North+0.146
 ge+0.144
Neg logit contributions
oys-0.092
fs-0.087
 handshake-0.083
kin-0.083
 compartment-0.080
Token He
Token ID679
Feature activation+6.43e-01

Pos logit contributions
oys+0.010
fs+0.009
 compartment+0.009
 globe+0.009
 handshake+0.009
Neg logit contributions
please-0.013
 chickens-0.012
 gor-0.012
 ge-0.012
 bored-0.011
Token let
Token ID1309
Feature activation-8.66e-02

Pos logit contributions
oys+0.016
fs+0.015
 compartment+0.014
kin+0.014
 handshake+0.014
Neg logit contributions
please-0.022
 gor-0.020
 chickens-0.020
 ge-0.020
 pigs-0.019
Token him
Token ID683
Feature activation+2.44e-01

Pos logit contributions
please+0.002
 gor+0.002
 chickens+0.002
 ge+0.002
 pigs+0.002
Neg logit contributions
oys-0.003
fs-0.003
 compartment-0.002
 globe-0.002
 handshake-0.002
Token ride
Token ID6594
Feature activation+9.08e-01

Pos logit contributions
oys+0.006
fs+0.006
 compartment+0.005
 globe+0.005
 handshake+0.005
Neg logit contributions
please-0.008
 gor-0.008
 chickens-0.008
 ge-0.007
 pigs-0.007
Token make
Token ID787
Feature activation-9.57e-01

Pos logit contributions
oys+0.004
 handshake+0.003
 note+0.003
 seek+0.003
 believe+0.003
Neg logit contributions
 chickens-0.006
please-0.006
 gor-0.006
 ge-0.006
 tur-0.006
Token the
Token ID262
Feature activation-1.89e+00

Pos logit contributions
please+0.036
 gor+0.030
 chickens+0.030
 North+0.030
 ge+0.029
Neg logit contributions
oys-0.019
 handshake-0.019
 card-0.018
 box-0.018
 note-0.018
Token fish
Token ID5916
Feature activation-1.34e+00

Pos logit contributions
please+0.063
 gor+0.055
 chickens+0.053
 pasture+0.051
 ge+0.050
Neg logit contributions
 globe-0.050
 box-0.048
oys-0.048
 handshake-0.047
kin-0.046
Token and
Token ID290
Feature activation-8.28e-01

Pos logit contributions
please+0.043
 gor+0.038
 chickens+0.037
 North+0.036
 ge+0.035
Neg logit contributions
oys-0.036
 note-0.034
kin-0.034
 globe-0.034
 handshake-0.034
Token the
Token ID262
Feature activation-2.05e+00

Pos logit contributions
please+0.029
 gor+0.025
 North+0.024
 chickens+0.024
 pasture+0.023
Neg logit contributions
oys-0.019
 handshake-0.019
 compartment-0.018
 box-0.018
 globe-0.018
Token cor
Token ID1162
Feature activation-4.29e+00

Pos logit contributions
please+0.062
 gor+0.055
 chickens+0.052
 North+0.050
 pasture+0.050
Neg logit contributions
oys-0.058
 globe-0.057
 box-0.056
 handshake-0.055
kin-0.055
Tokenals
Token ID874
Feature activation-2.83e+00

Pos logit contributions
please+0.151
 gor+0.131
 North+0.131
 known+0.129
ained+0.123
Neg logit contributions
 globe-0.092
cher-0.090
 box-0.089
suit-0.088
acle-0.083
Token sad
Token ID6507
Feature activation-2.06e+00

Pos logit contributions
please+0.101
 gor+0.094
 chickens+0.089
 North+0.087
 ge+0.085
Neg logit contributions
kin-0.067
fs-0.066
 handshake-0.063
 box-0.062
 globe-0.061
Token.
Token ID13
Feature activation-2.50e-02

Pos logit contributions
please+0.082
 gor+0.070
 pasture+0.068
 North+0.068
 chickens+0.067
Neg logit contributions
oys-0.044
kin-0.041
 handshake-0.040
 box-0.039
 globe-0.038
Token They
Token ID1119
Feature activation+5.84e-01

Pos logit contributions
please+0.001
 chickens+0.001
 gor+0.001
 ge+0.001
 North+0.001
Neg logit contributions
oys-0.001
 handshake-0.001
 compartment-0.001
 globe-0.001
fs-0.001
Token wanted
Token ID2227
Feature activation+9.76e-01

Pos logit contributions
oys+0.012
fs+0.012
 box+0.011
 compartment+0.011
kin+0.011
Neg logit contributions
please-0.021
 gor-0.020
 chickens-0.019
 North-0.019
 pigs-0.018
Token and
Token ID290
Feature activation-1.74e+00

Pos logit contributions
please+0.079
 gor+0.073
 North+0.070
 anth+0.068
 ge+0.067
Neg logit contributions
oys-0.032
 box-0.032
kin-0.032
fs-0.031
 globe-0.031
Token lady
Token ID10846
Feature activation-4.84e-01

Pos logit contributions
please+0.050
 gor+0.045
 North+0.042
 pasture+0.041
 chickens+0.039
Neg logit contributions
oys-0.048
 handshake-0.048
 seek-0.046
fs-0.046
 box-0.045
Tokenbugs
Token ID32965
Feature activation-3.13e+00

Pos logit contributions
please+0.016
 gor+0.015
 North+0.015
 ge+0.014
 chickens+0.014
Neg logit contributions
oys-0.011
 handshake-0.010
fs-0.010
kin-0.010
 globe-0.010
Token and
Token ID290
Feature activation-1.92e+00

Pos logit contributions
please+0.125
 gor+0.113
 North+0.109
 anth+0.108
eling+0.107
Neg logit contributions
oys-0.056
 box-0.055
kin-0.055
 handshake-0.054
 globe-0.053
Token cater
Token ID20825
Feature activation-3.58e+00

Pos logit contributions
please+0.056
 gor+0.049
 North+0.048
 pasture+0.047
abi+0.045
Neg logit contributions
 box-0.049
 handshake-0.049
oys-0.049
 seek-0.048
 card-0.047
Tokenpill
Token ID27215
Feature activation-4.28e+00

Pos logit contributions
 North+0.097
 known+0.097
 gor+0.096
please+0.093
 starting+0.092
Neg logit contributions
fs-0.087
of-0.084
ars-0.083
sc-0.083
bes-0.080
Tokenars
Token ID945
Feature activation-6.96e-01

Pos logit contributions
please+0.234
 gor+0.225
 North+0.224
 chickens+0.219
 ge+0.216
Neg logit contributions
fs-0.017
oys-0.009
kin-0.009
 handshake-0.007
ars-0.005
Token."
Token ID526
Feature activation-8.51e-02

Pos logit contributions
please+0.027
 gor+0.025
 colony+0.024
 North+0.024
 anth+0.024
Neg logit contributions
oys-0.014
fs-0.013
kin-0.013
 globe-0.012
 handshake-0.012
Token
Token ID198
Feature activation+3.75e-01

Pos logit contributions
please+0.003
 chickens+0.003
 gor+0.003
 ge+0.003
 North+0.003
Neg logit contributions
oys-0.002
fs-0.002
 handshake-0.002
 globe-0.002
 compartment-0.002
Token
Token ID198
Feature activation+5.44e-01

Pos logit contributions
oys+0.010
fs+0.010
 compartment+0.009
kin+0.009
 handshake+0.009
Neg logit contributions
please-0.012
 gor-0.011
 chickens-0.011
 ge-0.011
 pigs-0.010
TokenTim
Token ID14967
Feature activation+1.52e+00

Pos logit contributions
oys+0.009
fs+0.009
 compartment+0.008
kin+0.008
 handshake+0.008
Neg logit contributions
please-0.023
 gor-0.021
 chickens-0.021
 ge-0.021
 pigs-0.020
Token has
Token ID468
Feature activation+1.34e+00

Pos logit contributions
oys+0.025
 globe+0.024
fs+0.024
 compartment+0.023
 box+0.022
Neg logit contributions
please-0.062
 chickens-0.057
 gor-0.056
 ge-0.055
 North-0.053
Token cheese
Token ID9891
Feature activation-1.16e+00

Pos logit contributions
oys+0.025
 box+0.024
 card+0.023
fs+0.022
 present+0.022
Neg logit contributions
please-0.052
 gor-0.046
 colony-0.045
 bored-0.045
 pasture-0.044
Token and
Token ID290
Feature activation+2.80e-01

Pos logit contributions
please+0.043
 gor+0.039
 chickens+0.038
 ge+0.038
 North+0.037
Neg logit contributions
oys-0.027
fs-0.025
kin-0.024
 globe-0.024
 compartment-0.024
Token ham
Token ID8891
Feature activation-2.91e+00

Pos logit contributions
oys+0.008
fs+0.008
 globe+0.008
kin+0.008
 compartment+0.007
Neg logit contributions
please-0.008
 gor-0.008
 chickens-0.007
 ge-0.007
 bored-0.007
Token and
Token ID290
Feature activation-4.20e-01

Pos logit contributions
please+0.108
 gor+0.098
 chickens+0.095
 North+0.094
 ge+0.093
Neg logit contributions
oys-0.069
kin-0.063
 globe-0.060
fs-0.060
 compartment-0.058
Token lettuce
Token ID39406
Feature activation-4.27e+00

Pos logit contributions
please+0.013
 gor+0.012
 chickens+0.011
 North+0.011
 bored+0.011
Neg logit contributions
oys-0.011
fs-0.011
kin-0.011
 globe-0.010
 box-0.010
Token.
Token ID13
Feature activation-1.22e+00

Pos logit contributions
please+0.177
 gor+0.159
 North+0.157
 known+0.151
 pasture+0.150
Neg logit contributions
oys-0.084
kin-0.081
 box-0.080
 globe-0.074
fs-0.074
Token It
Token ID632
Feature activation+1.32e+00

Pos logit contributions
please+0.045
 gor+0.040
 chickens+0.039
 harming+0.038
 North+0.037
Neg logit contributions
oys-0.028
 compartment-0.025
fs-0.024
 globe-0.024
 handshake-0.024
Token looks
Token ID3073
Feature activation-4.69e-01

Pos logit contributions
 globe+0.021
oys+0.021
 box+0.020
fs+0.020
 compartment+0.020
Neg logit contributions
please-0.055
 chickens-0.050
 gor-0.049
 ge-0.048
 North-0.047
Token y
Token ID331
Feature activation-1.56e-01

Pos logit contributions
please+0.016
 gor+0.014
 chickens+0.014
 North+0.014
 colony+0.014
Neg logit contributions
fs-0.010
oys-0.010
 box-0.010
 present-0.010
kin-0.010
Tokenummy
Token ID13513
Feature activation-7.52e-01

Pos logit contributions
please+0.004
 gor+0.004
 chickens+0.004
 ge+0.004
 pigs+0.004
Neg logit contributions
oys-0.005
fs-0.005
 compartment-0.004
 handshake-0.004
 globe-0.004
Token you
Token ID345
Feature activation-7.04e-01

Pos logit contributions
please+0.059
 chickens+0.054
 gor+0.054
 ge+0.053
 pigs+0.051
Neg logit contributions
oys-0.036
fs-0.034
 compartment-0.033
 globe-0.032
 handshake-0.032
Token".
Token ID1911
Feature activation+5.58e-01

Pos logit contributions
please+0.024
 gor+0.021
 chickens+0.021
 ge+0.020
 North+0.020
Neg logit contributions
oys-0.019
fs-0.016
 handshake-0.016
kin-0.015
 box-0.015
Token
Token ID198
Feature activation+2.99e-01

Pos logit contributions
oys+0.015
fs+0.015
kin+0.014
 compartment+0.014
 handshake+0.014
Neg logit contributions
please-0.018
 chickens-0.016
 gor-0.016
 ge-0.015
 bored-0.015
Token
Token ID198
Feature activation+5.22e-01

Pos logit contributions
oys+0.009
fs+0.008
 compartment+0.008
 globe+0.008
kin+0.008
Neg logit contributions
please-0.009
 chickens-0.008
 gor-0.008
 ge-0.008
 bored-0.007
TokenThe
Token ID464
Feature activation-3.00e+00

Pos logit contributions
oys+0.011
fs+0.010
 compartment+0.010
 globe+0.010
 handshake+0.010
Neg logit contributions
please-0.020
 chickens-0.018
 gor-0.018
 ge-0.018
 bored-0.017
Token bad
Token ID2089
Feature activation-4.26e+00

Pos logit contributions
please+0.077
 chickens+0.070
 gor+0.070
 ge+0.067
 bored+0.064
Neg logit contributions
oys-0.101
fs-0.097
 compartment-0.093
 globe-0.093
 handshake-0.092
Tokenger
Token ID1362
Feature activation-1.97e-03

Pos logit contributions
please+0.160
 gor+0.151
 chickens+0.151
 ge+0.148
 bored+0.142
Neg logit contributions
oys-0.088
fs-0.085
 compartment-0.081
 globe-0.080
 handshake-0.078
Token was
Token ID373
Feature activation-1.41e-01

Pos logit contributions
please+0.000
 gor+0.000
 chickens+0.000
 ge+0.000
 pigs+0.000
Neg logit contributions
oys-0.000
fs-0.000
 compartment-0.000
 handshake-0.000
kin-0.000
Token very
Token ID845
Feature activation+9.77e-02

Pos logit contributions
please+0.004
 chickens+0.004
 gor+0.004
 pigs+0.004
 ge+0.004
Neg logit contributions
oys-0.004
kin-0.004
fs-0.004
 compartment-0.004
 handshake-0.004
Token surprised
Token ID6655
Feature activation+3.85e-01

Pos logit contributions
oys+0.003
kin+0.003
fs+0.003
 handshake+0.003
 compartment+0.003
Neg logit contributions
please-0.003
 chickens-0.002
 gor-0.002
 pigs-0.002
 North-0.002
Token.
Token ID13
Feature activation+7.55e-01

Pos logit contributions
fs+0.011
oys+0.011
kin+0.011
 globe+0.011
 compartment+0.010
Neg logit contributions
please-0.011
 gor-0.010
 chickens-0.010
 ge-0.010
 bored-0.009
Token she
Token ID673
Feature activation-1.27e+00

Pos logit contributions
please+0.025
 gor+0.023
 chickens+0.022
 ge+0.022
 North+0.021
Neg logit contributions
oys-0.016
fs-0.016
 compartment-0.014
kin-0.014
 globe-0.014
Token asked
Token ID1965
Feature activation-2.38e-01

Pos logit contributions
please+0.037
 gor+0.035
 chickens+0.034
 ge+0.034
 North+0.032
Neg logit contributions
oys-0.036
fs-0.035
kin-0.033
 globe-0.033
 handshake-0.033
Token herself
Token ID5223
Feature activation-1.16e+00

Pos logit contributions
please+0.008
 North+0.007
 gor+0.007
 ge+0.007
 pasture+0.007
Neg logit contributions
oys-0.006
fs-0.005
lf-0.005
 box-0.005
kin-0.005
Token if
Token ID611
Feature activation-4.65e-01

Pos logit contributions
please+0.044
 gor+0.038
 ge+0.038
 North+0.037
 chickens+0.037
Neg logit contributions
oys-0.025
fs-0.023
 handshake-0.022
kin-0.022
 globe-0.022
Token these
Token ID777
Feature activation-2.78e+00

Pos logit contributions
please+0.017
 gor+0.015
 chickens+0.015
 North+0.015
 pasture+0.015
Neg logit contributions
oys-0.010
fs-0.010
 handshake-0.009
kin-0.009
 box-0.009
Token berries
Token ID36322
Feature activation-4.25e+00

Pos logit contributions
please+0.088
 North+0.079
 gor+0.071
 bored+0.069
 ge+0.068
Neg logit contributions
oys-0.073
fs-0.071
 box-0.069
 handshake-0.064
 contents-0.063
Token were
Token ID547
Feature activation-2.00e+00

Pos logit contributions
please+0.151
 gor+0.137
 North+0.133
 chickens+0.132
 ge+0.128
Neg logit contributions
fs-0.100
oys-0.100
kin-0.093
 globe-0.090
 handshake-0.090
Token safe
Token ID3338
Feature activation+6.91e-02

Pos logit contributions
please+0.064
 North+0.060
 gor+0.058
 decides+0.054
 ge+0.053
Neg logit contributions
oys-0.047
fs-0.046
kin-0.045
 box-0.044
 seek-0.043
Token to
Token ID284
Feature activation-9.61e-01

Pos logit contributions
oys+0.002
fs+0.001
 compartment+0.001
kin+0.001
 globe+0.001
Neg logit contributions
please-0.003
 gor-0.002
 chickens-0.002
 ge-0.002
 North-0.002
Token eat
Token ID4483
Feature activation-2.67e+00

Pos logit contributions
please+0.034
 North+0.032
 gor+0.032
 bored+0.030
 chickens+0.030
Neg logit contributions
 seek-0.018
fs-0.017
oys-0.017
 handshake-0.016
 box-0.016
Token.
Token ID13
Feature activation+1.63e-01

Pos logit contributions
please+0.095
 gor+0.086
 North+0.085
 chickens+0.081
 ge+0.081
Neg logit contributions
fs-0.061
oys-0.059
kin-0.058
 box-0.054
 globe-0.053
Token reached
Token ID4251
Feature activation-1.53e+00

Pos logit contributions
please+0.043
 gor+0.042
 chickens+0.041
 ge+0.039
 North+0.039
Neg logit contributions
oys-0.039
 compartment-0.037
 handshake-0.037
 box-0.037
fs-0.037
Token her
Token ID607
Feature activation-8.27e-02

Pos logit contributions
please+0.051
 gor+0.046
 chickens+0.046
 ge+0.044
 pigs+0.043
Neg logit contributions
oys-0.039
fs-0.037
 compartment-0.037
 handshake-0.036
 globe-0.036
Token hand
Token ID1021
Feature activation-5.26e-01

Pos logit contributions
please+0.003
 gor+0.003
 chickens+0.003
 ge+0.003
 pigs+0.003
Neg logit contributions
oys-0.002
fs-0.002
 compartment-0.002
kin-0.002
 globe-0.002
Token out
Token ID503
Feature activation+1.10e-01

Pos logit contributions
please+0.016
 chickens+0.015
 gor+0.014
 ge+0.014
 pigs+0.014
Neg logit contributions
oys-0.015
fs-0.015
 globe-0.014
 compartment-0.014
kin-0.014
Token and
Token ID290
Feature activation-2.89e-01

Pos logit contributions
oys+0.003
fs+0.002
 globe+0.002
kin+0.002
 compartment+0.002
Neg logit contributions
please-0.004
 chickens-0.004
 gor-0.004
 ge-0.003
 pigs-0.003
Token tasted
Token ID29187
Feature activation-4.24e+00

Pos logit contributions
please+0.009
 chickens+0.008
 ge+0.008
 gor+0.008
 bored+0.007
Neg logit contributions
oys-0.009
fs-0.008
 globe-0.008
kin-0.008
 handshake-0.008
Token one
Token ID530
Feature activation+2.20e-01

Pos logit contributions
please+0.209
 gor+0.183
 chickens+0.178
 North+0.177
 pasture+0.176
Neg logit contributions
 present-0.034
 box-0.034
 with-0.033
kin-0.032
fs-0.032
Token.
Token ID13
Feature activation-6.23e-01

Pos logit contributions
oys+0.007
fs+0.006
 handshake+0.006
 globe+0.006
 compartment+0.006
Neg logit contributions
please-0.006
 gor-0.006
 chickens-0.006
 ge-0.006
 pigs-0.005
Token It
Token ID632
Feature activation+1.04e+00

Pos logit contributions
please+0.021
 gor+0.020
 chickens+0.019
 ge+0.019
 North+0.018
Neg logit contributions
oys-0.016
 compartment-0.015
fs-0.014
 Kate-0.014
 handshake-0.014
Token was
Token ID373
Feature activation+7.10e-01

Pos logit contributions
oys+0.019
fs+0.018
 compartment+0.017
kin+0.017
 globe+0.017
Neg logit contributions
please-0.042
 gor-0.039
 chickens-0.039
 ge-0.037
 North-0.037
Token so
Token ID523
Feature activation+6.68e-01

Pos logit contributions
kin+0.015
oys+0.014
fs+0.014
 compartment+0.013
 box+0.013
Neg logit contributions
please-0.027
 gor-0.024
 chickens-0.023
 ge-0.023
 North-0.023
Token sweets
Token ID42402
Feature activation-1.52e+00

Pos logit contributions
please+0.008
 chickens+0.007
 gor+0.007
 ge+0.007
 pigs+0.007
Neg logit contributions
oys-0.005
fs-0.005
 compartment-0.005
 globe-0.005
 box-0.005
Token with
Token ID351
Feature activation+1.49e-01

Pos logit contributions
please+0.054
 gor+0.050
 chickens+0.050
 ge+0.048
 pigs+0.047
Neg logit contributions
oys-0.036
fs-0.034
 compartment-0.033
 globe-0.032
kin-0.032
Token his
Token ID465
Feature activation+8.80e-01

Pos logit contributions
oys+0.005
fs+0.004
 compartment+0.004
kin+0.004
 globe+0.004
Neg logit contributions
please-0.004
 gor-0.004
 chickens-0.004
 ge-0.004
 pigs-0.003
Token fingers
Token ID9353
Feature activation-1.21e+00

Pos logit contributions
oys+0.022
fs+0.021
 compartment+0.020
 globe+0.019
 handshake+0.019
Neg logit contributions
please-0.030
 gor-0.028
 chickens-0.028
 ge-0.027
 pigs-0.026
Token and
Token ID290
Feature activation-3.41e-01

Pos logit contributions
please+0.041
 gor+0.038
 chickens+0.037
 ge+0.036
 pigs+0.035
Neg logit contributions
oys-0.031
fs-0.029
 compartment-0.028
 globe-0.027
 handshake-0.027
Token tasted
Token ID29187
Feature activation-4.23e+00

Pos logit contributions
please+0.011
 gor+0.010
 chickens+0.010
 ge+0.010
 North+0.009
Neg logit contributions
oys-0.009
fs-0.009
 compartment-0.008
 handshake-0.008
kin-0.008
Token each
Token ID1123
Feature activation+3.12e-01

Pos logit contributions
please+0.200
 gor+0.177
 North+0.176
 chickens+0.176
 colony+0.176
Neg logit contributions
 with-0.036
fs-0.034
 present-0.031
 note-0.029
 box-0.029
Token one
Token ID530
Feature activation-2.14e-01

Pos logit contributions
oys+0.010
fs+0.009
 compartment+0.009
 handshake+0.009
 globe+0.009
Neg logit contributions
please-0.009
 gor-0.008
 chickens-0.008
 ge-0.008
 bored-0.007
Token until
Token ID1566
Feature activation-2.44e+00

Pos logit contributions
please+0.006
 gor+0.005
 chickens+0.005
 ge+0.005
 bored+0.005
Neg logit contributions
oys-0.007
fs-0.007
 handshake-0.006
 globe-0.006
kin-0.006
Token he
Token ID339
Feature activation-1.86e+00

Pos logit contributions
please+0.101
 gor+0.094
 chickens+0.093
 pasture+0.089
 North+0.089
Neg logit contributions
kin-0.041
oys-0.041
fs-0.039
 compartment-0.037
 box-0.036
Token finally
Token ID3443
Feature activation-1.58e+00

Pos logit contributions
please+0.066
 chickens+0.065
 gor+0.065
 pigs+0.062
 North+0.061
Neg logit contributions
kin-0.040
oys-0.038
fs-0.038
 compartment-0.037
lf-0.036
Token sad
Token ID6507
Feature activation+3.33e-02

Pos logit contributions
oys+0.021
kin+0.020
fs+0.020
 handshake+0.019
 note+0.019
Neg logit contributions
please-0.018
 gor-0.017
 chickens-0.017
 North-0.016
 ge-0.016
Token and
Token ID290
Feature activation+6.68e-01

Pos logit contributions
oys+0.001
 compartment+0.001
fs+0.001
 box+0.001
 globe+0.001
Neg logit contributions
please-0.001
 gor-0.001
 chickens-0.001
 ge-0.001
 pigs-0.001
Token disappointed
Token ID11679
Feature activation-4.82e-01

Pos logit contributions
oys+0.017
kin+0.017
fs+0.017
lf+0.017
 box+0.016
Neg logit contributions
please-0.021
 chickens-0.019
 gor-0.019
 pasture-0.018
 pigs-0.018
Token.
Token ID13
Feature activation+3.14e-01

Pos logit contributions
please+0.019
 chickens+0.017
 gor+0.017
 ge+0.016
 North+0.016
Neg logit contributions
oys-0.011
fs-0.010
 compartment-0.009
kin-0.009
 globe-0.009
Token No
Token ID1400
Feature activation-1.13e+00

Pos logit contributions
oys+0.008
 compartment+0.007
fs+0.007
 globe+0.007
 handshake+0.006
Neg logit contributions
please-0.011
 chickens-0.010
 gor-0.010
 ge-0.010
 pigs-0.009
Token matter
Token ID2300
Feature activation+3.09e+00

Pos logit contributions
please+0.030
 chickens+0.027
 gor+0.027
 ge+0.026
 pigs+0.024
Neg logit contributions
oys-0.038
fs-0.036
 globe-0.035
kin-0.035
 compartment-0.035
Token how
Token ID703
Feature activation+3.19e-01

Pos logit contributions
oys+0.097
fs+0.094
kin+0.091
 handshake+0.090
 compartment+0.090
Neg logit contributions
please-0.079
 gor-0.078
 chickens-0.077
 pigs-0.072
 ge-0.071
Token hard
Token ID1327
Feature activation-1.42e+00

Pos logit contributions
oys+0.008
fs+0.008
kin+0.007
 box+0.007
 globe+0.007
Neg logit contributions
please-0.011
 gor-0.010
 chickens-0.010
 ge-0.010
 North-0.010
Token he
Token ID339
Feature activation-5.12e-01

Pos logit contributions
please+0.060
 gor+0.055
 chickens+0.053
 pasture+0.052
 North+0.052
Neg logit contributions
oys-0.024
fs-0.022
kin-0.022
lf-0.020
 globe-0.020
Token worked
Token ID3111
Feature activation-2.03e-01

Pos logit contributions
please+0.016
 gor+0.015
 chickens+0.015
 ge+0.014
 pigs+0.014
Neg logit contributions
oys-0.014
fs-0.013
 compartment-0.013
 globe-0.013
kin-0.013
Token,
Token ID11
Feature activation-7.76e-01

Pos logit contributions
please+0.007
 gor+0.006
 chickens+0.006
 ge+0.006
 bored+0.006
Neg logit contributions
oys-0.005
fs-0.005
 compartment-0.004
 handshake-0.004
 globe-0.004
Token her
Token ID607
Feature activation+4.32e-01

Pos logit contributions
 gor+0.003
 chickens+0.003
 bored+0.002
 North+0.002
 pigs+0.002
Neg logit contributions
oys-0.006
fs-0.006
cher-0.006
 handshake-0.006
ouse-0.006
Token backpack
Token ID22526
Feature activation+1.19e-01

Pos logit contributions
oys+0.014
fs+0.013
eway+0.012
cher+0.012
 handshake+0.012
Neg logit contributions
 bored-0.012
 lonely-0.012
 chickens-0.011
please-0.011
 miserable-0.011
Token but
Token ID475
Feature activation+1.46e-01

Pos logit contributions
oys+0.003
fs+0.003
 globe+0.003
 handshake+0.003
kin+0.003
Neg logit contributions
please-0.003
 chickens-0.003
 gor-0.003
 bored-0.003
 ge-0.003
Token it
Token ID340
Feature activation+1.23e-01

Pos logit contributions
oys+0.005
 handshake+0.005
fs+0.005
 globe+0.004
 compartment+0.004
Neg logit contributions
please-0.003
 bored-0.003
 chickens-0.003
 gor-0.003
 starting-0.003
Token was
Token ID373
Feature activation+5.46e-01

Pos logit contributions
oys+0.003
 handshake+0.003
fs+0.003
 globe+0.003
 compartment+0.003
Neg logit contributions
please-0.004
 bored-0.004
 chickens-0.004
 gor-0.004
 ge-0.003
Token nowhere
Token ID12062
Feature activation+3.36e+00

Pos logit contributions
oys+0.011
fs+0.010
kin+0.010
 compartment+0.010
 box+0.009
Neg logit contributions
please-0.021
 gor-0.020
 chickens-0.020
 ge-0.020
 pigs-0.019
Token to
Token ID284
Feature activation+1.28e+00

Pos logit contributions
fs+0.140
oys+0.132
kin+0.130
bes+0.126
 globe+0.124
Neg logit contributions
please-0.065
 bored-0.052
 chickens-0.051
 going-0.048
 gor-0.047
Token be
Token ID307
Feature activation+5.42e-01

Pos logit contributions
oys+0.037
fs+0.036
 globe+0.034
 compartment+0.033
 handshake+0.032
Neg logit contributions
please-0.039
 gor-0.035
 ge-0.033
 chickens-0.033
 North-0.033
Token found
Token ID1043
Feature activation-8.07e-02

Pos logit contributions
oys+0.015
fs+0.014
 globe+0.014
 compartment+0.014
 handshake+0.013
Neg logit contributions
please-0.017
 chickens-0.016
 gor-0.015
 ge-0.015
 bored-0.015
Token.
Token ID13
Feature activation-1.79e-01

Pos logit contributions
please+0.002
 gor+0.002
 chickens+0.002
 ge+0.002
 pigs+0.002
Neg logit contributions
oys-0.002
fs-0.002
 compartment-0.002
kin-0.002
 globe-0.002
Token
Token ID198
Feature activation-2.22e-01

Pos logit contributions
please+0.005
 gor+0.005
 chickens+0.005
 ge+0.005
 bored+0.005
Neg logit contributions
oys-0.005
fs-0.005
 globe-0.005
kin-0.005
 handshake-0.005
Token says
Token ID1139
Feature activation+1.01e+00

Pos logit contributions
oys+0.016
fs+0.015
 compartment+0.014
 handshake+0.014
 globe+0.014
Neg logit contributions
please-0.019
 gor-0.018
 chickens-0.017
 ge-0.017
 pigs-0.016
Token this
Token ID428
Feature activation+1.71e-01

Pos logit contributions
oys+0.019
fs+0.018
 handshake+0.016
kin+0.016
 globe+0.016
Neg logit contributions
please-0.041
 gor-0.038
 pasture-0.037
 chickens-0.037
 North-0.037
Token is
Token ID318
Feature activation+3.43e-01

Pos logit contributions
oys+0.004
fs+0.004
kin+0.004
 compartment+0.004
 globe+0.004
Neg logit contributions
please-0.006
 chickens-0.006
 gor-0.006
 ge-0.005
 North-0.005
Token a
Token ID257
Feature activation+1.41e+00

Pos logit contributions
oys+0.008
fs+0.007
kin+0.007
 compartment+0.007
 handshake+0.007
Neg logit contributions
please-0.012
 chickens-0.011
 gor-0.011
 ge-0.011
 pigs-0.011
Token fun
Token ID1257
Feature activation+2.61e+00

Pos logit contributions
oys+0.042
fs+0.041
 compartment+0.038
kin+0.038
 handshake+0.038
Neg logit contributions
please-0.041
 gor-0.037
 chickens-0.037
 ge-0.036
 North-0.034
Token way
Token ID835
Feature activation+3.15e+00

Pos logit contributions
fs+0.077
cher+0.069
kin+0.067
oys+0.067
bes+0.066
Neg logit contributions
please-0.069
 gor-0.065
 North-0.063
 ge-0.063
 going-0.063
Token to
Token ID284
Feature activation+3.09e-01

Pos logit contributions
oys+0.099
fs+0.099
kin+0.098
 compartment+0.095
 globe+0.094
Neg logit contributions
please-0.079
 chickens-0.074
 gor-0.073
 ge-0.070
 bored-0.069
Token play
Token ID711
Feature activation+1.30e+00

Pos logit contributions
oys+0.007
fs+0.007
 compartment+0.007
 handshake+0.007
 globe+0.006
Neg logit contributions
please-0.011
 chickens-0.010
 gor-0.010
 ge-0.010
 pigs-0.009
Token with
Token ID351
Feature activation+1.54e+00

Pos logit contributions
oys+0.025
fs+0.022
 compartment+0.020
 handshake+0.020
 globe+0.020
Neg logit contributions
please-0.051
 chickens-0.049
 gor-0.048
 ge-0.047
 bored-0.046
Token the
Token ID262
Feature activation+1.47e+00

Pos logit contributions
oys+0.042
fs+0.040
 compartment+0.038
kin+0.038
 globe+0.038
Neg logit contributions
please-0.050
 gor-0.045
 chickens-0.044
 ge-0.043
 North-0.042
Token red
Token ID2266
Feature activation+2.06e+00

Pos logit contributions
oys+0.042
fs+0.040
 compartment+0.038
kin+0.038
 handshake+0.038
Neg logit contributions
please-0.045
 gor-0.042
 chickens-0.042
 ge-0.040
 pigs-0.039
Token,
Token ID11
Feature activation+6.17e-01

Pos logit contributions
oys+0.026
fs+0.025
 handshake+0.024
 compartment+0.024
 globe+0.024
Neg logit contributions
please-0.023
 chickens-0.023
 gor-0.022
 bored-0.022
 pigs-0.021
Token Lily
Token ID20037
Feature activation+3.75e-01

Pos logit contributions
oys+0.013
fs+0.012
kin+0.011
 compartment+0.011
 globe+0.011
Neg logit contributions
please-0.024
 gor-0.022
 chickens-0.022
 ge-0.021
 North-0.021
Token and
Token ID290
Feature activation+8.49e-01

Pos logit contributions
oys+0.008
 box+0.008
lf+0.008
 costume+0.008
kin+0.007
Neg logit contributions
 gor-0.013
 ge-0.012
 pigs-0.012
 chickens-0.012
please-0.012
Token Emma
Token ID18966
Feature activation-1.04e+00

Pos logit contributions
oys+0.015
fs+0.014
 compartment+0.013
 globe+0.013
kin+0.012
Neg logit contributions
please-0.035
 gor-0.033
 chickens-0.033
 ge-0.033
 pigs-0.031
Token shared
Token ID4888
Feature activation-8.36e-01

Pos logit contributions
 gor+0.033
please+0.032
 ge+0.031
 chickens+0.031
 pasture+0.030
Neg logit contributions
 globe-0.024
 box-0.024
oys-0.023
kin-0.022
 costume-0.022
Token their
Token ID511
Feature activation+3.20e+00

Pos logit contributions
please+0.023
 chickens+0.022
 gor+0.022
 ge+0.021
 pigs+0.020
Neg logit contributions
oys-0.026
fs-0.025
 compartment-0.024
kin-0.024
 globe-0.024
Token toys
Token ID14958
Feature activation+7.73e-01

Pos logit contributions
oys+0.090
fs+0.086
 compartment+0.083
kin+0.081
 handshake+0.079
Neg logit contributions
 gor-0.097
please-0.097
 chickens-0.095
 ge-0.091
 pigs-0.090
Token and
Token ID290
Feature activation+5.00e-01

Pos logit contributions
oys+0.023
fs+0.022
 compartment+0.022
cher+0.020
 torch+0.020
Neg logit contributions
please-0.021
 chickens-0.020
 gor-0.020
 ge-0.019
 bored-0.019
Token candy
Token ID18550
Feature activation-1.64e+00

Pos logit contributions
oys+0.015
fs+0.015
 compartment+0.014
kin+0.014
 globe+0.014
Neg logit contributions
please-0.014
 gor-0.013
 chickens-0.013
 ge-0.012
 pigs-0.012
Token together
Token ID1978
Feature activation-3.28e-01

Pos logit contributions
please+0.065
 gor+0.061
 chickens+0.061
 ge+0.059
 pigs+0.057
Neg logit contributions
oys-0.032
fs-0.029
 compartment-0.028
 handshake-0.027
 globe-0.027
Token.
Token ID13
Feature activation+1.09e+00

Pos logit contributions
please+0.013
 North+0.012
 gor+0.012
 ge+0.012
 chickens+0.012
Neg logit contributions
 globe-0.006
kin-0.006
fs-0.006
oys-0.006
f-0.005
Token to
Token ID284
Feature activation+3.57e-01

Pos logit contributions
oys+0.029
fs+0.028
 compartment+0.027
 handshake+0.027
 globe+0.026
Neg logit contributions
please-0.035
 gor-0.032
 chickens-0.032
 ge-0.031
 bored-0.030
Token replace
Token ID6330
Feature activation+6.25e-01

Pos logit contributions
oys+0.009
fs+0.008
 handshake+0.008
 compartment+0.008
kin+0.008
Neg logit contributions
please-0.012
 gor-0.012
 chickens-0.011
 ge-0.011
 pigs-0.011
Token it
Token ID340
Feature activation+5.56e-01

Pos logit contributions
oys+0.017
fs+0.016
kin+0.016
 compartment+0.016
 handshake+0.015
Neg logit contributions
please-0.020
 chickens-0.018
 gor-0.018
 ge-0.018
 pigs-0.017
Token with
Token ID351
Feature activation+9.12e-01

Pos logit contributions
oys+0.014
fs+0.013
 compartment+0.012
 handshake+0.012
 globe+0.012
Neg logit contributions
please-0.019
 chickens-0.017
 gor-0.017
 ge-0.017
 pigs-0.016
Token a
Token ID257
Feature activation+1.38e+00

Pos logit contributions
oys+0.023
fs+0.022
 compartment+0.021
 globe+0.021
 handshake+0.021
Neg logit contributions
please-0.031
 gor-0.028
 chickens-0.028
 ge-0.027
 pigs-0.026
Token new
Token ID649
Feature activation+3.04e+00

Pos logit contributions
oys+0.032
fs+0.030
 compartment+0.029
 globe+0.029
 handshake+0.028
Neg logit contributions
please-0.049
 gor-0.046
 chickens-0.046
 ge-0.044
 pigs-0.043
Token one
Token ID530
Feature activation+9.86e-01

Pos logit contributions
oys+0.066
fs+0.063
kin+0.057
 compartment+0.055
 handshake+0.054
Neg logit contributions
please-0.112
 chickens-0.107
 gor-0.106
 ge-0.102
 pigs-0.101
Token."
Token ID526
Feature activation+8.22e-02

Pos logit contributions
oys+0.031
fs+0.030
 compartment+0.029
 globe+0.029
kin+0.029
Neg logit contributions
please-0.027
 gor-0.025
 chickens-0.024
 ge-0.024
 pigs-0.022
Token
Token ID220
Feature activation-1.41e+00

Pos logit contributions
oys+0.002
fs+0.002
 compartment+0.002
 globe+0.002
 handshake+0.002
Neg logit contributions
please-0.003
 gor-0.003
 chickens-0.003
 ge-0.003
 North-0.002
Token
Token ID198
Feature activation+4.33e-02

Pos logit contributions
please+0.047
 gor+0.044
 chickens+0.044
 ge+0.042
 pigs+0.041
Neg logit contributions
oys-0.036
fs-0.034
 compartment-0.032
 globe-0.032
 handshake-0.032
Token
Token ID198
Feature activation+5.60e-01

Pos logit contributions
oys+0.001
fs+0.001
 compartment+0.001
 globe+0.001
kin+0.001
Neg logit contributions
please-0.001
 chickens-0.001
 gor-0.001
 ge-0.001
 pigs-0.001
Token car
Token ID1097
Feature activation+8.77e-01

Pos logit contributions
oys+0.022
fs+0.021
kin+0.020
 compartment+0.019
 handshake+0.019
Neg logit contributions
please-0.035
 chickens-0.034
 gor-0.034
 ge-0.033
 pigs-0.032
Token practiced
Token ID19893
Feature activation-3.58e-01

Pos logit contributions
oys+0.025
fs+0.024
 compartment+0.023
 handshake+0.023
 globe+0.023
Neg logit contributions
please-0.026
 gor-0.024
 chickens-0.024
 ge-0.023
 pigs-0.022
Token going
Token ID1016
Feature activation+4.01e-03

Pos logit contributions
please+0.012
 chickens+0.011
 gor+0.011
 ge+0.011
 pigs+0.010
Neg logit contributions
oys-0.009
fs-0.009
 globe-0.009
 handshake-0.009
 compartment-0.008
Token fast
Token ID3049
Feature activation+6.82e-01

Pos logit contributions
oys+0.000
fs+0.000
 globe+0.000
 handshake+0.000
 compartment+0.000
Neg logit contributions
please-0.000
 chickens-0.000
 gor-0.000
 ge-0.000
 pigs-0.000
Token together
Token ID1978
Feature activation-2.72e-01

Pos logit contributions
oys+0.013
fs+0.012
 box+0.012
 globe+0.012
 handshake+0.012
Neg logit contributions
please-0.027
 ge-0.025
 chickens-0.024
 gor-0.024
 pigs-0.023
Token.
Token ID13
Feature activation+5.33e-01

Pos logit contributions
please+0.010
 chickens+0.009
 ge+0.009
 gor+0.009
 pigs+0.009
Neg logit contributions
oys-0.006
fs-0.006
 globe-0.006
 compartment-0.006
 handshake-0.006
Token They
Token ID1119
Feature activation+4.74e-01

Pos logit contributions
oys+0.012
 globe+0.010
 compartment+0.010
 handshake+0.010
 Kate+0.010
Neg logit contributions
please-0.020
 ge-0.018
 chickens-0.018
 gor-0.018
eling-0.017
Token raced
Token ID32897
Feature activation+1.27e+00

Pos logit contributions
oys+0.012
fs+0.012
 globe+0.011
kin+0.011
 compartment+0.011
Neg logit contributions
please-0.015
 chickens-0.014
 gor-0.014
 ge-0.014
 pigs-0.013
Token and
Token ID290
Feature activation+1.38e-01

Pos logit contributions
oys+0.040
fs+0.039
kin+0.039
 compartment+0.037
 handshake+0.036
Neg logit contributions
please-0.034
 gor-0.032
 chickens-0.031
 ge-0.030
 bored-0.029
Token raced
Token ID32897
Feature activation+8.44e-01

Pos logit contributions
oys+0.004
fs+0.004
 compartment+0.004
kin+0.004
 handshake+0.004
Neg logit contributions
please-0.004
 gor-0.004
 chickens-0.004
 ge-0.004
 bored-0.003
Token until
Token ID1566
Feature activation-1.89e+00

Pos logit contributions
oys+0.026
fs+0.025
kin+0.024
 compartment+0.024
 handshake+0.023
Neg logit contributions
please-0.024
 gor-0.022
 chickens-0.022
 ge-0.021
 bored-0.020
Token soon
Token ID2582
Feature activation-9.87e-01

Pos logit contributions
please+0.024
 gor+0.023
 ge+0.022
 chickens+0.022
 North+0.021
Neg logit contributions
oys-0.017
fs-0.016
kin-0.016
 compartment-0.015
 box-0.015
Token.
Token ID13
Feature activation-8.43e-01

Pos logit contributions
please+0.040
 gor+0.037
 ge+0.036
 North+0.035
 chickens+0.035
Neg logit contributions
oys-0.020
fs-0.018
kin-0.018
 box-0.017
 compartment-0.017
Token Sara
Token ID24799
Feature activation+2.68e-01

Pos logit contributions
please+0.031
 chickens+0.028
 harming+0.027
 ge+0.027
 gor+0.027
Neg logit contributions
oys-0.019
 Kate-0.017
 Liz-0.016
 handshake-0.016
 locker-0.016
Token had
Token ID550
Feature activation+5.35e-01

Pos logit contributions
oys+0.005
 handshake+0.005
 globe+0.005
kin+0.005
 compartment+0.005
Neg logit contributions
please-0.010
 gor-0.010
 ge-0.009
 chickens-0.009
 North-0.009
Token a
Token ID257
Feature activation+9.33e-01

Pos logit contributions
oys+0.007
 handshake+0.007
 card+0.007
box+0.007
 box+0.007
Neg logit contributions
please-0.022
 gor-0.021
 ge-0.021
 chickens-0.020
eling-0.020
Token beautiful
Token ID4950
Feature activation+6.93e-01

Pos logit contributions
oys+0.026
fs+0.024
 compartment+0.023
 globe+0.023
kin+0.023
Neg logit contributions
please-0.030
 chickens-0.027
 gor-0.027
 ge-0.026
 pigs-0.025
Token white
Token ID2330
Feature activation+1.24e+00

Pos logit contributions
oys+0.020
fs+0.019
 compartment+0.017
lock+0.017
cher+0.016
Neg logit contributions
please-0.021
 chickens-0.021
 gor-0.020
 ge-0.019
 bored-0.019
Token veil
Token ID30615
Feature activation+2.86e-01

Pos logit contributions
oys+0.032
fs+0.032
chery+0.028
 compartment+0.028
cher+0.027
Neg logit contributions
 chickens-0.040
please-0.038
 gor-0.037
 pigs-0.037
 ge-0.035
Token that
Token ID326
Feature activation-1.33e-01

Pos logit contributions
oys+0.007
fs+0.007
 compartment+0.006
 globe+0.006
kin+0.006
Neg logit contributions
please-0.010
 gor-0.009
 chickens-0.009
 ge-0.009
 pigs-0.009
Token she
Token ID673
Feature activation-1.23e+00

Pos logit contributions
please+0.004
 gor+0.004
 chickens+0.004
 ge+0.004
 pigs+0.004
Neg logit contributions
oys-0.004
fs-0.003
 compartment-0.003
kin-0.003
 globe-0.003
Token would
Token ID561
Feature activation-2.55e-01

Pos logit contributions
please+0.036
 gor+0.035
 chickens+0.034
 ge+0.034
 pigs+0.032
Neg logit contributions
oys-0.034
fs-0.033
kin-0.032
 globe-0.031
 handshake-0.031
Token a
Token ID257
Feature activation+3.84e-01

Pos logit contributions
please+0.005
 chickens+0.005
 gor+0.004
 ge+0.004
 pigs+0.004
Neg logit contributions
oys-0.005
fs-0.005
 compartment-0.004
 globe-0.004
 handshake-0.004
Token disappointed
Token ID11679
Feature activation-9.66e-01

Pos logit contributions
oys+0.013
fs+0.012
bes+0.011
kin+0.011
 compartment+0.011
Neg logit contributions
please-0.010
 chickens-0.009
 ge-0.009
 bored-0.009
 gor-0.009
Token look
Token ID804
Feature activation-1.18e+00

Pos logit contributions
 chickens+0.026
 ge+0.026
 gor+0.026
please+0.026
 lonely+0.024
Neg logit contributions
fs-0.029
oys-0.027
pack-0.025
bes-0.025
 globe-0.025
Token,
Token ID11
Feature activation+6.50e-02

Pos logit contributions
please+0.039
 chickens+0.037
 gor+0.036
 ge+0.035
 pigs+0.034
Neg logit contributions
oys-0.030
fs-0.028
 handshake-0.027
 compartment-0.027
 globe-0.027
Token she
Token ID673
Feature activation-4.17e-01

Pos logit contributions
oys+0.001
fs+0.001
 compartment+0.001
 globe+0.001
kin+0.001
Neg logit contributions
please-0.002
 gor-0.002
 chickens-0.002
 ge-0.002
 North-0.002
Token said
Token ID531
Feature activation+5.84e-01

Pos logit contributions
please+0.014
 chickens+0.012
 gor+0.012
 ge+0.012
 pigs+0.012
Neg logit contributions
oys-0.011
fs-0.011
 compartment-0.010
 globe-0.010
 handshake-0.010
Token,
Token ID11
Feature activation-6.33e-02

Pos logit contributions
oys+0.015
fs+0.015
 compartment+0.014
 globe+0.014
kin+0.014
Neg logit contributions
please-0.019
 chickens-0.018
 gor-0.018
 ge-0.017
 bored-0.017
Token "
Token ID366
Feature activation-7.21e-01

Pos logit contributions
please+0.002
 gor+0.002
 chickens+0.002
 ge+0.002
 North+0.002
Neg logit contributions
oys-0.001
 globe-0.001
fs-0.001
 compartment-0.001
lock-0.001
TokenI
Token ID40
Feature activation-8.97e-01

Pos logit contributions
please+0.019
 chickens+0.016
 ge+0.016
 gor+0.015
 North+0.014
Neg logit contributions
fs-0.025
oys-0.024
kin-0.024
 compartment-0.024
 handshake-0.024
Token told
Token ID1297
Feature activation+3.06e-01

Pos logit contributions
please+0.026
 gor+0.023
 chickens+0.023
 ge+0.022
 pigs+0.022
Neg logit contributions
oys-0.027
fs-0.026
 compartment-0.025
 globe-0.025
 handshake-0.025
Token you
Token ID345
Feature activation+1.71e-01

Pos logit contributions
oys+0.007
fs+0.006
 compartment+0.006
 handshake+0.006
 globe+0.006
Neg logit contributions
please-0.012
 gor-0.011
 chickens-0.011
 ge-0.010
 pigs-0.010
Tokenill
Token ID359
Feature activation+2.16e-01

Pos logit contributions
fs+0.024
 globe+0.023
oys+0.023
 compartment+0.023
 handshake+0.023
Neg logit contributions
please-0.025
 gor-0.023
 ge-0.023
 chickens-0.023
 bored-0.021
Token heard
Token ID2982
Feature activation-5.36e-01

Pos logit contributions
oys+0.005
fs+0.004
 compartment+0.004
lf+0.004
 box+0.004
Neg logit contributions
please-0.008
 gor-0.007
 chickens-0.007
 ge-0.007
 North-0.007
Token Tom
Token ID4186
Feature activation+8.85e-01

Pos logit contributions
please+0.013
 chickens+0.012
 gor+0.012
 ge+0.012
 bored+0.011
Neg logit contributions
oys-0.018
fs-0.018
kin-0.017
 compartment-0.017
 globe-0.017
Token's
Token ID338
Feature activation-9.46e-02

Pos logit contributions
oys+0.025
fs+0.024
kin+0.024
 compartment+0.023
 handshake+0.023
Neg logit contributions
please-0.027
 chickens-0.025
 gor-0.025
 ge-0.024
 bored-0.023
Token cry
Token ID3960
Feature activation-3.41e-01

Pos logit contributions
please+0.003
 bored+0.003
 chickens+0.002
 gor+0.002
 ge+0.002
Neg logit contributions
fs-0.003
oys-0.003
kin-0.003
 compartment-0.003
 globe-0.003
Token and
Token ID290
Feature activation+1.30e+00

Pos logit contributions
please+0.013
 gor+0.012
 ge+0.012
 chickens+0.012
 North+0.011
Neg logit contributions
oys-0.008
fs-0.007
 compartment-0.006
 box-0.006
 globe-0.006
Token ran
Token ID4966
Feature activation-1.82e-01

Pos logit contributions
oys+0.037
fs+0.036
 compartment+0.034
 handshake+0.034
 globe+0.034
Neg logit contributions
please-0.039
 gor-0.036
 chickens-0.036
 ge-0.035
 pigs-0.033
Token to
Token ID284
Feature activation-3.74e-01

Pos logit contributions
please+0.004
 gor+0.004
 chickens+0.004
 bored+0.004
 ge+0.004
Neg logit contributions
kin-0.006
fs-0.006
 globe-0.006
oys-0.006
lf-0.006
Token help
Token ID1037
Feature activation-6.11e-01

Pos logit contributions
please+0.006
 gor+0.006
 ge+0.006
 chickens+0.005
 bored+0.005
Neg logit contributions
fs-0.016
kin-0.016
oys-0.015
ouse-0.015
 globe-0.015
Token him
Token ID683
Feature activation-4.72e-02

Pos logit contributions
please+0.011
 gor+0.009
 ge+0.009
 chickens+0.009
 bored+0.008
Neg logit contributions
fs-0.024
oys-0.024
 globe-0.023
 compartment-0.023
kin-0.023
Token.
Token ID13
Feature activation+5.94e-02

Pos logit contributions
please+0.001
 bored+0.001
 gor+0.001
 chickens+0.001
 ge+0.001
Neg logit contributions
oys-0.002
fs-0.002
kin-0.002
 globe-0.002
 compartment-0.002
Token He
Token ID679
Feature activation+9.99e-01

Pos logit contributions
oys+0.011
 compartment+0.011
fs+0.010
 handshake+0.010
 globe+0.010
Neg logit contributions
please-0.017
 chickens-0.015
 gor-0.015
 ge-0.014
 North-0.014
Token told
Token ID1297
Feature activation+3.72e-01

Pos logit contributions
oys+0.023
fs+0.021
 compartment+0.020
 handshake+0.020
 globe+0.020
Neg logit contributions
please-0.036
 chickens-0.034
 gor-0.034
 ge-0.032
 pigs-0.031
Token Jill
Token ID22772
Feature activation+5.31e-01

Pos logit contributions
oys+0.013
fs+0.012
 compartment+0.012
kin+0.012
 globe+0.012
Neg logit contributions
please-0.009
 gor-0.009
 chickens-0.008
 ge-0.008
 pigs-0.008
Token it
Token ID340
Feature activation-2.17e-01

Pos logit contributions
oys+0.015
fs+0.014
 compartment+0.014
 handshake+0.013
 globe+0.013
Neg logit contributions
please-0.016
 chickens-0.015
 gor-0.015
 ge-0.015
 bored-0.014
Token was
Token ID373
Feature activation-1.46e-01

Pos logit contributions
please+0.008
 gor+0.007
 chickens+0.007
 ge+0.007
 pigs+0.007
Neg logit contributions
oys-0.005
fs-0.004
 compartment-0.004
kin-0.004
 globe-0.004
Token a
Token ID257
Feature activation+3.28e-01

Pos logit contributions
please+0.005
 gor+0.005
 chickens+0.005
 ge+0.004
 North+0.004
Neg logit contributions
oys-0.004
fs-0.003
kin-0.003
 compartment-0.003
 handshake-0.003
Token special
Token ID2041
Feature activation+3.42e-01

Pos logit contributions
oys+0.008
fs+0.008
 compartment+0.007
 globe+0.007
 handshake+0.007
Neg logit contributions
please-0.011
 gor-0.010
 chickens-0.010
 ge-0.010
 pigs-0.010
Token dinosaur
Token ID28257
Feature activation-2.50e-01

Pos logit contributions
fs+0.008
oys+0.008
bes+0.007
kin+0.007
chery+0.006
Neg logit contributions
please-0.012
 chickens-0.011
 ge-0.011
 gor-0.011
 North-0.011
Token.
Token ID13
Feature activation-4.97e-01

Pos logit contributions
please+0.007
 chickens+0.007
 gor+0.007
 ge+0.007
 pigs+0.006
Neg logit contributions
oys-0.007
fs-0.007
 compartment-0.006
 handshake-0.006
 globe-0.006
Token He
Token ID679
Feature activation+7.88e-01

Pos logit contributions
please+0.020
 chickens+0.019
 gor+0.019
 pigs+0.018
 ge+0.018
Neg logit contributions
oys-0.010
 compartment-0.008
 handshake-0.008
fs-0.008
kin-0.008
Token said
Token ID531
Feature activation+3.93e-01

Pos logit contributions
oys+0.020
fs+0.019
 globe+0.018
 compartment+0.018
kin+0.018
Neg logit contributions
please-0.026
 chickens-0.024
 gor-0.024
 ge-0.024
 pigs-0.023
Token idea
Token ID2126
Feature activation+1.11e+00

Pos logit contributions
please+0.022
 gor+0.020
 chickens+0.020
 ge+0.019
 pigs+0.018
Neg logit contributions
oys-0.016
 handshake-0.015
fs-0.015
 compartment-0.015
kin-0.014
Token what
Token ID644
Feature activation-5.42e-01

Pos logit contributions
oys+0.029
fs+0.027
 compartment+0.026
 globe+0.026
 handshake+0.026
Neg logit contributions
please-0.037
 gor-0.034
 chickens-0.034
 ge-0.033
 pigs-0.032
Token it
Token ID340
Feature activation-1.23e+00

Pos logit contributions
please+0.017
 chickens+0.016
 gor+0.016
 ge+0.015
 pigs+0.015
Neg logit contributions
oys-0.015
fs-0.014
 compartment-0.014
 globe-0.013
kin-0.013
Token was
Token ID373
Feature activation-2.74e-01

Pos logit contributions
please+0.049
 gor+0.046
 chickens+0.046
 ge+0.044
 North+0.044
Neg logit contributions
oys-0.022
fs-0.021
 compartment-0.020
 globe-0.020
 handshake-0.020
Token.
Token ID13
Feature activation+6.05e-01

Pos logit contributions
please+0.010
 gor+0.009
 chickens+0.009
 North+0.009
 ge+0.009
Neg logit contributions
oys-0.006
 compartment-0.006
 handshake-0.006
 box-0.005
fs-0.005
Token The
Token ID383
Feature activation-1.46e+00

Pos logit contributions
oys+0.017
fs+0.016
 compartment+0.016
 handshake+0.016
 globe+0.016
Neg logit contributions
please-0.019
 gor-0.017
 chickens-0.017
 ge-0.016
 North-0.016
Token machine
Token ID4572
Feature activation+7.07e-01

Pos logit contributions
please+0.045
 gor+0.041
 chickens+0.040
 ge+0.039
 pigs+0.037
Neg logit contributions
oys-0.042
fs-0.040
 compartment-0.038
 globe-0.038
kin-0.038
Token suddenly
Token ID6451
Feature activation-5.89e-01

Pos logit contributions
oys+0.020
fs+0.019
 compartment+0.018
 handshake+0.018
 globe+0.018
Neg logit contributions
please-0.021
 gor-0.020
 chickens-0.020
 ge-0.019
 pigs-0.018
Token replaced
Token ID6928
Feature activation+2.91e-01

Pos logit contributions
please+0.019
 gor+0.017
 chickens+0.016
 ge+0.015
 North+0.015
Neg logit contributions
oys-0.016
 compartment-0.016
fs-0.015
 handshake-0.015
 box-0.015
Token the
Token ID262
Feature activation+6.71e-01

Pos logit contributions
oys+0.008
fs+0.008
 compartment+0.007
 globe+0.007
 handshake+0.007
Neg logit contributions
please-0.009
 gor-0.008
 chickens-0.008
 ge-0.008
 pigs-0.008
Token blue
Token ID4171
Feature activation+1.55e+00

Pos logit contributions
oys+0.017
fs+0.016
 compartment+0.016
 globe+0.016
kin+0.015
Neg logit contributions
please-0.022
 gor-0.021
 chickens-0.020
 ge-0.020
 pigs-0.019
Token animal
Token ID5044
Feature activation-7.51e-02

Pos logit contributions
oys+0.011
fs+0.010
 compartment+0.010
 globe+0.010
 handshake+0.010
Neg logit contributions
please-0.013
 gor-0.012
 chickens-0.012
 ge-0.011
 bored-0.011
Token trying
Token ID2111
Feature activation-2.42e-02

Pos logit contributions
please+0.003
 gor+0.002
 chickens+0.002
 ge+0.002
 North+0.002
Neg logit contributions
oys-0.002
fs-0.002
 compartment-0.002
 globe-0.002
 handshake-0.002
Token to
Token ID284
Feature activation-1.15e-01

Pos logit contributions
please+0.001
 gor+0.001
 chickens+0.001
 ge+0.001
 pigs+0.001
Neg logit contributions
oys-0.001
fs-0.001
 handshake-0.001
 compartment-0.001
 globe-0.001
Token get
Token ID651
Feature activation-8.03e-01

Pos logit contributions
please+0.004
 gor+0.004
 chickens+0.004
 ge+0.004
 pigs+0.004
Neg logit contributions
oys-0.003
fs-0.003
 handshake-0.002
 compartment-0.002
kin-0.002
Token attention
Token ID3241
Feature activation-1.01e+00

Pos logit contributions
please+0.026
 gor+0.025
 chickens+0.024
 ge+0.023
 North+0.023
Neg logit contributions
oys-0.020
fs-0.019
 handshake-0.019
 compartment-0.018
kin-0.018
Token.
Token ID13
Feature activation+1.79e-01

Pos logit contributions
please+0.036
 gor+0.033
 chickens+0.032
 ge+0.032
 North+0.031
Neg logit contributions
oys-0.024
fs-0.023
 globe-0.021
 compartment-0.021
 handshake-0.021
Token From
Token ID3574
Feature activation+3.52e-02

Pos logit contributions
oys+0.004
 compartment+0.004
fs+0.004
 handshake+0.004
 globe+0.004
Neg logit contributions
please-0.006
 chickens-0.006
 gor-0.006
 ge-0.006
 pigs-0.005
Token then
Token ID788
Feature activation-4.62e-01

Pos logit contributions
oys+0.001
fs+0.001
 compartment+0.001
kin+0.001
 handshake+0.001
Neg logit contributions
please-0.001
 chickens-0.001
 gor-0.001
 ge-0.001
 bored-0.001
Token on
Token ID319
Feature activation-5.18e-03

Pos logit contributions
please+0.015
 chickens+0.014
 gor+0.014
 ge+0.013
 bored+0.013
Neg logit contributions
oys-0.012
fs-0.012
 compartment-0.011
 handshake-0.011
 globe-0.011
Token,
Token ID11
Feature activation-1.05e-01

Pos logit contributions
please+0.000
 gor+0.000
 chickens+0.000
 ge+0.000
 pigs+0.000
Neg logit contributions
oys-0.000
fs-0.000
 compartment-0.000
 handshake-0.000
 globe-0.000
Token the
Token ID262
Feature activation-4.59e-01

Pos logit contributions
please+0.004
 chickens+0.004
 gor+0.003
 pasture+0.003
 pigs+0.003
Neg logit contributions
 globe-0.002
oys-0.002
fs-0.002
kin-0.002
 seek-0.002
Token smile
Token ID8212
Feature activation-1.97e+00

Pos logit contributions
oys+0.014
fs+0.013
 compartment+0.012
lf+0.012
 globe+0.012
Neg logit contributions
please-0.017
 chickens-0.016
 ge-0.015
 gor-0.015
 bored-0.015
Token.
Token ID13
Feature activation-5.35e-01

Pos logit contributions
please+0.074
 gor+0.070
 chickens+0.068
 ge+0.065
 North+0.065
Neg logit contributions
oys-0.041
kin-0.040
 globe-0.039
 handshake-0.038
 box-0.037
Token It
Token ID632
Feature activation+1.30e+00

Pos logit contributions
please+0.020
 gor+0.018
 chickens+0.018
 ge+0.017
 bored+0.016
Neg logit contributions
oys-0.013
 handshake-0.011
 compartment-0.011
fs-0.011
kin-0.011
Token said
Token ID531
Feature activation+2.14e-01

Pos logit contributions
oys+0.023
fs+0.022
 compartment+0.021
 globe+0.021
kin+0.021
Neg logit contributions
please-0.053
 gor-0.049
 chickens-0.049
 ge-0.048
 pigs-0.047
Token,
Token ID11
Feature activation-5.85e-01

Pos logit contributions
oys+0.005
fs+0.004
kin+0.004
 handshake+0.004
 globe+0.004
Neg logit contributions
please-0.008
 gor-0.008
 chickens-0.007
 ge-0.007
 North-0.007
Token "
Token ID366
Feature activation-1.03e+00

Pos logit contributions
please+0.021
 gor+0.020
 ge+0.019
 chickens+0.019
 colony+0.018
Neg logit contributions
oys-0.013
 globe-0.013
fs-0.012
kin-0.012
 handshake-0.011
TokenThank
Token ID10449
Feature activation+4.72e-01

Pos logit contributions
please+0.028
 gor+0.025
 chickens+0.025
 ge+0.024
 pigs+0.023
Neg logit contributions
oys-0.033
fs-0.031
 compartment-0.030
 globe-0.030
 handshake-0.030
Token you
Token ID345
Feature activation+6.44e-01

Pos logit contributions
 compartment+0.017
belt+0.015
 globe+0.015
oys+0.015
 handshake+0.015
Neg logit contributions
please-0.011
 ge-0.010
 chickens-0.010
 gor-0.010
 bees-0.010
Token,
Token ID11
Feature activation+6.84e-02

Pos logit contributions
fs+0.017
oys+0.017
 compartment+0.016
 globe+0.016
kin+0.016
Neg logit contributions
please-0.020
 chickens-0.019
 gor-0.019
 ge-0.018
 pigs-0.018
Token thank
Token ID5875
Feature activation-7.95e-01

Pos logit contributions
oys+0.002
fs+0.002
 compartment+0.002
kin+0.002
 handshake+0.002
Neg logit contributions
please-0.002
 gor-0.002
 chickens-0.002
 ge-0.002
 pigs-0.002
Token you
Token ID345
Feature activation-2.28e-01

Pos logit contributions
please+0.027
 chickens+0.024
 ge+0.024
 gor+0.024
 pigs+0.022
Neg logit contributions
oys-0.020
 compartment-0.020
fs-0.020
 globe-0.019
 handshake-0.019
Token bed
Token ID3996
Feature activation-5.29e-01

Pos logit contributions
oys+0.012
fs+0.012
 compartment+0.011
 handshake+0.011
 globe+0.011
Neg logit contributions
please-0.010
 gor-0.009
 chickens-0.009
 ge-0.009
 pigs-0.009
Token.
Token ID13
Feature activation+4.35e-01

Pos logit contributions
please+0.020
 gor+0.018
 chickens+0.018
 ge+0.018
 pigs+0.017
Neg logit contributions
oys-0.012
fs-0.011
 compartment-0.011
kin-0.011
 globe-0.010
Token
Token ID198
Feature activation+1.57e-01

Pos logit contributions
oys+0.011
 compartment+0.010
fs+0.010
 handshake+0.010
 globe+0.010
Neg logit contributions
please-0.015
 gor-0.014
 ge-0.013
 chickens-0.013
 pigs-0.012
Token
Token ID198
Feature activation+3.40e-01

Pos logit contributions
oys+0.004
fs+0.004
 compartment+0.003
 handshake+0.003
 globe+0.003
Neg logit contributions
please-0.005
 gor-0.005
 chickens-0.005
 ge-0.005
 pigs-0.005
TokenThe
Token ID464
Feature activation-9.38e-01

Pos logit contributions
oys+0.008
fs+0.008
 handshake+0.007
kin+0.007
 compartment+0.007
Neg logit contributions
please-0.012
 gor-0.011
 chickens-0.011
 ge-0.010
 bored-0.010
Token princess
Token ID21752
Feature activation-3.48e-01

Pos logit contributions
please+0.032
 chickens+0.030
 gor+0.030
 ge+0.029
 pigs+0.028
Neg logit contributions
oys-0.023
fs-0.022
 compartment-0.020
 handshake-0.020
 globe-0.020
Token was
Token ID373
Feature activation-2.46e-01

Pos logit contributions
please+0.012
 gor+0.011
 chickens+0.011
 ge+0.011
 pigs+0.010
Neg logit contributions
oys-0.009
fs-0.008
 compartment-0.008
kin-0.008
 globe-0.008
Token so
Token ID523
Feature activation+3.83e-01

Pos logit contributions
please+0.008
 gor+0.008
 ge+0.007
 chickens+0.007
 pigs+0.007
Neg logit contributions
kin-0.006
 compartment-0.006
 seek-0.006
box-0.006
lf-0.005
Token happy
Token ID3772
Feature activation+5.02e-01

Pos logit contributions
kin+0.009
oys+0.008
fs+0.008
 handshake+0.008
 note+0.008
Neg logit contributions
please-0.013
 gor-0.013
 chickens-0.013
 pigs-0.012
 ge-0.012
Token.
Token ID13
Feature activation+4.83e-01

Pos logit contributions
oys+0.012
fs+0.011
 compartment+0.011
 handshake+0.010
 globe+0.010
Neg logit contributions
please-0.018
 gor-0.016
 chickens-0.016
 ge-0.016
 pigs-0.016
Token She
Token ID1375
Feature activation-2.39e-01

Pos logit contributions
oys+0.014
 compartment+0.013
 Kate+0.012
 handshake+0.012
 globe+0.012
Neg logit contributions
please-0.015
 ge-0.014
 gor-0.014
 chickens-0.014
 pigs-0.013
Token losing
Token ID6078
Feature activation-1.49e-01

Pos logit contributions
oys+0.036
fs+0.033
 compartment+0.032
 globe+0.032
kin+0.032
Neg logit contributions
please-0.051
 gor-0.048
 chickens-0.047
 ge-0.046
 pigs-0.044
Token her
Token ID607
Feature activation+1.48e-01

Pos logit contributions
please+0.004
 gor+0.004
 chickens+0.004
 ge+0.004
 pigs+0.003
Neg logit contributions
oys-0.005
fs-0.004
 compartment-0.004
kin-0.004
 globe-0.004
Token glove
Token ID29144
Feature activation+1.02e-01

Pos logit contributions
oys+0.004
fs+0.004
 compartment+0.004
 globe+0.004
cher+0.004
Neg logit contributions
please-0.005
 chickens-0.004
 ge-0.004
 bored-0.004
 pigs-0.004
Token,
Token ID11
Feature activation-4.65e-02

Pos logit contributions
oys+0.003
fs+0.003
 compartment+0.002
kin+0.002
 globe+0.002
Neg logit contributions
please-0.003
 gor-0.003
 chickens-0.003
 ge-0.003
 pigs-0.003
Token so
Token ID523
Feature activation+2.32e-01

Pos logit contributions
please+0.002
 gor+0.002
 chickens+0.002
 ge+0.002
 North+0.002
Neg logit contributions
oys-0.001
fs-0.001
 compartment-0.001
 globe-0.001
 box-0.001
Token she
Token ID673
Feature activation-1.10e+00

Pos logit contributions
oys+0.006
fs+0.006
 compartment+0.006
kin+0.006
 handshake+0.006
Neg logit contributions
please-0.007
 gor-0.007
 chickens-0.007
 ge-0.006
 pigs-0.006
Token asked
Token ID1965
Feature activation+2.01e-01

Pos logit contributions
please+0.029
 gor+0.027
 chickens+0.026
 ge+0.025
 North+0.024
Neg logit contributions
oys-0.035
fs-0.034
 compartment-0.033
kin-0.032
 handshake-0.032
Token her
Token ID607
Feature activation-2.23e-01

Pos logit contributions
oys+0.007
fs+0.006
 compartment+0.006
kin+0.006
 handshake+0.006
Neg logit contributions
please-0.005
 gor-0.005
 chickens-0.005
 ge-0.005
 North-0.005
Token mom
Token ID1995
Feature activation-8.82e-01

Pos logit contributions
please+0.008
 chickens+0.008
 gor+0.008
 ge+0.008
 pigs+0.007
Neg logit contributions
oys-0.005
fs-0.005
 compartment-0.004
 globe-0.004
 handshake-0.004
Token for
Token ID329
Feature activation-1.30e+00

Pos logit contributions
please+0.033
 gor+0.031
 ge+0.031
 chickens+0.030
 North+0.029
Neg logit contributions
oys-0.019
fs-0.017
 compartment-0.016
 box-0.016
box-0.016
Token help
Token ID1037
Feature activation-8.38e-01

Pos logit contributions
please+0.037
 chickens+0.034
 ge+0.032
 going+0.032
 pigs+0.031
Neg logit contributions
suit-0.042
 globe-0.041
oys-0.041
 compartment-0.039
ouse-0.039
Token of
Token ID286
Feature activation+8.55e-01

Pos logit contributions
please+0.017
 chickens+0.016
 gor+0.015
 ge+0.015
 bored+0.015
Neg logit contributions
oys-0.012
fs-0.012
 globe-0.011
 handshake-0.011
 compartment-0.011
Token soup
Token ID17141
Feature activation-1.87e+00

Pos logit contributions
oys+0.023
fs+0.022
kin+0.022
 box+0.021
 handshake+0.021
Neg logit contributions
please-0.026
 gor-0.024
 chickens-0.023
 North-0.023
 ge-0.022
Token with
Token ID351
Feature activation+4.92e-01

Pos logit contributions
please+0.065
 gor+0.061
 chickens+0.060
 ge+0.059
 North+0.057
Neg logit contributions
oys-0.046
fs-0.042
kin-0.042
 compartment-0.041
 globe-0.040
Token noodles
Token ID31103
Feature activation-2.16e+00

Pos logit contributions
 box+0.008
oys+0.008
fs+0.008
 present+0.008
 card+0.008
Neg logit contributions
please-0.019
 gor-0.018
 North-0.017
 pasture-0.017
 colony-0.016
Token and
Token ID290
Feature activation-7.86e-01

Pos logit contributions
please+0.083
 gor+0.077
 North+0.073
 ge+0.073
 chickens+0.072
Neg logit contributions
oys-0.048
kin-0.044
fs-0.043
box-0.041
 box-0.041
Token vegetables
Token ID13701
Feature activation-3.39e+00

Pos logit contributions
please+0.025
 gor+0.023
 North+0.022
 pasture+0.022
 colony+0.022
Neg logit contributions
oys-0.019
kin-0.018
fs-0.018
 box-0.018
 card-0.018
Token.
Token ID13
Feature activation-5.19e-01

Pos logit contributions
please+0.141
 North+0.133
 gor+0.133
 pasture+0.125
 Australian+0.124
Neg logit contributions
oys-0.058
kin-0.057
 box-0.057
 note-0.052
fs-0.052
Token Tom
Token ID4186
Feature activation+8.60e-01

Pos logit contributions
please+0.019
 chickens+0.017
 gor+0.017
 ge+0.016
 North+0.016
Neg logit contributions
oys-0.013
fs-0.010
 compartment-0.010
box-0.010
 box-0.010
Token looks
Token ID3073
Feature activation-7.26e-01

Pos logit contributions
 box+0.016
oys+0.016
 card+0.015
 globe+0.014
 compartment+0.014
Neg logit contributions
please-0.033
 gor-0.029
 harming-0.029
 North-0.029
 chickens-0.028
Token at
Token ID379
Feature activation+1.04e+00

Pos logit contributions
please+0.022
 North+0.020
 gor+0.020
 chickens+0.019
 pasture+0.018
Neg logit contributions
 box-0.019
 compartment-0.018
suit-0.018
 costume-0.018
 present-0.018
Token Lily
Token ID20037
Feature activation+2.25e-01

Pos logit contributions
oys+0.029
fs+0.027
 compartment+0.026
 globe+0.026
kin+0.026
Neg logit contributions
please-0.033
 gor-0.030
 chickens-0.030
 ge-0.029
 pigs-0.028
Token
Token ID220
Feature activation-1.99e+00

Pos logit contributions
please+0.014
 gor+0.013
 chickens+0.013
 ge+0.013
 North+0.012
Neg logit contributions
oys-0.011
 Kate-0.010
 compartment-0.010
 handshake-0.010
fs-0.010
Token
Token ID198
Feature activation-3.76e-01

Pos logit contributions
please+0.072
 gor+0.067
 chickens+0.067
 ge+0.065
 North+0.063
Neg logit contributions
oys-0.047
fs-0.043
 compartment-0.041
 handshake-0.040
kin-0.040
Token
Token ID198
Feature activation+7.50e-02

Pos logit contributions
please+0.013
 gor+0.012
 chickens+0.012
 ge+0.011
 pigs+0.011
Neg logit contributions
oys-0.010
fs-0.009
 compartment-0.008
 handshake-0.008
 globe-0.008
TokenThe
Token ID464
Feature activation-1.13e+00

Pos logit contributions
oys+0.002
fs+0.002
 compartment+0.002
 handshake+0.002
kin+0.002
Neg logit contributions
please-0.002
 gor-0.002
 chickens-0.002
 ge-0.002
 pigs-0.002
Token girl
Token ID2576
Feature activation-1.46e-01

Pos logit contributions
please+0.037
 gor+0.034
 chickens+0.033
 ge+0.032
 pigs+0.031
Neg logit contributions
oys-0.030
fs-0.029
 compartment-0.027
kin-0.027
 globe-0.027
Token felt
Token ID2936
Feature activation-2.44e+00

Pos logit contributions
please+0.005
 gor+0.005
 chickens+0.005
 North+0.005
 pigs+0.005
Neg logit contributions
oys-0.003
 compartment-0.003
fs-0.003
 handshake-0.003
lf-0.003
Token happy
Token ID3772
Feature activation+3.71e-01

Pos logit contributions
please+0.081
 gor+0.074
 chickens+0.071
 pigs+0.071
 North+0.069
Neg logit contributions
kin-0.059
fs-0.058
 handshake-0.055
 compartment-0.055
oys-0.054
Token and
Token ID290
Feature activation-5.76e-03

Pos logit contributions
oys+0.009
fs+0.008
 compartment+0.008
 handshake+0.008
kin+0.008
Neg logit contributions
please-0.013
 gor-0.012
 chickens-0.012
 ge-0.012
 pigs-0.011
Token amazed
Token ID24562
Feature activation-4.98e-01

Pos logit contributions
please+0.000
 gor+0.000
 chickens+0.000
 ge+0.000
 pigs+0.000
Neg logit contributions
oys-0.000
fs-0.000
 handshake-0.000
 compartment-0.000
kin-0.000
Token.
Token ID13
Feature activation-1.19e-01

Pos logit contributions
please+0.017
 chickens+0.016
 gor+0.016
 North+0.015
 ge+0.015
Neg logit contributions
oys-0.012
fs-0.011
 handshake-0.011
 compartment-0.011
kin-0.011
Token This
Token ID770
Feature activation-5.26e-01

Pos logit contributions
please+0.004
 gor+0.004
 ge+0.004
 chickens+0.004
 pigs+0.003
Neg logit contributions
oys-0.003
 compartment-0.003
 Kate-0.003
 Sure-0.003
 globe-0.003
Token bring
Token ID2222
Feature activation+7.36e-01

Pos logit contributions
please+0.005
 gor+0.005
 chickens+0.005
 ge+0.004
 pigs+0.004
Neg logit contributions
oys-0.003
fs-0.003
 handshake-0.003
kin-0.003
 compartment-0.003
Token him
Token ID683
Feature activation-3.35e-02

Pos logit contributions
oys+0.021
fs+0.020
 compartment+0.019
 globe+0.019
kin+0.019
Neg logit contributions
please-0.022
 gor-0.020
 chickens-0.020
 ge-0.019
 pigs-0.019
Token two
Token ID734
Feature activation-1.52e+00

Pos logit contributions
please+0.001
 gor+0.001
 chickens+0.001
 ge+0.001
 North+0.001
Neg logit contributions
oys-0.001
fs-0.001
kin-0.001
 handshake-0.001
 globe-0.001
Token delicious
Token ID12625
Feature activation-2.43e+00

Pos logit contributions
please+0.048
 gor+0.041
 North+0.040
ained+0.040
 harming+0.040
Neg logit contributions
fs-0.037
oys-0.036
 box-0.036
 present-0.035
kin-0.034
Token w
Token ID266
Feature activation-1.59e+00

Pos logit contributions
please+0.073
 gor+0.067
 chickens+0.066
 North+0.065
 harming+0.064
Neg logit contributions
oys-0.062
fs-0.059
 box-0.058
 present-0.057
kin-0.056
Tokenaffles
Token ID48501
Feature activation-2.76e+00

Pos logit contributions
please+0.050
 gor+0.046
 ge+0.046
 North+0.046
 chickens+0.046
Neg logit contributions
fs-0.043
oys-0.042
kin-0.041
f-0.037
bes-0.035
Token.
Token ID13
Feature activation+6.96e-01

Pos logit contributions
please+0.092
 gor+0.086
 chickens+0.082
 ge+0.082
 North+0.082
Neg logit contributions
oys-0.071
fs-0.068
kin-0.065
 globe-0.062
box-0.062
Token
Token ID198
Feature activation+3.26e-01

Pos logit contributions
oys+0.017
fs+0.016
 compartment+0.015
 globe+0.015
 handshake+0.015
Neg logit contributions
please-0.024
 chickens-0.022
 gor-0.022
 ge-0.021
 bored-0.020
Token
Token ID198
Feature activation+7.73e-01

Pos logit contributions
oys+0.009
fs+0.008
 compartment+0.008
 globe+0.008
 handshake+0.008
Neg logit contributions
please-0.011
 gor-0.010
 chickens-0.010
 ge-0.009
 pigs-0.009
TokenJohn
Token ID7554
Feature activation+1.74e+00

Pos logit contributions
oys+0.014
fs+0.013
lf+0.012
 globe+0.012
 compartment+0.011
Neg logit contributions
please-0.031
 chickens-0.029
 bored-0.029
 gor-0.029
 ge-0.028
Token loved
Token ID6151
Feature activation-4.17e-01

Pos logit contributions
oys+0.028
 box+0.027
lf+0.027
 belt+0.026
 compartment+0.026
Neg logit contributions
please-0.067
 gor-0.062
 harming-0.061
 ge-0.061
 chickens-0.060
Token She
Token ID1375
Feature activation+4.50e-01

Pos logit contributions
oys+0.003
 compartment+0.002
 handshake+0.002
 box+0.002
 globe+0.002
Neg logit contributions
please-0.003
 gor-0.003
 chickens-0.003
 ge-0.003
 North-0.003
Token wanted
Token ID2227
Feature activation+1.52e+00

Pos logit contributions
oys+0.013
fs+0.012
 compartment+0.012
 handshake+0.012
 box+0.012
Neg logit contributions
please-0.013
 gor-0.012
 chickens-0.012
 ge-0.011
 North-0.011
Token to
Token ID284
Feature activation+9.04e-01

Pos logit contributions
oys+0.041
fs+0.039
 compartment+0.038
 handshake+0.038
 globe+0.037
Neg logit contributions
please-0.048
 gor-0.044
 chickens-0.043
 ge-0.042
 pigs-0.040
Token know
Token ID760
Feature activation+1.33e-01

Pos logit contributions
oys+0.024
fs+0.023
 compartment+0.022
 globe+0.021
kin+0.021
Neg logit contributions
please-0.030
 gor-0.027
 chickens-0.027
 ge-0.026
 pigs-0.025
Token why
Token ID1521
Feature activation+2.49e-01

Pos logit contributions
oys+0.004
fs+0.004
kin+0.004
 globe+0.004
 compartment+0.004
Neg logit contributions
please-0.004
 chickens-0.004
 gor-0.004
 ge-0.003
 pigs-0.003
Token the
Token ID262
Feature activation-3.01e+00

Pos logit contributions
oys+0.006
fs+0.006
kin+0.006
 handshake+0.005
 compartment+0.005
Neg logit contributions
please-0.009
 gor-0.008
 chickens-0.008
 ge-0.008
 North-0.007
Token river
Token ID7850
Feature activation-1.94e+00

Pos logit contributions
please+0.081
 gor+0.071
 chickens+0.063
 North+0.063
 ge+0.061
Neg logit contributions
oys-0.094
 box-0.094
 globe-0.092
 handshake-0.088
 tube-0.087
Token had
Token ID550
Feature activation+5.44e-01

Pos logit contributions
please+0.074
 gor+0.068
 chickens+0.068
 ge+0.065
 North+0.064
Neg logit contributions
oys-0.041
fs-0.038
 compartment-0.036
 globe-0.036
 handshake-0.036
Token flooded
Token ID21050
Feature activation-6.03e-02

Pos logit contributions
oys+0.013
 handshake+0.012
 compartment+0.012
 box+0.012
fs+0.012
Neg logit contributions
please-0.019
 gor-0.017
 chickens-0.016
 North-0.016
 pasture-0.016
Token.
Token ID13
Feature activation+1.34e-01

Pos logit contributions
please+0.002
 chickens+0.002
 gor+0.002
 ge+0.002
 pigs+0.002
Neg logit contributions
oys-0.002
 compartment-0.001
fs-0.001
 globe-0.001
kin-0.001
Token
Token ID220
Feature activation-1.47e+00

Pos logit contributions
oys+0.003
fs+0.003
 compartment+0.003
 handshake+0.003
 globe+0.003
Neg logit contributions
please-0.005
 gor-0.005
 chickens-0.005
 ge-0.004
 pigs-0.004
Token It
Token ID632
Feature activation+1.18e+00

Pos logit contributions
oys+0.003
fs+0.003
 compartment+0.003
 handshake+0.003
kin+0.003
Neg logit contributions
please-0.005
 gor-0.004
 chickens-0.004
 ge-0.004
 North-0.004
Token opened
Token ID4721
Feature activation+5.49e-02

Pos logit contributions
oys+0.029
fs+0.028
 compartment+0.027
 handshake+0.027
 globe+0.026
Neg logit contributions
please-0.040
 chickens-0.037
 gor-0.037
 ge-0.035
 North-0.034
Token its
Token ID663
Feature activation+8.08e-01

Pos logit contributions
oys+0.001
 compartment+0.001
 globe+0.001
fs+0.001
 handshake+0.001
Neg logit contributions
please-0.002
 chickens-0.002
 gor-0.002
 ge-0.002
 pigs-0.002
Token mouth
Token ID5422
Feature activation-1.06e+00

Pos logit contributions
oys+0.021
fs+0.021
kin+0.019
 globe+0.019
 compartment+0.019
Neg logit contributions
please-0.026
 gor-0.024
 chickens-0.024
 ge-0.024
 pigs-0.023
Token and
Token ID290
Feature activation+3.16e-01

Pos logit contributions
please+0.035
 chickens+0.032
 gor+0.032
 ge+0.031
 pigs+0.030
Neg logit contributions
oys-0.028
fs-0.026
 compartment-0.025
 globe-0.025
kin-0.025
Token swallowed
Token ID27961
Feature activation-2.56e+00

Pos logit contributions
oys+0.009
fs+0.009
 compartment+0.008
 globe+0.008
 handshake+0.008
Neg logit contributions
please-0.010
 gor-0.009
 chickens-0.009
 ge-0.008
 pigs-0.008
Token the
Token ID262
Feature activation-6.40e-02

Pos logit contributions
please+0.099
 gor+0.089
 chickens+0.088
 North+0.085
 ge+0.083
Neg logit contributions
oys-0.057
fs-0.055
kin-0.051
 compartment-0.051
 globe-0.050
Token trash
Token ID13913
Feature activation-7.17e-01

Pos logit contributions
please+0.002
 gor+0.002
 chickens+0.002
 ge+0.002
 pigs+0.002
Neg logit contributions
oys-0.002
fs-0.002
kin-0.002
 compartment-0.002
 handshake-0.002
Token.
Token ID13
Feature activation-1.60e-01

Pos logit contributions
please+0.023
 gor+0.021
 chickens+0.021
 ge+0.020
 North+0.019
Neg logit contributions
oys-0.020
fs-0.019
 compartment-0.018
 globe-0.018
kin-0.018
Token Lily
Token ID20037
Feature activation+9.95e-01

Pos logit contributions
please+0.006
 gor+0.006
 chickens+0.005
 ge+0.005
 North+0.005
Neg logit contributions
oys-0.004
fs-0.003
 compartment-0.003
kin-0.003
 handshake-0.003
Token and
Token ID290
Feature activation-1.65e-01

Pos logit contributions
oys+0.023
fs+0.021
 compartment+0.021
kin+0.020
 globe+0.020
Neg logit contributions
please-0.036
 gor-0.034
 chickens-0.033
 ge-0.032
 North-0.031