TOP ACTIVATIONS
MAX = 0.100

the yellow duck fl apped her wings . The
favorite sweater and cl apped her hands . Grand pa smiled
!" The duck fl apped her wings and flew off into
child l aug heed in delight . " Oh
board , Bella cl apped her hands . The
. The butterfly fl apped her wings and moved slowly away
the bottom and cl apped her hands . Suddenly
on the shore and waved her arms . She hoped someone
whip away and cl apped her hands . The bird flew
it !" Lily cl apped her hands and said , "
at the dog and waved her arms . The dog stopped
Lily smiled and cl apped her hands . She was happy
eyes as the fairy waved her wand and chanted some secret
very happy and cl apped her hands too .
, so she fl apped her wings and flew away .
Anna . He l icks her face . Anna laughs .
would laugh and cl ap her hands . It was so
so excited and cl apped her hands ! Daddy
!" Mom said , waving her arms . The
her seat and cl apped her hands , ready to eat

MIN ACTIVATIONS
MIN = -0.119

didn 't know what " sm oother " meant , but
shell . The tort oise was happy to help
little er aser . The er aser was sad because it
would . The tort oise started on his adventure
that said " No T resp assing ". But Lily was
old man ?" The tort oise replied : " I
are you here , little cater pillar ?" the monkey said
he noticed something strange . Among st the keys , there
that said " No T resp assing ". "
big sign that said " EX IT ". He asked his
other er aser . This er aser was independent and could
. The er aser put everything in the
that said " No T resp assing ". And behind the
a cater pillar . The cater pillar wanted to go on
One day , the er aser met a kind boy
talk ! The er aser said , " Hi
heard . The rh in oc eros decided to
full of colourful things . Among them , there was a
. " Wow , that cater pillar is so big !"
it ." The rac coon laughed and pointed to

INTERVAL 0.045 - 0.100
CONTAINS 3.327%

up Jack 's eyes with excitement . But one night ,
cheered and Sammy felt so happy . Sam my
harsh voice , but John smiled anyways . He â
late . The pipe grabbed him and pulled him in .
noticed something unusual . On her bedroom floor there was a

INTERVAL -0.010 - 0.045
CONTAINS 64.238%

more careful and never got her bl ouse dirty again .
to come . <|endoftext|> Ell ie was feeling very lonely .
but she shouldn 't feel weak . Everyone has different tastes
The moral of this story is to make sure you
learned an important lesson that day - the dry ness of

INTERVAL -0.064 - -0.010
CONTAINS 32.040%

down on the beach . Jane was scared , but her
each other and promised to be friends . Their mom smiled
Tim 's mom saw the problem . She said
. She ran her fingers over the velvet dress and felt
of the monster . His mom my and daddy told him

INTERVAL -0.119 - -0.064
CONTAINS 0.395%

the bird and asked , " Do you want to come
They played with the k ite all day . Tom
many papers . " Mom , can we go
Mrs Johnson was the best mel on - cho oser in
angry . She said , " Why did you disob ey
Token the
Token ID262
Feature activation-1.17e-02

Pos logit contributions
edly+0.013
 faces+0.012
 face+0.012
by+0.011
 tum+0.011
Neg logit contributions
 course-0.018
ions-0.017
 July-0.016
é-0.016
issa-0.016
Token yellow
Token ID7872
Feature activation-1.61e-02

Pos logit contributions
 course+0.024
 July+0.023
ions+0.022
é+0.021
neath+0.021
Neg logit contributions
edly-0.017
 faces-0.015
by-0.015
 face-0.015
aj-0.015
Token duck
Token ID22045
Feature activation-1.94e-02

Pos logit contributions
 course+0.032
ions+0.030
 July+0.029
é+0.028
neath+0.028
Neg logit contributions
edly-0.024
aj-0.022
powers-0.021
by-0.021
 faces-0.021
Token fl
Token ID781
Feature activation-5.69e-03

Pos logit contributions
 course+0.040
ions+0.037
é+0.037
hing+0.035
e+0.035
Neg logit contributions
edly-0.026
aj-0.025
powers-0.024
 faces-0.023
 spine-0.022
Tokenapped
Token ID6320
Feature activation-9.92e-05

Pos logit contributions
ions+0.008
ang+0.007
 course+0.007
é+0.007
osh+0.007
Neg logit contributions
 faces-0.011
edly-0.011
 face-0.011
 tum-0.010
 teeth-0.010
Token her
Token ID607
Feature activation+9.95e-02

Pos logit contributions
 course+0.000
ions+0.000
 July+0.000
e+0.000
hing+0.000
Neg logit contributions
aj-0.000
powers-0.000
edly-0.000
 spine-0.000
suits-0.000
Token wings
Token ID12098
Feature activation+3.66e-02

Pos logit contributions
edly+0.139
aj+0.137
 upon+0.134
powers+0.127
by+0.123
Neg logit contributions
e-0.174
 course-0.170
osh-0.146
igh-0.146
hing-0.143
Token.
Token ID13
Feature activation+1.83e-03

Pos logit contributions
edly+0.037
aj+0.036
 faces+0.034
powers+0.033
by+0.032
Neg logit contributions
 course-0.086
ions-0.081
hing-0.081
é-0.077
ner-0.077
Token
Token ID198
Feature activation+1.25e-02

Pos logit contributions
edly+0.002
 face+0.002
 faces+0.002
 tum+0.002
by+0.002
Neg logit contributions
 course-0.004
ions-0.004
 July-0.004
é-0.004
issa-0.004
Token
Token ID198
Feature activation-5.70e-03

Pos logit contributions
edly+0.012
 faces+0.011
 face+0.011
by+0.010
 tum+0.010
Neg logit contributions
 course-0.031
ions-0.030
 July-0.030
é-0.029
issa-0.028
TokenThe
Token ID464
Feature activation-2.79e-02

Pos logit contributions
 course+0.011
ions+0.011
igh+0.011
issa+0.010
é+0.010
Neg logit contributions
edly-0.008
 face-0.008
 faces-0.007
powers-0.007
 upon-0.007
Token favorite
Token ID4004
Feature activation+4.67e-02

Pos logit contributions
edly+0.097
by+0.084
 faces+0.084
powers+0.079
 face+0.078
Neg logit contributions
 course-0.136
 July-0.128
issa-0.126
ions-0.123
neath-0.119
Token sweater
Token ID35685
Feature activation+1.95e-02

Pos logit contributions
edly+0.059
 faces+0.052
 face+0.051
by+0.048
powers+0.047
Neg logit contributions
 course-0.105
ions-0.100
 July-0.099
é-0.095
issa-0.094
Token and
Token ID290
Feature activation+1.92e-02

Pos logit contributions
 faces+0.018
edly+0.017
by+0.016
 face+0.016
powers+0.015
Neg logit contributions
 course-0.049
ions-0.047
ner-0.045
é-0.045
issa-0.044
Token cl
Token ID537
Feature activation-2.21e-02

Pos logit contributions
edly+0.024
 faces+0.022
 face+0.022
 tum+0.020
by+0.020
Neg logit contributions
 course-0.043
ions-0.041
 July-0.040
é-0.039
issa-0.039
Tokenapped
Token ID6320
Feature activation-7.47e-03

Pos logit contributions
 course+0.040
ions+0.039
 July+0.038
é+0.036
issa+0.036
Neg logit contributions
edly-0.037
 faces-0.035
 face-0.035
by-0.032
 tum-0.032
Token her
Token ID607
Feature activation+9.70e-02

Pos logit contributions
 course+0.012
ions+0.011
 July+0.011
 introducing+0.010
é+0.010
Neg logit contributions
edly-0.013
aj-0.013
powers-0.013
 spine-0.013
suits-0.012
Token hands
Token ID2832
Feature activation+3.65e-02

Pos logit contributions
edly+0.137
 faces+0.121
by+0.118
aj+0.114
powers+0.114
Neg logit contributions
 course-0.184
 July-0.174
issa-0.169
igh-0.167
é-0.167
Token.
Token ID13
Feature activation+1.28e-02

Pos logit contributions
edly+0.025
 faces+0.022
 face+0.021
aj+0.019
by+0.019
Neg logit contributions
 course-0.099
ions-0.097
 July-0.095
é-0.094
ner-0.093
Token Grand
Token ID5675
Feature activation-4.77e-02

Pos logit contributions
edly+0.012
 face+0.012
 faces+0.012
 tum+0.010
by+0.010
Neg logit contributions
 course-0.032
ions-0.031
 July-0.030
é-0.029
issa-0.029
Tokenpa
Token ID8957
Feature activation-1.48e-02

Pos logit contributions
ions+0.048
 course+0.046
 July+0.045
ner+0.044
neath+0.044
Neg logit contributions
edly-0.110
 faces-0.110
 face-0.110
by-0.107
aj-0.103
Token smiled
Token ID13541
Feature activation+4.47e-02

Pos logit contributions
opl+0.039
 course+0.038
ions+0.037
 July+0.036
neath+0.035
Neg logit contributions
by-0.013
 to-0.013
 felt-0.012
 face-0.012
 back-0.012
Token!"
Token ID2474
Feature activation+1.15e-02

Pos logit contributions
 July+0.039
ions+0.037
 course+0.037
neath+0.036
issa+0.036
Neg logit contributions
edly-0.020
 back-0.019
 to-0.019
 face-0.019
 faces-0.018
Token The
Token ID383
Feature activation-4.18e-02

Pos logit contributions
edly+0.017
 faces+0.016
 face+0.016
aj+0.016
by+0.016
Neg logit contributions
 course-0.023
ions-0.021
 July-0.021
é-0.021
neath-0.020
Token duck
Token ID22045
Feature activation-1.62e-02

Pos logit contributions
ions+0.090
 course+0.088
 July+0.086
é+0.085
issa+0.085
Neg logit contributions
 face-0.055
 faces-0.053
edly-0.052
 tum-0.048
 back-0.048
Token fl
Token ID781
Feature activation-1.67e-02

Pos logit contributions
 course+0.035
é+0.032
ions+0.031
 July+0.031
neath+0.030
Neg logit contributions
edly-0.021
 faces-0.020
aj-0.019
powers-0.018
 face-0.018
Tokenapped
Token ID6320
Feature activation-1.07e-02

Pos logit contributions
ions+0.027
 course+0.026
é+0.024
igh+0.024
 July+0.024
Neg logit contributions
edly-0.030
 faces-0.030
 face-0.029
 tum-0.027
by-0.027
Token her
Token ID607
Feature activation+9.69e-02

Pos logit contributions
 course+0.019
 July+0.017
ions+0.017
e+0.016
hing+0.016
Neg logit contributions
aj-0.018
powers-0.018
edly-0.018
suits-0.017
uddle-0.016
Token wings
Token ID12098
Feature activation+3.34e-02

Pos logit contributions
edly+0.133
aj+0.132
 faces+0.119
 upon+0.119
by+0.117
Neg logit contributions
 course-0.186
é-0.165
 July-0.161
neath-0.161
ions-0.159
Token and
Token ID290
Feature activation+2.39e-02

Pos logit contributions
edly+0.035
 faces+0.033
aj+0.033
 face+0.031
by+0.030
Neg logit contributions
 course-0.079
ions-0.074
hing-0.072
é-0.072
neath-0.072
Token flew
Token ID13112
Feature activation-4.60e-03

Pos logit contributions
edly+0.033
 faces+0.031
 face+0.031
 tum+0.028
by+0.028
Neg logit contributions
 course-0.050
ions-0.048
 July-0.047
é-0.046
issa-0.045
Token off
Token ID572
Feature activation+3.87e-02

Pos logit contributions
 course+0.008
ions+0.008
é+0.008
neath+0.008
ting+0.007
Neg logit contributions
aj-0.007
edly-0.007
 faces-0.007
 tum-0.007
 face-0.007
Token into
Token ID656
Feature activation-2.24e-02

Pos logit contributions
edly+0.056
 faces+0.053
 face+0.052
aj+0.049
by+0.049
Neg logit contributions
 course-0.079
ions-0.075
 July-0.074
é-0.072
ner-0.071
Token child
Token ID1200
Feature activation-2.51e-02

Pos logit contributions
 course+0.054
ions+0.053
é+0.049
neath+0.049
 July+0.048
Neg logit contributions
 face-0.040
 faces-0.040
edly-0.040
 tum-0.037
 eyes-0.036
Token l
Token ID300
Feature activation-4.33e-02

Pos logit contributions
 course+0.057
ions+0.055
 July+0.054
é+0.053
issa+0.052
Neg logit contributions
edly-0.030
 faces-0.028
 face-0.027
 tum-0.025
by-0.025
Tokenaug
Token ID7493
Feature activation-5.72e-02

Pos logit contributions
 course+0.076
ions+0.076
é+0.070
 July+0.070
issa+0.069
Neg logit contributions
edly-0.073
 faces-0.072
 face-0.071
 tum-0.067
by-0.067
Tokenheed
Token ID23616
Feature activation+4.37e-03

Pos logit contributions
 course+0.082
 July+0.076
ions+0.075
igh+0.069
 introducing+0.066
Neg logit contributions
edly-0.115
aj-0.100
 face-0.097
suit-0.096
ro-0.094
Token in
Token ID287
Feature activation-2.71e-03

Pos logit contributions
edly+0.006
powers+0.006
aj+0.006
 faces+0.006
 tum+0.006
Neg logit contributions
 July-0.008
ions-0.008
 course-0.008
neath-0.008
é-0.008
Token delight
Token ID10974
Feature activation+9.62e-02

Pos logit contributions
 course+0.005
 July+0.005
ions+0.005
neath+0.005
é+0.005
Neg logit contributions
edly-0.004
 faces-0.004
aj-0.004
 face-0.004
by-0.004
Token.
Token ID13
Feature activation+1.32e-02

Pos logit contributions
powers+0.099
suits+0.087
aj+0.085
edly+0.085
by+0.082
Neg logit contributions
 July-0.202
 course-0.200
ions-0.198
 introducing-0.194
é-0.193
Token
Token ID198
Feature activation+1.92e-02

Pos logit contributions
edly+0.018
 faces+0.017
 face+0.017
by+0.016
powers+0.016
Neg logit contributions
 course-0.027
 July-0.026
ions-0.026
é-0.025
neath-0.025
Token
Token ID198
Feature activation+2.71e-03

Pos logit contributions
edly+0.023
 faces+0.021
 face+0.020
 tum+0.019
by+0.019
Neg logit contributions
 course-0.044
ions-0.043
 July-0.042
é-0.041
issa-0.040
Token"
Token ID1
Feature activation-4.69e-02

Pos logit contributions
edly+0.004
 face+0.004
 faces+0.004
aj+0.004
by+0.004
Neg logit contributions
 course-0.005
ions-0.005
igh-0.005
é-0.005
 July-0.005
TokenOh
Token ID5812
Feature activation-7.93e-03

Pos logit contributions
 course+0.085
ions+0.082
igh+0.081
opl+0.079
 July+0.078
Neg logit contributions
edly-0.081
 face-0.067
 faces-0.067
 upon-0.066
by-0.066
Token board
Token ID3096
Feature activation-2.43e-03

Pos logit contributions
 course+0.000
 July+0.000
issa+0.000
é+0.000
ner+0.000
Neg logit contributions
edly-0.000
aj-0.000
powers-0.000
by-0.000
 faces-0.000
Token,
Token ID11
Feature activation-2.14e-02

Pos logit contributions
 course+0.006
ions+0.005
 July+0.005
é+0.005
neath+0.005
Neg logit contributions
edly-0.003
 faces-0.003
 face-0.003
aj-0.002
by-0.002
Token Bella
Token ID39304
Feature activation-7.05e-04

Pos logit contributions
 course+0.042
ions+0.041
 July+0.040
é+0.039
issa+0.038
Neg logit contributions
edly-0.031
 faces-0.030
 face-0.030
 tum-0.027
by-0.027
Token cl
Token ID537
Feature activation-1.06e-02

Pos logit contributions
 course+0.002
é+0.001
 July+0.001
ions+0.001
issa+0.001
Neg logit contributions
edly-0.001
 faces-0.001
 face-0.001
 tum-0.001
aj-0.001
Tokenapped
Token ID6320
Feature activation-3.04e-03

Pos logit contributions
 course+0.016
ions+0.016
é+0.015
 July+0.015
igh+0.015
Neg logit contributions
edly-0.020
 faces-0.019
 face-0.018
powers-0.018
 tum-0.018
Token her
Token ID607
Feature activation+9.62e-02

Pos logit contributions
 course+0.005
ions+0.005
 July+0.004
 introducing+0.004
hing+0.004
Neg logit contributions
aj-0.005
edly-0.005
powers-0.005
suits-0.005
 spine-0.005
Token hands
Token ID2832
Feature activation+4.41e-02

Pos logit contributions
edly+0.151
aj+0.144
 upon+0.136
powers+0.129
by+0.128
Neg logit contributions
 course-0.152
e-0.146
osh-0.136
igh-0.135
neath-0.134
Token.
Token ID13
Feature activation+1.01e-02

Pos logit contributions
edly+0.046
aj+0.043
 faces+0.043
powers+0.041
 face+0.041
Neg logit contributions
 course-0.103
ions-0.098
 July-0.094
é-0.094
ner-0.093
Token
Token ID198
Feature activation+1.41e-02

Pos logit contributions
edly+0.012
 faces+0.011
 face+0.011
by+0.010
 tum+0.010
Neg logit contributions
 course-0.023
ions-0.022
 July-0.022
é-0.021
neath-0.021
Token
Token ID198
Feature activation-3.56e-03

Pos logit contributions
edly+0.014
 faces+0.013
 face+0.013
by+0.012
 tum+0.012
Neg logit contributions
 course-0.035
ions-0.033
 July-0.033
é-0.032
issa-0.032
TokenThe
Token ID464
Feature activation-2.45e-02

Pos logit contributions
 course+0.007
issa+0.007
 July+0.007
igh+0.007
ions+0.007
Neg logit contributions
edly-0.005
 face-0.004
 to-0.004
 faces-0.004
powers-0.004
Token.
Token ID13
Feature activation+1.13e-02

Pos logit contributions
edly+0.011
 faces+0.009
 face+0.009
aj+0.008
powers+0.008
Neg logit contributions
 course-0.036
ions-0.034
 July-0.033
é-0.033
neath-0.032
Token The
Token ID383
Feature activation-3.19e-02

Pos logit contributions
edly+0.017
 faces+0.016
 face+0.016
by+0.015
aj+0.015
Neg logit contributions
 course-0.022
ions-0.021
 July-0.021
é-0.020
neath-0.020
Token butterfly
Token ID35113
Feature activation-2.00e-02

Pos logit contributions
 course+0.055
ions+0.054
 July+0.052
é+0.050
neath+0.049
Neg logit contributions
edly-0.055
 face-0.054
 faces-0.053
 tum-0.049
by-0.049
Token fl
Token ID781
Feature activation-1.17e-02

Pos logit contributions
 course+0.038
ions+0.036
 July+0.036
é+0.035
neath+0.034
Neg logit contributions
edly-0.031
 faces-0.029
 face-0.029
by-0.027
 tum-0.027
Tokenapped
Token ID6320
Feature activation-1.99e-02

Pos logit contributions
ions+0.019
 course+0.018
é+0.017
neath+0.016
igh+0.016
Neg logit contributions
 faces-0.021
edly-0.021
 face-0.020
 tum-0.019
powers-0.019
Token her
Token ID607
Feature activation+9.52e-02

Pos logit contributions
 course+0.032
ions+0.029
 July+0.028
neath+0.027
é+0.026
Neg logit contributions
aj-0.037
edly-0.036
powers-0.035
suits-0.034
 tum-0.033
Token wings
Token ID12098
Feature activation+2.77e-02

Pos logit contributions
edly+0.130
aj+0.129
 faces+0.119
by+0.116
powers+0.115
Neg logit contributions
 course-0.179
e-0.159
é-0.155
neath-0.152
ions-0.151
Token and
Token ID290
Feature activation+1.83e-02

Pos logit contributions
edly+0.031
aj+0.030
 faces+0.029
powers+0.028
 face+0.027
Neg logit contributions
 course-0.063
ions-0.059
hing-0.058
neath-0.058
é-0.057
Token moved
Token ID3888
Feature activation-1.16e-02

Pos logit contributions
edly+0.027
 faces+0.026
 face+0.026
 tum+0.024
by+0.024
Neg logit contributions
 course-0.036
ions-0.035
 July-0.035
é-0.034
issa-0.033
Token slowly
Token ID6364
Feature activation+3.80e-02

Pos logit contributions
 course+0.023
ions+0.021
neath+0.020
é+0.020
 July+0.020
Neg logit contributions
aj-0.018
edly-0.018
 faces-0.017
 tum-0.017
 face-0.017
Token away
Token ID1497
Feature activation+5.40e-02

Pos logit contributions
edly+0.038
 faces+0.034
aj+0.034
 face+0.033
by+0.031
Neg logit contributions
 course-0.093
ions-0.089
 July-0.087
é-0.086
neath-0.085
Token the
Token ID262
Feature activation+1.68e-03

Pos logit contributions
 face+0.002
 back+0.002
 faces+0.002
 to+0.002
edly+0.002
Neg logit contributions
ions-0.004
issa-0.004
 July-0.004
igh-0.004
 course-0.004
Token bottom
Token ID4220
Feature activation+1.82e-03

Pos logit contributions
edly+0.003
 faces+0.003
 face+0.003
by+0.003
aj+0.003
Neg logit contributions
 course-0.003
ions-0.003
 July-0.003
é-0.003
issa-0.002
Token and
Token ID290
Feature activation+1.30e-02

Pos logit contributions
edly+0.002
 face+0.002
 faces+0.001
 to+0.001
 back+0.001
Neg logit contributions
 course-0.005
 July-0.004
ions-0.004
issa-0.004
neath-0.004
Token cl
Token ID537
Feature activation-1.25e-02

Pos logit contributions
edly+0.017
 faces+0.016
 face+0.016
by+0.015
 back+0.014
Neg logit contributions
 course-0.028
ions-0.027
 July-0.027
é-0.026
issa-0.025
Tokenapped
Token ID6320
Feature activation-1.93e-03

Pos logit contributions
 course+0.021
ions+0.020
 July+0.019
é+0.019
neath+0.018
Neg logit contributions
edly-0.022
 faces-0.021
 face-0.021
 tum-0.020
by-0.020
Token her
Token ID607
Feature activation+9.52e-02

Pos logit contributions
 course+0.003
ions+0.003
 July+0.003
 introducing+0.002
é+0.002
Neg logit contributions
edly-0.004
powers-0.004
aj-0.004
suits-0.003
 spine-0.003
Token hands
Token ID2832
Feature activation+4.41e-02

Pos logit contributions
edly+0.151
aj+0.136
 faces+0.125
 upon+0.124
by+0.123
Neg logit contributions
 course-0.154
 July-0.149
é-0.147
e-0.146
osh-0.143
Token.
Token ID13
Feature activation+1.29e-02

Pos logit contributions
edly+0.036
 faces+0.033
aj+0.032
 face+0.031
 tum+0.029
Neg logit contributions
 course-0.112
ions-0.110
 July-0.106
é-0.106
ner-0.105
Token
Token ID198
Feature activation+2.42e-02

Pos logit contributions
edly+0.018
 faces+0.016
 face+0.016
by+0.015
aj+0.015
Neg logit contributions
 course-0.027
ions-0.026
 July-0.026
é-0.025
neath-0.025
Token
Token ID198
Feature activation+2.61e-03

Pos logit contributions
edly+0.026
 faces+0.024
 face+0.024
 tum+0.021
by+0.021
Neg logit contributions
 course-0.058
ions-0.056
 July-0.055
é-0.054
issa-0.053
TokenSuddenly
Token ID38582
Feature activation-1.14e-02

Pos logit contributions
edly+0.004
 face+0.004
 faces+0.004
 tum+0.003
powers+0.003
Neg logit contributions
 course-0.005
ions-0.005
 July-0.005
igh-0.005
é-0.005
Token on
Token ID319
Feature activation-2.23e-02

Pos logit contributions
 course+0.003
 July+0.003
ions+0.003
issa+0.003
neath+0.003
Neg logit contributions
powers-0.004
edly-0.004
 faces-0.003
 tum-0.003
by-0.003
Token the
Token ID262
Feature activation+1.93e-02

Pos logit contributions
 course+0.037
ions+0.033
 July+0.033
issa+0.031
neath+0.031
Neg logit contributions
edly-0.042
aj-0.039
powers-0.038
 faces-0.038
 face-0.038
Token shore
Token ID15191
Feature activation-1.93e-03

Pos logit contributions
edly+0.033
powers+0.031
by+0.030
aj+0.029
ossom+0.027
Neg logit contributions
 course-0.032
 July-0.031
issa-0.028
neath-0.027
 piano-0.027
Token and
Token ID290
Feature activation+1.44e-02

Pos logit contributions
 course+0.005
ions+0.005
 July+0.005
é+0.004
ner+0.004
Neg logit contributions
edly-0.002
powers-0.002
 faces-0.002
aj-0.002
by-0.002
Token waved
Token ID26834
Feature activation+5.11e-04

Pos logit contributions
powers+0.019
edly+0.019
aj+0.018
by+0.018
 faces+0.018
Neg logit contributions
 course-0.029
 July-0.027
issa-0.027
ions-0.026
neath-0.026
Token her
Token ID607
Feature activation+9.49e-02

Pos logit contributions
aj+0.001
ouse+0.001
edly+0.001
 contents+0.001
uddle+0.001
Neg logit contributions
 course-0.001
ting-0.001
ions-0.001
 July-0.001
 pian-0.001
Token arms
Token ID5101
Feature activation+2.87e-02

Pos logit contributions
edly+0.137
by+0.125
powers+0.119
 faces+0.117
aj+0.116
Neg logit contributions
 course-0.172
ions-0.166
 July-0.164
igh-0.158
issa-0.153
Token.
Token ID13
Feature activation+1.24e-02

Pos logit contributions
edly+0.028
 faces+0.026
aj+0.026
 face+0.024
powers+0.024
Neg logit contributions
 course-0.069
ions-0.067
é-0.064
hing-0.064
neath-0.063
Token She
Token ID1375
Feature activation+3.26e-02

Pos logit contributions
by+0.026
edly+0.025
 faces+0.024
ies+0.024
 contents+0.024
Neg logit contributions
 July-0.014
neath-0.014
ions-0.014
 course-0.014
otion-0.013
Token hoped
Token ID10719
Feature activation+8.83e-03

Pos logit contributions
edly+0.045
powers+0.042
 faces+0.041
 spine+0.041
by+0.040
Neg logit contributions
 course-0.065
issa-0.060
é-0.056
 July-0.056
osh-0.056
Token someone
Token ID2130
Feature activation-1.11e-02

Pos logit contributions
edly+0.014
aj+0.013
 faces+0.013
 face+0.013
powers+0.013
Neg logit contributions
 course-0.017
ions-0.016
 July-0.016
issa-0.015
ner-0.015
Token whip
Token ID22972
Feature activation+6.73e-03

Feature not active on prev token;
no predictions were affected.
Token away
Token ID1497
Feature activation+3.91e-02

Pos logit contributions
edly+0.007
 faces+0.007
 face+0.007
aj+0.006
by+0.006
Neg logit contributions
 course-0.016
ions-0.015
 July-0.015
é-0.015
neath-0.015
Token and
Token ID290
Feature activation+2.35e-03

Pos logit contributions
powers+0.042
aj+0.042
 faces+0.041
 tum+0.039
edly+0.039
Neg logit contributions
 course-0.084
ions-0.084
issa-0.081
é-0.079
ner-0.079
Token cl
Token ID537
Feature activation-1.32e-02

Pos logit contributions
edly+0.003
 face+0.003
 faces+0.003
 tum+0.003
 back+0.003
Neg logit contributions
 course-0.005
ions-0.005
opl-0.005
 July-0.005
é-0.005
Tokenapped
Token ID6320
Feature activation-2.32e-03

Pos logit contributions
 course+0.023
ions+0.023
 July+0.022
é+0.021
neath+0.021
Neg logit contributions
edly-0.022
 faces-0.021
 face-0.021
by-0.020
 tum-0.020
Token her
Token ID607
Feature activation+9.48e-02

Pos logit contributions
 course+0.004
ions+0.003
 July+0.003
e+0.003
 introducing+0.003
Neg logit contributions
edly-0.004
powers-0.004
aj-0.004
 spine-0.004
suits-0.004
Token hands
Token ID2832
Feature activation+4.01e-02

Pos logit contributions
edly+0.132
aj+0.118
 faces+0.115
by+0.110
 upon+0.109
Neg logit contributions
 course-0.177
 July-0.169
é-0.167
ions-0.166
neath-0.165
Token.
Token ID13
Feature activation+7.69e-03

Pos logit contributions
edly+0.034
 faces+0.031
aj+0.030
 face+0.029
by+0.028
Neg logit contributions
 course-0.101
ions-0.099
 July-0.095
é-0.095
ner-0.094
Token The
Token ID383
Feature activation-1.77e-02

Pos logit contributions
edly+0.010
 faces+0.010
 face+0.010
by+0.009
 tum+0.009
Neg logit contributions
 course-0.016
ions-0.016
 July-0.015
é-0.015
issa-0.015
Token bird
Token ID6512
Feature activation-1.41e-02

Pos logit contributions
ions+0.049
 course+0.048
é+0.047
 July+0.046
issa+0.046
Neg logit contributions
 face-0.013
 faces-0.013
edly-0.012
 tum-0.011
 back-0.009
Token flew
Token ID13112
Feature activation-4.74e-03

Pos logit contributions
 course+0.032
ions+0.030
 July+0.030
é+0.029
neath+0.029
Neg logit contributions
edly-0.017
 faces-0.016
 face-0.015
aj-0.015
powers-0.014
Token it
Token ID340
Feature activation+1.40e-02

Pos logit contributions
 face+0.033
 faces+0.033
edly+0.032
 tum+0.031
 to+0.030
Neg logit contributions
 course-0.033
ions-0.033
 July-0.033
é-0.032
issa-0.031
Token!"
Token ID2474
Feature activation+1.39e-02

Pos logit contributions
edly+0.023
 faces+0.021
 face+0.021
 tum+0.020
by+0.020
Neg logit contributions
 course-0.026
ions-0.025
 July-0.024
é-0.024
issa-0.023
Token Lily
Token ID20037
Feature activation+1.31e-02

Pos logit contributions
edly+0.025
by+0.024
eed+0.024
 upon+0.023
aj+0.023
Neg logit contributions
 course-0.020
ions-0.017
ner-0.017
on-0.017
é-0.016
Token cl
Token ID537
Feature activation-5.86e-03

Pos logit contributions
edly+0.017
 faces+0.015
 face+0.014
powers+0.014
aj+0.014
Neg logit contributions
 course-0.029
ions-0.027
é-0.026
 July-0.026
issa-0.026
Tokenapped
Token ID6320
Feature activation+2.52e-03

Pos logit contributions
 course+0.011
ions+0.010
 July+0.010
é+0.010
issa+0.010
Neg logit contributions
edly-0.010
 faces-0.009
 face-0.009
 tum-0.008
by-0.008
Token her
Token ID607
Feature activation+9.47e-02

Pos logit contributions
powers+0.005
aj+0.005
suits+0.005
edly+0.005
 spine+0.004
Neg logit contributions
 course-0.004
 introducing-0.003
 July-0.003
ions-0.003
hing-0.003
Token hands
Token ID2832
Feature activation+4.95e-02

Pos logit contributions
edly+0.141
aj+0.119
 faces+0.117
powers+0.115
 upon+0.112
Neg logit contributions
 course-0.178
 July-0.164
é-0.161
neath-0.160
ions-0.158
Token and
Token ID290
Feature activation+2.87e-02

Pos logit contributions
edly+0.047
 faces+0.042
 face+0.041
aj+0.040
powers+0.039
Neg logit contributions
 course-0.123
ions-0.117
 July-0.116
é-0.113
neath-0.112
Token said
Token ID531
Feature activation+3.43e-02

Pos logit contributions
edly+0.040
 faces+0.036
 face+0.036
powers+0.035
aj+0.035
Neg logit contributions
 course-0.060
ions-0.056
 July-0.055
ner-0.053
é-0.053
Token,
Token ID11
Feature activation+2.57e-02

Pos logit contributions
edly+0.045
 faces+0.041
aj+0.040
 face+0.039
powers+0.038
Neg logit contributions
 course-0.074
ions-0.071
 July-0.068
ner-0.068
é-0.067
Token "
Token ID366
Feature activation-4.87e-02

Pos logit contributions
edly+0.033
 faces+0.031
 face+0.031
by+0.029
 tum+0.028
Neg logit contributions
 course-0.056
ions-0.053
 July-0.053
é-0.051
issa-0.050
Token at
Token ID379
Feature activation-3.77e-02

Pos logit contributions
edly+0.024
by+0.022
powers+0.021
aj+0.021
 spine+0.020
Neg logit contributions
 course-0.027
ions-0.027
 July-0.026
otion-0.025
é-0.025
Token the
Token ID262
Feature activation-1.27e-02

Pos logit contributions
 course+0.071
ions+0.070
opl+0.069
é+0.068
 July+0.068
Neg logit contributions
 faces-0.055
edly-0.054
 face-0.053
 back-0.050
 eyes-0.048
Token dog
Token ID3290
Feature activation+2.32e-02

Pos logit contributions
 course+0.024
ner+0.020
 July+0.020
issa+0.020
ions+0.020
Neg logit contributions
edly-0.022
aj-0.019
by-0.019
powers-0.018
 faces-0.017
Token and
Token ID290
Feature activation+2.12e-02

Pos logit contributions
edly+0.023
 faces+0.019
aj+0.019
powers+0.019
 face+0.019
Neg logit contributions
 course-0.058
ions-0.055
é-0.054
 July-0.053
issa-0.051
Token waved
Token ID26834
Feature activation+2.65e-03

Pos logit contributions
edly+0.025
 faces+0.024
 face+0.024
 tum+0.021
 back+0.021
Neg logit contributions
 course-0.048
ions-0.047
 July-0.046
é-0.045
opl-0.044
Token her
Token ID607
Feature activation+9.43e-02

Pos logit contributions
edly+0.004
aj+0.004
 spine+0.004
ouse+0.004
powers+0.004
Neg logit contributions
 course-0.005
 July-0.004
ting-0.004
ions-0.004
é-0.004
Token arms
Token ID5101
Feature activation+3.27e-02

Pos logit contributions
edly+0.126
by+0.112
aj+0.108
 faces+0.105
powers+0.105
Neg logit contributions
 course-0.188
ions-0.185
 July-0.181
é-0.174
igh-0.174
Token.
Token ID13
Feature activation+8.07e-03

Pos logit contributions
edly+0.031
 faces+0.027
aj+0.027
by+0.026
 face+0.026
Neg logit contributions
 course-0.080
ions-0.078
é-0.075
hing-0.075
 July-0.075
Token The
Token ID383
Feature activation-3.02e-02

Pos logit contributions
edly+0.012
by+0.011
 faces+0.011
 face+0.011
aj+0.010
Neg logit contributions
 course-0.016
 July-0.015
ions-0.015
é-0.015
neath-0.015
Token dog
Token ID3290
Feature activation-2.43e-03

Pos logit contributions
 course+0.061
 July+0.055
ions+0.055
issa+0.053
ner+0.052
Neg logit contributions
edly-0.047
by-0.040
 faces-0.040
aj-0.040
 face-0.039
Token stopped
Token ID5025
Feature activation+2.36e-02

Pos logit contributions
 course+0.005
é+0.004
neath+0.004
issa+0.004
 introducing+0.004
Neg logit contributions
edly-0.003
aj-0.003
by-0.003
 spine-0.003
powers-0.003
Token Lily
Token ID20037
Feature activation+2.34e-03

Pos logit contributions
 course+0.009
ions+0.008
 July+0.008
é+0.008
issa+0.008
Neg logit contributions
edly-0.007
 face-0.007
 faces-0.007
 tum-0.006
by-0.006
Token smiled
Token ID13541
Feature activation+4.86e-02

Pos logit contributions
edly+0.003
 faces+0.003
 face+0.003
by+0.002
powers+0.002
Neg logit contributions
 course-0.005
ions-0.005
 July-0.005
issa-0.005
é-0.005
Token and
Token ID290
Feature activation+1.79e-02

Pos logit contributions
powers+0.089
aj+0.084
 spine+0.084
suits+0.084
suit+0.079
Neg logit contributions
 introducing-0.062
ting-0.062
ning-0.061
 course-0.060
neath-0.052
Token cl
Token ID537
Feature activation-5.91e-03

Pos logit contributions
edly+0.026
 faces+0.024
 face+0.023
by+0.023
powers+0.022
Neg logit contributions
 course-0.036
ions-0.034
 July-0.034
é-0.032
issa-0.032
Tokenapped
Token ID6320
Feature activation+1.98e-03

Pos logit contributions
 course+0.010
ions+0.010
 July+0.010
é+0.009
issa+0.009
Neg logit contributions
edly-0.010
 faces-0.010
 face-0.010
 tum-0.009
by-0.009
Token her
Token ID607
Feature activation+9.38e-02

Pos logit contributions
powers+0.004
edly+0.004
aj+0.004
 spine+0.004
suits+0.004
Neg logit contributions
 course-0.003
ions-0.003
 July-0.002
 introducing-0.002
e-0.002
Token hands
Token ID2832
Feature activation+4.30e-02

Pos logit contributions
edly+0.141
aj+0.124
by+0.117
 faces+0.116
powers+0.116
Neg logit contributions
 course-0.164
 July-0.162
e-0.156
ions-0.155
é-0.154
Token.
Token ID13
Feature activation+1.01e-02

Pos logit contributions
edly+0.037
aj+0.035
 faces+0.032
powers+0.032
by+0.032
Neg logit contributions
 course-0.105
ions-0.102
 July-0.099
ner-0.099
é-0.098
Token She
Token ID1375
Feature activation+2.89e-02

Pos logit contributions
edly+0.014
 faces+0.013
by+0.013
 face+0.013
aj+0.012
Neg logit contributions
 course-0.021
ions-0.020
 July-0.020
neath-0.019
é-0.019
Token was
Token ID373
Feature activation+1.01e-03

Pos logit contributions
edly+0.038
 faces+0.035
 face+0.034
by+0.033
powers+0.033
Neg logit contributions
 course-0.061
ions-0.057
 July-0.056
issa-0.056
é-0.055
Token happy
Token ID3772
Feature activation+5.12e-02

Pos logit contributions
edly+0.001
 face+0.001
 faces+0.001
 tum+0.001
by+0.001
Neg logit contributions
 course-0.002
ions-0.002
 July-0.002
é-0.002
neath-0.002
Token eyes
Token ID2951
Feature activation+2.02e-02

Pos logit contributions
edly+0.019
aj+0.015
suit+0.014
powers+0.014
 face+0.014
Neg logit contributions
 course-0.048
 July-0.047
ions-0.045
neath-0.044
é-0.044
Token as
Token ID355
Feature activation-1.34e-03

Pos logit contributions
edly+0.018
 faces+0.017
 face+0.017
by+0.015
 tum+0.015
Neg logit contributions
 course-0.052
ions-0.050
 July-0.049
é-0.048
issa-0.047
Token the
Token ID262
Feature activation-2.14e-02

Pos logit contributions
 course+0.003
ions+0.003
 July+0.003
é+0.003
issa+0.003
Neg logit contributions
edly-0.001
 face-0.001
 faces-0.001
by-0.001
 tum-0.001
Token fairy
Token ID25607
Feature activation-2.13e-02

Pos logit contributions
 course+0.045
ions+0.044
 July+0.041
neath+0.041
é+0.041
Neg logit contributions
 face-0.029
 faces-0.029
edly-0.028
 tum-0.026
 back-0.025
Token waved
Token ID26834
Feature activation-2.94e-03

Pos logit contributions
 course+0.044
ions+0.042
 July+0.042
é+0.040
issa+0.039
Neg logit contributions
edly-0.030
 face-0.028
 faces-0.028
by-0.026
 tum-0.025
Token her
Token ID607
Feature activation+9.37e-02

Pos logit contributions
 course+0.005
ting+0.005
 July+0.005
hing+0.005
neath+0.005
Neg logit contributions
aj-0.005
edly-0.004
suits-0.004
powers-0.004
ouse-0.004
Token wand
Token ID11569
Feature activation+3.20e-02

Pos logit contributions
edly+0.117
by+0.103
powers+0.101
 faces+0.101
aj+0.099
Neg logit contributions
 course-0.196
ions-0.189
 July-0.187
igh-0.185
issa-0.183
Token and
Token ID290
Feature activation+2.00e-02

Pos logit contributions
edly+0.033
 faces+0.032
aj+0.031
by+0.030
powers+0.030
Neg logit contributions
 course-0.075
ions-0.072
neath-0.068
 July-0.068
é-0.068
Token chanted
Token ID42261
Feature activation+6.87e-03

Pos logit contributions
edly+0.029
 faces+0.027
 face+0.027
aj+0.027
powers+0.025
Neg logit contributions
 course-0.040
ions-0.038
 July-0.037
é-0.036
neath-0.035
Token some
Token ID617
Feature activation+1.47e-02

Pos logit contributions
edly+0.009
aj+0.009
by+0.008
 faces+0.008
powers+0.008
Neg logit contributions
 course-0.014
ions-0.014
 July-0.013
é-0.013
neath-0.013
Token secret
Token ID3200
Feature activation+4.35e-02

Pos logit contributions
edly+0.019
 faces+0.017
 face+0.017
aj+0.016
by+0.016
Neg logit contributions
 course-0.032
ions-0.031
 July-0.031
é-0.029
issa-0.029
Token very
Token ID845
Feature activation+1.99e-02

Pos logit contributions
 face+0.005
edly+0.005
 back+0.005
 faces+0.005
 tum+0.005
Neg logit contributions
 course-0.012
ions-0.012
 July-0.011
neath-0.011
é-0.011
Token happy
Token ID3772
Feature activation+5.03e-02

Pos logit contributions
edly+0.022
 faces+0.020
 face+0.019
by+0.018
 tum+0.017
Neg logit contributions
 course-0.047
ions-0.046
 July-0.046
é-0.044
issa-0.043
Token and
Token ID290
Feature activation+1.10e-02

Pos logit contributions
aj+0.062
suits+0.061
powers+0.060
edly+0.060
suit+0.057
Neg logit contributions
 course-0.112
ions-0.103
 introducing-0.100
 July-0.095
ting-0.094
Token cl
Token ID537
Feature activation-9.95e-03

Pos logit contributions
edly+0.014
 faces+0.013
 face+0.013
by+0.012
aj+0.012
Neg logit contributions
 course-0.024
ions-0.023
 July-0.022
é-0.022
issa-0.021
Tokenapped
Token ID6320
Feature activation+1.41e-03

Pos logit contributions
 course+0.018
ions+0.017
 July+0.017
é+0.016
neath+0.016
Neg logit contributions
edly-0.017
 faces-0.016
 face-0.016
 tum-0.015
by-0.015
Token her
Token ID607
Feature activation+9.36e-02

Pos logit contributions
aj+0.002
powers+0.002
edly+0.002
suits+0.002
 spine+0.002
Neg logit contributions
 course-0.003
ions-0.002
 July-0.002
 introducing-0.002
é-0.002
Token hands
Token ID2832
Feature activation+4.12e-02

Pos logit contributions
edly+0.137
aj+0.126
 faces+0.118
 upon+0.113
by+0.110
Neg logit contributions
 course-0.170
é-0.158
 July-0.157
ions-0.155
e-0.155
Token too
Token ID1165
Feature activation+4.44e-02

Pos logit contributions
edly+0.035
aj+0.033
 faces+0.033
 face+0.031
powers+0.030
Neg logit contributions
 course-0.104
ions-0.100
ner-0.097
é-0.097
 July-0.096
Token.
Token ID13
Feature activation+2.83e-03

Pos logit contributions
edly+0.037
 faces+0.033
aj+0.032
 face+0.031
by+0.030
Neg logit contributions
 course-0.114
ions-0.109
 July-0.108
ner-0.107
é-0.107
Token
Token ID198
Feature activation+1.28e-02

Pos logit contributions
edly+0.004
 faces+0.003
 face+0.003
by+0.003
 tum+0.003
Neg logit contributions
 course-0.006
ions-0.006
 July-0.006
é-0.006
neath-0.006
Token
Token ID198
Feature activation-8.63e-03

Pos logit contributions
edly+0.013
 faces+0.012
 face+0.012
 tum+0.011
by+0.011
Neg logit contributions
 course-0.031
ions-0.030
 July-0.030
é-0.029
neath-0.028
Token,
Token ID11
Feature activation+1.14e-02

Pos logit contributions
 course+0.031
ions+0.028
neath+0.026
é+0.026
hing+0.025
Neg logit contributions
edly-0.029
 faces-0.027
 tum-0.026
aj-0.026
 face-0.026
Token so
Token ID523
Feature activation+1.67e-02

Pos logit contributions
edly+0.013
 face+0.013
 faces+0.012
 tum+0.011
 back+0.011
Neg logit contributions
 course-0.026
ions-0.026
 July-0.025
é-0.025
issa-0.025
Token she
Token ID673
Feature activation+1.41e-02

Pos logit contributions
edly+0.024
 faces+0.023
 face+0.022
by+0.021
 tum+0.021
Neg logit contributions
 course-0.034
ions-0.032
 July-0.032
é-0.031
issa-0.030
Token fl
Token ID781
Feature activation-7.89e-03

Pos logit contributions
edly+0.020
 faces+0.019
 face+0.018
aj+0.017
 tum+0.017
Neg logit contributions
 course-0.029
ions-0.027
 July-0.027
é-0.026
issa-0.026
Tokenapped
Token ID6320
Feature activation-1.39e-02

Pos logit contributions
ions+0.012
 course+0.011
é+0.011
neath+0.010
igh+0.010
Neg logit contributions
 faces-0.015
edly-0.015
 face-0.014
powers-0.014
 tum-0.014
Token her
Token ID607
Feature activation+9.35e-02

Pos logit contributions
 course+0.025
ions+0.023
 July+0.022
é+0.020
neath+0.020
Neg logit contributions
edly-0.024
aj-0.023
powers-0.023
suits-0.022
 tum-0.022
Token wings
Token ID12098
Feature activation+2.49e-02

Pos logit contributions
edly+0.148
aj+0.141
 faces+0.134
suits+0.133
by+0.130
Neg logit contributions
 course-0.156
 July-0.135
igh-0.135
e-0.133
osh-0.132
Token and
Token ID290
Feature activation+2.28e-02

Pos logit contributions
edly+0.030
 faces+0.029
aj+0.029
by+0.027
 face+0.027
Neg logit contributions
 course-0.055
ions-0.051
hing-0.050
neath-0.049
é-0.049
Token flew
Token ID13112
Feature activation-8.96e-03

Pos logit contributions
edly+0.036
 faces+0.034
 face+0.033
aj+0.032
by+0.032
Neg logit contributions
 course-0.043
ions-0.040
 July-0.040
é-0.038
neath-0.038
Token away
Token ID1497
Feature activation+5.79e-02

Pos logit contributions
 course+0.016
ions+0.015
neath+0.014
é+0.014
ting+0.013
Neg logit contributions
aj-0.016
 faces-0.015
 tum-0.015
edly-0.015
ouse-0.015
Token.
Token ID13
Feature activation+1.36e-02

Pos logit contributions
aj+0.081
 faces+0.074
ouse+0.069
by+0.069
 tum+0.068
Neg logit contributions
 course-0.113
ner-0.110
ions-0.107
ting-0.106
é-0.106
Token Anna
Token ID11735
Feature activation+2.35e-02

Pos logit contributions
 course+0.023
ions+0.022
 July+0.022
é+0.020
issa+0.020
Neg logit contributions
edly-0.020
 face-0.018
 faces-0.018
aj-0.017
by-0.017
Token.
Token ID13
Feature activation-4.16e-03

Pos logit contributions
edly+0.029
aj+0.026
powers+0.024
 spine+0.024
 faces+0.024
Neg logit contributions
 course-0.054
e-0.049
ions-0.048
é-0.046
igh-0.046
Token He
Token ID679
Feature activation+2.49e-02

Pos logit contributions
 course+0.009
ions+0.009
 July+0.009
ner+0.008
neath+0.008
Neg logit contributions
edly-0.005
by-0.005
 faces-0.005
aj-0.004
 face-0.004
Token l
Token ID300
Feature activation-2.80e-02

Pos logit contributions
edly+0.037
 faces+0.032
 face+0.032
 spine+0.031
by+0.030
Neg logit contributions
 course-0.053
é-0.046
 July-0.046
ions-0.045
neath-0.045
Tokenicks
Token ID3378
Feature activation-3.41e-02

Pos logit contributions
 course+0.044
ions+0.043
é+0.038
 July+0.038
igh+0.037
Neg logit contributions
edly-0.053
 faces-0.052
 face-0.051
by-0.050
aj-0.050
Token her
Token ID607
Feature activation+9.34e-02

Pos logit contributions
 course+0.065
ions+0.063
 July+0.060
é+0.057
neath+0.057
Neg logit contributions
edly-0.054
aj-0.050
powers-0.048
 faces-0.048
 face-0.046
Token face
Token ID1986
Feature activation+5.21e-02

Pos logit contributions
aj+0.104
powers+0.095
by+0.095
edly+0.092
 forgot+0.090
Neg logit contributions
e-0.193
 course-0.173
on-0.171
hing-0.164
bug-0.159
Token.
Token ID13
Feature activation-2.56e-03

Pos logit contributions
aj+0.047
by+0.046
edly+0.046
powers+0.045
 faces+0.044
Neg logit contributions
 course-0.121
ions-0.116
on-0.112
é-0.111
ner-0.109
Token Anna
Token ID11735
Feature activation+1.73e-02

Pos logit contributions
 course+0.005
 July+0.005
ions+0.005
ner+0.005
é+0.005
Neg logit contributions
edly-0.004
by-0.003
 faces-0.003
 face-0.003
aj-0.003
Token laughs
Token ID22051
Feature activation+4.20e-02

Pos logit contributions
edly+0.027
aj+0.023
 faces+0.023
 spine+0.023
 face+0.023
Neg logit contributions
 course-0.035
é-0.030
ions-0.030
neath-0.030
 July-0.029
Token.
Token ID13
Feature activation+3.79e-03

Pos logit contributions
by+0.059
aj+0.058
 spine+0.057
edly+0.057
powers+0.054
Neg logit contributions
 course-0.085
ting-0.076
ions-0.076
 introducing-0.075
hing-0.073
Token would
Token ID561
Feature activation+3.21e-02

Pos logit contributions
edly+0.012
 faces+0.011
 face+0.011
 tum+0.010
aj+0.010
Neg logit contributions
 course-0.020
ions-0.019
 July-0.018
é-0.018
issa-0.018
Token laugh
Token ID6487
Feature activation+1.95e-02

Pos logit contributions
edly+0.047
 faces+0.044
 face+0.043
by+0.041
 tum+0.040
Neg logit contributions
 course-0.065
ions-0.062
 July-0.061
é-0.059
neath-0.058
Token and
Token ID290
Feature activation+2.94e-02

Pos logit contributions
edly+0.024
aj+0.023
by+0.022
 spine+0.022
 tum+0.022
Neg logit contributions
 course-0.041
ions-0.040
 July-0.038
hing-0.038
é-0.037
Token cl
Token ID537
Feature activation-9.59e-03

Pos logit contributions
edly+0.044
 faces+0.039
 face+0.038
aj+0.038
by+0.038
Neg logit contributions
 course-0.058
 July-0.055
ions-0.055
é-0.052
neath-0.051
Tokenap
Token ID499
Feature activation-6.18e-03

Pos logit contributions
 course+0.013
ions+0.012
 July+0.012
é+0.011
issa+0.011
Neg logit contributions
edly-0.020
 faces-0.019
 face-0.019
by-0.018
 tum-0.018
Token her
Token ID607
Feature activation+9.32e-02

Pos logit contributions
ions+0.012
 course+0.012
e+0.011
 July+0.011
hing+0.011
Neg logit contributions
aj-0.008
edly-0.008
powers-0.008
 spine-0.008
 tum-0.007
Token hands
Token ID2832
Feature activation+3.36e-02

Pos logit contributions
edly+0.149
aj+0.133
 upon+0.129
by+0.127
powers+0.124
Neg logit contributions
 course-0.149
e-0.148
 July-0.146
ions-0.142
é-0.138
Token.
Token ID13
Feature activation+9.14e-03

Pos logit contributions
edly+0.027
 faces+0.024
aj+0.024
 face+0.022
powers+0.022
Neg logit contributions
 course-0.087
ions-0.084
é-0.081
 July-0.081
ner-0.080
Token It
Token ID632
Feature activation-1.43e-02

Pos logit contributions
edly+0.012
 faces+0.011
 face+0.011
by+0.010
 tum+0.010
Neg logit contributions
 course-0.020
ions-0.019
 July-0.019
é-0.018
neath-0.018
Token was
Token ID373
Feature activation-5.85e-03

Pos logit contributions
 course+0.033
ions+0.032
 July+0.032
ner+0.030
é+0.030
Neg logit contributions
edly-0.016
 face-0.015
 faces-0.015
 tum-0.013
by-0.013
Token so
Token ID523
Feature activation+1.05e-02

Pos logit contributions
 course+0.013
ions+0.012
 July+0.012
é+0.012
issa+0.012
Neg logit contributions
edly-0.008
 faces-0.007
 face-0.007
by-0.006
 tum-0.006
Token so
Token ID523
Feature activation+1.58e-02

Pos logit contributions
 course+0.012
ions+0.012
 July+0.011
é+0.011
neath+0.011
Neg logit contributions
edly-0.008
 face-0.007
 faces-0.007
 tum-0.007
by-0.007
Token excited
Token ID6568
Feature activation+3.79e-02

Pos logit contributions
edly+0.020
 faces+0.019
 face+0.018
by+0.017
aj+0.017
Neg logit contributions
 course-0.034
 July-0.033
ions-0.032
é-0.032
ner-0.031
Token and
Token ID290
Feature activation+1.06e-02

Pos logit contributions
edly+0.047
aj+0.046
suits+0.044
powers+0.042
suit+0.041
Neg logit contributions
 course-0.085
ions-0.077
ting-0.073
 introducing-0.073
issa-0.072
Token cl
Token ID537
Feature activation-1.96e-02

Pos logit contributions
edly+0.015
aj+0.013
by+0.013
 faces+0.013
powers+0.013
Neg logit contributions
 course-0.021
 July-0.020
ner-0.019
 introducing-0.019
ions-0.019
Tokenapped
Token ID6320
Feature activation-3.69e-03

Pos logit contributions
 course+0.032
ions+0.031
 July+0.030
é+0.030
neath+0.029
Neg logit contributions
edly-0.035
 faces-0.034
 face-0.033
 tum-0.031
by-0.031
Token her
Token ID607
Feature activation+9.31e-02

Pos logit contributions
 course+0.006
ions+0.006
 July+0.006
 introducing+0.006
hing+0.005
Neg logit contributions
aj-0.006
powers-0.006
suits-0.006
edly-0.006
 spine-0.006
Token hands
Token ID2832
Feature activation+4.02e-02

Pos logit contributions
edly+0.132
aj+0.121
 faces+0.117
by+0.110
suits+0.108
Neg logit contributions
 course-0.174
 July-0.161
neath-0.160
é-0.160
ions-0.158
Token!
Token ID0
Feature activation+7.53e-03

Pos logit contributions
edly+0.035
 faces+0.033
 face+0.031
aj+0.031
by+0.029
Neg logit contributions
 course-0.101
ions-0.098
 July-0.095
é-0.094
ner-0.094
Token
Token ID198
Feature activation+7.54e-03

Pos logit contributions
edly+0.011
 faces+0.010
 face+0.010
by+0.009
 tum+0.009
Neg logit contributions
 course-0.016
ions-0.015
 July-0.015
é-0.014
neath-0.014
Token
Token ID198
Feature activation-6.56e-03

Pos logit contributions
edly+0.009
 face+0.009
 faces+0.008
by+0.008
aj+0.008
Neg logit contributions
 course-0.017
ions-0.016
 July-0.016
é-0.016
issa-0.015
TokenDaddy
Token ID48280
Feature activation-1.34e-02

Pos logit contributions
 course+0.013
ions+0.013
 July+0.012
é+0.012
igh+0.012
Neg logit contributions
edly-0.010
 face-0.009
 faces-0.009
powers-0.008
by-0.008
Token!"
Token ID2474
Feature activation+1.19e-02

Pos logit contributions
edly+0.016
 faces+0.016
 face+0.015
aj+0.015
 tum+0.015
Neg logit contributions
 course-0.016
ions-0.015
 July-0.014
é-0.014
ner-0.014
Token Mom
Token ID11254
Feature activation-8.67e-03

Pos logit contributions
edly+0.026
by+0.024
suit+0.023
eed+0.023
suits+0.022
Neg logit contributions
 course-0.018
 July-0.017
ions-0.015
on-0.015
neath-0.014
Token said
Token ID531
Feature activation+3.70e-02

Pos logit contributions
 course+0.020
ions+0.020
 July+0.019
é+0.019
neath+0.019
Neg logit contributions
edly-0.010
 face-0.009
 faces-0.009
 tum-0.008
by-0.008
Token,
Token ID11
Feature activation+2.88e-02

Pos logit contributions
aj+0.053
edly+0.053
powers+0.052
by+0.052
 pockets+0.049
Neg logit contributions
ner-0.064
ions-0.063
 course-0.061
otion-0.059
é-0.058
Token waving
Token ID25849
Feature activation+2.46e-03

Pos logit contributions
edly+0.050
 face+0.048
 faces+0.048
 tum+0.044
 back+0.044
Neg logit contributions
 course-0.050
ions-0.048
 July-0.046
é-0.045
issa-0.044
Token her
Token ID607
Feature activation+9.31e-02

Pos logit contributions
aj+0.004
edly+0.004
powers+0.004
 spine+0.004
ouse+0.004
Neg logit contributions
 course-0.004
ions-0.004
ting-0.004
 July-0.004
e-0.004
Token arms
Token ID5101
Feature activation+3.55e-02

Pos logit contributions
edly+0.125
aj+0.113
 faces+0.111
powers+0.110
 tum+0.108
Neg logit contributions
 course-0.184
ions-0.173
igh-0.166
 July-0.164
é-0.163
Token.
Token ID13
Feature activation+5.49e-03

Pos logit contributions
edly+0.038
 faces+0.036
aj+0.035
 face+0.033
powers+0.033
Neg logit contributions
 course-0.082
ions-0.079
é-0.076
neath-0.076
hing-0.076
Token
Token ID198
Feature activation+4.12e-03

Pos logit contributions
by+0.012
edly+0.011
 faces+0.010
powers+0.009
ies+0.009
Neg logit contributions
 course-0.008
ions-0.007
 July-0.007
neath-0.007
ner-0.007
Token
Token ID198
Feature activation-1.20e-02

Pos logit contributions
edly+0.007
 faces+0.006
 spine+0.006
suit+0.006
 face+0.006
Neg logit contributions
 course-0.008
ions-0.008
 July-0.007
neath-0.007
Little-0.007
TokenThe
Token ID464
Feature activation-3.93e-02

Pos logit contributions
 course+0.021
ions+0.020
 July+0.019
neath+0.018
ner+0.018
Neg logit contributions
edly-0.021
 faces-0.020
 face-0.020
 tum-0.019
by-0.019
Token her
Token ID607
Feature activation+6.61e-02

Pos logit contributions
 face+0.006
 faces+0.005
 back+0.005
 to+0.005
edly+0.005
Neg logit contributions
 course-0.008
ions-0.008
opl-0.008
é-0.007
 July-0.007
Token seat
Token ID5852
Feature activation+9.53e-03

Pos logit contributions
edly+0.100
powers+0.085
by+0.081
 faces+0.080
emo+0.078
Neg logit contributions
 course-0.121
issa-0.118
 July-0.116
ions-0.115
é-0.113
Token and
Token ID290
Feature activation+2.27e-02

Pos logit contributions
edly+0.010
powers+0.009
 faces+0.009
 face+0.009
by+0.008
Neg logit contributions
 course-0.022
ions-0.022
é-0.021
issa-0.021
 July-0.021
Token cl
Token ID537
Feature activation-7.72e-03

Pos logit contributions
edly+0.028
 face+0.027
 faces+0.026
by+0.024
 tum+0.024
Neg logit contributions
 course-0.051
ions-0.049
 July-0.048
é-0.047
opl-0.046
Tokenapped
Token ID6320
Feature activation+4.00e-03

Pos logit contributions
 course+0.014
ions+0.013
 July+0.013
é+0.012
issa+0.012
Neg logit contributions
edly-0.013
 faces-0.013
 face-0.013
by-0.012
 tum-0.012
Token her
Token ID607
Feature activation+9.31e-02

Pos logit contributions
powers+0.006
edly+0.006
aj+0.006
 spine+0.006
suits+0.006
Neg logit contributions
ions-0.006
 course-0.006
 July-0.006
 introducing-0.006
é-0.006
Token hands
Token ID2832
Feature activation+4.42e-02

Pos logit contributions
edly+0.139
aj+0.119
 faces+0.115
powers+0.115
by+0.113
Neg logit contributions
 July-0.162
 course-0.160
é-0.155
ions-0.153
issa-0.153
Token,
Token ID11
Feature activation+3.72e-02

Pos logit contributions
edly+0.035
 faces+0.031
 face+0.030
aj+0.028
powers+0.028
Neg logit contributions
 course-0.115
ions-0.112
 July-0.110
é-0.108
ner-0.107
Token ready
Token ID3492
Feature activation+4.50e-02

Pos logit contributions
edly+0.053
 faces+0.049
 face+0.048
by+0.048
powers+0.047
Neg logit contributions
 course-0.075
 July-0.071
ions-0.071
é-0.068
neath-0.067
Token to
Token ID284
Feature activation+1.32e-02

Pos logit contributions
edly+0.037
powers+0.033
aj+0.032
by+0.031
 faces+0.030
Neg logit contributions
 course-0.112
ions-0.110
 July-0.108
ner-0.104
é-0.103
Token eat
Token ID4483
Feature activation-1.43e-02

Pos logit contributions
 face+0.016
 faces+0.013
 back+0.013
 tum+0.012
 to+0.011
Neg logit contributions
ions-0.031
 course-0.031
issa-0.030
é-0.029
 July-0.029
Token didn
Token ID1422
Feature activation+4.37e-02

Pos logit contributions
 course+0.013
é+0.011
issa+0.011
 July+0.011
neath+0.011
Neg logit contributions
edly-0.008
 spine-0.007
powers-0.007
 faces-0.007
aj-0.007
Token't
Token ID470
Feature activation+2.84e-02

Pos logit contributions
 face+0.078
edly+0.076
 to+0.075
 back+0.074
 faces+0.073
Neg logit contributions
 course-0.068
 July-0.065
ions-0.063
igh-0.060
issa-0.059
Token know
Token ID760
Feature activation+2.04e-02

Pos logit contributions
edly+0.048
aj+0.047
 faces+0.041
powers+0.040
 tum+0.040
Neg logit contributions
 course-0.051
 July-0.048
hing-0.047
ner-0.047
é-0.046
Token what
Token ID644
Feature activation-3.15e-03

Pos logit contributions
 faces+0.024
 face+0.024
 eyes+0.022
 to+0.021
edly+0.021
Neg logit contributions
 July-0.044
é-0.044
ions-0.043
neath-0.043
 course-0.042
Token "
Token ID366
Feature activation-5.85e-02

Pos logit contributions
 course+0.008
ions+0.008
 July+0.007
neath+0.007
é+0.007
Neg logit contributions
edly-0.003
 face-0.003
 faces-0.003
 tum-0.003
by-0.002
Tokensm
Token ID5796
Feature activation-1.19e-01

Pos logit contributions
 course+0.097
 July+0.094
ions+0.093
issa+0.087
é+0.085
Neg logit contributions
edly-0.106
 face-0.097
 faces-0.097
by-0.095
powers-0.092
Tokenoother
Token ID31724
Feature activation-5.44e-02

Pos logit contributions
roads+0.146
Michael+0.144
icians+0.142
 course+0.138
ner+0.137
Neg logit contributions
 to-0.222
edly-0.216
by-0.215
 faces-0.211
 back-0.208
Token"
Token ID1
Feature activation-1.86e-02

Pos logit contributions
 course+0.116
rob+0.104
 July+0.103
opl+0.103
eday+0.101
Neg logit contributions
 to-0.068
.-0.066
 face-0.066
edly-0.062
ouse-0.058
Token meant
Token ID4001
Feature activation-5.64e-03

Pos logit contributions
 course+0.028
ions+0.026
é+0.025
ner+0.024
issa+0.024
Neg logit contributions
edly-0.035
 faces-0.034
 face-0.033
aj-0.033
by-0.032
Token,
Token ID11
Feature activation-2.18e-03

Pos logit contributions
 course+0.013
ions+0.012
ner+0.012
 July+0.012
é+0.012
Neg logit contributions
edly-0.007
by-0.006
 faces-0.006
aj-0.006
 face-0.006
Token but
Token ID475
Feature activation-1.68e-02

Pos logit contributions
 course+0.005
ions+0.005
 July+0.005
é+0.005
ner+0.005
Neg logit contributions
edly-0.003
by-0.002
 faces-0.002
 face-0.002
aj-0.002
Token shell
Token ID7582
Feature activation+1.52e-02

Pos logit contributions
edly+0.073
aj+0.063
ashed+0.057
 upon+0.056
by+0.056
Neg logit contributions
 course-0.113
ions-0.103
 July-0.099
opl-0.099
e-0.098
Token.
Token ID13
Feature activation+1.60e-03

Pos logit contributions
edly+0.015
 faces+0.015
by+0.013
powers+0.013
aj+0.012
Neg logit contributions
 course-0.038
ions-0.036
on-0.033
igh-0.033
ner-0.033
Token
Token ID198
Feature activation+1.01e-02

Pos logit contributions
edly+0.002
 face+0.002
 faces+0.002
 tum+0.002
by+0.002
Neg logit contributions
 course-0.003
ions-0.003
 July-0.003
issa-0.003
é-0.003
Token
Token ID198
Feature activation-3.91e-03

Pos logit contributions
edly+0.008
 face+0.007
 faces+0.007
by+0.006
powers+0.006
Neg logit contributions
 course-0.027
ions-0.026
 July-0.026
issa-0.025
é-0.025
TokenThe
Token ID464
Feature activation-3.92e-02

Pos logit contributions
ions+0.007
 July+0.007
 course+0.007
issa+0.007
otion+0.007
Neg logit contributions
edly-0.006
 face-0.006
powers-0.005
 faces-0.005
 to-0.005
Token tort
Token ID7619
Feature activation-1.17e-01

Pos logit contributions
ions+0.085
 July+0.081
 course+0.081
issa+0.080
neath+0.078
Neg logit contributions
edly-0.048
 face-0.047
 faces-0.046
 back-0.045
 tum-0.041
Tokenoise
Token ID25678
Feature activation-2.90e-02

Pos logit contributions
 July+0.162
aster+0.149
 oven+0.136
 cinema+0.130
 bolt+0.130
Neg logit contributions
edly-0.230
by-0.217
 upon-0.207
ed-0.199
pt-0.199
Token was
Token ID373
Feature activation+4.65e-03

Pos logit contributions
 course+0.064
ions+0.060
é+0.059
 July+0.058
issa+0.057
Neg logit contributions
edly-0.036
 faces-0.035
 face-0.033
aj-0.032
 tum-0.031
Token happy
Token ID3772
Feature activation+5.07e-02

Pos logit contributions
edly+0.006
 faces+0.005
 face+0.005
by+0.005
 tum+0.005
Neg logit contributions
 course-0.010
ions-0.010
 July-0.010
é-0.010
issa-0.009
Token to
Token ID284
Feature activation+2.74e-04

Pos logit contributions
aj+0.071
suits+0.065
edly+0.065
powers+0.064
suit+0.062
Neg logit contributions
 course-0.109
 introducing-0.100
ions-0.099
ting-0.091
on-0.089
Token help
Token ID1037
Feature activation-1.73e-02

Pos logit contributions
edly+0.000
aj+0.000
 faces+0.000
powers+0.000
suits+0.000
Neg logit contributions
 course-0.001
 July-0.000
ions-0.000
e-0.000
é-0.000
Token little
Token ID1310
Feature activation-2.06e-02

Pos logit contributions
 course+0.014
 July+0.013
ions+0.013
é+0.013
issa+0.013
Neg logit contributions
edly-0.009
 faces-0.008
 face-0.007
aj-0.007
powers-0.007
Token er
Token ID1931
Feature activation-1.06e-01

Pos logit contributions
 course+0.044
ions+0.042
 July+0.041
é+0.040
neath+0.040
Neg logit contributions
edly-0.027
 face-0.026
 faces-0.026
 tum-0.024
by-0.024
Tokenaser
Token ID6005
Feature activation-5.00e-02

Pos logit contributions
 July+0.186
neath+0.177
 course+0.176
ions+0.160
opl+0.156
Neg logit contributions
edly-0.174
 face-0.168
powers-0.163
 faces-0.160
by-0.157
Token.
Token ID13
Feature activation-2.20e-02

Pos logit contributions
 course+0.126
 July+0.124
ions+0.124
neath+0.120
é+0.118
Neg logit contributions
edly-0.045
 face-0.042
 faces-0.039
 tum-0.037
by-0.034
Token The
Token ID383
Feature activation-1.97e-02

Pos logit contributions
 course+0.050
ions+0.048
issa+0.048
 July+0.046
icians+0.046
Neg logit contributions
 face-0.025
edly-0.025
 tum-0.023
 faces-0.022
 He-0.022
Token er
Token ID1931
Feature activation-1.17e-01

Pos logit contributions
 course+0.027
ions+0.026
 July+0.025
é+0.024
issa+0.023
Neg logit contributions
edly-0.041
 faces-0.039
 face-0.039
 tum-0.037
by-0.037
Tokenaser
Token ID6005
Feature activation-4.75e-02

Pos logit contributions
 July+0.220
 course+0.209
neath+0.203
hing+0.186
ions+0.184
Neg logit contributions
edly-0.177
 face-0.167
powers-0.166
 faces-0.158
ouse-0.156
Token was
Token ID373
Feature activation-1.43e-02

Pos logit contributions
 course+0.096
issa+0.085
é+0.084
ions+0.084
on+0.083
Neg logit contributions
edly-0.073
 faces-0.068
aj-0.064
by-0.062
 spine-0.061
Token sad
Token ID6507
Feature activation+4.09e-02

Pos logit contributions
 course+0.033
 July+0.032
ions+0.032
é+0.031
issa+0.030
Neg logit contributions
edly-0.017
 faces-0.015
 face-0.014
by-0.014
aj-0.014
Token because
Token ID780
Feature activation-3.34e-02

Pos logit contributions
edly+0.067
 faces+0.064
aj+0.062
by+0.059
powers+0.059
Neg logit contributions
ions-0.071
 course-0.071
é-0.064
 July-0.063
issa-0.061
Token it
Token ID340
Feature activation-2.15e-02

Pos logit contributions
 course+0.067
hing+0.066
é+0.064
rob+0.064
neath+0.062
Neg logit contributions
 face-0.044
 to-0.042
 eyes-0.042
 back-0.041
 tum-0.040
Token would
Token ID561
Feature activation+2.96e-02

Pos logit contributions
edly+0.031
 faces+0.030
aj+0.029
 face+0.029
 spine+0.027
Neg logit contributions
 course-0.042
é-0.040
ions-0.039
issa-0.039
ner-0.038
Token.
Token ID13
Feature activation+1.23e-02

Pos logit contributions
edly+0.047
 faces+0.044
aj+0.041
 face+0.041
powers+0.041
Neg logit contributions
 course-0.057
ions-0.052
 July-0.052
é-0.051
neath-0.049
Token
Token ID198
Feature activation+1.63e-02

Pos logit contributions
edly+0.017
 faces+0.016
 face+0.015
powers+0.014
by+0.014
Neg logit contributions
 course-0.026
ions-0.025
 July-0.024
é-0.024
neath-0.024
Token
Token ID198
Feature activation-2.81e-04

Pos logit contributions
edly+0.015
 face+0.014
 faces+0.014
by+0.013
 tum+0.012
Neg logit contributions
 course-0.041
ions-0.039
 July-0.039
é-0.038
issa-0.037
TokenThe
Token ID464
Feature activation-2.83e-02

Pos logit contributions
ions+0.001
 course+0.001
 July+0.001
igh+0.001
issa+0.001
Neg logit contributions
edly-0.000
 face-0.000
 faces-0.000
by-0.000
powers-0.000
Token tort
Token ID7619
Feature activation-1.15e-01

Pos logit contributions
 course+0.055
ions+0.055
 July+0.052
neath+0.051
é+0.050
Neg logit contributions
edly-0.043
 face-0.041
 faces-0.040
 tum-0.038
by-0.037
Tokenoise
Token ID25678
Feature activation-3.18e-02

Pos logit contributions
 July+0.167
 tornado+0.148
 fireworks+0.140
 course+0.134
 bolt+0.133
Neg logit contributions
by-0.224
edly-0.218
 upon-0.200
pt-0.191
ies-0.188
Token started
Token ID2067
Feature activation+9.14e-03

Pos logit contributions
 course+0.070
ions+0.066
 July+0.065
é+0.064
issa+0.063
Neg logit contributions
edly-0.040
 faces-0.038
 face-0.037
aj-0.034
 tum-0.034
Token on
Token ID319
Feature activation-6.90e-03

Pos logit contributions
edly+0.009
 faces+0.008
 face+0.007
aj+0.007
powers+0.007
Neg logit contributions
 course-0.022
 July-0.022
ions-0.022
é-0.021
neath-0.021
Token his
Token ID465
Feature activation+4.59e-02

Pos logit contributions
 course+0.012
ions+0.011
 July+0.011
é+0.011
issa+0.011
Neg logit contributions
edly-0.012
 faces-0.011
 face-0.011
 tum-0.011
by-0.011
Token adventure
Token ID8855
Feature activation+4.60e-03

Pos logit contributions
edly+0.069
aj+0.060
 faces+0.060
 face+0.059
 upon+0.058
Neg logit contributions
 course-0.091
 July-0.083
issa-0.079
opl-0.079
é-0.078
Token that
Token ID326
Feature activation-5.87e-03

Pos logit contributions
 course+0.057
ions+0.057
 July+0.057
é+0.054
igh+0.054
Neg logit contributions
 face-0.038
 to-0.037
edly-0.037
 faces-0.036
 back-0.036
Token said
Token ID531
Feature activation+1.68e-02

Pos logit contributions
 course+0.015
ions+0.014
 July+0.014
é+0.014
issa+0.013
Neg logit contributions
edly-0.006
 face-0.005
 faces-0.005
 tum-0.005
by-0.004
Token "
Token ID366
Feature activation-6.94e-02

Pos logit contributions
edly+0.026
 faces+0.025
aj+0.024
by+0.024
powers+0.023
Neg logit contributions
 course-0.032
ions-0.031
 July-0.030
ner-0.029
é-0.029
TokenNo
Token ID2949
Feature activation-5.60e-03

Pos logit contributions
 course+0.120
ions+0.116
 July+0.116
issa+0.110
é+0.107
Neg logit contributions
edly-0.121
by-0.108
 faces-0.107
 face-0.107
powers-0.104
Token T
Token ID309
Feature activation-4.20e-02

Pos logit contributions
ions+0.012
 course+0.012
 July+0.012
neath+0.012
é+0.012
Neg logit contributions
 face-0.007
 faces-0.007
 tum-0.007
 eyes-0.006
edly-0.006
Tokenresp
Token ID4363
Feature activation-1.15e-01

Pos logit contributions
 course+0.078
ions+0.075
 July+0.071
é+0.070
issa+0.069
Neg logit contributions
edly-0.068
 faces-0.065
 face-0.063
 tum-0.060
by-0.059
Tokenassing
Token ID19696
Feature activation-4.09e-02

Pos logit contributions
ions+0.222
 July+0.220
otion+0.199
issa+0.199
azy+0.192
Neg logit contributions
 back-0.155
edly-0.154
 faces-0.146
 face-0.143
 spine-0.136
Token".
Token ID1911
Feature activation+1.35e-02

Pos logit contributions
 course+0.078
 July+0.078
ions+0.072
neath+0.072
issa+0.072
Neg logit contributions
edly-0.057
 to-0.055
 back-0.053
 face-0.053
 upon-0.051
Token But
Token ID887
Feature activation-1.38e-02

Pos logit contributions
by+0.024
edly+0.024
powers+0.023
aj+0.022
emo+0.021
Neg logit contributions
 course-0.019
issa-0.018
neath-0.018
é-0.017
ner-0.017
Token Lily
Token ID20037
Feature activation+3.07e-04

Pos logit contributions
 course+0.026
ions+0.025
 July+0.024
é+0.023
issa+0.023
Neg logit contributions
edly-0.022
 face-0.021
 faces-0.021
 tum-0.020
by-0.019
Token was
Token ID373
Feature activation+7.66e-03

Pos logit contributions
edly+0.000
 faces+0.000
powers+0.000
aj+0.000
 face+0.000
Neg logit contributions
 course-0.001
issa-0.001
é-0.001
neath-0.001
azy-0.001
Token old
Token ID1468
Feature activation-3.94e-02

Pos logit contributions
ions+0.090
 course+0.089
 July+0.087
é+0.086
ner+0.085
Neg logit contributions
 face-0.050
 tum-0.042
 back-0.041
 faces-0.040
 eyes-0.040
Token man
Token ID582
Feature activation-3.56e-02

Pos logit contributions
 course+0.078
ions+0.075
 July+0.074
é+0.072
neath+0.071
Neg logit contributions
edly-0.057
 face-0.056
 faces-0.054
by-0.051
 tum-0.051
Token?"
Token ID1701
Feature activation+4.72e-03

Pos logit contributions
 course+0.065
ions+0.065
 July+0.063
neath+0.060
é+0.060
Neg logit contributions
 face-0.053
 to-0.053
by-0.052
 back-0.051
 upon-0.050
Token
Token ID198
Feature activation+1.28e-02

Pos logit contributions
edly+0.006
 face+0.005
 faces+0.005
 tum+0.005
by+0.005
Neg logit contributions
 course-0.011
ions-0.010
 July-0.010
é-0.010
issa-0.010
TokenThe
Token ID464
Feature activation-3.29e-02

Pos logit contributions
edly+0.009
 face+0.009
 faces+0.008
by+0.007
 tum+0.007
Neg logit contributions
ions-0.034
 course-0.034
 July-0.033
issa-0.032
é-0.032
Token tort
Token ID7619
Feature activation-1.14e-01

Pos logit contributions
 course+0.076
ions+0.073
 July+0.072
é+0.070
issa+0.069
Neg logit contributions
edly-0.039
 faces-0.036
 face-0.036
 tum-0.032
by-0.032
Tokenoise
Token ID25678
Feature activation-2.76e-02

Pos logit contributions
 July+0.172
 fireworks+0.155
 tornado+0.146
 cinema+0.138
 films+0.137
Neg logit contributions
edly-0.218
by-0.210
pt-0.190
ouse-0.187
 upon-0.185
Token replied
Token ID8712
Feature activation+4.18e-02

Pos logit contributions
 course+0.060
ions+0.056
 July+0.056
é+0.055
issa+0.054
Neg logit contributions
edly-0.036
 faces-0.033
 face-0.033
aj-0.030
 tum-0.030
Token:
Token ID25
Feature activation+1.18e-02

Pos logit contributions
edly+0.055
aj+0.053
 faces+0.053
powers+0.052
by+0.050
Neg logit contributions
 course-0.087
ions-0.081
 July-0.079
é-0.078
ner-0.077
Token "
Token ID366
Feature activation-5.98e-02

Pos logit contributions
edly+0.013
 face+0.013
 faces+0.012
 tum+0.011
 back+0.011
Neg logit contributions
 course-0.027
ions-0.027
 July-0.026
é-0.026
issa-0.025
TokenI
Token ID40
Feature activation+2.95e-04

Pos logit contributions
 July+0.103
 course+0.101
ions+0.098
issa+0.095
igh+0.095
Neg logit contributions
edly-0.104
powers-0.090
by-0.086
 upon-0.084
 to-0.084
Token are
Token ID389
Feature activation-3.87e-02

Pos logit contributions
 course+0.037
ions+0.036
 July+0.036
é+0.034
neath+0.034
Neg logit contributions
 face-0.020
 faces-0.020
edly-0.019
by-0.018
 tum-0.018
Token you
Token ID345
Feature activation-3.04e-03

Pos logit contributions
opl+0.060
 course+0.059
ions+0.059
hing+0.058
é+0.058
Neg logit contributions
 faces-0.069
 face-0.067
 to-0.067
 tum-0.065
edly-0.065
Token here
Token ID994
Feature activation-4.09e-03

Pos logit contributions
 course+0.006
ions+0.006
 July+0.005
neath+0.005
é+0.005
Neg logit contributions
 face-0.005
 back-0.004
 upon-0.004
 to-0.004
 faces-0.004
Token,
Token ID11
Feature activation-6.12e-02

Pos logit contributions
 course+0.007
ions+0.007
 July+0.006
é+0.006
issa+0.006
Neg logit contributions
 face-0.007
edly-0.007
 faces-0.007
 back-0.007
 tum-0.007
Token little
Token ID1310
Feature activation-5.61e-02

Pos logit contributions
ions+0.129
issa+0.126
 course+0.125
é+0.124
 July+0.123
Neg logit contributions
 face-0.083
 tum-0.075
 faces-0.073
 back-0.071
 upon-0.070
Token cater
Token ID20825
Feature activation-1.13e-01

Pos logit contributions
 course+0.112
ions+0.110
 July+0.109
é+0.106
issa+0.104
Neg logit contributions
edly-0.078
 face-0.077
 faces-0.076
 tum-0.071
by-0.071
Tokenpillar
Token ID41643
Feature activation-4.84e-02

Pos logit contributions
 July+0.183
neath+0.160
issa+0.155
ators+0.155
ions+0.152
Neg logit contributions
powers-0.196
edly-0.193
 to-0.178
 face-0.177
 upon-0.174
Token?"
Token ID1701
Feature activation-2.08e-03

Pos logit contributions
 course+0.076
ions+0.075
 July+0.074
é+0.070
neath+0.069
Neg logit contributions
 face-0.086
edly-0.085
 faces-0.082
 tum-0.081
by-0.081
Token the
Token ID262
Feature activation-4.87e-02

Pos logit contributions
ions+0.004
 course+0.004
 July+0.004
issa+0.004
opl+0.004
Neg logit contributions
edly-0.003
 face-0.003
 faces-0.003
 tum-0.003
by-0.002
Token monkey
Token ID21657
Feature activation-1.24e-02

Pos logit contributions
 course+0.099
ions+0.094
 July+0.093
é+0.090
issa+0.088
Neg logit contributions
edly-0.071
 faces-0.066
 face-0.066
by-0.061
 tum-0.061
Token said
Token ID531
Feature activation+4.00e-02

Pos logit contributions
 course+0.026
ions+0.024
 July+0.024
é+0.024
issa+0.023
Neg logit contributions
edly-0.017
 faces-0.016
 face-0.016
 tum-0.015
by-0.015
Token he
Token ID339
Feature activation+9.03e-03

Pos logit contributions
 course+0.033
ions+0.032
 July+0.032
é+0.031
issa+0.030
Neg logit contributions
edly-0.013
 faces-0.012
 face-0.012
by-0.011
 tum-0.011
Token noticed
Token ID6810
Feature activation-5.51e-03

Pos logit contributions
edly+0.012
 faces+0.011
 face+0.010
powers+0.010
aj+0.009
Neg logit contributions
 course-0.019
 July-0.018
issa-0.018
ions-0.018
é-0.018
Token something
Token ID1223
Feature activation+1.91e-02

Pos logit contributions
 course+0.012
ions+0.011
 July+0.011
é+0.011
neath+0.011
Neg logit contributions
edly-0.007
 face-0.007
 faces-0.007
 tum-0.006
by-0.006
Token strange
Token ID6283
Feature activation+5.80e-03

Pos logit contributions
edly+0.027
 faces+0.024
aj+0.023
by+0.023
 face+0.023
Neg logit contributions
 course-0.041
ions-0.039
 July-0.037
issa-0.037
é-0.036
Token.
Token ID13
Feature activation+4.74e-03

Pos logit contributions
edly+0.006
aj+0.005
 faces+0.005
powers+0.005
 face+0.005
Neg logit contributions
 course-0.014
ions-0.014
 July-0.013
é-0.013
issa-0.013
Token Among
Token ID9754
Feature activation-1.13e-01

Pos logit contributions
edly+0.006
 faces+0.006
 face+0.006
by+0.006
aj+0.006
Neg logit contributions
 course-0.010
ions-0.010
 July-0.009
é-0.009
neath-0.009
Tokenst
Token ID301
Feature activation-1.93e-02

Pos logit contributions
otion+0.211
ner+0.205
opl+0.203
 course+0.202
 bolt+0.200
Neg logit contributions
by-0.162
 to-0.156
 upon-0.153
 back-0.148
edly-0.133
Token the
Token ID262
Feature activation-9.14e-03

Pos logit contributions
 course+0.036
ions+0.035
 July+0.034
é+0.033
neath+0.033
Neg logit contributions
edly-0.032
 faces-0.030
 face-0.029
 tum-0.027
by-0.027
Token keys
Token ID8251
Feature activation-3.66e-02

Pos logit contributions
ions+0.019
 July+0.018
 course+0.018
é+0.018
neath+0.017
Neg logit contributions
 face-0.012
 faces-0.012
 eyes-0.011
 back-0.010
 to-0.010
Token,
Token ID11
Feature activation-1.81e-02

Pos logit contributions
 course+0.086
 July+0.084
ions+0.083
é+0.080
issa+0.079
Neg logit contributions
edly-0.038
 face-0.037
 faces-0.036
by-0.032
 tum-0.032
Token there
Token ID612
Feature activation+8.88e-04

Pos logit contributions
 course+0.042
ions+0.040
 July+0.039
é+0.039
opl+0.038
Neg logit contributions
edly-0.021
 face-0.020
 faces-0.020
 tum-0.018
by-0.017
Token that
Token ID326
Feature activation-1.45e-02

Pos logit contributions
 course+0.060
 July+0.059
ions+0.059
é+0.057
neath+0.057
Neg logit contributions
edly-0.046
 face-0.046
 faces-0.044
 to-0.043
 back-0.043
Token said
Token ID531
Feature activation+3.88e-03

Pos logit contributions
 course+0.036
ions+0.035
 July+0.035
é+0.034
issa+0.034
Neg logit contributions
edly-0.013
 face-0.013
 faces-0.013
 tum-0.011
 back-0.010
Token "
Token ID366
Feature activation-6.96e-02

Pos logit contributions
edly+0.005
 faces+0.005
 face+0.005
by+0.004
 tum+0.004
Neg logit contributions
 course-0.008
ions-0.008
 July-0.008
é-0.008
issa-0.007
TokenNo
Token ID2949
Feature activation-8.69e-03

Pos logit contributions
 course+0.117
 July+0.111
issa+0.110
ions+0.109
é+0.104
Neg logit contributions
edly-0.123
 faces-0.111
by-0.110
 face-0.109
powers-0.107
Token T
Token ID309
Feature activation-3.51e-02

Pos logit contributions
 course+0.018
ions+0.018
 July+0.018
é+0.018
neath+0.018
Neg logit contributions
 face-0.011
 faces-0.011
edly-0.010
 tum-0.010
 eyes-0.010
Tokenresp
Token ID4363
Feature activation-1.13e-01

Pos logit contributions
 course+0.066
ions+0.063
 July+0.062
é+0.059
issa+0.058
Neg logit contributions
edly-0.056
 faces-0.053
 face-0.053
by-0.049
 tum-0.049
Tokenassing
Token ID19696
Feature activation-4.15e-02

Pos logit contributions
 July+0.207
ions+0.206
issa+0.199
otion+0.190
 course+0.183
Neg logit contributions
 back-0.161
edly-0.154
 faces-0.150
suit-0.148
suits-0.145
Token".
Token ID1911
Feature activation+1.72e-02

Pos logit contributions
 course+0.080
 July+0.079
issa+0.075
neath+0.073
é+0.073
Neg logit contributions
edly-0.056
 to-0.055
 back-0.053
 face-0.052
 faces-0.050
Token
Token ID198
Feature activation+9.37e-03

Pos logit contributions
edly+0.025
by+0.023
aj+0.023
 faces+0.022
 face+0.022
Neg logit contributions
 course-0.035
ions-0.034
 July-0.032
é-0.032
neath-0.032
Token
Token ID198
Feature activation-5.88e-03

Pos logit contributions
edly+0.010
 faces+0.010
 face+0.009
by+0.009
 tum+0.009
Neg logit contributions
 course-0.022
ions-0.021
 July-0.021
é-0.021
neath-0.020
Token"
Token ID1
Feature activation-5.53e-02

Pos logit contributions
 course+0.010
ions+0.010
 July+0.010
é+0.009
issa+0.009
Neg logit contributions
edly-0.010
 faces-0.009
 face-0.009
by-0.009
 tum-0.009
Token big
Token ID1263
Feature activation-2.62e-02

Pos logit contributions
 face+0.022
edly+0.022
 faces+0.022
 tum+0.020
by+0.019
Neg logit contributions
 course-0.054
ions-0.053
 July-0.051
é-0.050
neath-0.049
Token sign
Token ID1051
Feature activation-2.73e-02

Pos logit contributions
ions+0.061
é+0.058
azy+0.056
 course+0.056
neath+0.056
Neg logit contributions
 face-0.033
 faces-0.029
 back-0.029
 tum-0.028
 eyes-0.028
Token that
Token ID326
Feature activation-1.11e-02

Pos logit contributions
 July+0.056
ions+0.055
 course+0.055
é+0.053
igh+0.053
Neg logit contributions
 face-0.037
 to-0.036
edly-0.035
 back-0.035
 faces-0.035
Token said
Token ID531
Feature activation+1.03e-02

Pos logit contributions
 course+0.028
ions+0.028
 July+0.027
é+0.026
opl+0.026
Neg logit contributions
 face-0.009
edly-0.009
 faces-0.009
 tum-0.008
 back-0.007
Token "
Token ID366
Feature activation-7.10e-02

Pos logit contributions
edly+0.015
 faces+0.014
 face+0.013
by+0.013
aj+0.013
Neg logit contributions
 course-0.021
ions-0.020
 July-0.020
é-0.019
ner-0.019
TokenEX
Token ID6369
Feature activation-1.13e-01

Pos logit contributions
 course+0.121
ions+0.115
 July+0.114
issa+0.109
é+0.105
Neg logit contributions
edly-0.126
 faces-0.114
 face-0.114
by-0.113
powers-0.109
TokenIT
Token ID2043
Feature activation-3.82e-02

Pos logit contributions
ions+0.190
otion+0.174
 July+0.170
osh+0.165
igh+0.159
Neg logit contributions
 back-0.180
edly-0.176
 faces-0.168
 himself-0.166
 face-0.165
Token".
Token ID1911
Feature activation+6.22e-03

Pos logit contributions
 course+0.072
 July+0.068
ions+0.065
neath+0.061
é+0.061
Neg logit contributions
edly-0.060
 face-0.058
 back-0.057
 to-0.056
 faces-0.055
Token He
Token ID679
Feature activation+2.15e-02

Pos logit contributions
edly+0.007
by+0.007
aj+0.006
powers+0.006
 faces+0.006
Neg logit contributions
 course-0.014
é-0.013
ions-0.013
 July-0.013
neath-0.013
Token asked
Token ID1965
Feature activation+1.11e-02

Pos logit contributions
edly+0.029
 faces+0.027
 face+0.026
aj+0.025
 spine+0.025
Neg logit contributions
 course-0.043
 July-0.042
issa-0.042
é-0.041
ions-0.040
Token his
Token ID465
Feature activation+3.92e-02

Pos logit contributions
edly+0.010
 faces+0.009
 face+0.009
by+0.007
 tum+0.007
Neg logit contributions
 course-0.029
ions-0.028
 July-0.028
é-0.027
issa-0.027
Token other
Token ID584
Feature activation-3.55e-02

Pos logit contributions
 course+0.025
ions+0.024
 July+0.024
é+0.023
issa+0.023
Neg logit contributions
edly-0.018
 faces-0.016
 face-0.016
by-0.015
aj-0.015
Token er
Token ID1931
Feature activation-1.09e-01

Pos logit contributions
 course+0.073
ions+0.071
 July+0.071
neath+0.068
é+0.068
Neg logit contributions
 faces-0.048
 face-0.048
edly-0.048
 tum-0.043
by-0.043
Tokenaser
Token ID6005
Feature activation-3.26e-02

Pos logit contributions
 course+0.174
 July+0.173
ions+0.158
neath+0.157
é+0.154
Neg logit contributions
edly-0.195
 faces-0.187
 face-0.186
powers-0.184
by-0.179
Token.
Token ID13
Feature activation-1.31e-03

Pos logit contributions
 course+0.093
ions+0.090
 July+0.090
é+0.087
neath+0.086
Neg logit contributions
edly-0.020
 faces-0.017
 face-0.017
 tum-0.014
by-0.014
Token This
Token ID770
Feature activation-3.03e-02

Pos logit contributions
 course+0.003
ions+0.003
 July+0.002
é+0.002
issa+0.002
Neg logit contributions
edly-0.002
 face-0.002
 faces-0.002
 tum-0.002
by-0.002
Token er
Token ID1931
Feature activation-1.12e-01

Pos logit contributions
 July+0.045
 course+0.044
ions+0.044
é+0.040
neath+0.039
Neg logit contributions
 face-0.058
 faces-0.055
edly-0.055
by-0.054
 tum-0.054
Tokenaser
Token ID6005
Feature activation-4.40e-02

Pos logit contributions
 July+0.192
 course+0.184
neath+0.173
ions+0.167
é+0.161
Neg logit contributions
edly-0.188
 face-0.181
powers-0.180
 faces-0.179
by-0.173
Token was
Token ID373
Feature activation-2.60e-02

Pos logit contributions
 course+0.103
ions+0.099
 July+0.097
é+0.095
issa+0.094
Neg logit contributions
edly-0.050
 faces-0.046
 face-0.045
by-0.041
 tum-0.041
Token independent
Token ID4795
Feature activation-2.93e-03

Pos logit contributions
 course+0.059
ions+0.057
 July+0.056
é+0.054
issa+0.054
Neg logit contributions
edly-0.031
 faces-0.029
 face-0.028
by-0.026
 tum-0.026
Token and
Token ID290
Feature activation-2.23e-02

Pos logit contributions
 course+0.007
ions+0.006
issa+0.006
ner+0.006
é+0.006
Neg logit contributions
edly-0.004
by-0.003
aj-0.003
 faces-0.003
 face-0.003
Token could
Token ID714
Feature activation+2.83e-02

Pos logit contributions
 course+0.051
ions+0.049
 July+0.048
é+0.047
issa+0.046
Neg logit contributions
edly-0.027
 faces-0.024
 face-0.023
aj-0.022
by-0.022
Token.
Token ID13
Feature activation-1.81e-03

Pos logit contributions
 course+0.025
ions+0.024
 July+0.024
é+0.023
issa+0.023
Neg logit contributions
edly-0.005
 faces-0.005
 face-0.004
by-0.004
powers-0.004
Token
Token ID220
Feature activation+4.60e-02

Pos logit contributions
 course+0.004
ions+0.004
 July+0.004
issa+0.004
é+0.004
Neg logit contributions
edly-0.002
 face-0.002
 faces-0.002
 tum-0.002
by-0.001
Token
Token ID198
Feature activation+1.78e-02

Pos logit contributions
edly+0.046
 faces+0.042
 face+0.042
by+0.037
 tum+0.037
Neg logit contributions
 course-0.113
ions-0.109
 July-0.108
é-0.105
issa-0.103
Token
Token ID198
Feature activation-2.09e-03

Pos logit contributions
edly+0.015
 face+0.013
 faces+0.013
by+0.012
 tum+0.012
Neg logit contributions
 course-0.046
ions-0.045
 July-0.044
é-0.043
issa-0.043
TokenThe
Token ID464
Feature activation-2.77e-02

Pos logit contributions
igh+0.004
ions+0.004
 course+0.004
opl+0.004
issa+0.004
Neg logit contributions
edly-0.002
 upon-0.002
 face-0.002
 faces-0.002
powers-0.002
Token er
Token ID1931
Feature activation-1.12e-01

Pos logit contributions
 course+0.060
ions+0.060
neath+0.058
 July+0.058
é+0.057
Neg logit contributions
 face-0.035
 faces-0.033
edly-0.033
 tum-0.031
by-0.029
Tokenaser
Token ID6005
Feature activation-3.84e-02

Pos logit contributions
 July+0.219
 course+0.202
neath+0.199
opl+0.182
ions+0.182
Neg logit contributions
edly-0.167
 face-0.156
powers-0.154
 faces-0.147
ouse-0.146
Token put
Token ID1234
Feature activation-5.26e-02

Pos logit contributions
 course+0.088
ions+0.085
 July+0.084
é+0.081
issa+0.080
Neg logit contributions
edly-0.045
 faces-0.042
 face-0.041
by-0.038
 tum-0.038
Token everything
Token ID2279
Feature activation+1.49e-02

Pos logit contributions
 course+0.104
ner+0.101
 July+0.100
issa+0.099
opl+0.098
Neg logit contributions
edly-0.070
 faces-0.070
 face-0.069
 back-0.069
by-0.064
Token in
Token ID287
Feature activation-2.12e-02

Pos logit contributions
edly+0.020
 faces+0.019
 face+0.019
by+0.018
 tum+0.018
Neg logit contributions
 course-0.031
ions-0.030
 July-0.029
é-0.029
neath-0.028
Token the
Token ID262
Feature activation+1.10e-03

Pos logit contributions
 course+0.042
ions+0.040
é+0.040
issa+0.040
ner+0.039
Neg logit contributions
 face-0.029
edly-0.029
 faces-0.028
 to-0.027
 back-0.027
Token that
Token ID326
Feature activation-2.01e-02

Pos logit contributions
 course+0.060
 July+0.058
ions+0.058
é+0.057
neath+0.056
Neg logit contributions
edly-0.030
 face-0.030
 faces-0.029
 back-0.027
 to-0.027
Token said
Token ID531
Feature activation-1.20e-03

Pos logit contributions
 course+0.050
ions+0.048
 July+0.048
é+0.047
opl+0.046
Neg logit contributions
edly-0.019
 face-0.019
 faces-0.018
 tum-0.016
 back-0.015
Token "
Token ID366
Feature activation-6.96e-02

Pos logit contributions
 course+0.003
ions+0.002
 July+0.002
é+0.002
issa+0.002
Neg logit contributions
edly-0.002
 faces-0.001
 face-0.001
by-0.001
 tum-0.001
TokenNo
Token ID2949
Feature activation-1.23e-02

Pos logit contributions
 course+0.118
 July+0.110
ions+0.110
issa+0.106
é+0.103
Neg logit contributions
edly-0.125
 faces-0.113
by-0.111
 face-0.111
powers-0.108
Token T
Token ID309
Feature activation-3.72e-02

Pos logit contributions
 course+0.025
ions+0.025
 July+0.024
neath+0.024
é+0.024
Neg logit contributions
 face-0.016
 faces-0.016
edly-0.015
 tum-0.015
 eyes-0.015
Tokenresp
Token ID4363
Feature activation-1.12e-01

Pos logit contributions
 course+0.070
ions+0.066
 July+0.065
é+0.063
neath+0.063
Neg logit contributions
edly-0.059
 face-0.056
 faces-0.056
by-0.052
 tum-0.052
Tokenassing
Token ID19696
Feature activation-3.75e-02

Pos logit contributions
ions+0.216
 July+0.206
issa+0.197
otion+0.192
 course+0.190
Neg logit contributions
 back-0.153
edly-0.147
 faces-0.142
suits-0.135
 face-0.135
Token".
Token ID1911
Feature activation+1.00e-02

Pos logit contributions
 course+0.076
 July+0.074
neath+0.070
issa+0.070
ions+0.068
Neg logit contributions
edly-0.049
 to-0.048
 back-0.045
 face-0.045
.-0.044
Token And
Token ID843
Feature activation-2.80e-03

Pos logit contributions
edly+0.017
by+0.016
powers+0.016
aj+0.015
 recovered+0.015
Neg logit contributions
ions-0.015
 July-0.015
 course-0.015
issa-0.015
neath-0.015
Token behind
Token ID2157
Feature activation-3.89e-02

Pos logit contributions
 course+0.005
ions+0.005
 July+0.004
issa+0.004
ner+0.004
Neg logit contributions
edly-0.005
by-0.004
powers-0.004
 faces-0.004
aj-0.004
Token the
Token ID262
Feature activation-1.83e-02

Pos logit contributions
 course+0.063
ions+0.060
 July+0.059
é+0.057
issa+0.056
Neg logit contributions
edly-0.070
 face-0.068
 faces-0.068
 tum-0.064
by-0.063
Token a
Token ID257
Feature activation-1.52e-03

Pos logit contributions
 course+0.033
ions+0.032
é+0.032
 July+0.032
issa+0.031
Neg logit contributions
edly-0.014
 face-0.013
 faces-0.013
 tum-0.012
 back-0.011
Token cater
Token ID20825
Feature activation-9.65e-02

Pos logit contributions
 course+0.004
ions+0.004
 July+0.004
é+0.004
neath+0.004
Neg logit contributions
 face-0.001
edly-0.001
 faces-0.001
 tum-0.001
by-0.001
Tokenpillar
Token ID41643
Feature activation-4.53e-02

Pos logit contributions
 July+0.156
neath+0.152
ions+0.143
 course+0.140
ators+0.134
Neg logit contributions
edly-0.173
 face-0.166
powers-0.164
 tum-0.159
by-0.157
Token.
Token ID13
Feature activation-2.77e-02

Pos logit contributions
 course+0.083
ions+0.080
 July+0.080
neath+0.077
é+0.075
Neg logit contributions
edly-0.073
 face-0.070
 faces-0.068
 tum-0.065
by-0.065
Token The
Token ID383
Feature activation-2.87e-02

Pos logit contributions
 course+0.062
ions+0.060
issa+0.059
 July+0.058
é+0.057
Neg logit contributions
edly-0.033
 face-0.032
 tum-0.031
 faces-0.029
 eyes-0.028
Token cater
Token ID20825
Feature activation-1.12e-01

Pos logit contributions
 course+0.043
ions+0.037
 July+0.037
é+0.036
issa+0.034
Neg logit contributions
edly-0.059
 faces-0.056
aj-0.053
 face-0.052
powers-0.052
Tokenpillar
Token ID41643
Feature activation-4.04e-02

Pos logit contributions
 July+0.179
 course+0.161
ions+0.159
neath+0.154
ators+0.150
Neg logit contributions
edly-0.207
powers-0.190
 face-0.188
by-0.180
 felt-0.179
Token wanted
Token ID2227
Feature activation-1.05e-02

Pos logit contributions
 course+0.083
ions+0.076
é+0.075
 July+0.074
issa+0.073
Neg logit contributions
edly-0.059
 faces-0.057
 face-0.053
aj-0.051
powers-0.050
Token to
Token ID284
Feature activation-2.61e-03

Pos logit contributions
 course+0.028
ions+0.027
 July+0.026
on+0.025
ner+0.024
Neg logit contributions
edly-0.010
aj-0.009
 faces-0.008
suit-0.008
suits-0.007
Token go
Token ID467
Feature activation-1.70e-02

Pos logit contributions
 course+0.005
ions+0.005
 July+0.005
é+0.005
neath+0.005
Neg logit contributions
edly-0.004
 faces-0.003
 face-0.003
by-0.003
aj-0.003
Token on
Token ID319
Feature activation-1.05e-02

Pos logit contributions
 course+0.033
ions+0.031
 July+0.029
ner+0.029
é+0.029
Neg logit contributions
edly-0.029
 faces-0.026
aj-0.024
 face-0.024
powers-0.023
Token
Token ID198
Feature activation+1.23e-03

Pos logit contributions
edly+0.021
 faces+0.020
 face+0.019
 tum+0.017
aj+0.017
Neg logit contributions
 course-0.046
ions-0.044
 July-0.044
é-0.043
issa-0.042
TokenOne
Token ID3198
Feature activation-1.39e-02

Pos logit contributions
edly+0.002
 face+0.002
 faces+0.002
by+0.002
powers+0.002
Neg logit contributions
ions-0.002
 July-0.002
issa-0.002
 course-0.002
igh-0.002
Token day
Token ID1110
Feature activation-1.44e-02

Pos logit contributions
 course+0.024
ions+0.023
 July+0.023
é+0.022
issa+0.021
Neg logit contributions
edly-0.024
 face-0.023
 faces-0.023
 tum-0.021
by-0.021
Token,
Token ID11
Feature activation-3.09e-02

Pos logit contributions
 course+0.036
ions+0.035
 July+0.035
é+0.034
issa+0.034
Neg logit contributions
edly-0.013
 face-0.012
 faces-0.012
 tum-0.011
by-0.011
Token the
Token ID262
Feature activation-3.03e-02

Pos logit contributions
 course+0.058
ions+0.056
 July+0.055
é+0.054
issa+0.054
Neg logit contributions
edly-0.048
 face-0.047
 faces-0.046
 tum-0.044
by-0.042
Token er
Token ID1931
Feature activation-1.11e-01

Pos logit contributions
 course+0.057
ions+0.055
 July+0.054
é+0.052
neath+0.051
Neg logit contributions
edly-0.047
 face-0.047
 faces-0.046
 tum-0.043
by-0.042
Tokenaser
Token ID6005
Feature activation-3.46e-02

Pos logit contributions
 July+0.216
 course+0.206
neath+0.198
opl+0.183
hing+0.182
Neg logit contributions
edly-0.165
 face-0.155
powers-0.154
 faces-0.146
ouse-0.143
Token met
Token ID1138
Feature activation-4.08e-04

Pos logit contributions
 course+0.076
ions+0.070
 July+0.069
issa+0.068
neath+0.068
Neg logit contributions
edly-0.045
 faces-0.042
 face-0.039
aj-0.039
by-0.038
Token a
Token ID257
Feature activation+3.79e-02

Pos logit contributions
 course+0.001
ions+0.001
 July+0.001
é+0.001
neath+0.001
Neg logit contributions
edly-0.001
 faces-0.001
aj-0.001
 face-0.001
powers-0.001
Token kind
Token ID1611
Feature activation-1.41e-03

Pos logit contributions
edly+0.048
 faces+0.041
aj+0.039
 face+0.038
by+0.036
Neg logit contributions
 course-0.087
 July-0.083
ions-0.079
issa-0.078
ner-0.078
Token boy
Token ID2933
Feature activation-7.34e-03

Pos logit contributions
 course+0.003
ions+0.003
 July+0.003
é+0.003
issa+0.003
Neg logit contributions
edly-0.002
 faces-0.002
 face-0.002
by-0.002
 tum-0.002
Token talk
Token ID1561
Feature activation+1.15e-02

Pos logit contributions
edly+0.038
 faces+0.034
by+0.032
aj+0.032
 face+0.032
Neg logit contributions
 course-0.061
ions-0.058
 July-0.057
é-0.056
neath-0.055
Token!
Token ID0
Feature activation+1.52e-03

Pos logit contributions
edly+0.013
aj+0.011
 faces+0.010
 spine+0.010
 tum+0.010
Neg logit contributions
 course-0.027
ions-0.027
é-0.025
 July-0.024
issa-0.024
Token
Token ID198
Feature activation+1.96e-02

Pos logit contributions
 face+0.002
edly+0.002
 faces+0.002
 tum+0.002
aj+0.002
Neg logit contributions
 course-0.003
ions-0.003
 July-0.003
issa-0.003
é-0.003
Token
Token ID198
Feature activation+9.33e-05

Pos logit contributions
edly+0.022
 faces+0.020
 face+0.020
 tum+0.017
by+0.017
Neg logit contributions
 course-0.046
ions-0.045
 July-0.044
é-0.043
issa-0.042
TokenThe
Token ID464
Feature activation-1.76e-02

Pos logit contributions
edly+0.000
 face+0.000
 faces+0.000
 tum+0.000
by+0.000
Neg logit contributions
 course-0.000
ions-0.000
 July-0.000
é-0.000
issa-0.000
Token er
Token ID1931
Feature activation-1.11e-01

Pos logit contributions
 course+0.038
 July+0.036
ions+0.036
é+0.035
issa+0.035
Neg logit contributions
edly-0.024
 faces-0.021
 face-0.020
by-0.019
aj-0.019
Tokenaser
Token ID6005
Feature activation-3.80e-02

Pos logit contributions
 July+0.199
 course+0.194
neath+0.183
ions+0.177
é+0.169
Neg logit contributions
edly-0.180
 face-0.174
 faces-0.168
powers-0.167
by-0.160
Token said
Token ID531
Feature activation+1.37e-02

Pos logit contributions
 course+0.080
issa+0.071
ions+0.071
 July+0.071
é+0.070
Neg logit contributions
edly-0.054
 faces-0.049
aj-0.047
 face-0.046
by-0.046
Token,
Token ID11
Feature activation+1.12e-02

Pos logit contributions
edly+0.022
aj+0.020
 spine+0.019
by+0.019
suit+0.019
Neg logit contributions
 course-0.026
ions-0.024
 July-0.024
on-0.024
ner-0.023
Token "
Token ID366
Feature activation-7.26e-02

Pos logit contributions
edly+0.018
by+0.017
 faces+0.016
aj+0.016
 face+0.016
Neg logit contributions
 course-0.021
 July-0.020
ions-0.020
ner-0.019
é-0.019
TokenHi
Token ID17250
Feature activation-2.58e-03

Pos logit contributions
ions+0.134
igh+0.128
 July+0.127
issa+0.127
é+0.124
Neg logit contributions
edly-0.119
 faces-0.109
by-0.104
 face-0.101
powers-0.099
Token heard
Token ID2982
Feature activation-6.80e-03

Pos logit contributions
edly+0.047
aj+0.042
 faces+0.040
 face+0.040
powers+0.040
Neg logit contributions
 course-0.046
 July-0.039
e-0.038
ions-0.038
igh-0.038
Token.
Token ID13
Feature activation+7.58e-03

Pos logit contributions
 course+0.019
ions+0.018
 July+0.018
é+0.018
issa+0.017
Neg logit contributions
edly-0.005
 face-0.004
 faces-0.004
 tum-0.003
 back-0.003
Token
Token ID198
Feature activation+1.70e-02

Pos logit contributions
edly+0.008
 faces+0.008
 face+0.008
by+0.007
 tum+0.007
Neg logit contributions
 course-0.018
ions-0.017
 July-0.017
é-0.017
issa-0.016
Token
Token ID198
Feature activation-3.85e-03

Pos logit contributions
edly+0.018
 faces+0.016
 face+0.016
 tum+0.015
by+0.015
Neg logit contributions
 course-0.041
ions-0.040
 July-0.039
é-0.038
issa-0.038
TokenThe
Token ID464
Feature activation-2.83e-02

Pos logit contributions
ions+0.008
 July+0.007
 course+0.007
igh+0.007
issa+0.007
Neg logit contributions
edly-0.006
 face-0.005
 faces-0.005
powers-0.005
by-0.005
Token rh
Token ID9529
Feature activation-1.11e-01

Pos logit contributions
ions+0.056
neath+0.050
 course+0.050
é+0.050
issa+0.049
Neg logit contributions
 face-0.045
 faces-0.043
edly-0.043
 tum-0.042
 back-0.040
Tokenin
Token ID259
Feature activation-5.19e-02

Pos logit contributions
 July+0.204
ions+0.195
neath+0.192
 course+0.183
opl+0.179
Neg logit contributions
edly-0.169
by-0.162
 face-0.160
 upon-0.151
 faces-0.149
Tokenoc
Token ID420
Feature activation-1.02e-01

Pos logit contributions
ions+0.073
 course+0.069
 July+0.067
neath+0.064
ner+0.064
Neg logit contributions
edly-0.108
 face-0.104
 faces-0.102
by-0.101
 tum-0.098
Tokeneros
Token ID27498
Feature activation-1.92e-02

Pos logit contributions
 course+0.152
ions+0.149
 July+0.147
neath+0.140
issa+0.137
Neg logit contributions
edly-0.204
 faces-0.191
 face-0.189
by-0.182
 tum-0.177
Token decided
Token ID3066
Feature activation+3.55e-02

Pos logit contributions
 course+0.044
 July+0.040
é+0.040
ions+0.039
issa+0.039
Neg logit contributions
edly-0.023
 faces-0.022
 face-0.021
aj-0.020
powers-0.020
Token to
Token ID284
Feature activation+1.10e-02

Pos logit contributions
edly+0.041
aj+0.040
suit+0.037
suits+0.036
 spine+0.032
Neg logit contributions
 course-0.083
 introducing-0.076
 whoever-0.075
ner-0.074
ting-0.074
Token full
Token ID1336
Feature activation-2.73e-02

Pos logit contributions
edly+0.003
 faces+0.002
 face+0.002
 tum+0.002
by+0.002
Neg logit contributions
 course-0.015
ions-0.015
 July-0.014
é-0.014
issa-0.014
Token of
Token ID286
Feature activation-4.30e-02

Pos logit contributions
 course+0.040
ions+0.038
 July+0.037
é+0.036
issa+0.036
Neg logit contributions
edly-0.054
 face-0.053
 faces-0.052
 tum-0.050
by-0.048
Token colourful
Token ID44799
Feature activation-1.61e-02

Pos logit contributions
 course+0.099
ions+0.096
 July+0.094
é+0.091
issa+0.090
Neg logit contributions
edly-0.050
 faces-0.045
 face-0.045
by-0.042
 tum-0.041
Token things
Token ID1243
Feature activation+7.35e-03

Pos logit contributions
 course+0.034
ions+0.033
 July+0.032
neath+0.030
issa+0.030
Neg logit contributions
edly-0.022
 face-0.019
aj-0.019
by-0.019
 spine-0.019
Token.
Token ID13
Feature activation+2.64e-03

Pos logit contributions
edly+0.007
 faces+0.006
 face+0.006
aj+0.006
 tum+0.006
Neg logit contributions
 course-0.018
ions-0.018
 July-0.017
é-0.017
ner-0.017
Token Among
Token ID9754
Feature activation-1.11e-01

Pos logit contributions
edly+0.003
 face+0.003
 faces+0.003
 tum+0.003
aj+0.002
Neg logit contributions
 course-0.006
ions-0.006
 July-0.006
é-0.006
igh-0.006
Token them
Token ID606
Feature activation+1.53e-02

Pos logit contributions
opl+0.224
ner+0.222
hing+0.215
 bolt+0.215
otion+0.213
Neg logit contributions
by-0.152
 upon-0.140
 to-0.139
 back-0.135
 faces-0.120
Token,
Token ID11
Feature activation-2.02e-02

Pos logit contributions
edly+0.017
 faces+0.016
 face+0.015
aj+0.014
 tum+0.014
Neg logit contributions
 course-0.036
ions-0.035
 July-0.034
é-0.033
neath-0.033
Token there
Token ID612
Feature activation-3.56e-03

Pos logit contributions
 course+0.045
opl+0.045
ions+0.042
é+0.042
hing+0.041
Neg logit contributions
 face-0.025
 faces-0.025
edly-0.025
 upon-0.022
 eyes-0.022
Token was
Token ID373
Feature activation-1.57e-02

Pos logit contributions
 course+0.008
ions+0.008
 July+0.008
é+0.008
issa+0.008
Neg logit contributions
edly-0.004
 face-0.004
 faces-0.004
by-0.003
 tum-0.003
Token a
Token ID257
Feature activation+2.48e-02

Pos logit contributions
 course+0.036
ions+0.035
 July+0.034
é+0.034
ner+0.033
Neg logit contributions
edly-0.018
 face-0.017
 faces-0.017
 tum-0.015
by-0.015
Token.
Token ID13
Feature activation-4.21e-04

Pos logit contributions
 course+0.022
ions+0.022
 July+0.021
é+0.021
neath+0.021
Neg logit contributions
edly-0.005
 faces-0.004
 face-0.004
by-0.003
aj-0.003
Token "
Token ID366
Feature activation-6.58e-02

Pos logit contributions
 course+0.001
 July+0.001
ions+0.001
neath+0.001
é+0.001
Neg logit contributions
edly-0.001
 faces-0.001
by-0.001
 face-0.001
powers-0.001
TokenWow
Token ID22017
Feature activation-2.87e-03

Pos logit contributions
ions+0.128
 course+0.118
igh+0.114
 July+0.113
ner+0.112
Neg logit contributions
edly-0.103
 faces-0.096
 face-0.094
 back-0.087
 felt-0.085
Token,
Token ID11
Feature activation-3.22e-02

Pos logit contributions
 course+0.007
ions+0.006
 July+0.006
é+0.006
issa+0.006
Neg logit contributions
edly-0.003
 faces-0.003
 face-0.003
by-0.003
 tum-0.003
Token that
Token ID326
Feature activation-6.00e-02

Pos logit contributions
ions+0.065
 course+0.063
é+0.062
 July+0.062
opl+0.061
Neg logit contributions
 face-0.048
 faces-0.045
 tum-0.044
 back-0.041
edly-0.041
Token cater
Token ID20825
Feature activation-1.10e-01

Pos logit contributions
ions+0.121
 July+0.119
 course+0.116
é+0.112
issa+0.110
Neg logit contributions
 face-0.086
 faces-0.082
 tum-0.080
edly-0.080
by-0.076
Tokenpillar
Token ID41643
Feature activation-5.31e-02

Pos logit contributions
 July+0.180
ators+0.157
issa+0.153
neath+0.151
é+0.150
Neg logit contributions
powers-0.192
edly-0.191
 face-0.179
 tum-0.177
 to-0.175
Token is
Token ID318
Feature activation-2.38e-02

Pos logit contributions
 July+0.113
ions+0.108
 course+0.106
é+0.103
neath+0.103
Neg logit contributions
 face-0.067
 faces-0.064
edly-0.064
 tum-0.062
by-0.061
Token so
Token ID523
Feature activation-1.19e-02

Pos logit contributions
 course+0.056
ions+0.056
 July+0.054
igh+0.054
é+0.054
Neg logit contributions
 face-0.025
 back-0.023
 faces-0.023
edly-0.023
 to-0.022
Token big
Token ID1263
Feature activation-3.09e-02

Pos logit contributions
 course+0.031
ions+0.030
 July+0.029
neath+0.029
é+0.029
Neg logit contributions
edly-0.010
 face-0.010
 faces-0.009
 tum-0.008
powers-0.008
Token!"
Token ID2474
Feature activation+1.06e-02

Pos logit contributions
 July+0.062
 course+0.062
ions+0.061
é+0.061
issa+0.060
Neg logit contributions
edly-0.041
 face-0.039
 to-0.039
 faces-0.038
 back-0.037
Token it
Token ID340
Feature activation+2.29e-02

Pos logit contributions
 course+0.050
ions+0.048
 July+0.048
é+0.047
issa+0.046
Neg logit contributions
edly-0.045
 faces-0.043
 face-0.043
 back-0.041
 tum-0.040
Token."
Token ID526
Feature activation+2.07e-02

Pos logit contributions
edly+0.032
 faces+0.030
 face+0.029
by+0.028
 tum+0.027
Neg logit contributions
 course-0.048
ions-0.045
 July-0.044
é-0.043
neath-0.043
Token
Token ID198
Feature activation+2.39e-02

Pos logit contributions
edly+0.028
 faces+0.027
 face+0.025
aj+0.025
by+0.025
Neg logit contributions
 course-0.045
ions-0.041
ner-0.040
 July-0.040
é-0.040
Token
Token ID198
Feature activation+4.79e-03

Pos logit contributions
edly+0.030
 faces+0.028
 face+0.027
by+0.025
 tum+0.024
Neg logit contributions
 course-0.054
ions-0.051
 July-0.050
é-0.049
ner-0.048
TokenThe
Token ID464
Feature activation-3.58e-02

Pos logit contributions
edly+0.009
 face+0.008
 faces+0.008
powers+0.008
by+0.008
Neg logit contributions
 course-0.008
ions-0.008
 July-0.007
é-0.007
neath-0.007
Token rac
Token ID3444
Feature activation-1.10e-01

Pos logit contributions
 course+0.075
ions+0.072
 July+0.071
é+0.069
issa+0.068
Neg logit contributions
edly-0.050
 faces-0.047
 face-0.047
by-0.043
 tum-0.043
Tokencoon
Token ID20912
Feature activation-1.30e-02

Pos logit contributions
 July+0.155
neath+0.134
ions+0.132
 course+0.129
opl+0.120
Neg logit contributions
edly-0.235
 face-0.200
 back-0.190
 faces-0.189
 himself-0.186
Token laughed
Token ID13818
Feature activation+3.48e-02

Pos logit contributions
 course+0.030
 July+0.027
é+0.026
ions+0.026
issa+0.026
Neg logit contributions
edly-0.016
 faces-0.015
powers-0.014
aj-0.014
 face-0.014
Token and
Token ID290
Feature activation+1.99e-02

Pos logit contributions
aj+0.060
powers+0.056
by+0.054
suits+0.054
 spine+0.053
Neg logit contributions
 course-0.060
ting-0.054
ions-0.051
 introducing-0.051
hing-0.049
Token pointed
Token ID6235
Feature activation+2.19e-03

Pos logit contributions
edly+0.025
 faces+0.024
 face+0.023
by+0.022
aj+0.022
Neg logit contributions
 course-0.044
ions-0.041
 July-0.041
é-0.040
issa-0.039
Token to
Token ID284
Feature activation+3.69e-03

Pos logit contributions
aj+0.004
edly+0.004
ouse+0.003
powers+0.003
 spine+0.003
Neg logit contributions
 course-0.004
neath-0.003
ions-0.003
 July-0.003
é-0.003
Token up
Token ID510
Feature activation+1.87e-02

Pos logit contributions
 course+0.028
ions+0.027
 July+0.027
é+0.026
issa+0.026
Neg logit contributions
edly-0.023
 faces-0.022
 face-0.022
by-0.021
 tum-0.020
Token Jack
Token ID3619
Feature activation+3.50e-02

Pos logit contributions
edly+0.028
aj+0.026
 faces+0.025
 face+0.025
 tum+0.024
Neg logit contributions
 course-0.037
ions-0.035
 July-0.035
é-0.033
neath-0.033
Token's
Token ID338
Feature activation+5.96e-02

Pos logit contributions
edly+0.050
 faces+0.047
 face+0.047
 tum+0.044
by+0.044
Neg logit contributions
 course-0.071
ions-0.068
 July-0.067
é-0.065
issa-0.064
Token eyes
Token ID2951
Feature activation+3.13e-02

Pos logit contributions
edly+0.074
aj+0.064
suits+0.062
by+0.061
 recovered+0.060
Neg logit contributions
 course-0.124
 July-0.122
é-0.115
issa-0.111
opl-0.111
Token with
Token ID351
Feature activation-2.93e-02

Pos logit contributions
edly+0.023
 faces+0.020
aj+0.019
 face+0.018
by+0.018
Neg logit contributions
 course-0.084
ions-0.081
 July-0.080
neath-0.079
é-0.078
Token excitement
Token ID14067
Feature activation+4.73e-02

Pos logit contributions
 course+0.063
 July+0.059
ions+0.059
neath+0.056
é+0.055
Neg logit contributions
edly-0.041
 faces-0.035
aj-0.035
 face-0.035
by-0.034
Token.
Token ID13
Feature activation+1.10e-02

Pos logit contributions
edly+0.048
powers+0.043
aj+0.040
by+0.040
suits+0.039
Neg logit contributions
ions-0.108
 course-0.107
 July-0.103
é-0.101
hing-0.101
Token But
Token ID887
Feature activation-1.47e-02

Pos logit contributions
edly+0.012
 faces+0.010
 face+0.010
by+0.010
aj+0.009
Neg logit contributions
 course-0.027
ions-0.026
 July-0.026
é-0.025
neath-0.024
Token one
Token ID530
Feature activation-1.21e-02

Pos logit contributions
 course+0.025
ions+0.025
 July+0.024
é+0.024
issa+0.023
Neg logit contributions
edly-0.025
 face-0.024
 faces-0.024
 tum-0.023
by-0.022
Token night
Token ID1755
Feature activation-6.25e-03

Pos logit contributions
ions+0.025
 course+0.024
ner+0.023
neath+0.023
 July+0.023
Neg logit contributions
 face-0.017
 tum-0.017
edly-0.016
by-0.016
 faces-0.015
Token,
Token ID11
Feature activation-1.95e-02

Pos logit contributions
 course+0.013
ions+0.013
 July+0.013
neath+0.012
ner+0.012
Neg logit contributions
edly-0.008
 faces-0.007
aj-0.007
powers-0.007
 face-0.007
Token cheered
Token ID34767
Feature activation+2.14e-02

Pos logit contributions
edly+0.007
 faces+0.007
 face+0.007
aj+0.006
by+0.006
Neg logit contributions
 course-0.015
ions-0.015
 July-0.014
é-0.014
neath-0.014
Token and
Token ID290
Feature activation+1.55e-02

Pos logit contributions
edly+0.024
powers+0.023
aj+0.022
 tum+0.022
 faces+0.021
Neg logit contributions
 course-0.048
ions-0.047
 July-0.044
é-0.044
hing-0.044
Token Sammy
Token ID44067
Feature activation+5.98e-03

Pos logit contributions
edly+0.022
 faces+0.021
 face+0.021
 tum+0.019
by+0.019
Neg logit contributions
 course-0.031
ions-0.030
 July-0.030
é-0.029
issa-0.028
Token felt
Token ID2936
Feature activation-1.69e-03

Pos logit contributions
edly+0.008
 faces+0.007
 face+0.007
aj+0.007
 tum+0.007
Neg logit contributions
 course-0.013
ions-0.012
é-0.012
issa-0.012
neath-0.011
Token so
Token ID523
Feature activation+1.83e-02

Pos logit contributions
 course+0.003
ions+0.003
 July+0.003
neath+0.003
ner+0.003
Neg logit contributions
edly-0.003
aj-0.002
 faces-0.002
by-0.002
 spine-0.002
Token happy
Token ID3772
Feature activation+5.69e-02

Pos logit contributions
edly+0.026
 faces+0.023
aj+0.023
by+0.022
 face+0.021
Neg logit contributions
 course-0.038
 July-0.035
ions-0.035
ner-0.034
hing-0.034
Token.
Token ID13
Feature activation+9.46e-03

Pos logit contributions
aj+0.060
edly+0.059
by+0.055
suits+0.054
 faces+0.054
Neg logit contributions
 course-0.133
ions-0.129
 introducing-0.120
ner-0.117
ting-0.116
Token
Token ID198
Feature activation+1.49e-02

Pos logit contributions
edly+0.013
 faces+0.012
 face+0.011
aj+0.010
by+0.010
Neg logit contributions
 course-0.021
ions-0.020
 July-0.019
é-0.019
neath-0.019
Token
Token ID198
Feature activation-6.03e-03

Pos logit contributions
edly+0.013
 face+0.012
 faces+0.012
by+0.011
 tum+0.011
Neg logit contributions
 course-0.038
ions-0.037
 July-0.036
é-0.035
issa-0.035
TokenSam
Token ID16305
Feature activation-1.91e-02

Pos logit contributions
 course+0.012
ions+0.012
 July+0.011
issa+0.011
é+0.011
Neg logit contributions
edly-0.009
 face-0.008
 faces-0.008
powers-0.008
by-0.008
Tokenmy
Token ID1820
Feature activation+7.07e-03

Pos logit contributions
 course+0.028
ions+0.027
 July+0.026
é+0.025
issa+0.024
Neg logit contributions
edly-0.038
 faces-0.036
 face-0.036
by-0.035
 tum-0.034
Token harsh
Token ID11859
Feature activation+9.36e-03

Pos logit contributions
 face+0.036
 faces+0.035
edly+0.034
 tum+0.033
 back+0.032
Neg logit contributions
ions-0.073
 course-0.072
 July-0.069
é-0.069
neath-0.068
Token voice
Token ID3809
Feature activation+4.13e-02

Pos logit contributions
edly+0.011
 faces+0.009
aj+0.009
powers+0.009
 face+0.009
Neg logit contributions
 course-0.022
 July-0.021
ions-0.020
é-0.020
issa-0.020
Token,
Token ID11
Feature activation+2.14e-02

Pos logit contributions
edly+0.035
 faces+0.031
 face+0.031
by+0.027
 tum+0.027
Neg logit contributions
 course-0.108
ions-0.104
 July-0.103
é-0.101
issa-0.099
Token but
Token ID475
Feature activation-1.51e-02

Pos logit contributions
edly+0.026
 faces+0.024
 face+0.024
by+0.023
 tum+0.022
Neg logit contributions
 course-0.048
ions-0.046
 July-0.046
é-0.044
issa-0.043
Token John
Token ID1757
Feature activation-3.90e-03

Pos logit contributions
ions+0.031
 course+0.030
 July+0.030
opl+0.029
é+0.029
Neg logit contributions
 face-0.021
edly-0.021
 back-0.020
 to-0.019
 faces-0.019
Token smiled
Token ID13541
Feature activation+5.12e-02

Pos logit contributions
 course+0.009
ions+0.008
é+0.008
 July+0.008
issa+0.008
Neg logit contributions
edly-0.005
 faces-0.005
 face-0.004
powers-0.004
 tum-0.004
Token anyways
Token ID32845
Feature activation+7.90e-02

Pos logit contributions
powers+0.076
aj+0.070
suits+0.068
 faces+0.066
edly+0.064
Neg logit contributions
ning-0.084
ting-0.082
 course-0.081
 introducing-0.079
neath-0.078
Token.
Token ID13
Feature activation+6.49e-03

Pos logit contributions
edly+0.081
 faces+0.080
aj+0.076
powers+0.076
suit+0.072
Neg logit contributions
 course-0.182
ions-0.169
ner-0.163
é-0.163
opl-0.162
Token He
Token ID679
Feature activation+2.17e-02

Pos logit contributions
edly+0.007
 faces+0.006
 face+0.006
 tum+0.005
by+0.005
Neg logit contributions
 course-0.016
ions-0.015
 July-0.015
é-0.015
issa-0.014
Tokenâ
Token ID22940
Feature activation-2.69e-02

Pos logit contributions
edly+0.028
 faces+0.027
 face+0.025
aj+0.024
 tum+0.023
Neg logit contributions
 course-0.047
issa-0.043
 July-0.043
ions-0.043
é-0.043
Token
Token ID26391
Feature activation+1.64e-02

Pos logit contributions
ions+0.047
 course+0.046
 July+0.045
é+0.042
neath+0.041
Neg logit contributions
edly-0.047
 face-0.043
 faces-0.042
by-0.041
 back-0.040
Token late
Token ID2739
Feature activation+2.27e-02

Pos logit contributions
edly+0.038
 faces+0.035
 face+0.034
 tum+0.031
by+0.031
Neg logit contributions
 course-0.063
 July-0.061
ions-0.061
é-0.059
neath-0.058
Token.
Token ID13
Feature activation+2.99e-03

Pos logit contributions
edly+0.022
powers+0.020
 face+0.020
 faces+0.020
 tum+0.019
Neg logit contributions
 course-0.056
ions-0.052
 July-0.052
ner-0.050
issa-0.050
Token The
Token ID383
Feature activation-1.52e-02

Pos logit contributions
edly+0.004
 face+0.003
 faces+0.003
 tum+0.003
aj+0.003
Neg logit contributions
 course-0.007
ions-0.006
 July-0.006
é-0.006
issa-0.006
Token pipe
Token ID12656
Feature activation-4.71e-02

Pos logit contributions
 course+0.030
ions+0.029
 July+0.029
é+0.028
issa+0.027
Neg logit contributions
edly-0.022
 faces-0.021
 face-0.021
 tum-0.019
by-0.019
Token grabbed
Token ID13176
Feature activation-4.02e-02

Pos logit contributions
 course+0.102
ions+0.095
 July+0.094
é+0.093
neath+0.092
Neg logit contributions
edly-0.062
 faces-0.056
 face-0.055
by-0.053
aj-0.053
Token him
Token ID683
Feature activation+6.18e-02

Pos logit contributions
 course+0.072
ions+0.069
 July+0.069
é+0.067
issa+0.065
Neg logit contributions
edly-0.065
 faces-0.064
 face-0.064
 back-0.059
by-0.059
Token and
Token ID290
Feature activation+5.83e-03

Pos logit contributions
edly+0.072
 faces+0.063
 face+0.059
powers+0.058
aj+0.058
Neg logit contributions
 course-0.143
ions-0.136
igh-0.130
neath-0.130
é-0.130
Token pulled
Token ID5954
Feature activation-4.06e-02

Pos logit contributions
edly+0.008
 faces+0.007
 face+0.007
 tum+0.007
by+0.007
Neg logit contributions
 course-0.012
ions-0.012
 July-0.012
é-0.011
issa-0.011
Token him
Token ID683
Feature activation+6.16e-02

Pos logit contributions
 course+0.076
ions+0.072
 July+0.071
é+0.069
issa+0.068
Neg logit contributions
edly-0.065
 faces-0.062
 face-0.061
by-0.057
 tum-0.057
Token in
Token ID287
Feature activation-5.02e-03

Pos logit contributions
edly+0.094
 faces+0.083
aj+0.081
 tum+0.080
 face+0.078
Neg logit contributions
 course-0.113
ions-0.112
neath-0.107
é-0.107
igh-0.104
Token.
Token ID13
Feature activation-4.68e-04

Pos logit contributions
 course+0.010
ions+0.010
 July+0.010
neath+0.009
é+0.009
Neg logit contributions
edly-0.007
 faces-0.007
 face-0.007
aj-0.006
by-0.006
Token noticed
Token ID6810
Feature activation-8.31e-03

Pos logit contributions
edly+0.016
 faces+0.014
 face+0.013
powers+0.013
aj+0.012
Neg logit contributions
 course-0.022
 July-0.020
neath-0.020
issa-0.020
ions-0.020
Token something
Token ID1223
Feature activation+2.00e-02

Pos logit contributions
 course+0.019
ions+0.018
 July+0.018
é+0.018
neath+0.018
Neg logit contributions
edly-0.009
 face-0.009
 faces-0.009
 tum-0.008
by-0.008
Token unusual
Token ID8468
Feature activation+1.32e-03

Pos logit contributions
edly+0.027
 faces+0.024
aj+0.024
 face+0.023
by+0.023
Neg logit contributions
 course-0.043
ions-0.041
 July-0.040
é-0.039
neath-0.038
Token.
Token ID13
Feature activation+2.49e-03

Pos logit contributions
edly+0.001
aj+0.001
 faces+0.001
powers+0.001
 face+0.001
Neg logit contributions
 course-0.003
ions-0.003
 July-0.003
é-0.003
neath-0.003
Token On
Token ID1550
Feature activation-4.78e-02

Pos logit contributions
edly+0.004
 faces+0.004
by+0.004
 face+0.004
aj+0.004
Neg logit contributions
 course-0.004
ions-0.004
 July-0.004
é-0.004
neath-0.004
Token her
Token ID607
Feature activation+5.20e-02

Pos logit contributions
hing+0.109
igh+0.104
iling+0.104
ting+0.101
icians+0.101
Neg logit contributions
 to-0.050
 back-0.049
 eyes-0.047
 face-0.045
 himself-0.043
Token bedroom
Token ID14043
Feature activation+1.99e-02

Pos logit contributions
edly+0.074
aj+0.064
 faces+0.061
powers+0.060
suit+0.059
Neg logit contributions
 course-0.104
 July-0.100
é-0.098
issa-0.098
neath-0.096
Token floor
Token ID4314
Feature activation-2.02e-02

Pos logit contributions
edly+0.026
 faces+0.023
powers+0.023
 face+0.022
aj+0.022
Neg logit contributions
 course-0.044
ions-0.042
 July-0.040
é-0.040
issa-0.040
Token there
Token ID612
Feature activation+5.17e-03

Pos logit contributions
 course+0.048
ions+0.046
 July+0.045
é+0.044
neath+0.044
Neg logit contributions
edly-0.023
 faces-0.021
 face-0.020
by-0.019
 tum-0.019
Token was
Token ID373
Feature activation-1.43e-02

Pos logit contributions
edly+0.007
 faces+0.006
 face+0.006
powers+0.006
aj+0.006
Neg logit contributions
 course-0.012
ions-0.011
 July-0.011
neath-0.011
é-0.011
Token a
Token ID257
Feature activation+1.43e-02

Pos logit contributions
 course+0.030
ions+0.029
 July+0.028
neath+0.027
é+0.027
Neg logit contributions
edly-0.020
 faces-0.018
 face-0.018
by-0.017
aj-0.017
Token more
Token ID517
Feature activation+2.05e-02

Pos logit contributions
 course+0.007
ions+0.007
 July+0.006
neath+0.006
é+0.006
Neg logit contributions
 back-0.004
 face-0.003
 to-0.003
edly-0.003
 tum-0.003
Token careful
Token ID8161
Feature activation+4.55e-02

Pos logit contributions
edly+0.024
 faces+0.023
 face+0.022
by+0.021
 tum+0.020
Neg logit contributions
 course-0.047
ions-0.045
 July-0.044
é-0.043
issa-0.043
Token and
Token ID290
Feature activation-1.22e-02

Pos logit contributions
edly+0.056
 face+0.054
 faces+0.051
 to+0.050
 back+0.048
Neg logit contributions
 course-0.103
 July-0.099
ions-0.098
neath-0.096
é-0.095
Token never
Token ID1239
Feature activation+2.32e-02

Pos logit contributions
 course+0.025
ions+0.024
 July+0.024
é+0.023
neath+0.023
Neg logit contributions
edly-0.017
 face-0.017
 faces-0.016
by-0.015
 tum-0.015
Token got
Token ID1392
Feature activation-2.72e-02

Pos logit contributions
edly+0.033
 faces+0.026
suits+0.025
aj+0.025
powers+0.025
Neg logit contributions
 course-0.047
 July-0.045
ner-0.044
é-0.043
 introducing-0.042
Token her
Token ID607
Feature activation+3.71e-02

Pos logit contributions
ions+0.053
 course+0.053
é+0.051
 July+0.051
issa+0.050
Neg logit contributions
edly-0.041
 spine-0.033
 face-0.033
 faces-0.033
atively-0.032
Token bl
Token ID698
Feature activation+1.22e-02

Pos logit contributions
edly+0.071
 spine+0.061
ows+0.060
atively+0.060
powers+0.057
Neg logit contributions
 course-0.058
issa-0.052
ions-0.049
ner-0.049
 pian-0.049
Tokenouse
Token ID1076
Feature activation+6.71e-03

Pos logit contributions
 faces+0.013
edly+0.012
 face+0.010
 tum+0.010
 eyes+0.010
Neg logit contributions
ions-0.025
ang-0.025
ner-0.025
issa-0.024
emaker-0.024
Token dirty
Token ID11841
Feature activation+2.29e-02

Pos logit contributions
edly+0.007
 faces+0.006
powers+0.006
aj+0.005
suits+0.005
Neg logit contributions
 course-0.016
ions-0.015
ner-0.015
on-0.015
neath-0.015
Token again
Token ID757
Feature activation+2.88e-02

Pos logit contributions
 faces+0.031
edly+0.030
powers+0.030
by+0.030
aj+0.030
Neg logit contributions
 course-0.046
ions-0.043
ner-0.040
ting-0.040
issa-0.038
Token.
Token ID13
Feature activation-5.52e-03

Pos logit contributions
edly+0.022
 faces+0.020
 face+0.019
powers+0.017
 tum+0.016
Neg logit contributions
 course-0.078
ions-0.075
 July-0.073
é-0.071
issa-0.071
Token to
Token ID284
Feature activation-1.48e-03

Pos logit contributions
edly+0.032
 faces+0.030
 face+0.030
 tum+0.026
aj+0.026
Neg logit contributions
 course-0.089
 July-0.086
ions-0.086
ner-0.083
é-0.082
Token come
Token ID1282
Feature activation+2.06e-02

Pos logit contributions
 course+0.003
ions+0.003
 July+0.003
é+0.003
issa+0.003
Neg logit contributions
edly-0.002
 face-0.002
 faces-0.002
 tum-0.002
by-0.002
Token.
Token ID13
Feature activation+1.82e-03

Pos logit contributions
edly+0.025
 faces+0.023
 tum+0.022
 face+0.021
powers+0.021
Neg logit contributions
 course-0.045
ions-0.044
neath-0.043
é-0.042
 July-0.042
Token<|endoftext|>
Token ID50256
Feature activation-1.31e-02

Pos logit contributions
edly+0.001
 face+0.001
 tum+0.001
 faces+0.001
 He+0.001
Neg logit contributions
 course-0.005
 July-0.005
ions-0.005
issa-0.005
é-0.004
TokenEll
Token ID30639
Feature activation-2.41e-02

Pos logit contributions
 course+0.029
ions+0.027
 July+0.027
é+0.026
issa+0.026
Neg logit contributions
edly-0.017
 faces-0.015
 face-0.015
by-0.014
 tum-0.014
Tokenie
Token ID494
Feature activation+3.79e-03

Pos logit contributions
 course+0.039
ions+0.039
 July+0.038
ner+0.037
neath+0.035
Neg logit contributions
by-0.042
edly-0.041
 faces-0.040
 face-0.040
aj-0.038
Token was
Token ID373
Feature activation+2.50e-02

Pos logit contributions
edly+0.004
 faces+0.004
 face+0.004
 tum+0.004
powers+0.004
Neg logit contributions
 course-0.009
ions-0.008
 July-0.008
é-0.008
neath-0.008
Token feeling
Token ID4203
Feature activation-7.28e-03

Pos logit contributions
 face+0.024
 back+0.023
edly+0.023
 favorite+0.022
 tum+0.021
Neg logit contributions
 course-0.063
ions-0.060
ner-0.057
hing-0.057
é-0.057
Token very
Token ID845
Feature activation+2.48e-02

Pos logit contributions
 course+0.017
ions+0.016
 July+0.016
é+0.015
issa+0.015
Neg logit contributions
edly-0.009
 faces-0.008
 face-0.007
aj-0.007
by-0.007
Token lonely
Token ID21757
Feature activation+4.86e-02

Pos logit contributions
edly+0.035
 faces+0.031
by+0.030
aj+0.029
suits+0.028
Neg logit contributions
 July-0.050
 course-0.048
ions-0.046
é-0.046
ner-0.046
Token.
Token ID13
Feature activation+5.08e-03

Pos logit contributions
 faces+0.046
edly+0.044
 face+0.042
powers+0.041
aj+0.040
Neg logit contributions
 course-0.122
ions-0.115
issa-0.110
ner-0.110
é-0.109
Token but
Token ID475
Feature activation-1.92e-02

Pos logit contributions
 course+0.032
ions+0.031
 July+0.031
é+0.031
issa+0.030
Neg logit contributions
edly-0.018
 faces-0.017
 face-0.017
 tum-0.016
 back-0.015
Token she
Token ID673
Feature activation+1.69e-02

Pos logit contributions
 course+0.040
ions+0.039
 July+0.038
é+0.037
issa+0.037
Neg logit contributions
edly-0.026
 face-0.025
 faces-0.025
 tum-0.023
by-0.022
Token shouldn
Token ID6584
Feature activation-2.11e-02

Pos logit contributions
edly+0.026
 faces+0.024
 face+0.024
aj+0.022
powers+0.022
Neg logit contributions
 course-0.033
ions-0.031
 July-0.030
é-0.030
issa-0.030
Token't
Token ID470
Feature activation+4.89e-03

Pos logit contributions
 July+0.029
 course+0.029
ions+0.029
issa+0.027
é+0.026
Neg logit contributions
 face-0.040
edly-0.040
 back-0.038
 faces-0.038
 to-0.038
Token feel
Token ID1254
Feature activation-2.02e-02

Pos logit contributions
edly+0.007
 faces+0.006
aj+0.006
 face+0.006
by+0.006
Neg logit contributions
 course-0.010
ions-0.010
 July-0.009
é-0.009
neath-0.009
Token weak
Token ID4939
Feature activation+9.94e-03

Pos logit contributions
 course+0.044
ions+0.042
 July+0.041
é+0.040
issa+0.039
Neg logit contributions
edly-0.027
 faces-0.025
 face-0.024
aj-0.023
by-0.023
Token.
Token ID13
Feature activation+7.25e-03

Pos logit contributions
edly+0.009
 faces+0.007
 face+0.007
aj+0.007
by+0.007
Neg logit contributions
 course-0.026
ions-0.025
 July-0.024
é-0.024
ner-0.023
Token Everyone
Token ID11075
Feature activation-1.24e-02

Pos logit contributions
edly+0.012
suit+0.011
by+0.011
suits+0.011
aj+0.011
Neg logit contributions
 course-0.012
é-0.011
 July-0.011
ions-0.011
neath-0.010
Token has
Token ID468
Feature activation-8.03e-04

Pos logit contributions
 course+0.027
ions+0.025
 July+0.024
é+0.024
issa+0.024
Neg logit contributions
edly-0.017
aj-0.016
 faces-0.015
 face-0.015
powers-0.015
Token different
Token ID1180
Feature activation-1.09e-02

Pos logit contributions
 course+0.002
ions+0.001
 July+0.001
neath+0.001
 introducing+0.001
Neg logit contributions
edly-0.001
suit-0.001
 spine-0.001
 faces-0.001
aj-0.001
Token tastes
Token ID18221
Feature activation-1.92e-02

Pos logit contributions
 course+0.021
ions+0.019
 July+0.019
neath+0.017
ons+0.017
Neg logit contributions
edly-0.019
aj-0.016
 spine-0.016
powers-0.016
suit-0.016
Token
Token ID198
Feature activation-5.05e-03

Pos logit contributions
edly+0.020
 faces+0.019
 face+0.018
by+0.017
aj+0.016
Neg logit contributions
 course-0.031
ions-0.030
 July-0.029
é-0.028
neath-0.028
TokenThe
Token ID464
Feature activation-2.85e-02

Pos logit contributions
 July+0.009
 course+0.009
ions+0.008
issa+0.008
é+0.008
Neg logit contributions
edly-0.009
 face-0.008
 faces-0.008
by-0.008
powers-0.007
Token moral
Token ID6573
Feature activation-7.87e-03

Pos logit contributions
 course+0.057
ions+0.055
 July+0.054
é+0.052
neath+0.052
Neg logit contributions
edly-0.042
 face-0.039
 faces-0.039
by-0.036
 tum-0.036
Token of
Token ID286
Feature activation-5.41e-02

Pos logit contributions
 July+0.013
 course+0.013
ions+0.012
neath+0.012
é+0.012
Neg logit contributions
edly-0.013
 face-0.013
powers-0.013
 to-0.013
by-0.012
Token this
Token ID428
Feature activation+1.74e-02

Pos logit contributions
ner+0.110
 pian+0.104
 bolt+0.103
 July+0.102
hing+0.102
Neg logit contributions
 to-0.068
 back-0.067
by-0.065
 face-0.063
 himself-0.063
Token story
Token ID1621
Feature activation+9.83e-03

Pos logit contributions
edly+0.022
 face+0.021
 faces+0.020
by+0.019
 tum+0.019
Neg logit contributions
 course-0.038
ions-0.037
 July-0.036
é-0.035
neath-0.035
Token is
Token ID318
Feature activation+5.55e-03

Pos logit contributions
edly+0.015
 face+0.014
 faces+0.014
by+0.013
 tum+0.013
Neg logit contributions
 course-0.019
ions-0.018
 July-0.018
é-0.017
issa-0.017
Token to
Token ID284
Feature activation-1.36e-03

Pos logit contributions
edly+0.009
 face+0.008
 faces+0.008
by+0.008
 tum+0.008
Neg logit contributions
 course-0.010
ions-0.010
 July-0.010
é-0.009
issa-0.009
Token make
Token ID787
Feature activation-3.11e-02

Pos logit contributions
 course+0.003
ions+0.003
 July+0.003
é+0.002
issa+0.002
Neg logit contributions
edly-0.002
 face-0.002
 faces-0.002
by-0.002
 tum-0.002
Token sure
Token ID1654
Feature activation+5.70e-04

Pos logit contributions
 course+0.070
 July+0.069
ions+0.068
é+0.067
neath+0.066
Neg logit contributions
 face-0.035
 faces-0.035
edly-0.035
by-0.032
 back-0.031
Token you
Token ID345
Feature activation+1.11e-02

Pos logit contributions
edly+0.001
 faces+0.001
 face+0.001
 tum+0.001
by+0.001
Neg logit contributions
 course-0.001
ions-0.001
 July-0.001
é-0.001
issa-0.001
Token learned
Token ID4499
Feature activation+3.19e-02

Pos logit contributions
edly+0.002
 faces+0.002
 face+0.002
aj+0.002
 spine+0.002
Neg logit contributions
 course-0.002
ions-0.002
 July-0.002
neath-0.002
issa-0.002
Token an
Token ID281
Feature activation+1.35e-02

Pos logit contributions
edly+0.058
 faces+0.056
aj+0.053
suit+0.052
 face+0.052
Neg logit contributions
 course-0.056
ions-0.049
issa-0.046
 July-0.046
neath-0.045
Token important
Token ID1593
Feature activation+3.13e-02

Pos logit contributions
edly+0.016
 face+0.015
 faces+0.015
 tum+0.013
by+0.013
Neg logit contributions
 course-0.030
ions-0.030
 July-0.029
neath-0.028
é-0.028
Token lesson
Token ID11483
Feature activation+3.19e-02

Pos logit contributions
edly+0.027
 faces+0.026
aj+0.025
 face+0.023
 eyes+0.020
Neg logit contributions
 course-0.082
 July-0.076
ions-0.076
é-0.075
issa-0.074
Token that
Token ID326
Feature activation-1.97e-02

Pos logit contributions
edly+0.048
 faces+0.046
 face+0.045
 tum+0.042
by+0.042
Neg logit contributions
 course-0.063
ions-0.060
 July-0.058
é-0.057
issa-0.056
Token day
Token ID1110
Feature activation+2.45e-02

Pos logit contributions
ions+0.047
icians+0.047
é+0.046
ner+0.045
issa+0.045
Neg logit contributions
 to-0.020
 face-0.019
 tum-0.019
 lesson-0.017
 back-0.017
Token -
Token ID532
Feature activation-1.65e-02

Pos logit contributions
edly+0.023
 face+0.021
 faces+0.021
by+0.018
 tum+0.018
Neg logit contributions
 course-0.061
 July-0.059
ions-0.059
é-0.058
neath-0.056
Token the
Token ID262
Feature activation-1.43e-02

Pos logit contributions
 July+0.032
issa+0.032
é+0.032
 course+0.032
ions+0.032
Neg logit contributions
 to-0.023
 face-0.022
edly-0.021
 faces-0.021
 tum-0.021
Token dry
Token ID5894
Feature activation-4.21e-02

Pos logit contributions
 course+0.028
 July+0.026
ions+0.025
é+0.024
neath+0.024
Neg logit contributions
edly-0.023
 faces-0.021
 face-0.020
by-0.020
aj-0.020
Tokenness
Token ID1108
Feature activation-1.21e-02

Pos logit contributions
 course+0.085
ions+0.077
 July+0.076
é+0.073
opl+0.073
Neg logit contributions
edly-0.065
 faces-0.058
powers-0.057
aj-0.057
 face-0.055
Token of
Token ID286
Feature activation-4.79e-02

Pos logit contributions
 course+0.022
ions+0.021
 July+0.020
é+0.019
issa+0.019
Neg logit contributions
edly-0.020
 face-0.019
 faces-0.019
 tum-0.018
by-0.018
Token down
Token ID866
Feature activation+2.30e-03

Pos logit contributions
 course+0.022
ions+0.021
 July+0.021
é+0.021
neath+0.020
Neg logit contributions
edly-0.017
 faces-0.016
 face-0.016
aj-0.016
powers-0.016
Token on
Token ID319
Feature activation-2.54e-02

Pos logit contributions
edly+0.003
 faces+0.003
 face+0.003
 tum+0.002
by+0.002
Neg logit contributions
 course-0.005
ions-0.005
 July-0.005
é-0.005
neath-0.005
Token the
Token ID262
Feature activation-1.35e-03

Pos logit contributions
 course+0.047
ions+0.046
 July+0.045
é+0.044
igh+0.044
Neg logit contributions
edly-0.039
 faces-0.038
 face-0.038
 back-0.035
 tum-0.034
Token beach
Token ID10481
Feature activation+1.38e-02

Pos logit contributions
 course+0.003
 July+0.002
ions+0.002
é+0.002
issa+0.002
Neg logit contributions
edly-0.002
aj-0.002
 faces-0.002
by-0.002
 face-0.002
Token.
Token ID13
Feature activation+1.36e-03

Pos logit contributions
edly+0.011
 faces+0.009
aj+0.009
powers+0.008
 face+0.008
Neg logit contributions
 course-0.037
ions-0.035
 July-0.035
é-0.034
ner-0.033
Token Jane
Token ID12091
Feature activation-1.25e-02

Pos logit contributions
edly+0.003
 faces+0.002
by+0.002
aj+0.002
 face+0.002
Neg logit contributions
 course-0.002
 July-0.002
ions-0.002
é-0.002
neath-0.002
Token was
Token ID373
Feature activation+3.59e-03

Pos logit contributions
 course+0.029
ions+0.028
 July+0.028
é+0.027
issa+0.027
Neg logit contributions
edly-0.014
 faces-0.013
 face-0.013
by-0.012
 tum-0.011
Token scared
Token ID12008
Feature activation+1.55e-02

Pos logit contributions
 face+0.003
edly+0.003
 faces+0.003
 back+0.003
 tum+0.003
Neg logit contributions
 course-0.009
ions-0.009
 July-0.009
neath-0.008
issa-0.008
Token,
Token ID11
Feature activation+5.71e-03

Pos logit contributions
edly+0.018
 faces+0.016
aj+0.015
 face+0.015
by+0.015
Neg logit contributions
 course-0.036
ions-0.035
 July-0.034
é-0.033
issa-0.033
Token but
Token ID475
Feature activation-5.36e-03

Pos logit contributions
edly+0.008
aj+0.007
by+0.007
powers+0.007
 faces+0.007
Neg logit contributions
 course-0.012
 July-0.011
ions-0.011
é-0.011
ner-0.011
Token her
Token ID607
Feature activation+1.92e-02

Pos logit contributions
 course+0.012
ions+0.011
 July+0.011
é+0.011
neath+0.010
Neg logit contributions
edly-0.007
 face-0.007
 faces-0.007
 tum-0.006
 back-0.006
Token each
Token ID1123
Feature activation-1.94e-02

Pos logit contributions
 course+0.014
 July+0.012
ions+0.012
é+0.012
neath+0.012
Neg logit contributions
edly-0.012
powers-0.012
aj-0.012
suit-0.011
ouse-0.011
Token other
Token ID584
Feature activation+5.31e-02

Pos logit contributions
 course+0.038
 July+0.035
ions+0.035
é+0.033
issa+0.033
Neg logit contributions
edly-0.031
aj-0.028
 faces-0.028
 face-0.027
powers-0.027
Token and
Token ID290
Feature activation+1.95e-02

Pos logit contributions
edly+0.063
powers+0.060
aj+0.059
suit+0.054
 upon+0.053
Neg logit contributions
 course-0.117
 July-0.102
ions-0.101
issa-0.100
é-0.100
Token promised
Token ID8072
Feature activation+3.30e-02

Pos logit contributions
edly+0.029
powers+0.027
aj+0.027
 spine+0.026
suit+0.026
Neg logit contributions
 course-0.040
 July-0.034
ions-0.034
issa-0.032
 introducing-0.032
Token to
Token ID284
Feature activation+7.59e-04

Pos logit contributions
suit+0.042
aj+0.037
edly+0.037
suits+0.036
uddle+0.032
Neg logit contributions
 course-0.070
 July-0.069
 whoever-0.067
ner-0.066
 introducing-0.065
Token be
Token ID307
Feature activation-1.18e-02

Pos logit contributions
edly+0.001
 faces+0.001
 face+0.001
 tum+0.001
by+0.001
Neg logit contributions
 course-0.002
ions-0.002
 July-0.002
é-0.002
issa-0.002
Token friends
Token ID2460
Feature activation+3.54e-02

Pos logit contributions
 course+0.022
ions+0.020
 July+0.020
é+0.018
 introducing+0.018
Neg logit contributions
edly-0.020
 spine-0.018
suit-0.017
aj-0.017
 faces-0.017
Token.
Token ID13
Feature activation+1.56e-02

Pos logit contributions
edly+0.044
aj+0.043
 faces+0.042
 spine+0.040
powers+0.040
Neg logit contributions
 course-0.080
ions-0.069
ner-0.067
issa-0.065
 introducing-0.065
Token Their
Token ID5334
Feature activation+3.93e-02

Pos logit contributions
edly+0.023
by+0.023
 face+0.022
powers+0.021
 faces+0.021
Neg logit contributions
 July-0.029
 course-0.028
ions-0.027
neath-0.026
é-0.025
Token mom
Token ID1995
Feature activation-9.99e-04

Pos logit contributions
edly+0.065
suit+0.054
powers+0.051
 spine+0.049
atively+0.048
Neg logit contributions
 July-0.077
 course-0.072
é-0.063
ions-0.063
 introducing-0.062
Token smiled
Token ID13541
Feature activation+4.73e-02

Pos logit contributions
 course+0.002
ions+0.002
 July+0.002
é+0.002
issa+0.002
Neg logit contributions
edly-0.001
 faces-0.001
 face-0.001
 tum-0.001
by-0.001
Token
Token ID198
Feature activation+1.86e-02

Pos logit contributions
edly+0.008
by+0.007
 faces+0.007
 face+0.007
aj+0.007
Neg logit contributions
 course-0.012
ions-0.011
 July-0.011
é-0.011
neath-0.011
Token
Token ID198
Feature activation-2.28e-03

Pos logit contributions
edly+0.022
 faces+0.020
 face+0.019
aj+0.017
by+0.017
Neg logit contributions
 course-0.044
ions-0.042
 July-0.041
é-0.040
issa-0.039
TokenTim
Token ID14967
Feature activation-8.24e-03

Pos logit contributions
ions+0.004
 course+0.004
 July+0.004
issa+0.004
igh+0.004
Neg logit contributions
edly-0.004
 face-0.003
 faces-0.003
 tum-0.003
by-0.003
Token's
Token ID338
Feature activation+2.38e-02

Pos logit contributions
 course+0.017
é+0.015
issa+0.015
 July+0.015
neath+0.014
Neg logit contributions
edly-0.012
 faces-0.011
powers-0.011
 face-0.010
 spine-0.010
Token mom
Token ID1995
Feature activation-2.56e-03

Pos logit contributions
edly+0.040
powers+0.031
by+0.030
eed+0.030
 spine+0.029
Neg logit contributions
 course-0.040
 July-0.040
issa-0.037
 piano-0.037
 tornado-0.035
Token saw
Token ID2497
Feature activation-2.64e-02

Pos logit contributions
 course+0.006
ions+0.006
 July+0.006
é+0.006
opl+0.006
Neg logit contributions
edly-0.003
 face-0.003
 faces-0.003
by-0.002
 tum-0.002
Token the
Token ID262
Feature activation-6.40e-03

Pos logit contributions
 course+0.047
issa+0.047
 July+0.047
ions+0.047
é+0.046
Neg logit contributions
 faces-0.040
 face-0.039
edly-0.038
 back-0.036
 to-0.035
Token problem
Token ID1917
Feature activation+2.37e-03

Pos logit contributions
 course+0.013
 July+0.012
issa+0.012
ions+0.012
é+0.011
Neg logit contributions
edly-0.010
by-0.008
 faces-0.008
powers-0.008
aj-0.008
Token.
Token ID13
Feature activation-5.86e-03

Pos logit contributions
edly+0.002
 faces+0.002
 face+0.002
by+0.002
aj+0.002
Neg logit contributions
 course-0.006
ions-0.006
 July-0.006
é-0.006
issa-0.006
Token She
Token ID1375
Feature activation+2.90e-02

Pos logit contributions
 course+0.010
 July+0.010
ions+0.010
é+0.010
ner+0.009
Neg logit contributions
edly-0.010
by-0.010
aj-0.009
 faces-0.008
powers-0.008
Token said
Token ID531
Feature activation+1.33e-02

Pos logit contributions
edly+0.037
 faces+0.034
 face+0.033
powers+0.031
aj+0.031
Neg logit contributions
 course-0.064
ions-0.061
 July-0.060
é-0.059
issa-0.059
Token.
Token ID13
Feature activation+7.30e-03

Pos logit contributions
aj+0.030
 faces+0.029
edly+0.028
 tum+0.027
 face+0.027
Neg logit contributions
 course-0.064
ions-0.060
issa-0.056
é-0.055
 introducing-0.054
Token She
Token ID1375
Feature activation+2.21e-02

Pos logit contributions
edly+0.011
 faces+0.010
by+0.010
 face+0.010
powers+0.009
Neg logit contributions
 course-0.014
 July-0.013
ions-0.013
é-0.013
neath-0.013
Token ran
Token ID4966
Feature activation-4.66e-03

Pos logit contributions
edly+0.034
 faces+0.031
 face+0.030
 spine+0.029
powers+0.028
Neg logit contributions
 course-0.043
 July-0.039
ions-0.039
issa-0.039
é-0.038
Token her
Token ID607
Feature activation+6.29e-02

Pos logit contributions
 course+0.010
ions+0.009
 July+0.009
é+0.009
ner+0.009
Neg logit contributions
edly-0.006
aj-0.006
 faces-0.006
 face-0.006
 tum-0.005
Token fingers
Token ID9353
Feature activation+1.65e-02

Pos logit contributions
edly+0.102
 faces+0.083
 face+0.081
powers+0.079
eed+0.079
Neg logit contributions
 course-0.115
issa-0.113
ions-0.105
 July-0.104
é-0.103
Token over
Token ID625
Feature activation-1.05e-02

Pos logit contributions
edly+0.021
 faces+0.020
 face+0.019
aj+0.019
 tum+0.019
Neg logit contributions
 course-0.035
ions-0.034
 July-0.033
é-0.033
neath-0.032
Token the
Token ID262
Feature activation+5.23e-03

Pos logit contributions
 course+0.024
ions+0.023
 July+0.023
é+0.022
issa+0.022
Neg logit contributions
edly-0.012
 faces-0.011
 face-0.011
 tum-0.010
by-0.010
Token velvet
Token ID47750
Feature activation+2.24e-02

Pos logit contributions
edly+0.009
by+0.008
 faces+0.008
powers+0.008
 face+0.008
Neg logit contributions
 course-0.009
 July-0.008
ions-0.008
issa-0.008
neath-0.008
Token dress
Token ID6576
Feature activation+1.90e-02

Pos logit contributions
powers+0.032
edly+0.032
aj+0.031
 faces+0.031
by+0.030
Neg logit contributions
 course-0.042
ions-0.040
issa-0.036
é-0.036
ner-0.036
Token and
Token ID290
Feature activation+1.98e-02

Pos logit contributions
edly+0.017
 faces+0.016
 face+0.015
powers+0.014
by+0.014
Neg logit contributions
 course-0.048
ions-0.046
 July-0.044
é-0.044
issa-0.044
Token felt
Token ID2936
Feature activation-1.08e-02

Pos logit contributions
edly+0.027
 faces+0.025
 face+0.025
 tum+0.023
by+0.023
Neg logit contributions
 course-0.042
ions-0.041
 July-0.040
é-0.039
neath-0.038
Token of
Token ID286
Feature activation-3.18e-02

Pos logit contributions
edly+0.021
 faces+0.019
 face+0.019
by+0.018
aj+0.018
Neg logit contributions
 course-0.033
ions-0.031
 July-0.031
é-0.030
issa-0.029
Token the
Token ID262
Feature activation-1.46e-02

Pos logit contributions
 course+0.058
ions+0.058
 July+0.055
neath+0.055
é+0.055
Neg logit contributions
 faces-0.049
 face-0.048
edly-0.047
 eyes-0.045
 to-0.044
Token monster
Token ID9234
Feature activation+8.15e-03

Pos logit contributions
ions+0.026
 course+0.026
 July+0.025
neath+0.024
é+0.024
Neg logit contributions
 faces-0.023
 face-0.023
edly-0.023
 tum-0.022
by-0.021
Token.
Token ID13
Feature activation+7.40e-03

Pos logit contributions
edly+0.007
 faces+0.006
 face+0.006
aj+0.006
 tum+0.006
Neg logit contributions
 course-0.021
ions-0.020
 July-0.020
é-0.020
issa-0.019
Token His
Token ID2399
Feature activation+3.78e-02

Pos logit contributions
edly+0.007
 faces+0.007
by+0.006
 face+0.006
aj+0.006
Neg logit contributions
 course-0.018
 July-0.017
ions-0.017
é-0.017
neath-0.017
Token mom
Token ID1995
Feature activation-2.10e-02

Pos logit contributions
edly+0.056
suit+0.045
powers+0.045
suits+0.045
ows+0.044
Neg logit contributions
 July-0.073
issa-0.069
 course-0.068
é-0.064
 introducing-0.063
Tokenmy
Token ID1820
Feature activation-2.68e-03

Pos logit contributions
ions+0.043
 July+0.042
 course+0.041
opl+0.040
neath+0.039
Neg logit contributions
 face-0.028
 faces-0.026
by-0.026
 to-0.026
 back-0.026
Token and
Token ID290
Feature activation-7.33e-03

Pos logit contributions
ions+0.006
 course+0.006
 July+0.006
opl+0.006
é+0.006
Neg logit contributions
 face-0.003
edly-0.003
 faces-0.003
by-0.003
 back-0.002
Token daddy
Token ID35822
Feature activation+7.63e-04

Pos logit contributions
 course+0.015
ions+0.015
 July+0.015
é+0.014
issa+0.014
Neg logit contributions
edly-0.010
 faces-0.009
 face-0.009
by-0.009
 tum-0.009
Token told
Token ID1297
Feature activation-3.27e-02

Pos logit contributions
edly+0.001
 face+0.001
 faces+0.001
by+0.001
 tum+0.001
Neg logit contributions
 course-0.002
ions-0.002
 July-0.002
opl-0.002
é-0.002
Token him
Token ID683
Feature activation+3.67e-02

Pos logit contributions
 course+0.064
ions+0.061
 July+0.060
é+0.057
issa+0.056
Neg logit contributions
edly-0.051
 faces-0.047
 face-0.047
by-0.044
 tum-0.044
Token the
Token ID262
Feature activation+1.15e-02

Pos logit contributions
 course+0.007
ions+0.007
 July+0.007
é+0.007
issa+0.007
Neg logit contributions
edly-0.005
 face-0.005
 faces-0.005
by-0.005
 tum-0.005
Token bird
Token ID6512
Feature activation+1.28e-03

Pos logit contributions
edly+0.017
 faces+0.016
aj+0.015
 face+0.015
by+0.015
Neg logit contributions
 course-0.023
 July-0.021
ions-0.021
é-0.020
ner-0.020
Token and
Token ID290
Feature activation+1.17e-02

Pos logit contributions
edly+0.001
 faces+0.001
 face+0.001
 tum+0.001
by+0.001
Neg logit contributions
 course-0.003
ions-0.003
 July-0.003
é-0.003
issa-0.003
Token asked
Token ID1965
Feature activation+9.70e-03

Pos logit contributions
edly+0.015
 faces+0.015
 face+0.014
by+0.013
 tum+0.013
Neg logit contributions
 course-0.025
ions-0.024
 July-0.024
é-0.023
issa-0.023
Token,
Token ID11
Feature activation+7.99e-03

Pos logit contributions
edly+0.013
 faces+0.012
 face+0.012
 tum+0.011
by+0.011
Neg logit contributions
 course-0.020
ions-0.020
 July-0.019
é-0.019
issa-0.018
Token "
Token ID366
Feature activation-7.02e-02

Pos logit contributions
edly+0.011
 face+0.010
 faces+0.010
 tum+0.009
by+0.009
Neg logit contributions
 course-0.017
ions-0.016
 July-0.016
é-0.015
issa-0.015
TokenDo
Token ID5211
Feature activation-2.12e-02

Pos logit contributions
ions+0.128
opl+0.124
é+0.123
 July+0.122
otion+0.119
Neg logit contributions
edly-0.107
 back-0.098
 upon-0.098
 face-0.091
by-0.091
Token you
Token ID345
Feature activation+8.31e-03

Pos logit contributions
 course+0.027
ions+0.025
 July+0.024
é+0.023
issa+0.022
Neg logit contributions
edly-0.047
 faces-0.045
 face-0.044
by-0.043
 tum-0.042
Token want
Token ID765
Feature activation-1.66e-02

Pos logit contributions
edly+0.011
 faces+0.010
 face+0.010
by+0.009
 tum+0.009
Neg logit contributions
 course-0.018
ions-0.017
 July-0.017
é-0.017
issa-0.016
Token to
Token ID284
Feature activation-1.58e-03

Pos logit contributions
 course+0.050
ions+0.049
 July+0.048
é+0.047
issa+0.046
Neg logit contributions
edly-0.008
 faces-0.006
 face-0.006
by-0.005
 tum-0.005
Token come
Token ID1282
Feature activation+6.46e-03

Pos logit contributions
 course+0.003
ions+0.003
 July+0.003
é+0.003
neath+0.003
Neg logit contributions
edly-0.002
 faces-0.002
 face-0.002
by-0.002
 tum-0.002
Token
Token ID198
Feature activation-6.29e-03

Pos logit contributions
edly+0.008
 faces+0.007
 face+0.007
by+0.006
 tum+0.006
Neg logit contributions
 course-0.018
ions-0.017
 July-0.017
é-0.017
issa-0.016
TokenThey
Token ID2990
Feature activation+1.12e-02

Pos logit contributions
 course+0.013
ions+0.013
 July+0.012
é+0.012
igh+0.012
Neg logit contributions
edly-0.009
 face-0.008
 faces-0.008
 upon-0.008
aj-0.008
Token played
Token ID2826
Feature activation-9.52e-03

Pos logit contributions
edly+0.012
 face+0.012
 faces+0.012
by+0.011
 tum+0.011
Neg logit contributions
 course-0.026
ions-0.026
 July-0.025
é-0.024
opl-0.024
Token with
Token ID351
Feature activation-4.26e-02

Pos logit contributions
 course+0.019
ions+0.018
 July+0.018
é+0.017
neath+0.017
Neg logit contributions
edly-0.014
 faces-0.013
 face-0.013
by-0.012
aj-0.012
Token the
Token ID262
Feature activation+1.94e-03

Pos logit contributions
 course+0.083
ions+0.082
ner+0.081
 July+0.080
é+0.079
Neg logit contributions
 faces-0.063
 face-0.060
 back-0.057
 tum-0.056
by-0.055
Token k
Token ID479
Feature activation-7.48e-02

Pos logit contributions
 face+0.003
 faces+0.003
 back+0.003
 tum+0.003
 favorite+0.003
Neg logit contributions
ions-0.003
 course-0.003
 July-0.003
issa-0.003
igh-0.003
Tokenite
Token ID578
Feature activation-1.45e-03

Pos logit contributions
 July+0.139
 course+0.129
 whoever+0.123
issa+0.121
 introducing+0.118
Neg logit contributions
powers-0.116
 back-0.112
uddle-0.110
s-0.109
 to-0.108
Token all
Token ID477
Feature activation-2.27e-02

Pos logit contributions
 course+0.003
ions+0.003
 July+0.003
é+0.003
issa+0.003
Neg logit contributions
edly-0.002
 face-0.002
 faces-0.002
by-0.001
 tum-0.001
Token day
Token ID1110
Feature activation+5.17e-02

Pos logit contributions
ograph+0.054
ner+0.052
igh+0.051
ions+0.050
icians+0.050
Neg logit contributions
 back-0.025
 to-0.024
by-0.023
 upon-0.021
.-0.021
Token.
Token ID13
Feature activation-2.81e-04

Pos logit contributions
edly+0.041
 faces+0.036
 face+0.035
by+0.032
aj+0.031
Neg logit contributions
 course-0.137
ions-0.133
 July-0.130
é-0.127
ner-0.127
Token Tom
Token ID4186
Feature activation+1.14e-03

Pos logit contributions
 course+0.001
ions+0.001
 July+0.001
é+0.001
issa+0.001
Neg logit contributions
edly-0.000
 face-0.000
 faces-0.000
 tum-0.000
aj-0.000
Token many
Token ID867
Feature activation-1.50e-02

Pos logit contributions
 course+0.095
ions+0.090
 July+0.090
é+0.089
issa+0.087
Neg logit contributions
edly-0.051
 faces-0.050
 face-0.049
 tum-0.045
 eyes-0.044
Token papers
Token ID9473
Feature activation-7.03e-03

Pos logit contributions
 course+0.032
ions+0.030
 July+0.030
é+0.029
issa+0.028
Neg logit contributions
edly-0.021
 face-0.018
 faces-0.018
by-0.018
aj-0.017
Token.
Token ID13
Feature activation+2.35e-04

Pos logit contributions
 course+0.018
ions+0.017
 July+0.017
é+0.016
ner+0.016
Neg logit contributions
edly-0.006
 faces-0.006
 face-0.006
by-0.005
 tum-0.005
Token
Token ID198
Feature activation+8.49e-03

Pos logit contributions
edly+0.000
 faces+0.000
 face+0.000
by+0.000
powers+0.000
Neg logit contributions
 course-0.000
ions-0.000
 July-0.000
é-0.000
neath-0.000
Token
Token ID198
Feature activation-6.66e-03

Pos logit contributions
edly+0.007
 face+0.006
 faces+0.006
by+0.005
aj+0.005
Neg logit contributions
 course-0.022
ions-0.022
 July-0.021
é-0.021
issa-0.021
Token"
Token ID1
Feature activation-6.80e-02

Pos logit contributions
 course+0.015
igh+0.014
opl+0.014
é+0.014
ions+0.014
Neg logit contributions
edly-0.008
 face-0.007
 faces-0.007
aj-0.007
 upon-0.007
TokenMom
Token ID29252
Feature activation-1.24e-02

Pos logit contributions
opl+0.154
 course+0.145
igh+0.143
é+0.142
ions+0.141
Neg logit contributions
edly-0.084
 faces-0.076
 face-0.074
 upon-0.071
 back-0.070
Token,
Token ID11
Feature activation-4.21e-02

Pos logit contributions
ions+0.031
 course+0.031
 July+0.030
é+0.029
neath+0.029
Neg logit contributions
 face-0.011
by-0.010
 faces-0.010
 back-0.010
aj-0.010
Token can
Token ID460
Feature activation-2.18e-02

Pos logit contributions
opl+0.100
 course+0.099
é+0.097
ions+0.096
hing+0.095
Neg logit contributions
 face-0.047
 to-0.043
 tum-0.043
 back-0.043
 faces-0.042
Token we
Token ID356
Feature activation-1.46e-02

Pos logit contributions
opl+0.051
hing+0.047
rob+0.046
 course+0.046
 sorts+0.045
Neg logit contributions
 face-0.028
 faces-0.026
 to-0.025
 eyes-0.024
 tum-0.024
Token go
Token ID467
Feature activation-1.26e-02

Pos logit contributions
rob+0.035
ions+0.034
 course+0.034
opl+0.034
issa+0.033
Neg logit contributions
 face-0.017
 to-0.015
 back-0.015
 tum-0.013
 faces-0.013
Token Mrs
Token ID9074
Feature activation-1.11e-02

Pos logit contributions
ions+0.052
hing+0.050
 July+0.050
 course+0.049
é+0.048
Neg logit contributions
 face-0.030
by-0.030
 tum-0.030
 to-0.030
edly-0.030
Token Johnson
Token ID5030
Feature activation+3.12e-03

Pos logit contributions
 course+0.029
ions+0.028
 July+0.027
é+0.027
issa+0.027
Neg logit contributions
edly-0.009
 faces-0.009
 face-0.008
powers-0.008
 tum-0.008
Token was
Token ID373
Feature activation-1.04e-02

Pos logit contributions
edly+0.004
 faces+0.003
 face+0.003
by+0.003
 tum+0.003
Neg logit contributions
 course-0.007
ions-0.007
 July-0.007
é-0.007
issa-0.007
Token the
Token ID262
Feature activation-3.02e-03

Pos logit contributions
ions+0.022
 course+0.021
hing+0.021
neath+0.021
ner+0.021
Neg logit contributions
 back-0.013
 to-0.013
 face-0.012
edly-0.012
 tum-0.011
Token best
Token ID1266
Feature activation-2.60e-04

Pos logit contributions
ions+0.006
 course+0.006
 July+0.006
é+0.006
neath+0.006
Neg logit contributions
 face-0.004
edly-0.004
 faces-0.004
 back-0.004
 tum-0.004
Token mel
Token ID7758
Feature activation-6.82e-02

Pos logit contributions
ions+0.000
 course+0.000
 July+0.000
neath+0.000
é+0.000
Neg logit contributions
 face-0.000
edly-0.000
 faces-0.000
 tum-0.000
by-0.000
Tokenon
Token ID261
Feature activation-1.97e-02

Pos logit contributions
 July+0.075
neath+0.057
issa+0.054
 cinema+0.054
é+0.052
Neg logit contributions
edly-0.158
by-0.156
 to-0.155
ouse-0.152
 back-0.152
Token-
Token ID12
Feature activation-4.60e-02

Pos logit contributions
 course+0.039
 July+0.038
ions+0.037
neath+0.036
é+0.036
Neg logit contributions
edly-0.028
 face-0.027
 faces-0.026
by-0.025
 back-0.024
Tokencho
Token ID6679
Feature activation-2.53e-02

Pos logit contributions
 July+0.084
 course+0.082
ions+0.079
neath+0.076
issa+0.076
Neg logit contributions
edly-0.075
by-0.071
powers-0.066
 to-0.066
 faces-0.065
Tokenoser
Token ID13416
Feature activation+1.87e-02

Pos logit contributions
 course+0.041
 July+0.040
ions+0.038
é+0.035
neath+0.035
Neg logit contributions
edly-0.047
by-0.044
 face-0.043
 faces-0.042
 back-0.042
Token in
Token ID287
Feature activation-1.75e-02

Pos logit contributions
edly+0.019
powers+0.016
aj+0.016
 faces+0.016
 face+0.016
Neg logit contributions
 course-0.045
ions-0.043
 July-0.041
é-0.041
neath-0.040
Token angry
Token ID7954
Feature activation+1.76e-02

Pos logit contributions
 course+0.025
ions+0.024
 July+0.024
é+0.023
issa+0.023
Neg logit contributions
edly-0.016
 faces-0.014
 face-0.014
by-0.013
powers-0.013
Token.
Token ID13
Feature activation+9.28e-06

Pos logit contributions
aj+0.021
powers+0.020
edly+0.020
by+0.019
ouse+0.019
Neg logit contributions
 course-0.040
ting-0.037
ions-0.037
otion-0.035
ner-0.035
Token She
Token ID1375
Feature activation+2.18e-02

Feature not active on prev token;
no predictions were affected.
Token said
Token ID531
Feature activation-1.50e-03

Pos logit contributions
edly+0.032
 faces+0.028
 face+0.027
powers+0.027
by+0.027
Neg logit contributions
 course-0.045
 July-0.042
issa-0.042
ions-0.041
é-0.041
Token,
Token ID11
Feature activation+5.19e-03

Pos logit contributions
 course+0.003
ions+0.003
 July+0.003
é+0.003
ner+0.003
Neg logit contributions
edly-0.002
 faces-0.002
 face-0.002
by-0.002
powers-0.002
Token "
Token ID366
Feature activation-6.79e-02

Pos logit contributions
edly+0.008
by+0.007
 faces+0.007
powers+0.007
 face+0.007
Neg logit contributions
 course-0.010
 July-0.010
ions-0.010
é-0.009
ner-0.009
TokenWhy
Token ID5195
Feature activation-1.15e-02

Pos logit contributions
ions+0.118
 course+0.117
 July+0.111
igh+0.110
é+0.109
Neg logit contributions
edly-0.115
 faces-0.111
 face-0.106
by-0.100
 back-0.099
Token did
Token ID750
Feature activation-1.32e-02

Pos logit contributions
 course+0.020
ions+0.019
 July+0.018
issa+0.017
é+0.017
Neg logit contributions
edly-0.022
 faces-0.019
 face-0.018
powers-0.018
by-0.018
Token you
Token ID345
Feature activation+9.07e-03

Pos logit contributions
 course+0.019
ions+0.019
 July+0.018
é+0.018
neath+0.017
Neg logit contributions
 faces-0.026
 face-0.026
edly-0.025
 tum-0.024
powers-0.023
Token disob
Token ID30163
Feature activation-3.10e-02

Pos logit contributions
edly+0.012
 face+0.012
 faces+0.012
 tum+0.011
by+0.010
Neg logit contributions
 course-0.019
ions-0.019
 July-0.018
é-0.018
issa-0.017
Tokeney
Token ID2959
Feature activation-2.82e-02

Pos logit contributions
 course+0.060
ions+0.059
 July+0.055
neath+0.053
issa+0.050
Neg logit contributions
 face-0.049
edly-0.048
 back-0.048
 faces-0.045
 to-0.044