TOP ACTIVATIONS
MAX = 0.454

as he opened the ch ocol ates and started eating them
not eat too many ch ocol ates and his friend learned
was filled with delicious ch ocol ates . He was happy
colorful cand ies and ch ocol ates . She said to
a big box of ch ocol ates . It was so
was a box of ch ocol ates . " For me
giving him too many ch ocol ates ." The king learned
a big box of ch ocol ates ! <|endoftext|> Once upon
and started eating the ch ocol ates . The shop keeper
, cup cakes and ch ocol ates .
enjoyed her cool , ch ocol at y treat !
a big box of ch ocol ates !
a large selection of ch ocol ates . Ell a picked
't have all these ch ocol ates ! Now , go
was a box of ch ocol ates ! Alice and her
find a box of ch ocol ates . "
looking at all the ch ocol ates , Lex i chose
was a box of ch ocol ates . Lily loved ch
started eating the delicious ch ocol ates . She was so
she paid for the ch ocol ates and the two of

MIN ACTIVATIONS
MIN = -0.312

ala named Chris . Chris said , " Harry , be
Jack smiled and said , " Yes , let
Jack was surprised and said , " Why did you
Jack 's mother said : " If you want
big tree . B ally said , " Hello , tree
Da isy smiled and said , " It 's so
Johnny smiled and said , " You 're welcome
Jack smiled and said , " I can 't
â Jane said ,
Sarah smiled back and said , " Yes !"
. Charlie said , " Hello , Lily
No ! â said the rude man .
The truck said , " Let 's go
Jack 's mom said , " No , I
Her mum smiled and said ,
Jimmy took a sip and said , " This is the
L ily said , " I don 't
L ily said , " Why won 't
me ". Sarah said , " It 's OK
" Poor kitten ," Sarah said . The kitten

INTERVAL 0.263 - 0.454
CONTAINS 0.165%

had zero balloons . <|endoftext|> L ily and Tom were playing
too . L ily decided to go home
nodded and said , " Bl ank ie !". The mom
smart dog !" L ily and Ben get their
was not bad . <|endoftext|> L ily and Ben were playing

INTERVAL 0.071 - 0.263
CONTAINS 15.329%

said , " Oh ! That 's very nice ! You
find things to mark that you can remove easily and that
" Oh no , you are a bad pirate !"
hammer and pretended to fix things around the yard . Suddenly
had been very brave and faced his fear with courage .

INTERVAL -0.120 - 0.071
CONTAINS 78.685%

dirty from carrying the rock . Twe ety said , "
cartoon machine that he did not hear his mom call him
searched the house , but could not find the cord .
to his mom . From that day on ,
time , there was a little girl named Lily . Lily

INTERVAL -0.312 - -0.120
CONTAINS 5.821%

happy and it made her mom smile too . <|endoftext|> Once
was glad that her mom my was feeling better , and
what to do . But then , Lily 's mom came
be very nice to go up the hill .
today . She was very excited to show her teacher how
Token as
Token ID355
Feature activation+7.55e-02

Pos logit contributions
ocol+0.258
ions+0.231
 situations+0.221
+0.220
ipping+0.214
Neg logit contributions
 said-0.103
 smile-0.086
 say-0.084
 big-0.078
 asked-0.076
Token he
Token ID339
Feature activation-4.17e-02

Pos logit contributions
ocol+0.187
ions+0.163
 situations+0.159
+0.159
ipping+0.154
Neg logit contributions
 said-0.105
 smile-0.090
 say-0.087
 big-0.087
 dad-0.083
Token opened
Token ID4721
Feature activation-4.21e-02

Pos logit contributions
 said+0.043
 smile+0.037
 say+0.035
 big+0.033
 asked+0.032
Neg logit contributions
ocol-0.114
ions-0.102
 situations-0.099
-0.098
ipping-0.095
Token the
Token ID262
Feature activation+2.21e-03

Pos logit contributions
 said+0.054
 smile+0.047
 say+0.045
 big+0.043
 asked+0.042
Neg logit contributions
ocol-0.106
ions-0.094
 situations-0.091
-0.089
ipping-0.087
Token ch
Token ID442
Feature activation+2.38e-02

Pos logit contributions
ocol+0.006
ions+0.005
 situations+0.005
+0.005
umm+0.005
Neg logit contributions
 said-0.003
 asked-0.002
 say-0.002
 smile-0.002
 understand-0.002
Tokenocol
Token ID4668
Feature activation+4.54e-01

Pos logit contributions
ocol+0.017
+0.016
ions+0.015
mm+0.014
+0.014
Neg logit contributions
 said-0.065
 smile-0.064
 big-0.061
 say-0.060
 surprised-0.059
Tokenates
Token ID689
Feature activation+1.41e-01

Pos logit contributions
ocol+0.535
 situations+0.501
Too+0.493
+0.463
+0.443
Neg logit contributions
 said-1.033
 smile-0.925
 dad-0.908
 say-0.908
 asked-0.906
Token and
Token ID290
Feature activation-2.19e-02

Pos logit contributions
ocol+0.368
ions+0.327
 situations+0.316
+0.311
ipping+0.303
Neg logit contributions
 said-0.168
 smile-0.143
 say-0.139
 big-0.130
 asked-0.129
Token started
Token ID2067
Feature activation-4.63e-02

Pos logit contributions
 said+0.025
 smile+0.022
 say+0.021
 big+0.020
 picture+0.020
Neg logit contributions
ocol-0.055
ions-0.049
 situations-0.047
-0.046
ubb-0.044
Token eating
Token ID6600
Feature activation+1.30e-01

Pos logit contributions
 said+0.056
 smile+0.049
 say+0.046
 big+0.045
 asked+0.044
Neg logit contributions
ocol-0.123
ions-0.109
 situations-0.106
-0.106
umm-0.103
Token them
Token ID606
Feature activation+3.05e-02

Pos logit contributions
ocol+0.334
+0.308
 situations+0.299
ions+0.298
Too+0.296
Neg logit contributions
 said-0.164
 big-0.145
 smile-0.145
 say-0.134
 open-0.127
Token not
Token ID407
Feature activation-4.16e-03

Pos logit contributions
 said+0.137
 say+0.119
 smile+0.119
 asked+0.105
 big+0.101
Neg logit contributions
ocol-0.343
ions-0.307
 situations-0.294
-0.289
ipping-0.287
Token eat
Token ID4483
Feature activation+1.19e-01

Pos logit contributions
 said+0.005
 smile+0.004
 say+0.004
 asked+0.004
 big+0.004
Neg logit contributions
ocol-0.011
ions-0.010
 situations-0.010
-0.009
ipping-0.009
Token too
Token ID1165
Feature activation+2.42e-02

Pos logit contributions
ocol+0.275
+0.258
ions+0.251
+0.247
umm+0.246
Neg logit contributions
 said-0.173
 big-0.164
 smile-0.154
 open-0.145
 say-0.145
Token many
Token ID867
Feature activation+1.97e-02

Pos logit contributions
ocol+0.057
ions+0.050
+0.048
 situations+0.048
ipping+0.047
Neg logit contributions
 said-0.036
 big-0.032
 smile-0.031
 say-0.030
 asked-0.030
Token ch
Token ID442
Feature activation+2.64e-02

Pos logit contributions
ocol+0.041
ions+0.036
+0.036
+0.035
ipping+0.034
Neg logit contributions
 said-0.034
 smile-0.030
 big-0.030
 say-0.029
 asked-0.029
Tokenocol
Token ID4668
Feature activation+4.54e-01

Pos logit contributions
+0.011
Too+0.009
+0.008
ions+0.008
Time+0.008
Neg logit contributions
 smile-0.079
 said-0.077
 big-0.076
 know-0.074
 dad-0.074
Tokenates
Token ID689
Feature activation+1.40e-01

Pos logit contributions
ocol+0.566
 situations+0.496
Too+0.492
+0.455
+0.439
Neg logit contributions
 said-1.045
 smile-0.930
 dad-0.918
 asked-0.917
 say-0.916
Token and
Token ID290
Feature activation-1.69e-02

Pos logit contributions
ocol+0.378
 situations+0.331
ions+0.330
+0.327
ipping+0.316
Neg logit contributions
 said-0.162
 smile-0.134
 say-0.129
 big-0.124
 asked-0.117
Token his
Token ID465
Feature activation+1.06e-02

Pos logit contributions
 said+0.022
 smile+0.019
 say+0.019
 big+0.018
 asked+0.017
Neg logit contributions
ocol-0.043
ions-0.038
 situations-0.036
-0.036
ipping-0.035
Token friend
Token ID1545
Feature activation-9.67e-02

Pos logit contributions
ocol+0.026
ions+0.022
 situations+0.022
+0.021
ubb+0.021
Neg logit contributions
 said-0.014
 say-0.012
 smile-0.012
 asked-0.011
 big-0.011
Token learned
Token ID4499
Feature activation-6.12e-02

Pos logit contributions
 said+0.102
 smile+0.084
 say+0.082
 big+0.076
 asked+0.075
Neg logit contributions
ocol-0.267
ions-0.239
 situations-0.231
-0.227
ipping-0.222
Token was
Token ID373
Feature activation+4.06e-02

Pos logit contributions
 said+0.048
 smile+0.040
 say+0.039
 big+0.037
 asked+0.036
Neg logit contributions
ocol-0.111
ions-0.099
 situations-0.095
-0.094
ipping-0.091
Token filled
Token ID5901
Feature activation+1.76e-02

Pos logit contributions
ocol+0.100
ions+0.088
 situations+0.084
+0.082
ipping+0.081
Neg logit contributions
 said-0.053
 smile-0.046
 say-0.045
 says-0.042
 big-0.042
Token with
Token ID351
Feature activation+1.14e-01

Pos logit contributions
ocol+0.045
ions+0.039
 situations+0.038
+0.038
+0.037
Neg logit contributions
 said-0.024
 smile-0.020
 big-0.020
 say-0.019
 asked-0.019
Token delicious
Token ID12625
Feature activation+4.60e-02

Pos logit contributions
ocol+0.279
ions+0.245
 situations+0.236
+0.227
ipping+0.224
Neg logit contributions
 said-0.153
 smile-0.131
 say-0.129
 asked-0.122
 big-0.120
Token ch
Token ID442
Feature activation+2.36e-02

Pos logit contributions
ocol+0.094
ions+0.087
+0.084
+0.080
 situations+0.080
Neg logit contributions
 said-0.079
 smile-0.074
 big-0.070
 say-0.069
 picture-0.067
Tokenocol
Token ID4668
Feature activation+4.53e-01

Pos logit contributions
+0.017
Too+0.015
Sp+0.015
+0.014
mm+0.014
Neg logit contributions
 smile-0.062
 big-0.061
 said-0.061
 old-0.058
 open-0.058
Tokenates
Token ID689
Feature activation+1.38e-01

Pos logit contributions
ocol+0.573
Too+0.542
 situations+0.521
+0.486
 pleasures+0.462
Neg logit contributions
 said-1.016
 smile-0.910
 asked-0.877
 answered-0.876
 dad-0.874
Token.
Token ID13
Feature activation-2.27e-02

Pos logit contributions
ocol+0.354
ions+0.317
 situations+0.303
+0.297
ipping+0.293
Neg logit contributions
 said-0.166
 smile-0.142
 say-0.139
 big-0.130
 asked-0.130
Token He
Token ID679
Feature activation+2.31e-02

Pos logit contributions
 said+0.029
 smile+0.024
 say+0.024
 asked+0.023
 big+0.022
Neg logit contributions
ocol-0.060
ions-0.052
 situations-0.051
-0.049
ipping-0.049
Token was
Token ID373
Feature activation+4.06e-02

Pos logit contributions
ocol+0.060
ions+0.054
+0.052
 situations+0.051
+0.051
Neg logit contributions
 said-0.021
 say-0.019
 smile-0.019
 picture-0.017
 big-0.017
Token happy
Token ID3772
Feature activation-9.27e-02

Pos logit contributions
ocol+0.097
ions+0.088
ubb+0.082
 accompl+0.082
ipping+0.080
Neg logit contributions
 said-0.048
 say-0.041
 says-0.041
 picture-0.040
 smile-0.039
Token colorful
Token ID20239
Feature activation-5.41e-02

Pos logit contributions
ocol+0.039
ions+0.034
 situations+0.033
+0.032
ipping+0.031
Neg logit contributions
 said-0.023
 smile-0.020
 say-0.019
 asked-0.018
 big-0.018
Token cand
Token ID2658
Feature activation+1.78e-01

Pos logit contributions
 said+0.092
 smile+0.082
 big+0.081
 say+0.079
 asked+0.077
Neg logit contributions
ocol-0.113
ions-0.100
-0.096
-0.095
 situations-0.094
Tokenies
Token ID444
Feature activation+2.68e-02

Pos logit contributions
ocol+0.390
ions+0.339
 situations+0.316
+0.311
ipping+0.309
Neg logit contributions
 said-0.285
 smile-0.257
 say-0.256
 asked-0.239
 big-0.236
Token and
Token ID290
Feature activation-2.38e-02

Pos logit contributions
ocol+0.069
ions+0.062
 situations+0.059
+0.058
ipping+0.057
Neg logit contributions
 said-0.033
 smile-0.028
 say-0.027
 big-0.025
 asked-0.025
Token ch
Token ID442
Feature activation+2.45e-02

Pos logit contributions
 said+0.025
 smile+0.024
 say+0.024
 understand+0.023
 know+0.022
Neg logit contributions
ocol-0.057
ions-0.050
pill-0.047
 situations-0.046
-0.046
Tokenocol
Token ID4668
Feature activation+4.51e-01

Pos logit contributions
ocol+0.014
 situations+0.012
+0.012
ions+0.011
Too+0.011
Neg logit contributions
 said-0.072
 smile-0.070
 asked-0.067
 say-0.066
 big-0.066
Tokenates
Token ID689
Feature activation+1.37e-01

Pos logit contributions
ocol+0.554
 situations+0.505
Too+0.485
+0.454
 pleasures+0.434
Neg logit contributions
 said-1.068
 smile-0.938
 asked-0.937
 dad-0.926
 say-0.921
Token.
Token ID13
Feature activation-1.92e-02

Pos logit contributions
ocol+0.338
ions+0.308
ipping+0.277
 situations+0.275
+0.274
Neg logit contributions
 said-0.163
 say-0.144
 smile-0.144
 big-0.133
 says-0.133
Token She
Token ID1375
Feature activation-5.20e-03

Pos logit contributions
 said+0.025
 smile+0.021
 say+0.020
 big+0.019
 asked+0.019
Neg logit contributions
ocol-0.049
ions-0.043
 situations-0.042
-0.041
ipping-0.040
Token said
Token ID531
Feature activation-2.92e-01

Pos logit contributions
 say+0.004
 smile+0.004
 understand+0.004
 picture+0.004
 know+0.004
Neg logit contributions
ocol-0.012
-0.011
ions-0.011
Too-0.011
pill-0.011
Token to
Token ID284
Feature activation-5.16e-02

Pos logit contributions
 said+0.375
 smile+0.322
 say+0.313
 big+0.297
 asked+0.293
Neg logit contributions
ocol-0.742
ions-0.658
 situations-0.636
-0.623
ipping-0.608
Token a
Token ID257
Feature activation-6.67e-02

Pos logit contributions
 said+0.142
 smile+0.141
 say+0.134
 know+0.123
 says+0.121
Neg logit contributions
ocol-0.235
ions-0.219
 situations-0.201
orns-0.187
ipping-0.185
Token big
Token ID1263
Feature activation-8.33e-02

Pos logit contributions
 said+0.065
 say+0.052
 smile+0.049
 asked+0.048
 says+0.048
Neg logit contributions
ocol-0.183
ions-0.160
-0.158
 situations-0.157
-0.155
Token box
Token ID3091
Feature activation-9.70e-03

Pos logit contributions
 said+0.107
 smile+0.095
 say+0.090
 big+0.087
 asked+0.083
Neg logit contributions
ocol-0.210
ions-0.188
 situations-0.181
-0.178
ipping-0.173
Token of
Token ID286
Feature activation-4.76e-03

Pos logit contributions
 said+0.013
 smile+0.012
 know+0.011
 say+0.011
 understand+0.011
Neg logit contributions
ocol-0.022
ions-0.020
ipping-0.019
-0.018
 situations-0.018
Token ch
Token ID442
Feature activation+2.49e-02

Pos logit contributions
 said+0.008
 smile+0.007
 say+0.007
 asked+0.006
 says+0.006
Neg logit contributions
ocol-0.011
ions-0.009
 situations-0.009
-0.008
 pleasures-0.008
Tokenocol
Token ID4668
Feature activation+4.51e-01

Pos logit contributions
+0.019
+0.017
mm+0.017
Too+0.017
 accompl+0.016
Neg logit contributions
 said-0.063
 big-0.063
 smile-0.063
 open-0.063
 dad-0.061
Tokenates
Token ID689
Feature activation+1.35e-01

Pos logit contributions
ocol+0.555
 situations+0.493
Too+0.492
+0.457
 pleasures+0.449
Neg logit contributions
 said-1.026
 smile-0.926
 dad-0.915
 asked-0.896
 picture-0.894
Token.
Token ID13
Feature activation-1.38e-02

Pos logit contributions
ocol+0.358
ions+0.321
 situations+0.308
+0.304
ipping+0.297
Neg logit contributions
 said-0.154
 smile-0.131
 say-0.127
 big-0.118
 asked-0.117
Token It
Token ID632
Feature activation+2.24e-02

Pos logit contributions
 said+0.018
 asked+0.014
 smile+0.014
 say+0.014
 big+0.014
Neg logit contributions
ocol-0.037
ions-0.032
 situations-0.032
ipping-0.031
rim-0.031
Token was
Token ID373
Feature activation+4.77e-02

Pos logit contributions
ocol+0.049
ions+0.046
+0.044
ar+0.042
pill+0.042
Neg logit contributions
 smile-0.027
 understand-0.026
 say-0.026
 dad-0.025
 said-0.025
Token so
Token ID523
Feature activation-1.02e-01

Pos logit contributions
ocol+0.132
ions+0.118
 situations+0.111
+0.109
ubb+0.107
Neg logit contributions
 said-0.049
 smile-0.041
 say-0.040
 asked-0.035
 says-0.035
Token was
Token ID373
Feature activation+3.86e-02

Pos logit contributions
ocol+0.124
ions+0.111
 situations+0.105
+0.104
ipping+0.102
Neg logit contributions
 said-0.061
 smile-0.053
 say-0.052
 big-0.048
 asked-0.047
Token a
Token ID257
Feature activation-7.32e-02

Pos logit contributions
ocol+0.103
ions+0.094
 situations+0.089
+0.086
ubb+0.085
Neg logit contributions
 said-0.041
 smile-0.035
 say-0.034
 asked-0.031
 says-0.030
Token box
Token ID3091
Feature activation-1.03e-02

Pos logit contributions
 said+0.075
 say+0.061
 says+0.057
 asked+0.057
 smile+0.056
Neg logit contributions
ocol-0.193
ions-0.169
 situations-0.166
-0.165
-0.164
Token of
Token ID286
Feature activation-9.71e-03

Pos logit contributions
 said+0.014
 smile+0.012
 say+0.012
 know+0.012
 says+0.012
Neg logit contributions
ocol-0.023
ions-0.021
ipping-0.019
-0.019
 situations-0.019
Token ch
Token ID442
Feature activation+2.42e-02

Pos logit contributions
 said+0.014
 smile+0.012
 say+0.012
 asked+0.011
 says+0.011
Neg logit contributions
ocol-0.023
ions-0.020
 situations-0.019
-0.018
ubb-0.018
Tokenocol
Token ID4668
Feature activation+4.50e-01

Pos logit contributions
+0.012
+0.011
mm+0.010
Too+0.009
ocol+0.009
Neg logit contributions
 said-0.070
 smile-0.068
 big-0.068
 open-0.066
 surprised-0.065
Tokenates
Token ID689
Feature activation+1.33e-01

Pos logit contributions
ocol+0.574
Too+0.528
 situations+0.521
+0.484
 pleasures+0.483
Neg logit contributions
 said-0.998
 smile-0.897
 dad-0.868
 say-0.866
 answered-0.865
Token.
Token ID13
Feature activation-2.04e-02

Pos logit contributions
ocol+0.337
ions+0.301
 situations+0.287
+0.283
ipping+0.278
Neg logit contributions
 said-0.166
 smile-0.143
 say-0.140
 big-0.131
 asked-0.130
Token "
Token ID366
Feature activation+1.09e-01

Pos logit contributions
 said+0.025
 asked+0.020
 say+0.020
 smile+0.020
 big+0.019
Neg logit contributions
ocol-0.054
ions-0.047
 situations-0.047
ipping-0.046
-0.046
TokenFor
Token ID1890
Feature activation+1.66e-01

Pos logit contributions
ocol+0.224
 situations+0.195
ions+0.182
ipping+0.180
igs+0.175
Neg logit contributions
 said-0.195
 say-0.173
 asked-0.173
 smile-0.169
 understand-0.163
Token me
Token ID502
Feature activation-1.62e-02

Pos logit contributions
ocol+0.426
+0.377
ipping+0.368
ions+0.366
 situations+0.364
Neg logit contributions
 said-0.213
 big-0.182
 say-0.178
 smile-0.174
 asked-0.169
Token giving
Token ID3501
Feature activation+1.53e-01

Pos logit contributions
ocol+0.203
ions+0.182
+0.176
 situations+0.172
ipping+0.170
Neg logit contributions
 said-0.177
 say-0.158
 smile-0.156
 asked-0.152
 big-0.151
Token him
Token ID683
Feature activation+1.41e-02

Pos logit contributions
ocol+0.351
ions+0.304
 situations+0.293
+0.284
orns+0.275
Neg logit contributions
 said-0.221
 smile-0.201
 say-0.194
 surprised-0.182
 understand-0.181
Token too
Token ID1165
Feature activation+2.39e-02

Pos logit contributions
ocol+0.035
ions+0.031
 situations+0.030
+0.029
ipping+0.028
Neg logit contributions
 said-0.019
 smile-0.016
 say-0.016
 asked-0.015
 big-0.015
Token many
Token ID867
Feature activation+2.15e-02

Pos logit contributions
ocol+0.050
ions+0.043
 situations+0.042
+0.040
 pleasures+0.039
Neg logit contributions
 said-0.039
 say-0.035
 smile-0.035
 asked-0.033
 dad-0.033
Token ch
Token ID442
Feature activation+2.83e-02

Pos logit contributions
ocol+0.052
ions+0.046
+0.045
 situations+0.044
+0.044
Neg logit contributions
 said-0.030
 smile-0.026
 say-0.025
 big-0.024
 asked-0.024
Tokenocol
Token ID4668
Feature activation+4.50e-01

Pos logit contributions
+0.029
ocol+0.027
ions+0.026
+0.026
Too+0.025
Neg logit contributions
 said-0.069
 smile-0.068
 big-0.064
 say-0.063
 dad-0.062
Tokenates
Token ID689
Feature activation+1.37e-01

Pos logit contributions
ocol+0.534
 situations+0.469
Too+0.462
+0.438
+0.421
Neg logit contributions
 said-1.052
 smile-0.947
 dad-0.936
 say-0.929
 asked-0.929
Token."
Token ID526
Feature activation-6.46e-02

Pos logit contributions
ocol+0.356
ions+0.316
 situations+0.306
+0.301
ipping+0.294
Neg logit contributions
 said-0.165
 smile-0.140
 say-0.136
 big-0.128
 asked-0.126
Token The
Token ID383
Feature activation+7.64e-02

Pos logit contributions
 said+0.076
 smile+0.065
 say+0.063
 big+0.060
 asked+0.058
Neg logit contributions
ocol-0.168
ions-0.150
 situations-0.145
-0.143
ipping-0.139
Token king
Token ID5822
Feature activation-4.42e-02

Pos logit contributions
ocol+0.200
ions+0.172
 situations+0.168
+0.166
ubb+0.164
Neg logit contributions
 said-0.086
 smile-0.070
 say-0.068
 asked-0.066
 says-0.063
Token learned
Token ID4499
Feature activation-5.96e-02

Pos logit contributions
 big+0.032
 picture+0.031
 dad+0.030
 say+0.029
 lost+0.028
Neg logit contributions
ocol-0.102
ions-0.098
ar-0.097
rim-0.094
ipping-0.092
Token a
Token ID257
Feature activation-7.21e-02

Pos logit contributions
ocol+0.306
ions+0.274
 situations+0.263
+0.255
+0.251
Neg logit contributions
 said-0.125
 smile-0.106
 say-0.103
 asked-0.094
 big-0.093
Token big
Token ID1263
Feature activation-8.53e-02

Pos logit contributions
 said+0.077
 say+0.062
 asked+0.061
 says+0.061
 exclaimed+0.054
Neg logit contributions
ocol-0.183
 situations-0.158
-0.158
ions-0.156
-0.155
Token box
Token ID3091
Feature activation-8.42e-03

Pos logit contributions
 said+0.063
 say+0.044
 smile+0.041
 asked+0.041
 says+0.037
Neg logit contributions
ocol-0.261
ions-0.232
 situations-0.227
-0.224
-0.221
Token of
Token ID286
Feature activation-1.14e-02

Pos logit contributions
 said+0.011
 smile+0.010
 say+0.009
 says+0.009
 know+0.009
Neg logit contributions
ocol-0.020
ions-0.018
 situations-0.017
ipping-0.017
-0.016
Token ch
Token ID442
Feature activation+2.15e-02

Pos logit contributions
 said+0.017
 smile+0.015
 say+0.015
 asked+0.015
 surprised+0.014
Neg logit contributions
ocol-0.026
ions-0.022
 situations-0.021
 pleasures-0.019
ubb-0.019
Tokenocol
Token ID4668
Feature activation+4.50e-01

Pos logit contributions
+0.011
mm+0.009
+0.009
Too+0.008
Sp+0.008
Neg logit contributions
 smile-0.062
 said-0.062
 big-0.060
 open-0.059
 dad-0.058
Tokenates
Token ID689
Feature activation+1.38e-01

Pos logit contributions
ocol+0.575
Too+0.541
 situations+0.509
+0.495
 pleasures+0.483
Neg logit contributions
 said-0.995
 smile-0.894
 answered-0.874
 dad-0.873
 asked-0.864
Token!
Token ID0
Feature activation+3.26e-02

Pos logit contributions
ocol+0.352
ions+0.319
 situations+0.300
+0.296
ipping+0.293
Neg logit contributions
 said-0.164
 smile-0.143
 say-0.139
 big-0.131
 asked-0.129
Token<|endoftext|>
Token ID50256
Feature activation-2.55e-02

Pos logit contributions
ocol+0.088
ions+0.079
 situations+0.076
+0.075
ipping+0.073
Neg logit contributions
 said-0.037
 smile-0.031
 say-0.030
 big-0.028
 asked-0.028
TokenOnce
Token ID7454
Feature activation+3.24e-02

Pos logit contributions
 said+0.046
 asked+0.038
 big+0.038
 smile+0.038
 saw+0.038
Neg logit contributions
ocol-0.056
 situations-0.045
ions-0.045
ubb-0.044
-0.043
Token upon
Token ID2402
Feature activation+6.32e-02

Pos logit contributions
ocol+0.046
ions+0.042
ipping+0.032
+0.032
igs+0.032
Neg logit contributions
 address-0.061
 say-0.060
 wrote-0.060
 write-0.060
 said-0.059
Token and
Token ID290
Feature activation-2.69e-02

Pos logit contributions
 said+0.012
 smile+0.012
 know+0.012
 say+0.012
 understand+0.012
Neg logit contributions
ocol-0.023
ions-0.022
ipping-0.020
-0.020
umm-0.020
Token started
Token ID2067
Feature activation-4.57e-02

Pos logit contributions
 smile+0.024
 understand+0.024
 say+0.022
 said+0.022
 picture+0.021
Neg logit contributions
ocol-0.066
ions-0.061
-0.058
ar-0.055
ipping-0.054
Token eating
Token ID6600
Feature activation+1.28e-01

Pos logit contributions
 said+0.042
 smile+0.034
 say+0.031
 big+0.030
 asked+0.030
Neg logit contributions
ocol-0.136
ions-0.122
 situations-0.120
-0.119
umm-0.115
Token the
Token ID262
Feature activation-1.10e-04

Pos logit contributions
ocol+0.368
 situations+0.334
+0.334
ions+0.331
rim+0.325
Neg logit contributions
 said-0.130
 big-0.109
 smile-0.105
 say-0.096
 open-0.092
Token ch
Token ID442
Feature activation+2.35e-02

Pos logit contributions
 said+0.000
 smile+0.000
 say+0.000
 asked+0.000
 dad+0.000
Neg logit contributions
ocol-0.000
ions-0.000
 situations-0.000
-0.000
ipping-0.000
Tokenocol
Token ID4668
Feature activation+4.49e-01

Pos logit contributions
Sp+0.007
+0.006
 accompl+0.006
 pic+0.005
Dri+0.005
Neg logit contributions
 smile-0.070
 said-0.069
 picture-0.068
 big-0.068
 open-0.067
Tokenates
Token ID689
Feature activation+1.32e-01

Pos logit contributions
ocol+0.564
 situations+0.542
Too+0.518
+0.469
 pleasures+0.466
Neg logit contributions
 said-0.985
 smile-0.880
 dad-0.865
 say-0.862
 picture-0.859
Token.
Token ID13
Feature activation-2.03e-02

Pos logit contributions
ocol+0.348
ions+0.311
 situations+0.299
+0.294
ipping+0.288
Neg logit contributions
 said-0.154
 smile-0.131
 say-0.127
 big-0.119
 asked-0.118
Token The
Token ID383
Feature activation+6.98e-02

Pos logit contributions
 said+0.029
 smile+0.025
 say+0.024
 asked+0.024
 big+0.024
Neg logit contributions
ocol-0.050
ions-0.044
 situations-0.043
ipping-0.041
-0.041
Token shop
Token ID6128
Feature activation+1.20e-02

Pos logit contributions
ocol+0.152
ions+0.133
 situations+0.127
+0.124
ipping+0.121
Neg logit contributions
 said-0.115
 smile-0.103
 say-0.101
 big-0.097
 asked-0.096
Tokenkeeper
Token ID13884
Feature activation-1.29e-01

Pos logit contributions
ocol+0.028
ions+0.026
+0.025
ar+0.024
umm+0.024
Neg logit contributions
 said-0.013
 say-0.012
 understand-0.012
 smile-0.012
 know-0.012
Token,
Token ID11
Feature activation+7.11e-02

Pos logit contributions
ocol+0.251
ions+0.226
 situations+0.216
+0.213
ipping+0.210
Neg logit contributions
 said-0.109
 smile-0.092
 say-0.090
 asked-0.083
 big-0.083
Token cup
Token ID6508
Feature activation+1.56e-01

Pos logit contributions
ocol+0.190
 situations+0.168
ions+0.166
+0.162
ubb+0.158
Neg logit contributions
 said-0.093
 smile-0.079
 say-0.075
 big-0.073
 asked-0.072
Tokencakes
Token ID37263
Feature activation+4.61e-02

Pos logit contributions
ocol+0.388
 situations+0.337
ions+0.324
+0.322
ubb+0.313
Neg logit contributions
 said-0.231
 smile-0.194
 big-0.188
 say-0.184
 asked-0.174
Token and
Token ID290
Feature activation-1.81e-02

Pos logit contributions
ocol+0.129
ions+0.114
 situations+0.113
+0.109
ubb+0.107
Neg logit contributions
 said-0.051
 smile-0.042
 say-0.040
 big-0.038
 asked-0.037
Token ch
Token ID442
Feature activation+2.73e-02

Pos logit contributions
 said+0.028
 smile+0.024
 say+0.024
 asked+0.023
 big+0.023
Neg logit contributions
ocol-0.042
ions-0.037
 situations-0.036
-0.035
ipping-0.035
Tokenocol
Token ID4668
Feature activation+4.48e-01

Pos logit contributions
 situations+0.014
+0.012
 accompl+0.011
+0.010
Too+0.010
Neg logit contributions
 said-0.081
 smile-0.079
 asked-0.075
 dad-0.075
 big-0.074
Tokenates
Token ID689
Feature activation+1.33e-01

Pos logit contributions
 situations+0.643
Too+0.622
ocol+0.587
+0.558
 pleasures+0.557
Neg logit contributions
 said-0.905
 dad-0.773
 asked-0.772
 answered-0.761
 smile-0.759
Token.
Token ID13
Feature activation-1.13e-02

Pos logit contributions
ocol+0.354
ions+0.316
 situations+0.304
+0.300
ipping+0.293
Neg logit contributions
 said-0.152
 smile-0.128
 say-0.125
 big-0.116
 asked-0.115
Token
Token ID220
Feature activation-6.79e-02

Pos logit contributions
 said+0.014
 say+0.011
 asked+0.011
 smile+0.010
 dad+0.010
Neg logit contributions
ocol-0.031
ions-0.028
 situations-0.027
ubb-0.026
ipping-0.026
Token
Token ID198
Feature activation+2.13e-04

Pos logit contributions
 said+0.077
 smile+0.066
 say+0.064
 big+0.059
 asked+0.058
Neg logit contributions
ocol-0.180
ions-0.161
 situations-0.155
-0.153
ipping-0.149
Token
Token ID198
Feature activation+4.06e-04

Pos logit contributions
ocol+0.001
ions+0.001
 situations+0.001
+0.001
ipping+0.000
Neg logit contributions
 said-0.000
 smile-0.000
 say-0.000
 asked-0.000
 big-0.000
Token enjoyed
Token ID8359
Feature activation-2.68e-02

Pos logit contributions
 said+0.026
 smile+0.022
 say+0.021
 big+0.020
 asked+0.019
Neg logit contributions
ocol-0.065
ions-0.058
 situations-0.056
-0.055
ipping-0.054
Token her
Token ID607
Feature activation-5.34e-02

Pos logit contributions
 said+0.027
 smile+0.023
 big+0.022
 dad+0.021
 asked+0.020
Neg logit contributions
ocol-0.077
 situations-0.073
-0.071
umm-0.069
Too-0.069
Token cool
Token ID3608
Feature activation-2.01e-02

Pos logit contributions
 said+0.081
 smile+0.073
 say+0.070
 big+0.069
 picture+0.065
Neg logit contributions
ocol-0.124
ions-0.110
 situations-0.106
-0.104
ipping-0.100
Token,
Token ID11
Feature activation+6.64e-02

Pos logit contributions
 said+0.029
 smile+0.028
 big+0.025
 say+0.025
 picture+0.024
Neg logit contributions
ocol-0.047
ions-0.043
 situations-0.042
 accompl-0.041
-0.040
Token ch
Token ID442
Feature activation+2.49e-02

Pos logit contributions
ocol+0.173
ions+0.154
 situations+0.150
+0.147
ipping+0.142
Neg logit contributions
 said-0.084
 smile-0.073
 say-0.070
 big-0.067
 asked-0.065
Tokenocol
Token ID4668
Feature activation+4.48e-01

Pos logit contributions
+0.031
 situations+0.031
Sp+0.030
+0.028
ocol+0.028
Neg logit contributions
 smile-0.054
 said-0.053
 surprised-0.048
 dad-0.048
 say-0.048
Tokenat
Token ID265
Feature activation+4.36e-02

Pos logit contributions
ocol+0.567
 situations+0.534
Too+0.500
+0.480
 pleasures+0.459
Neg logit contributions
 said-1.019
 smile-0.909
 dad-0.888
 answered-0.881
 asked-0.878
Tokeny
Token ID88
Feature activation-3.45e-02

Pos logit contributions
ocol+0.102
ions+0.088
 situations+0.087
+0.084
Too+0.080
Neg logit contributions
 said-0.066
 smile-0.058
 say-0.056
 big-0.054
 asked-0.052
Token treat
Token ID2190
Feature activation+8.90e-02

Pos logit contributions
 said+0.046
 smile+0.040
 say+0.038
 big+0.037
 asked+0.036
Neg logit contributions
ocol-0.086
ions-0.076
 situations-0.073
-0.072
ipping-0.070
Token!
Token ID0
Feature activation+3.37e-02

Pos logit contributions
ocol+0.240
 situations+0.214
+0.211
ions+0.206
Too+0.203
Neg logit contributions
 said-0.111
 smile-0.089
 say-0.088
 big-0.083
 asked-0.078
Token
Token ID198
Feature activation-3.10e-03

Pos logit contributions
ocol+0.095
ions+0.083
 situations+0.083
ipping+0.079
+0.079
Neg logit contributions
 said-0.037
 say-0.029
 smile-0.029
 asked-0.028
 dad-0.026
Token a
Token ID257
Feature activation-7.21e-02

Pos logit contributions
ocol+0.207
ions+0.183
 situations+0.179
+0.174
rim+0.173
Neg logit contributions
 said-0.082
 smile-0.069
 say-0.066
 big-0.065
 asked-0.061
Token big
Token ID1263
Feature activation-8.23e-02

Pos logit contributions
 said+0.080
 say+0.067
 says+0.062
 asked+0.062
 smile+0.061
Neg logit contributions
ocol-0.186
ions-0.160
-0.158
 situations-0.156
-0.155
Token box
Token ID3091
Feature activation-1.31e-02

Pos logit contributions
 said+0.105
 say+0.088
 smile+0.086
 asked+0.083
 understand+0.080
Neg logit contributions
ocol-0.206
ions-0.180
 situations-0.175
-0.173
-0.167
Token of
Token ID286
Feature activation-1.03e-02

Pos logit contributions
 said+0.019
 smile+0.017
 say+0.016
 understand+0.015
 asked+0.015
Neg logit contributions
ocol-0.030
ions-0.027
 situations-0.025
-0.025
ipping-0.024
Token ch
Token ID442
Feature activation+2.28e-02

Pos logit contributions
 said+0.016
 smile+0.014
 say+0.014
 asked+0.013
 surprised+0.013
Neg logit contributions
ocol-0.023
ions-0.020
 situations-0.019
-0.017
 pleasures-0.017
Tokenocol
Token ID4668
Feature activation+4.48e-01

Pos logit contributions
+0.007
+0.006
Too+0.006
we+0.005
mm+0.005
Neg logit contributions
 smile-0.068
 said-0.067
 open-0.067
 big-0.067
 picture-0.065
Tokenates
Token ID689
Feature activation+1.31e-01

Pos logit contributions
ocol+0.575
Too+0.511
 situations+0.509
 pleasures+0.469
+0.464
Neg logit contributions
 said-1.010
 smile-0.916
 dad-0.885
 answered-0.883
 big-0.880
Token!
Token ID0
Feature activation+3.32e-02

Pos logit contributions
ocol+0.324
ions+0.285
 situations+0.276
+0.272
ipping+0.263
Neg logit contributions
 said-0.176
 smile-0.152
 say-0.148
 big-0.140
 asked-0.138
Token
Token ID220
Feature activation-7.28e-02

Pos logit contributions
ocol+0.083
ions+0.074
 situations+0.071
+0.070
+0.068
Neg logit contributions
 said-0.044
 smile-0.038
 say-0.037
 big-0.035
 asked-0.034
Token
Token ID198
Feature activation-1.31e-03

Pos logit contributions
 said+0.091
 smile+0.082
 say+0.078
 picture+0.072
 asked+0.072
Neg logit contributions
ocol-0.179
ions-0.164
-0.154
 situations-0.154
-0.151
Token
Token ID198
Feature activation-1.42e-03

Pos logit contributions
 smile+0.002
 said+0.002
 picture+0.002
 say+0.002
 dad+0.002
Neg logit contributions
ocol-0.002
ions-0.002
-0.002
F-0.002
-0.002
Token a
Token ID257
Feature activation-6.88e-02

Pos logit contributions
 said+0.079
 smile+0.066
 say+0.065
 big+0.062
 asked+0.061
Neg logit contributions
ocol-0.175
ions-0.155
 situations-0.150
-0.148
ipping-0.145
Token large
Token ID1588
Feature activation-3.34e-02

Pos logit contributions
 said+0.090
 say+0.076
 smile+0.072
 asked+0.072
 says+0.071
Neg logit contributions
ocol-0.167
ions-0.143
-0.140
 situations-0.139
-0.138
Token selection
Token ID6356
Feature activation+1.07e-01

Pos logit contributions
 said+0.044
 smile+0.039
 say+0.037
 big+0.036
 picture+0.035
Neg logit contributions
ocol-0.083
ions-0.075
 situations-0.071
-0.070
ubb-0.069
Token of
Token ID286
Feature activation-9.46e-03

Pos logit contributions
ocol+0.304
 situations+0.270
+0.264
ions+0.261
ubb+0.256
Neg logit contributions
 said-0.114
 smile-0.087
 big-0.085
 say-0.084
 asked-0.080
Token ch
Token ID442
Feature activation+2.26e-02

Pos logit contributions
 said+0.015
 smile+0.013
 say+0.013
 big+0.013
 asked+0.012
Neg logit contributions
ocol-0.021
ions-0.019
-0.018
 situations-0.018
ipping-0.018
Tokenocol
Token ID4668
Feature activation+4.48e-01

Pos logit contributions
mm+0.009
Too+0.009
Fl+0.008
+0.008
 accompl+0.008
Neg logit contributions
 big-0.064
 smile-0.064
 said-0.062
 old-0.062
 open-0.062
Tokenates
Token ID689
Feature activation+1.31e-01

Pos logit contributions
Too+0.581
 situations+0.568
ocol+0.555
 pleasures+0.503
+0.499
Neg logit contributions
 said-0.924
 dad-0.820
 smile-0.817
 asked-0.808
 answered-0.802
Token.
Token ID13
Feature activation-2.06e-02

Pos logit contributions
ocol+0.372
ions+0.326
 situations+0.326
+0.320
ipping+0.310
Neg logit contributions
 said-0.135
 smile-0.106
 say-0.104
 big-0.097
 asked-0.095
Token Ell
Token ID7122
Feature activation+4.21e-02

Pos logit contributions
 said+0.029
 asked+0.025
 say+0.023
 smile+0.023
 dad+0.023
Neg logit contributions
ocol-0.051
 situations-0.045
ions-0.044
rim-0.043
orns-0.043
Tokena
Token ID64
Feature activation-1.15e-03

Pos logit contributions
ocol+0.079
ions+0.070
we+0.066
ipping+0.066
+0.066
Neg logit contributions
 smile-0.069
 say-0.068
 said-0.068
 big-0.066
 picture-0.066
Token picked
Token ID6497
Feature activation-3.35e-02

Pos logit contributions
 said+0.001
 smile+0.001
 say+0.001
 big+0.001
 picture+0.001
Neg logit contributions
ocol-0.003
ions-0.003
-0.003
 situations-0.003
-0.003
Token't
Token ID470
Feature activation+2.32e-02

Pos logit contributions
ocol+0.072
+0.065
ions+0.064
+0.063
 situations+0.059
Neg logit contributions
 said-0.050
 surprised-0.043
 smile-0.043
 says-0.042
 big-0.041
Token have
Token ID423
Feature activation+2.97e-03

Pos logit contributions
ocol+0.060
ions+0.053
 situations+0.051
+0.051
+0.049
Neg logit contributions
 said-0.028
 smile-0.023
 say-0.022
 big-0.021
 surprised-0.021
Token all
Token ID477
Feature activation-1.27e-01

Pos logit contributions
ocol+0.007
ions+0.006
 situations+0.006
+0.006
ipping+0.006
Neg logit contributions
 said-0.004
 smile-0.004
 say-0.004
 big-0.003
 surprised-0.003
Token these
Token ID777
Feature activation+1.18e-01

Pos logit contributions
 said+0.177
 smile+0.156
 say+0.151
 big+0.143
 asked+0.142
Neg logit contributions
ocol-0.303
ions-0.268
 situations-0.258
-0.248
ipping-0.243
Token ch
Token ID442
Feature activation+2.40e-02

Pos logit contributions
ocol+0.261
ions+0.227
 situations+0.217
+0.214
ipping+0.207
Neg logit contributions
 said-0.189
 smile-0.167
 say-0.164
 big-0.157
 asked-0.155
Tokenocol
Token ID4668
Feature activation+4.47e-01

Pos logit contributions
Sp+0.011
+0.010
Too+0.009
Dri+0.008
Fl+0.007
Neg logit contributions
 smile-0.070
 said-0.069
 know-0.067
 dad-0.066
 say-0.066
Tokenates
Token ID689
Feature activation+1.34e-01

Pos logit contributions
Too+0.592
 situations+0.579
ocol+0.577
umm+0.530
+0.524
Neg logit contributions
 said-0.934
 asked-0.809
 dad-0.807
 smile-0.804
 answered-0.802
Token!
Token ID0
Feature activation+3.43e-02

Pos logit contributions
ocol+0.336
ions+0.294
 situations+0.287
+0.281
ipping+0.275
Neg logit contributions
 said-0.178
 smile-0.152
 say-0.148
 big-0.141
 asked-0.138
Token Now
Token ID2735
Feature activation+1.56e-01

Pos logit contributions
ocol+0.071
ipping+0.059
 situations+0.058
ions+0.057
ubb+0.057
Neg logit contributions
 said-0.063
 asked-0.057
 say-0.056
 know-0.055
 says-0.054
Token,
Token ID11
Feature activation+6.59e-02

Pos logit contributions
ocol+0.372
ions+0.323
 situations+0.300
rim+0.295
ipping+0.294
Neg logit contributions
 said-0.249
 smile-0.221
 say-0.218
 big-0.205
 asked-0.204
Token go
Token ID467
Feature activation+9.29e-02

Pos logit contributions
ocol+0.151
ions+0.130
 situations+0.125
ipping+0.123
+0.122
Neg logit contributions
 said-0.108
 say-0.097
 smile-0.095
 asked-0.091
 big-0.090
Token was
Token ID373
Feature activation+3.92e-02

Pos logit contributions
 said+0.133
 smile+0.121
 say+0.118
 big+0.110
 picture+0.108
Neg logit contributions
ocol-0.281
ions-0.256
 situations-0.237
-0.236
-0.231
Token a
Token ID257
Feature activation-7.42e-02

Pos logit contributions
ocol+0.091
ions+0.087
 accompl+0.074
 degrees+0.073
+0.072
Neg logit contributions
 said-0.045
 smile-0.044
 say-0.043
 says-0.040
 know-0.039
Token box
Token ID3091
Feature activation-1.06e-02

Pos logit contributions
 said+0.072
 say+0.059
 says+0.055
 asked+0.053
 smile+0.053
Neg logit contributions
ocol-0.198
ions-0.172
-0.171
-0.170
 situations-0.168
Token of
Token ID286
Feature activation-1.36e-02

Pos logit contributions
 said+0.014
 smile+0.013
 know+0.013
 says+0.013
 understand+0.012
Neg logit contributions
ocol-0.023
ions-0.021
ipping-0.020
-0.019
-0.018
Token ch
Token ID442
Feature activation+2.08e-02

Pos logit contributions
 said+0.018
 smile+0.016
 say+0.015
 asked+0.015
 says+0.015
Neg logit contributions
ocol-0.033
ions-0.029
 situations-0.027
 pleasures-0.025
-0.025
Tokenocol
Token ID4668
Feature activation+4.47e-01

Pos logit contributions
+0.007
+0.006
Too+0.006
mm+0.006
ions+0.004
Neg logit contributions
 said-0.062
 smile-0.061
 open-0.061
 big-0.061
 old-0.059
Tokenates
Token ID689
Feature activation+1.33e-01

Pos logit contributions
ocol+0.598
Too+0.513
 situations+0.499
+0.474
 pleasures+0.470
Neg logit contributions
 said-1.012
 smile-0.912
 dad-0.891
 say-0.890
 asked-0.883
Token!
Token ID0
Feature activation+3.08e-02

Pos logit contributions
ocol+0.334
ions+0.302
 situations+0.281
ipping+0.278
+0.277
Neg logit contributions
 said-0.160
 smile-0.142
 say-0.137
 big-0.129
 asked-0.127
Token Alice
Token ID14862
Feature activation-1.43e-01

Pos logit contributions
ocol+0.057
+0.056
+0.053
ions+0.052
 situations+0.049
Neg logit contributions
 said-0.046
 say-0.044
 smile-0.044
 lost-0.042
 picture-0.041
Token and
Token ID290
Feature activation-2.68e-02

Pos logit contributions
 say+0.144
 smile+0.138
 said+0.137
 understand+0.133
 big+0.131
Neg logit contributions
ocol-0.322
ions-0.301
-0.300
-0.282
ar-0.281
Token her
Token ID607
Feature activation-5.86e-02

Pos logit contributions
 surprised+0.039
 lost+0.039
 said+0.039
 big+0.036
 know+0.036
Neg logit contributions
ocol-0.040
-0.038
 Wed-0.036
 occasions-0.036
ions-0.036
Token find
Token ID1064
Feature activation-1.54e-02

Pos logit contributions
ocol+0.295
ions+0.260
+0.249
+0.246
ubb+0.236
Neg logit contributions
 said-0.160
 picture-0.131
 exclaimed-0.130
 surprised-0.128
 replied-0.128
Token a
Token ID257
Feature activation-6.52e-02

Pos logit contributions
 said+0.022
 smile+0.021
 say+0.020
 asked+0.019
 replied+0.018
Neg logit contributions
ocol-0.034
ions-0.033
 situations-0.029
ubb-0.027
-0.027
Token box
Token ID3091
Feature activation-1.18e-02

Pos logit contributions
 said+0.086
 say+0.072
 smile+0.072
 asked+0.069
 says+0.067
Neg logit contributions
ocol-0.157
ions-0.139
 situations-0.133
-0.132
-0.128
Token of
Token ID286
Feature activation-5.97e-03

Pos logit contributions
 said+0.016
 smile+0.015
 say+0.014
 understand+0.014
 know+0.014
Neg logit contributions
ocol-0.025
ions-0.024
ipping-0.021
-0.021
ar-0.021
Token ch
Token ID442
Feature activation+2.35e-02

Pos logit contributions
 said+0.010
 smile+0.008
 say+0.008
 asked+0.008
 big+0.008
Neg logit contributions
ocol-0.013
ions-0.011
 situations-0.011
-0.011
ipping-0.010
Tokenocol
Token ID4668
Feature activation+4.47e-01

Pos logit contributions
+0.021
+0.020
Too+0.020
 accompl+0.019
umm+0.018
Neg logit contributions
 big-0.058
 smile-0.058
 said-0.057
 open-0.056
 dad-0.056
Tokenates
Token ID689
Feature activation+1.32e-01

Pos logit contributions
ocol+0.594
Too+0.550
 situations+0.532
+0.490
 pleasures+0.490
Neg logit contributions
 said-0.974
 smile-0.885
 dad-0.874
 big-0.858
 picture-0.857
Token.
Token ID13
Feature activation-9.96e-03

Pos logit contributions
ocol+0.368
ions+0.329
 situations+0.320
+0.315
ipping+0.307
Neg logit contributions
 said-0.137
 smile-0.112
 say-0.109
 big-0.100
 asked-0.099
Token
Token ID198
Feature activation+8.80e-03

Pos logit contributions
 said+0.013
 big+0.011
 say+0.010
 asked+0.010
 smile+0.010
Neg logit contributions
ocol-0.027
 situations-0.023
ions-0.023
ipping-0.022
ubb-0.022
Token
Token ID198
Feature activation+8.98e-03

Pos logit contributions
ocol+0.019
ions+0.018
+0.017
 situations+0.016
+0.016
Neg logit contributions
 said-0.012
 smile-0.012
 say-0.011
 picture-0.010
 lost-0.010
Token"
Token ID1
Feature activation+1.53e-01

Pos logit contributions
ocol+0.022
umm+0.018
 situations+0.018
ubb+0.018
ions+0.017
Neg logit contributions
 said-0.016
 big-0.013
 asked-0.013
 saw-0.013
 say-0.013
Token looking
Token ID2045
Feature activation+3.11e-02

Pos logit contributions
ocol+0.401
rim+0.376
umm+0.375
 situations+0.371
orns+0.357
Neg logit contributions
 said-0.147
 mom-0.122
 dad-0.121
 big-0.119
 asked-0.113
Token at
Token ID379
Feature activation-2.88e-02

Pos logit contributions
ocol+0.075
ions+0.065
+0.063
 situations+0.063
ipping+0.062
Neg logit contributions
 said-0.040
 say-0.035
 picture-0.034
 smile-0.033
 asked-0.032
Token all
Token ID477
Feature activation-1.29e-01

Pos logit contributions
 said+0.035
 smile+0.030
 say+0.029
 big+0.027
 asked+0.027
Neg logit contributions
ocol-0.075
ions-0.066
 situations-0.064
-0.063
ipping-0.062
Token the
Token ID262
Feature activation-8.50e-04

Pos logit contributions
 said+0.147
 smile+0.123
 say+0.119
 big+0.113
 asked+0.111
Neg logit contributions
ocol-0.349
ions-0.309
 situations-0.301
-0.297
ipping-0.289
Token ch
Token ID442
Feature activation+2.18e-02

Pos logit contributions
 said+0.001
 smile+0.001
 say+0.001
 big+0.001
 asked+0.001
Neg logit contributions
ocol-0.002
ions-0.002
 situations-0.001
-0.001
ipping-0.001
Tokenocol
Token ID4668
Feature activation+4.47e-01

Pos logit contributions
Sp+0.009
+0.007
ground+0.006
Ag+0.005
 situations+0.005
Neg logit contributions
 smile-0.065
 said-0.064
 big-0.062
 know-0.061
 say-0.061
Tokenates
Token ID689
Feature activation+1.33e-01

Pos logit contributions
ocol+0.602
 situations+0.577
Too+0.571
 pleasures+0.510
+0.503
Neg logit contributions
 said-0.944
 answered-0.829
 smile-0.826
 dad-0.824
 asked-0.816
Token,
Token ID11
Feature activation+6.46e-02

Pos logit contributions
ocol+0.344
ions+0.302
 situations+0.295
+0.289
ipping+0.281
Neg logit contributions
 said-0.168
 smile-0.142
 say-0.138
 big-0.131
 asked-0.128
Token Lex
Token ID17210
Feature activation-3.75e-02

Pos logit contributions
ocol+0.151
ions+0.131
 situations+0.126
ipping+0.121
rim+0.119
Neg logit contributions
 said-0.102
 smile-0.086
 say-0.085
 big-0.085
 asked-0.084
Tokeni
Token ID72
Feature activation-3.28e-02

Pos logit contributions
 said+0.068
 smile+0.062
 say+0.061
 big+0.059
 asked+0.058
Neg logit contributions
ocol-0.074
ions-0.063
 situations-0.059
-0.059
ipping-0.057
Token chose
Token ID7690
Feature activation-5.48e-02

Pos logit contributions
 said+0.025
 say+0.023
 smile+0.023
 big+0.020
 picture+0.020
Neg logit contributions
ocol-0.090
ions-0.082
-0.081
ipping-0.079
-0.078
Token was
Token ID373
Feature activation+5.05e-02

Pos logit contributions
ocol+0.044
ions+0.044
+0.040
ar+0.040
we+0.038
Neg logit contributions
 smile-0.027
 understand-0.027
 say-0.026
 said-0.025
 dad-0.025
Token a
Token ID257
Feature activation-6.29e-02

Pos logit contributions
ocol+0.136
ions+0.121
 situations+0.112
+0.108
ubb+0.108
Neg logit contributions
 said-0.054
 smile-0.046
 say-0.044
 understand-0.041
 asked-0.041
Token box
Token ID3091
Feature activation-1.24e-02

Pos logit contributions
 said+0.074
 smile+0.061
 say+0.061
 asked+0.057
 says+0.055
Neg logit contributions
ocol-0.164
ions-0.144
 situations-0.139
-0.138
-0.134
Token of
Token ID286
Feature activation-5.42e-03

Pos logit contributions
 said+0.018
 smile+0.016
 say+0.015
 asked+0.015
 understand+0.015
Neg logit contributions
ocol-0.028
ions-0.025
 situations-0.023
-0.023
ipping-0.023
Token ch
Token ID442
Feature activation+2.54e-02

Pos logit contributions
 said+0.008
 smile+0.007
 say+0.007
 asked+0.007
 surprised+0.007
Neg logit contributions
ocol-0.012
ions-0.011
 situations-0.010
-0.010
 pleasures-0.009
Tokenocol
Token ID4668
Feature activation+4.47e-01

Pos logit contributions
 accompl+0.016
Too+0.015
+0.015
+0.015
mm+0.013
Neg logit contributions
 big-0.069
 open-0.068
 said-0.067
 smile-0.066
 picture-0.065
Tokenates
Token ID689
Feature activation+1.31e-01

Pos logit contributions
Too+0.563
ocol+0.556
 situations+0.548
 pleasures+0.505
+0.499
Neg logit contributions
 said-0.942
 smile-0.840
 dad-0.837
 asked-0.819
 say-0.816
Token.
Token ID13
Feature activation-9.95e-03

Pos logit contributions
ocol+0.353
ions+0.312
 situations+0.306
+0.302
ipping+0.292
Neg logit contributions
 said-0.149
 smile-0.123
 say-0.120
 big-0.114
 asked-0.110
Token Lily
Token ID20037
Feature activation+5.67e-02

Pos logit contributions
 said+0.012
 asked+0.009
 say+0.009
 dad+0.009
 says+0.009
Neg logit contributions
ocol-0.027
 situations-0.024
ions-0.024
rim-0.024
ipping-0.024
Token loved
Token ID6151
Feature activation-7.49e-02

Pos logit contributions
ocol+0.132
ions+0.127
+0.124
pill+0.119
ar+0.115
Neg logit contributions
 smile-0.052
 say-0.050
 understand-0.049
 said-0.048
 picture-0.047
Token ch
Token ID442
Feature activation+2.52e-02

Pos logit contributions
 said+0.113
 smile+0.098
 big+0.095
 say+0.095
 asked+0.093
Neg logit contributions
ocol-0.175
 situations-0.153
ions-0.153
-0.152
ipping-0.148
Token started
Token ID2067
Feature activation-4.77e-02

Pos logit contributions
 smile+0.026
 understand+0.025
 say+0.024
 said+0.024
 picture+0.023
Neg logit contributions
ocol-0.063
ions-0.061
-0.056
ipping-0.053
ar-0.052
Token eating
Token ID6600
Feature activation+1.25e-01

Pos logit contributions
 said+0.056
 smile+0.047
 say+0.045
 big+0.044
 asked+0.043
Neg logit contributions
ocol-0.128
ions-0.114
 situations-0.112
-0.110
umm-0.106
Token the
Token ID262
Feature activation-1.96e-03

Pos logit contributions
ocol+0.348
 situations+0.318
+0.310
ions+0.309
Too+0.305
Neg logit contributions
 said-0.139
 smile-0.115
 big-0.114
 say-0.107
 open-0.099
Token delicious
Token ID12625
Feature activation+4.17e-02

Pos logit contributions
 said+0.003
 smile+0.003
 say+0.002
 asked+0.002
 dad+0.002
Neg logit contributions
ocol-0.005
ions-0.004
 situations-0.004
-0.004
ipping-0.004
Token ch
Token ID442
Feature activation+2.09e-02

Pos logit contributions
ocol+0.089
ions+0.085
 situations+0.079
+0.079
ubb+0.077
Neg logit contributions
 said-0.067
 smile-0.062
 big-0.061
 picture-0.058
 say-0.058
Tokenocol
Token ID4668
Feature activation+4.47e-01

Pos logit contributions
Sp+0.008
+0.006
Too+0.006
Fl+0.005
 accompl+0.005
Neg logit contributions
 smile-0.060
 said-0.060
 picture-0.060
 big-0.060
 old-0.059
Tokenates
Token ID689
Feature activation+1.33e-01

Pos logit contributions
ocol+0.589
 situations+0.511
Too+0.506
+0.459
 pleasures+0.454
Neg logit contributions
 said-1.006
 smile-0.892
 dad-0.889
 say-0.887
 picture-0.884
Token.
Token ID13
Feature activation-2.21e-02

Pos logit contributions
ocol+0.348
ions+0.313
 situations+0.298
+0.294
ipping+0.288
Neg logit contributions
 said-0.155
 smile-0.133
 say-0.129
 big-0.120
 asked-0.120
Token She
Token ID1375
Feature activation-7.06e-03

Pos logit contributions
 said+0.031
 smile+0.028
 say+0.028
 lost+0.026
 big+0.026
Neg logit contributions
ocol-0.047
-0.044
ions-0.043
-0.043
 situations-0.040
Token was
Token ID373
Feature activation+4.02e-02

Pos logit contributions
 say+0.007
 smile+0.007
 said+0.007
 understand+0.007
 picture+0.006
Neg logit contributions
ocol-0.016
ions-0.015
-0.014
pill-0.014
-0.014
Token so
Token ID523
Feature activation-1.05e-01

Pos logit contributions
ocol+0.089
ions+0.081
 accompl+0.075
ubb+0.074
 centimeters+0.073
Neg logit contributions
 said-0.051
 say-0.046
 says-0.045
 picture-0.043
 smile-0.042
Token she
Token ID673
Feature activation-5.57e-02

Pos logit contributions
ocol+0.162
ions+0.142
 situations+0.138
+0.132
orns+0.132
Neg logit contributions
 said-0.088
 smile-0.074
 say-0.071
 asked-0.069
 big-0.067
Token paid
Token ID3432
Feature activation+6.88e-02

Pos logit contributions
 smile+0.046
 picture+0.044
 understand+0.042
 say+0.042
 said+0.040
Neg logit contributions
ocol-0.135
ions-0.125
ar-0.119
-0.119
-0.114
Token for
Token ID329
Feature activation+7.50e-02

Pos logit contributions
ocol+0.169
ions+0.151
 situations+0.142
+0.139
ipping+0.137
Neg logit contributions
 said-0.090
 smile-0.079
 say-0.076
 asked-0.072
 picture-0.072
Token the
Token ID262
Feature activation+2.53e-03

Pos logit contributions
ocol+0.217
+0.201
 situations+0.199
ions+0.197
pill+0.197
Neg logit contributions
 said-0.072
 smile-0.060
 dad-0.059
 big-0.058
 say-0.057
Token ch
Token ID442
Feature activation+2.27e-02

Pos logit contributions
ocol+0.006
ions+0.005
 situations+0.005
+0.005
ipping+0.005
Neg logit contributions
 said-0.004
 smile-0.003
 say-0.003
 big-0.003
 asked-0.003
Tokenocol
Token ID4668
Feature activation+4.47e-01

Pos logit contributions
Sp+0.004
 situations+0.003
ions+0.003
+0.003
 accompl+0.002
Neg logit contributions
 smile-0.072
 said-0.071
 big-0.069
 say-0.069
 picture-0.068
Tokenates
Token ID689
Feature activation+1.33e-01

Pos logit contributions
ocol+0.576
 situations+0.558
Too+0.524
 pleasures+0.475
+0.471
Neg logit contributions
 said-0.975
 smile-0.870
 dad-0.864
 say-0.858
 answered-0.846
Token and
Token ID290
Feature activation-2.58e-02

Pos logit contributions
ocol+0.348
ions+0.311
 situations+0.299
+0.295
ipping+0.288
Neg logit contributions
 said-0.155
 smile-0.132
 say-0.128
 big-0.120
 asked-0.119
Token the
Token ID262
Feature activation+2.50e-03

Pos logit contributions
 said+0.026
 smile+0.022
 say+0.021
 big+0.020
 asked+0.019
Neg logit contributions
ocol-0.071
ions-0.064
 situations-0.061
-0.060
ipping-0.059
Token two
Token ID734
Feature activation-1.32e-01

Pos logit contributions
ocol+0.006
ions+0.005
 situations+0.005
+0.005
ipping+0.005
Neg logit contributions
 said-0.004
 smile-0.003
 say-0.003
 big-0.003
 asked-0.003
Token of
Token ID286
Feature activation-1.11e-02

Pos logit contributions
 said+0.177
 smile+0.154
 say+0.150
 big+0.142
 asked+0.141
Neg logit contributions
ocol-0.326
ions-0.288
 situations-0.277
-0.272
ipping-0.265
Tokenala
Token ID6081
Feature activation+5.41e-03

Pos logit contributions
ocol+0.233
ions+0.194
 situations+0.187
ubb+0.177
ipping+0.175
Neg logit contributions
 said-0.203
 asked-0.179
 saw-0.173
 say-0.172
 smile-0.163
Token named
Token ID3706
Feature activation+2.54e-02

Pos logit contributions
ocol+0.013
ions+0.011
+0.011
 situations+0.011
ipping+0.010
Neg logit contributions
 said-0.007
 smile-0.006
 picture-0.006
 big-0.006
 say-0.006
Token Chris
Token ID5180
Feature activation-1.17e-01

Pos logit contributions
ocol+0.045
ions+0.039
 situations+0.036
+0.034
ubb+0.034
Neg logit contributions
 said-0.048
 smile-0.045
 say-0.043
 picture-0.043
 surprised-0.041
Token.
Token ID13
Feature activation-3.41e-02

Pos logit contributions
 said+0.139
 smile+0.129
 say+0.124
 picture+0.122
 big+0.120
Neg logit contributions
ocol-0.274
ions-0.248
-0.240
 situations-0.237
ipping-0.226
Token Chris
Token ID5180
Feature activation-1.10e-01

Pos logit contributions
 said+0.049
 say+0.042
 smile+0.041
 asked+0.039
 big+0.039
Neg logit contributions
ocol-0.083
ions-0.072
 situations-0.070
-0.068
ipping-0.067
Token said
Token ID531
Feature activation-3.08e-01

Pos logit contributions
 said+0.115
 smile+0.110
 say+0.106
 picture+0.100
 big+0.100
Neg logit contributions
ocol-0.266
ions-0.244
-0.242
 situations-0.237
pill-0.226
Token,
Token ID11
Feature activation+5.00e-02

Pos logit contributions
 said+0.398
 smile+0.342
 say+0.333
 big+0.315
 asked+0.312
Neg logit contributions
ocol-0.777
ions-0.690
 situations-0.664
-0.653
ipping-0.636
Token "
Token ID366
Feature activation+9.77e-02

Pos logit contributions
ocol+0.114
ions+0.099
 situations+0.096
+0.093
ipping+0.091
Neg logit contributions
 said-0.078
 smile-0.069
 say-0.068
 big-0.064
 asked-0.064
TokenHarry
Token ID18308
Feature activation-3.74e-02

Pos logit contributions
ocol+0.199
ions+0.166
 situations+0.159
+0.148
ipping+0.148
Neg logit contributions
 said-0.185
 say-0.164
 smile-0.162
 asked-0.159
 saw-0.151
Token,
Token ID11
Feature activation+5.06e-02

Pos logit contributions
 said+0.048
 smile+0.041
 say+0.040
 big+0.038
 asked+0.038
Neg logit contributions
ocol-0.095
ions-0.084
 situations-0.081
-0.080
ipping-0.078
Token be
Token ID307
Feature activation+1.09e-01

Pos logit contributions
ocol+0.124
ions+0.109
 situations+0.105
+0.104
+0.104
Neg logit contributions
 said-0.072
 say-0.063
 smile-0.063
 big-0.059
 asked-0.057
Token
Token ID198
Feature activation-5.20e-03

Pos logit contributions
 said+0.181
 smile+0.158
 say+0.154
 big+0.145
 asked+0.145
Neg logit contributions
ocol-0.318
ions-0.282
 situations-0.270
-0.265
ipping-0.258
Token
Token ID198
Feature activation-6.97e-03

Pos logit contributions
 said+0.007
 say+0.006
 smile+0.006
 asked+0.006
 big+0.006
Neg logit contributions
ocol-0.013
ions-0.011
 situations-0.011
umm-0.011
ipping-0.011
TokenJack
Token ID14295
Feature activation-1.44e-02

Pos logit contributions
 said+0.012
 asked+0.011
 say+0.011
 smile+0.010
 big+0.010
Neg logit contributions
ocol-0.015
ions-0.013
umm-0.013
 situations-0.012
ipping-0.012
Token smiled
Token ID13541
Feature activation-1.84e-01

Pos logit contributions
 picture+0.014
 big+0.014
 dad+0.013
 say+0.013
 smile+0.013
Neg logit contributions
ocol-0.032
-0.030
ions-0.029
ipping-0.029
ar-0.029
Token and
Token ID290
Feature activation-3.88e-02

Pos logit contributions
 said+0.250
 say+0.218
 picture+0.217
 know+0.213
 says+0.205
Neg logit contributions
ocol-0.396
ipping-0.344
-0.338
 situations-0.336
umm-0.330
Token said
Token ID531
Feature activation-3.06e-01

Pos logit contributions
 picture+0.022
 understand+0.018
 smile+0.018
 say+0.017
bell+0.016
Neg logit contributions
ocol-0.101
ions-0.093
ar-0.092
-0.092
ipping-0.092
Token,
Token ID11
Feature activation+5.35e-02

Pos logit contributions
 said+0.376
 smile+0.322
 say+0.299
 big+0.296
 asked+0.285
Neg logit contributions
ocol-0.812
ions-0.722
 situations-0.707
-0.678
ipping-0.667
Token "
Token ID366
Feature activation+1.00e-01

Pos logit contributions
ocol+0.123
ions+0.107
 situations+0.104
+0.099
ipping+0.098
Neg logit contributions
 said-0.083
 smile-0.074
 say-0.072
 big-0.069
 asked-0.068
TokenYes
Token ID5297
Feature activation-1.20e-02

Pos logit contributions
ocol+0.216
ions+0.181
 situations+0.179
ipping+0.164
ubb+0.161
Neg logit contributions
 said-0.179
 say-0.156
 smile-0.154
 asked-0.154
 understand-0.145
Token,
Token ID11
Feature activation+5.72e-02

Pos logit contributions
 said+0.016
 smile+0.013
 say+0.013
 big+0.012
 asked+0.012
Neg logit contributions
ocol-0.031
ions-0.027
 situations-0.026
-0.026
ipping-0.025
Token let
Token ID1309
Feature activation-2.39e-02

Pos logit contributions
ocol+0.126
ions+0.110
 situations+0.104
+0.103
ipping+0.099
Neg logit contributions
 said-0.091
 smile-0.081
 say-0.079
 big-0.076
 asked-0.076
Token
Token ID198
Feature activation-6.97e-03

Pos logit contributions
 said+0.007
 say+0.006
 smile+0.006
 asked+0.006
 big+0.006
Neg logit contributions
ocol-0.013
ions-0.011
 situations-0.011
umm-0.011
ipping-0.011
TokenJack
Token ID14295
Feature activation-1.44e-02

Pos logit contributions
 said+0.012
 asked+0.011
 say+0.011
 smile+0.010
 big+0.010
Neg logit contributions
ocol-0.015
ions-0.013
umm-0.013
 situations-0.012
ipping-0.012
Token was
Token ID373
Feature activation+2.63e-02

Pos logit contributions
 picture+0.014
 big+0.014
 dad+0.013
 say+0.013
 smile+0.013
Neg logit contributions
ocol-0.032
-0.030
ions-0.029
ipping-0.029
ar-0.029
Token surprised
Token ID6655
Feature activation-2.19e-01

Pos logit contributions
ocol+0.061
 accompl+0.054
ions+0.053
ubb+0.051
ipping+0.050
Neg logit contributions
 said-0.030
 say-0.027
 picture-0.027
 says-0.027
 smile-0.025
Token and
Token ID290
Feature activation-3.70e-02

Pos logit contributions
 said+0.263
 smile+0.224
 say+0.218
 big+0.204
 asked+0.202
Neg logit contributions
ocol-0.570
ions-0.507
 situations-0.491
-0.480
ipping-0.471
Token said
Token ID531
Feature activation-3.05e-01

Pos logit contributions
 picture+0.030
 said+0.028
 smile+0.027
 say+0.027
 understand+0.026
Neg logit contributions
ocol-0.092
 accompl-0.080
 situations-0.080
Too-0.079
ions-0.079
Token,
Token ID11
Feature activation+5.45e-02

Pos logit contributions
 said+0.380
 smile+0.324
 say+0.310
 big+0.299
 asked+0.293
Neg logit contributions
ocol-0.796
ions-0.709
 situations-0.686
-0.668
ipping-0.656
Token "
Token ID366
Feature activation+1.01e-01

Pos logit contributions
ocol+0.126
ions+0.110
 situations+0.106
+0.102
ipping+0.100
Neg logit contributions
 said-0.085
 smile-0.075
 say-0.073
 big-0.070
 asked-0.069
TokenWhy
Token ID5195
Feature activation+1.78e-01

Pos logit contributions
ocol+0.211
ions+0.178
 situations+0.171
ipping+0.159
igs+0.158
Neg logit contributions
 said-0.187
 say-0.164
 asked-0.164
 smile-0.160
 understand-0.153
Token did
Token ID750
Feature activation-1.95e-02

Pos logit contributions
ocol+0.415
ions+0.367
 situations+0.352
+0.345
ipping+0.333
Neg logit contributions
 said-0.255
 smile-0.224
 say-0.219
 asked-0.209
 big-0.207
Token you
Token ID345
Feature activation+8.43e-02

Pos logit contributions
 said+0.037
 smile+0.033
 say+0.033
 asked+0.031
 big+0.031
Neg logit contributions
ocol-0.037
ions-0.031
 situations-0.030
-0.029
ipping-0.028
Token
Token ID198
Feature activation-5.20e-03

Pos logit contributions
 said+0.068
 say+0.062
 smile+0.060
 big+0.057
 open+0.056
Neg logit contributions
ocol-0.156
ions-0.140
ipping-0.133
 situations-0.133
-0.132
Token
Token ID198
Feature activation-6.97e-03

Pos logit contributions
 said+0.007
 say+0.006
 smile+0.006
 asked+0.006
 big+0.006
Neg logit contributions
ocol-0.013
ions-0.011
 situations-0.011
umm-0.011
ipping-0.011
TokenJack
Token ID14295
Feature activation-1.44e-02

Pos logit contributions
 said+0.012
 asked+0.011
 say+0.011
 smile+0.010
 big+0.010
Neg logit contributions
ocol-0.015
ions-0.013
umm-0.013
 situations-0.012
ipping-0.012
Token's
Token ID338
Feature activation-3.65e-02

Pos logit contributions
 picture+0.014
 big+0.014
 dad+0.013
 say+0.013
 smile+0.013
Neg logit contributions
ocol-0.032
-0.030
ions-0.029
ipping-0.029
ar-0.029
Token mother
Token ID2802
Feature activation-2.17e-01

Pos logit contributions
 said+0.035
 say+0.028
 whispered+0.026
 asked+0.026
 know+0.025
Neg logit contributions
ocol-0.097
-0.084
ions-0.083
ubb-0.083
 situations-0.083
Token said
Token ID531
Feature activation-3.05e-01

Pos logit contributions
 said+0.139
 picture+0.137
 smile+0.134
 big+0.131
 understand+0.125
Neg logit contributions
ocol-0.604
ions-0.545
-0.530
ipping-0.526
ar-0.511
Token:
Token ID25
Feature activation-5.82e-02

Pos logit contributions
 said+0.373
 smile+0.313
 say+0.296
 big+0.285
 asked+0.281
Neg logit contributions
ocol-0.814
ions-0.726
 situations-0.716
-0.681
orns-0.681
Token "
Token ID366
Feature activation+9.90e-02

Pos logit contributions
 said+0.072
 smile+0.062
 dad+0.059
 big+0.059
 say+0.058
Neg logit contributions
ocol-0.158
 situations-0.146
ions-0.141
orns-0.139
 pleasures-0.135
TokenIf
Token ID1532
Feature activation+6.15e-02

Pos logit contributions
ocol+0.210
ions+0.173
 situations+0.173
ubb+0.159
ipping+0.159
Neg logit contributions
 said-0.183
 say-0.158
 asked-0.157
 smile-0.154
 saw-0.147
Token you
Token ID345
Feature activation+8.17e-02

Pos logit contributions
ocol+0.127
ions+0.105
ubb+0.099
ipping+0.099
umm+0.098
Neg logit contributions
 said-0.119
 say-0.105
 smile-0.102
 dad-0.101
 asked-0.101
Token want
Token ID765
Feature activation-2.87e-02

Pos logit contributions
ocol+0.197
ions+0.174
+0.167
 situations+0.166
+0.162
Neg logit contributions
 said-0.105
 smile-0.089
 big-0.087
 surprised-0.086
 picture-0.085
Token big
Token ID1263
Feature activation-1.03e-01

Pos logit contributions
 said+0.098
 say+0.080
 smile+0.079
 asked+0.075
 says+0.073
Neg logit contributions
ocol-0.230
ions-0.203
 situations-0.197
-0.196
-0.191
Token tree
Token ID5509
Feature activation-9.40e-02

Pos logit contributions
 said+0.136
 smile+0.121
 say+0.116
 big+0.112
 picture+0.106
Neg logit contributions
ocol-0.257
ions-0.231
 situations-0.221
-0.219
ipping-0.211
Token.
Token ID13
Feature activation-3.36e-02

Pos logit contributions
 said+0.122
 smile+0.104
 say+0.102
 big+0.096
 asked+0.095
Neg logit contributions
ocol-0.240
ions-0.211
 situations-0.205
-0.201
ipping-0.195
Token B
Token ID347
Feature activation+1.54e-01

Pos logit contributions
 said+0.037
 asked+0.030
 say+0.027
 smile+0.027
 big+0.027
Neg logit contributions
ocol-0.097
ions-0.086
 situations-0.083
ubb-0.081
rim-0.081
Tokenally
Token ID453
Feature activation+1.21e-02

Pos logit contributions
ocol+0.321
ions+0.279
 situations+0.268
+0.251
 accompl+0.246
Neg logit contributions
 said-0.277
 say-0.242
 smile-0.241
 big-0.229
 asked-0.229
Token said
Token ID531
Feature activation-3.04e-01

Pos logit contributions
+0.025
pill+0.024
ocol+0.024
F+0.024
Fl+0.023
Neg logit contributions
 smile-0.012
 letter-0.011
 understand-0.011
 picture-0.011
 say-0.011
Token,
Token ID11
Feature activation+5.40e-02

Pos logit contributions
 said+0.402
 smile+0.347
 say+0.341
 big+0.320
 asked+0.318
Neg logit contributions
ocol-0.758
ions-0.671
 situations-0.645
-0.637
ipping-0.619
Token "
Token ID366
Feature activation+1.02e-01

Pos logit contributions
ocol+0.124
ions+0.108
 situations+0.105
+0.101
ipping+0.099
Neg logit contributions
 said-0.084
 smile-0.074
 say-0.072
 big-0.069
 asked-0.069
TokenHello
Token ID15496
Feature activation-1.70e-02

Pos logit contributions
ocol+0.214
ions+0.182
 situations+0.174
ubb+0.162
ipping+0.161
Neg logit contributions
 said-0.187
 say-0.164
 smile-0.163
 asked-0.160
 big-0.156
Token,
Token ID11
Feature activation+5.48e-02

Pos logit contributions
 said+0.022
 big+0.019
 smile+0.018
 say+0.017
 open+0.017
Neg logit contributions
ocol-0.045
ions-0.040
 situations-0.039
 pleasures-0.037
ubb-0.037
Token tree
Token ID5509
Feature activation-8.70e-02

Pos logit contributions
ocol+0.141
ions+0.126
 situations+0.121
+0.119
ipping+0.116
Neg logit contributions
 said-0.068
 smile-0.058
 say-0.056
 big-0.053
 asked-0.052
Token
Token ID198
Feature activation-6.97e-03

Pos logit contributions
 said+0.007
 say+0.006
 smile+0.006
 asked+0.006
 big+0.006
Neg logit contributions
ocol-0.013
ions-0.011
 situations-0.011
umm-0.011
ipping-0.011
TokenDa
Token ID26531
Feature activation+1.19e-01

Pos logit contributions
 said+0.012
 asked+0.011
 say+0.011
 smile+0.010
 big+0.010
Neg logit contributions
ocol-0.015
ions-0.013
umm-0.013
 situations-0.012
ipping-0.012
Tokenisy
Token ID13560
Feature activation-1.53e-01

Pos logit contributions
ocol+0.236
ions+0.208
 situations+0.204
+0.196
rim+0.196
Neg logit contributions
 said-0.177
 smile-0.174
 big-0.169
 dad-0.168
 say-0.165
Token smiled
Token ID13541
Feature activation-1.80e-01

Pos logit contributions
 said+0.147
 smile+0.131
 say+0.130
 big+0.119
 picture+0.118
Neg logit contributions
ocol-0.398
ions-0.358
-0.356
 situations-0.347
ipping-0.341
Token and
Token ID290
Feature activation-3.78e-02

Pos logit contributions
 said+0.240
 say+0.216
 picture+0.213
 know+0.210
 says+0.201
Neg logit contributions
ocol-0.384
ipping-0.337
-0.334
umm-0.323
 situations-0.322
Token said
Token ID531
Feature activation-3.04e-01

Pos logit contributions
 picture+0.025
 smile+0.023
 understand+0.022
 say+0.022
 know+0.019
Neg logit contributions
ocol-0.093
ions-0.088
-0.087
ar-0.085
ipping-0.084
Token,
Token ID11
Feature activation+5.48e-02

Pos logit contributions
 said+0.378
 smile+0.323
 say+0.303
 big+0.297
 asked+0.288
Neg logit contributions
ocol-0.802
ions-0.711
 situations-0.696
-0.667
ipping-0.657
Token "
Token ID366
Feature activation+1.02e-01

Pos logit contributions
ocol+0.129
ions+0.113
 situations+0.110
+0.103
ipping+0.103
Neg logit contributions
 said-0.084
 smile-0.074
 say-0.072
 big-0.069
 asked-0.069
TokenIt
Token ID1026
Feature activation-2.82e-02

Pos logit contributions
ocol+0.229
ions+0.191
 situations+0.191
ipping+0.176
ubb+0.173
Neg logit contributions
 said-0.175
 say-0.152
 smile-0.150
 asked-0.150
 understand-0.140
Token's
Token ID338
Feature activation-3.07e-02

Pos logit contributions
 said+0.028
 smile+0.024
 say+0.023
 dad+0.022
 big+0.021
Neg logit contributions
ocol-0.074
ions-0.068
 situations-0.066
-0.066
ipping-0.063
Token so
Token ID523
Feature activation-1.12e-01

Pos logit contributions
 said+0.041
 smile+0.035
 say+0.035
 big+0.034
 asked+0.033
Neg logit contributions
ocol-0.078
ions-0.069
-0.067
 situations-0.067
ipping-0.066
Token
Token ID198
Feature activation-1.63e-02

Pos logit contributions
 said+0.097
 smile+0.088
 say+0.087
 picture+0.082
 open+0.081
Neg logit contributions
ocol-0.177
ions-0.160
-0.151
-0.149
ipping-0.146
Token
Token ID198
Feature activation-1.57e-02

Pos logit contributions
 said+0.019
 smile+0.016
 say+0.015
 big+0.014
 asked+0.014
Neg logit contributions
ocol-0.044
ions-0.039
 situations-0.038
-0.037
ipping-0.036
TokenJohnny
Token ID44960
Feature activation+6.93e-02

Pos logit contributions
 said+0.027
 asked+0.024
 say+0.023
 smile+0.023
 big+0.023
Neg logit contributions
ocol-0.035
 situations-0.029
ions-0.029
umm-0.028
ipping-0.027
Token smiled
Token ID13541
Feature activation-1.81e-01

Pos logit contributions
ocol+0.149
+0.144
ipping+0.141
ions+0.141
+0.135
Neg logit contributions
 smile-0.067
 big-0.066
 picture-0.066
 say-0.065
 understand-0.064
Token and
Token ID290
Feature activation-3.74e-02

Pos logit contributions
 said+0.256
 say+0.226
 picture+0.225
 know+0.225
 says+0.218
Neg logit contributions
ocol-0.380
ipping-0.327
-0.322
umm-0.313
 situations-0.311
Token said
Token ID531
Feature activation-3.04e-01

Pos logit contributions
 picture+0.021
 smile+0.018
 understand+0.018
 say+0.016
 said+0.015
Neg logit contributions
ocol-0.098
ions-0.091
ar-0.089
-0.089
ipping-0.088
Token,
Token ID11
Feature activation+5.37e-02

Pos logit contributions
 said+0.377
 smile+0.323
 say+0.301
 big+0.298
 asked+0.287
Neg logit contributions
ocol-0.807
ions-0.719
 situations-0.702
-0.674
orns-0.666
Token "
Token ID366
Feature activation+1.01e-01

Pos logit contributions
ocol+0.124
ions+0.108
 situations+0.105
+0.100
ipping+0.098
Neg logit contributions
 said-0.084
 smile-0.074
 say-0.072
 big-0.069
 asked-0.068
TokenYou
Token ID1639
Feature activation+1.01e-01

Pos logit contributions
ocol+0.220
ions+0.185
 situations+0.184
ipping+0.167
igs+0.165
Neg logit contributions
 said-0.179
 say-0.156
 asked-0.154
 smile-0.154
 understand-0.145
Token're
Token ID821
Feature activation+2.84e-02

Pos logit contributions
ocol+0.184
ions+0.168
+0.154
 situations+0.149
+0.148
Neg logit contributions
 said-0.146
 picture-0.139
 smile-0.139
 old-0.135
 mother-0.134
Token welcome
Token ID7062
Feature activation+3.55e-02

Pos logit contributions
ocol+0.070
ions+0.062
 situations+0.061
+0.060
ipping+0.058
Neg logit contributions
 said-0.040
 smile-0.035
 say-0.034
 big-0.032
 asked-0.032
Token
Token ID198
Feature activation+9.94e-04

Pos logit contributions
 said+0.161
 smile+0.146
 say+0.139
 big+0.137
 picture+0.136
Neg logit contributions
ocol-0.323
ions-0.283
-0.272
 situations-0.272
-0.264
Token
Token ID198
Feature activation-1.39e-03

Pos logit contributions
ocol+0.003
ions+0.002
 situations+0.002
+0.002
ipping+0.002
Neg logit contributions
 said-0.001
 smile-0.001
 say-0.001
 big-0.001
 asked-0.001
TokenJack
Token ID14295
Feature activation-9.21e-03

Pos logit contributions
 said+0.003
 say+0.002
 smile+0.002
 asked+0.002
 big+0.002
Neg logit contributions
ocol-0.003
ions-0.002
 situations-0.002
umm-0.002
ipping-0.002
Token smiled
Token ID13541
Feature activation-1.78e-01

Pos logit contributions
 picture+0.008
 big+0.008
 dad+0.008
 understand+0.007
 letter+0.007
Neg logit contributions
ocol-0.020
-0.019
ipping-0.019
ar-0.018
Too-0.018
Token and
Token ID290
Feature activation-3.73e-02

Pos logit contributions
 said+0.257
 picture+0.240
 know+0.226
 say+0.224
 rang+0.223
Neg logit contributions
ocol-0.327
ipping-0.285
-0.278
 situations-0.268
 pleasures-0.267
Token said
Token ID531
Feature activation-3.04e-01

Pos logit contributions
 picture+0.021
 understand+0.017
bell+0.014
 letter+0.014
 smile+0.014
Neg logit contributions
ocol-0.095
ar-0.089
ions-0.088
-0.088
ipping-0.087
Token,
Token ID11
Feature activation+5.55e-02

Pos logit contributions
 said+0.374
 smile+0.321
 say+0.300
 big+0.292
 asked+0.285
Neg logit contributions
ocol-0.803
ions-0.715
 situations-0.702
-0.672
orns-0.661
Token "
Token ID366
Feature activation+1.02e-01

Pos logit contributions
ocol+0.126
ions+0.110
 situations+0.106
+0.102
ipping+0.100
Neg logit contributions
 said-0.087
 smile-0.077
 say-0.076
 big-0.072
 asked-0.072
TokenI
Token ID40
Feature activation+3.72e-02

Pos logit contributions
ocol+0.223
ions+0.192
 situations+0.185
+0.177
ipping+0.175
Neg logit contributions
 said-0.170
 smile-0.150
 say-0.148
 asked-0.142
 big-0.140
Token can
Token ID460
Feature activation+2.58e-02

Pos logit contributions
ocol+0.085
ions+0.074
+0.071
 situations+0.070
+0.069
Neg logit contributions
 said-0.051
 smile-0.045
 dad-0.044
 big-0.044
 say-0.044
Token't
Token ID470
Feature activation+2.33e-02

Pos logit contributions
ocol+0.058
ions+0.052
+0.051
+0.050
ipping+0.048
Neg logit contributions
 said-0.036
 smile-0.029
 big-0.029
 says-0.029
 picture-0.029
Tokenâ
Token ID22940
Feature activation+9.54e-02

Pos logit contributions
ocol+0.063
ions+0.055
 situations+0.053
+0.051
ipping+0.051
Neg logit contributions
 said-0.041
 smile-0.035
 say-0.035
 asked-0.033
 big-0.032
Token
Token ID26391
Feature activation+2.13e-02

Pos logit contributions
ocol+0.194
ions+0.166
 situations+0.159
+0.155
ipping+0.150
Neg logit contributions
 said-0.170
 smile-0.153
 say-0.150
 big-0.145
 asked-0.143
Token
Token ID198
Feature activation-9.73e-03

Pos logit contributions
ocol+0.053
ions+0.048
+0.046
 situations+0.046
+0.045
Neg logit contributions
 said-0.026
 smile-0.022
 say-0.022
 big-0.021
 asked-0.020
Token
Token ID198
Feature activation-9.35e-03

Pos logit contributions
 said+0.010
 smile+0.008
 say+0.008
 asked+0.007
 big+0.007
Neg logit contributions
ocol-0.028
ions-0.025
 situations-0.024
-0.024
ipping-0.024
TokenJane
Token ID41083
Feature activation+1.01e-03

Pos logit contributions
 said+0.016
 asked+0.014
 say+0.013
 big+0.013
 understand+0.013
Neg logit contributions
ocol-0.021
 situations-0.017
ions-0.017
umm-0.017
ipping-0.016
Token said
Token ID531
Feature activation-3.04e-01

Pos logit contributions
ocol+0.003
ions+0.002
+0.002
 situations+0.002
ipping+0.002
Neg logit contributions
 said-0.001
 smile-0.001
 picture-0.001
 say-0.001
 big-0.001
Token,
Token ID11
Feature activation+5.64e-02

Pos logit contributions
 said+0.393
 smile+0.337
 say+0.326
 big+0.310
 asked+0.306
Neg logit contributions
ocol-0.773
ions-0.686
 situations-0.663
-0.646
ipping-0.632
Token
Token ID6184
Feature activation+1.66e-01

Pos logit contributions
ocol+0.131
ions+0.114
 situations+0.110
+0.105
ipping+0.104
Neg logit contributions
 said-0.088
 smile-0.078
 say-0.076
 big-0.073
 asked-0.072
Token
Token ID95
Feature activation+1.55e-01

Pos logit contributions
ocol+0.271
 situations+0.245
ions+0.243
orns+0.241
 pleasures+0.222
Neg logit contributions
 said-0.348
 smile-0.327
 say-0.324
 open-0.311
 replied-0.308
Token
Token ID26391
Feature activation+1.85e-02

Pos logit contributions
ocol+0.314
ions+0.268
ubb+0.256
 situations+0.250
orns+0.240
Neg logit contributions
 said-0.287
 say-0.262
 smile-0.250
 open-0.247
 says-0.243
Token
Token ID129
Feature activation+2.18e-01

Pos logit contributions
ocol+0.040
ions+0.034
 situations+0.032
ubb+0.031
+0.030
Neg logit contributions
 said-0.033
 smile-0.029
 say-0.028
 big-0.027
 asked-0.027
Token
Token ID198
Feature activation-5.20e-03

Pos logit contributions
 said+0.048
 smile+0.040
 say+0.038
 big+0.037
 asked+0.036
Neg logit contributions
ocol-0.122
ions-0.108
 situations-0.105
-0.104
ipping-0.100
TokenSarah
Token ID29284
Feature activation-6.46e-02

Pos logit contributions
 said+0.007
 say+0.006
 smile+0.006
 asked+0.006
 big+0.006
Neg logit contributions
ocol-0.013
ions-0.011
 situations-0.011
umm-0.011
ipping-0.011
Token smiled
Token ID13541
Feature activation-1.86e-01

Pos logit contributions
 big+0.067
 picture+0.065
 smile+0.065
 say+0.064
 dad+0.064
Neg logit contributions
ocol-0.132
-0.128
ions-0.128
ipping-0.125
pill-0.122
Token back
Token ID736
Feature activation-1.06e-01

Pos logit contributions
 said+0.262
 picture+0.237
 know+0.237
 say+0.232
 says+0.221
Neg logit contributions
ocol-0.369
ipping-0.327
-0.315
umm-0.312
ubb-0.306
Token and
Token ID290
Feature activation-3.77e-02

Pos logit contributions
 said+0.140
 say+0.128
 understand+0.122
 write+0.120
 picture+0.120
Neg logit contributions
ocol-0.237
ipping-0.204
-0.198
ions-0.196
 situations-0.196
Token said
Token ID531
Feature activation-3.04e-01

Pos logit contributions
 picture+0.026
 understand+0.021
bell+0.021
 letter+0.021
 big+0.019
Neg logit contributions
ocol-0.087
ipping-0.083
-0.083
ar-0.082
ions-0.081
Token,
Token ID11
Feature activation+5.33e-02

Pos logit contributions
 said+0.398
 smile+0.343
 say+0.335
 big+0.316
 asked+0.313
Neg logit contributions
ocol-0.761
ions-0.675
 situations-0.650
-0.639
ipping-0.622
Token "
Token ID366
Feature activation+1.01e-01

Pos logit contributions
ocol+0.120
ions+0.104
 situations+0.100
+0.097
ipping+0.095
Neg logit contributions
 said-0.084
 smile-0.075
 say-0.073
 big-0.070
 asked-0.069
TokenYes
Token ID5297
Feature activation-1.04e-02

Pos logit contributions
ocol+0.214
ions+0.180
 situations+0.177
ipping+0.163
+0.162
Neg logit contributions
 said-0.181
 smile-0.160
 say-0.159
 asked-0.154
 big-0.148
Token!"
Token ID2474
Feature activation-9.06e-02

Pos logit contributions
 said+0.015
 smile+0.013
 say+0.013
 asked+0.012
 big+0.012
Neg logit contributions
ocol-0.024
ions-0.021
-0.020
 situations-0.020
ipping-0.020
Token
Token ID198
Feature activation-9.38e-03

Pos logit contributions
 said+0.113
 smile+0.098
 say+0.096
 big+0.091
 asked+0.088
Neg logit contributions
ocol-0.225
ions-0.203
 situations-0.194
-0.193
-0.189
Token.
Token ID13
Feature activation-3.24e-02

Pos logit contributions
 said+0.192
 smile+0.173
 say+0.167
 big+0.160
 picture+0.159
Neg logit contributions
ocol-0.415
ions-0.371
-0.354
 situations-0.353
ipping-0.342
Token
Token ID220
Feature activation-7.93e-02

Pos logit contributions
 said+0.042
 smile+0.036
 say+0.036
 big+0.034
 picture+0.033
Neg logit contributions
ocol-0.079
ions-0.070
-0.068
 situations-0.068
-0.066
Token
Token ID198
Feature activation-1.50e-02

Pos logit contributions
 said+0.108
 smile+0.101
 say+0.097
 picture+0.093
 open+0.089
Neg logit contributions
ocol-0.180
ions-0.167
-0.152
-0.150
 situations-0.150
Token
Token ID198
Feature activation-1.40e-02

Pos logit contributions
 said+0.022
 smile+0.021
 say+0.020
 lost+0.019
 picture+0.018
Neg logit contributions
ocol-0.032
ions-0.029
-0.029
 situations-0.027
-0.027
TokenCharlie
Token ID37136
Feature activation-8.69e-02

Pos logit contributions
 said+0.027
 smile+0.025
 say+0.025
 big+0.024
 picture+0.023
Neg logit contributions
ocol-0.025
ions-0.021
-0.021
 situations-0.020
ipping-0.019
Token said
Token ID531
Feature activation-3.03e-01

Pos logit contributions
 said+0.070
 smile+0.070
 picture+0.067
 say+0.067
 big+0.066
Neg logit contributions
ocol-0.218
-0.202
ions-0.197
 situations-0.190
ar-0.190
Token,
Token ID11
Feature activation+5.22e-02

Pos logit contributions
 said+0.400
 smile+0.345
 say+0.341
 big+0.317
 asked+0.316
Neg logit contributions
ocol-0.750
ions-0.664
 situations-0.638
-0.631
ipping-0.613
Token "
Token ID366
Feature activation+1.02e-01

Pos logit contributions
ocol+0.121
ions+0.106
 situations+0.103
+0.098
ipping+0.097
Neg logit contributions
 said-0.080
 smile-0.071
 say-0.069
 big-0.066
 asked-0.066
TokenHello
Token ID15496
Feature activation-1.57e-02

Pos logit contributions
ocol+0.209
ions+0.177
 situations+0.168
ipping+0.159
ubb+0.157
Neg logit contributions
 said-0.190
 smile-0.168
 say-0.167
 asked-0.164
 big-0.157
Token,
Token ID11
Feature activation+5.35e-02

Pos logit contributions
 said+0.022
 smile+0.019
 say+0.018
 asked+0.017
 big+0.017
Neg logit contributions
ocol-0.038
ions-0.034
 situations-0.032
-0.032
ipping-0.031
Token Lily
Token ID20037
Feature activation+4.52e-02

Pos logit contributions
ocol+0.122
ions+0.107
 situations+0.100
+0.098
ubb+0.096
Neg logit contributions
 said-0.072
 smile-0.062
 asked-0.061
 whispered-0.060
 say-0.059
Token
Token ID241
Feature activation+7.71e-02

Pos logit contributions
ocol+0.413
ions+0.346
 situations+0.330
+0.323
ipping+0.311
Neg logit contributions
 said-0.453
 smile-0.415
 say-0.405
 big-0.393
 asked-0.389
TokenNo
Token ID2949
Feature activation+1.01e-01

Pos logit contributions
ocol+0.178
ions+0.153
 situations+0.148
+0.144
ipping+0.142
Neg logit contributions
 said-0.120
 say-0.104
 smile-0.102
 asked-0.100
 big-0.098
Token!
Token ID0
Feature activation+2.56e-02

Pos logit contributions
ocol+0.278
ions+0.250
+0.235
umm+0.234
ubb+0.233
Neg logit contributions
 said-0.120
 smile-0.098
 say-0.097
 big-0.090
 asked-0.086
Tokenâ
Token ID22940
Feature activation+1.00e-01

Pos logit contributions
ocol+0.055
ions+0.046
 situations+0.046
ipping+0.044
rim+0.044
Neg logit contributions
 said-0.044
 say-0.039
 asked-0.038
 smile-0.037
 mom-0.037
Token
Token ID26391
Feature activation+2.11e-02

Pos logit contributions
ocol+0.222
ions+0.193
 situations+0.179
pill+0.178
ubb+0.177
Neg logit contributions
 said-0.173
 say-0.157
 smile-0.145
 says-0.140
 asked-0.140
Token said
Token ID531
Feature activation-3.03e-01

Pos logit contributions
ocol+0.049
ions+0.042
 situations+0.040
ubb+0.039
umm+0.039
Neg logit contributions
 said-0.035
 smile-0.030
 say-0.029
 asked-0.028
 big-0.028
Token the
Token ID262
Feature activation-1.77e-03

Pos logit contributions
 said+0.400
 smile+0.312
 say+0.310
 big+0.307
 mom+0.306
Neg logit contributions
ocol-0.797
ions-0.712
 situations-0.709
orns-0.692
umm-0.689
Token rude
Token ID22066
Feature activation+3.09e-02

Pos logit contributions
 said+0.003
 smile+0.002
 say+0.002
 big+0.002
 asked+0.002
Neg logit contributions
ocol-0.004
ions-0.004
 situations-0.004
-0.004
ipping-0.003
Token man
Token ID582
Feature activation-1.28e-01

Pos logit contributions
ocol+0.087
ions+0.079
+0.074
 situations+0.073
+0.073
Neg logit contributions
 said-0.034
 smile-0.028
 say-0.027
 big-0.026
 dad-0.025
Token.
Token ID13
Feature activation-2.17e-02

Pos logit contributions
 said+0.153
 say+0.121
 smile+0.120
 asked+0.112
 says+0.109
Neg logit contributions
ocol-0.351
ions-0.310
 situations-0.304
-0.296
umm-0.292
Token
Token ID6184
Feature activation+1.72e-01

Pos logit contributions
 said+0.028
 asked+0.023
 say+0.022
 says+0.021
 smile+0.021
Neg logit contributions
ocol-0.058
ions-0.051
 situations-0.051
ipping-0.049
rim-0.049
Token
Token ID220
Feature activation-7.54e-02

Pos logit contributions
 said+0.026
 smile+0.026
 say+0.024
 picture+0.023
 understand+0.023
Neg logit contributions
ocol-0.067
ions-0.061
-0.059
 situations-0.059
ipping-0.057
Token
Token ID198
Feature activation-1.63e-02

Pos logit contributions
 said+0.097
 smile+0.088
 say+0.087
 picture+0.082
 open+0.081
Neg logit contributions
ocol-0.177
ions-0.160
-0.151
-0.149
ipping-0.146
Token
Token ID198
Feature activation-1.57e-02

Pos logit contributions
 said+0.019
 smile+0.016
 say+0.015
 big+0.014
 asked+0.014
Neg logit contributions
ocol-0.044
ions-0.039
 situations-0.038
-0.037
ipping-0.036
TokenThe
Token ID464
Feature activation+3.29e-02

Pos logit contributions
 said+0.027
 asked+0.024
 say+0.023
 smile+0.023
 big+0.023
Neg logit contributions
ocol-0.035
 situations-0.029
ions-0.029
umm-0.028
ipping-0.027
Token truck
Token ID7779
Feature activation-4.56e-03

Pos logit contributions
ocol+0.085
ions+0.076
 situations+0.073
+0.072
ipping+0.070
Neg logit contributions
 said-0.039
 smile-0.033
 say-0.032
 asked-0.030
 big-0.030
Token said
Token ID531
Feature activation-3.03e-01

Pos logit contributions
 said+0.004
 smile+0.004
 say+0.004
 big+0.004
 know+0.004
Neg logit contributions
ocol-0.011
ions-0.010
-0.010
ar-0.010
 situations-0.010
Token,
Token ID11
Feature activation+5.54e-02

Pos logit contributions
 said+0.371
 smile+0.307
 big+0.287
 say+0.286
 asked+0.277
Neg logit contributions
ocol-0.817
ions-0.726
 situations-0.723
orns-0.690
umm-0.684
Token "
Token ID366
Feature activation+1.00e-01

Pos logit contributions
ocol+0.138
ions+0.120
 situations+0.119
orns+0.113
ipping+0.111
Neg logit contributions
 said-0.081
 smile-0.071
 say-0.068
 big-0.066
 asked-0.065
TokenLet
Token ID5756
Feature activation+4.90e-02

Pos logit contributions
ocol+0.212
 situations+0.183
ions+0.172
igs+0.172
 degrees+0.162
Neg logit contributions
 said-0.187
 asked-0.164
 say-0.160
 saw-0.149
 understand-0.145
Token's
Token ID338
Feature activation-2.94e-02

Pos logit contributions
ocol+0.124
 situations+0.113
+0.111
ions+0.109
ipping+0.107
Neg logit contributions
 said-0.045
 smile-0.043
 surprised-0.038
 picture-0.036
 say-0.036
Token go
Token ID467
Feature activation+8.62e-02

Pos logit contributions
 said+0.038
 surprised+0.032
 smile+0.031
 says+0.030
 answered+0.030
Neg logit contributions
ocol-0.066
ions-0.059
 situations-0.058
-0.058
-0.056
Token
Token ID198
Feature activation-7.78e-03

Pos logit contributions
 said+0.169
 smile+0.150
 say+0.144
 big+0.140
 picture+0.137
Neg logit contributions
ocol-0.351
ions-0.312
 situations-0.299
-0.298
-0.289
Token
Token ID198
Feature activation-7.19e-03

Pos logit contributions
 said+0.009
 smile+0.007
 say+0.007
 asked+0.007
 big+0.007
Neg logit contributions
ocol-0.021
ions-0.019
 situations-0.018
-0.018
ipping-0.017
TokenJack
Token ID14295
Feature activation-9.94e-03

Pos logit contributions
 said+0.013
 say+0.011
 asked+0.011
 smile+0.011
 big+0.011
Neg logit contributions
ocol-0.015
ions-0.013
 situations-0.013
umm-0.013
ubb-0.012
Token's
Token ID338
Feature activation-3.39e-02

Pos logit contributions
 picture+0.009
 big+0.008
 dad+0.008
 understand+0.007
 say+0.007
Neg logit contributions
ocol-0.022
-0.021
ipping-0.021
ar-0.021
ions-0.020
Token mom
Token ID1995
Feature activation-1.82e-01

Pos logit contributions
 said+0.034
 whispered+0.029
 say+0.027
 know+0.027
 asked+0.027
Neg logit contributions
ocol-0.081
-0.068
ubb-0.068
umm-0.067
 situations-0.067
Token said
Token ID531
Feature activation-3.03e-01

Pos logit contributions
 picture+0.110
 big+0.108
 understand+0.103
 dad+0.102
 know+0.096
Neg logit contributions
ocol-0.502
ions-0.462
ipping-0.449
-0.441
-0.427
Token,
Token ID11
Feature activation+5.75e-02

Pos logit contributions
 said+0.353
 smile+0.296
 say+0.274
 big+0.264
 asked+0.258
Neg logit contributions
ocol-0.828
ions-0.739
 situations-0.738
orns-0.711
umm-0.698
Token "
Token ID366
Feature activation+1.02e-01

Pos logit contributions
ocol+0.143
ions+0.125
 situations+0.125
orns+0.119
 pleasures+0.116
Neg logit contributions
 said-0.085
 smile-0.073
 say-0.070
 asked-0.068
 dad-0.067
TokenNo
Token ID2949
Feature activation+1.10e-01

Pos logit contributions
ocol+0.228
 situations+0.190
ions+0.184
ipping+0.173
ubb+0.172
Neg logit contributions
 said-0.181
 say-0.153
 asked-0.152
 smile-0.148
 saw-0.141
Token,
Token ID11
Feature activation+5.80e-02

Pos logit contributions
ocol+0.302
ions+0.270
+0.262
 situations+0.256
ubb+0.254
Neg logit contributions
 said-0.133
 smile-0.110
 say-0.108
 big-0.100
 says-0.096
Token I
Token ID314
Feature activation+1.45e-02

Pos logit contributions
ocol+0.162
orns+0.142
ions+0.141
 situations+0.138
+0.137
Neg logit contributions
 said-0.069
 smile-0.058
 say-0.057
 dad-0.056
 big-0.053
Token
Token ID198
Feature activation-1.47e-02

Pos logit contributions
 said+0.016
 asked+0.012
 big+0.012
 say+0.011
 smile+0.010
Neg logit contributions
ocol-0.049
 situations-0.044
ions-0.044
umm-0.043
ipping-0.042
TokenHer
Token ID9360
Feature activation-1.31e-04

Pos logit contributions
 said+0.025
 asked+0.022
 big+0.021
 understand+0.021
 say+0.020
Neg logit contributions
ocol-0.033
umm-0.028
 situations-0.028
ions-0.027
ubb-0.026
Token mum
Token ID25682
Feature activation-1.77e-01

Pos logit contributions
 said+0.000
 say+0.000
 smile+0.000
 asked+0.000
 surprised+0.000
Neg logit contributions
ocol-0.000
 situations-0.000
-0.000
ions-0.000
-0.000
Token smiled
Token ID13541
Feature activation-1.76e-01

Pos logit contributions
 said+0.080
 smile+0.065
 big+0.059
 say+0.057
 picture+0.057
Neg logit contributions
ocol-0.554
ions-0.508
-0.493
 situations-0.485
ipping-0.484
Token and
Token ID290
Feature activation-3.24e-02

Pos logit contributions
 said+0.220
 picture+0.195
 say+0.194
 know+0.193
 says+0.181
Neg logit contributions
ocol-0.398
-0.349
ipping-0.344
ions-0.339
 situations-0.337
Token said
Token ID531
Feature activation-3.03e-01

Pos logit contributions
 picture+0.010
 smile+0.009
 big+0.006
 understand+0.006
 know+0.004
Neg logit contributions
ocol-0.096
ions-0.089
-0.087
ipping-0.085
ar-0.084
Token,
Token ID11
Feature activation+5.86e-02

Pos logit contributions
 said+0.357
 smile+0.277
 big+0.261
 say+0.259
 asked+0.258
Neg logit contributions
ocol-0.832
 situations-0.750
ions-0.747
orns-0.719
umm-0.712
Token
Token ID6184
Feature activation+1.66e-01

Pos logit contributions
ocol+0.149
ions+0.132
 situations+0.131
orns+0.125
ipping+0.121
Neg logit contributions
 said-0.084
 smile-0.071
 say-0.069
 asked-0.067
 big-0.066
Token
Token ID95
Feature activation+1.54e-01

Pos logit contributions
ocol+0.285
ions+0.262
 situations+0.259
orns+0.249
 pleasures+0.226
Neg logit contributions
 said-0.345
 say-0.309
 smile-0.305
 replied-0.296
 open-0.294
Token
Token ID26391
Feature activation+1.85e-02

Pos logit contributions
ocol+0.325
ions+0.282
ubb+0.267
 situations+0.262
orns+0.248
Neg logit contributions
 said-0.280
 say-0.250
 asked-0.234
 smile-0.233
 says-0.233
Token
Token ID129
Feature activation+2.17e-01

Pos logit contributions
ocol+0.035
ions+0.029
 situations+0.028
ubb+0.026
umm+0.026
Neg logit contributions
 said-0.039
 smile-0.035
 say-0.034
 big-0.033
 dad-0.033
TokenJimmy
Token ID40335
Feature activation-4.00e-02

Pos logit contributions
 said+0.012
 asked+0.011
 say+0.011
 smile+0.010
 big+0.010
Neg logit contributions
ocol-0.015
ions-0.013
umm-0.013
 situations-0.012
ipping-0.012
Token took
Token ID1718
Feature activation+1.24e-02

Pos logit contributions
 said+0.038
 smile+0.034
 say+0.033
 picture+0.033
 big+0.032
Neg logit contributions
ocol-0.103
ions-0.092
-0.092
 situations-0.089
ipping-0.089
Token a
Token ID257
Feature activation-7.89e-02

Pos logit contributions
ocol+0.032
ions+0.028
 situations+0.027
+0.026
ipping+0.026
Neg logit contributions
 said-0.015
 smile-0.013
 say-0.013
 asked-0.012
 surprised-0.012
Token sip
Token ID31145
Feature activation+5.79e-02

Pos logit contributions
 said+0.092
 asked+0.086
 says+0.074
 wondered+0.074
 whispered+0.073
Neg logit contributions
ocol-0.180
-0.156
ions-0.149
ar-0.148
 mor-0.147
Token and
Token ID290
Feature activation-3.26e-02

Pos logit contributions
ocol+0.158
ions+0.140
 situations+0.137
+0.135
ipping+0.131
Neg logit contributions
 said-0.065
 smile-0.054
 say-0.052
 big-0.050
 asked-0.048
Token said
Token ID531
Feature activation-3.03e-01

Pos logit contributions
 picture+0.021
 smile+0.021
 say+0.019
 big+0.019
 said+0.019
Neg logit contributions
ocol-0.086
ions-0.079
 situations-0.073
-0.072
ipping-0.072
Token,
Token ID11
Feature activation+5.71e-02

Pos logit contributions
 said+0.373
 smile+0.319
 say+0.302
 big+0.292
 asked+0.285
Neg logit contributions
ocol-0.792
ions-0.708
 situations-0.689
-0.671
ipping-0.656
Token "
Token ID366
Feature activation+1.02e-01

Pos logit contributions
ocol+0.138
ions+0.121
 situations+0.119
+0.111
orns+0.111
Neg logit contributions
 said-0.085
 smile-0.074
 say-0.072
 asked-0.069
 big-0.069
TokenThis
Token ID1212
Feature activation+8.39e-02

Pos logit contributions
ocol+0.199
ions+0.171
 situations+0.168
ipping+0.151
igs+0.150
Neg logit contributions
 said-0.197
 say-0.175
 asked-0.174
 smile-0.173
 understand-0.164
Token is
Token ID318
Feature activation+6.10e-02

Pos logit contributions
ocol+0.191
ions+0.167
 situations+0.160
+0.157
ipping+0.153
Neg logit contributions
 said-0.129
 smile-0.114
 say-0.112
 big-0.107
 asked-0.105
Token the
Token ID262
Feature activation-2.82e-03

Pos logit contributions
ocol+0.159
ions+0.141
 situations+0.140
+0.139
ipping+0.136
Neg logit contributions
 said-0.079
 say-0.065
 smile-0.065
 big-0.065
 asked-0.060
Token
Token ID220
Feature activation-7.54e-02

Pos logit contributions
 smile+0.027
 said+0.027
 understand+0.026
 say+0.026
 picture+0.025
Neg logit contributions
-0.036
ocol-0.035
-0.035
ions-0.032
 situations-0.032
Token
Token ID198
Feature activation-1.63e-02

Pos logit contributions
 said+0.097
 smile+0.088
 say+0.087
 picture+0.082
 open+0.081
Neg logit contributions
ocol-0.177
ions-0.160
-0.151
-0.149
ipping-0.146
Token
Token ID198
Feature activation-1.57e-02

Pos logit contributions
 said+0.019
 smile+0.016
 say+0.015
 big+0.014
 asked+0.014
Neg logit contributions
ocol-0.044
ions-0.039
 situations-0.038
-0.037
ipping-0.036
TokenL
Token ID43
Feature activation+2.53e-01

Pos logit contributions
 said+0.027
 asked+0.024
 say+0.023
 smile+0.023
 big+0.023
Neg logit contributions
ocol-0.035
 situations-0.029
ions-0.029
umm-0.028
ipping-0.027
Tokenily
Token ID813
Feature activation-2.06e-02

Pos logit contributions
ocol+0.590
+0.527
 situations+0.526
+0.505
ions+0.505
Neg logit contributions
 said-0.369
 say-0.313
 smile-0.299
 asked-0.288
 understand-0.279
Token said
Token ID531
Feature activation-3.02e-01

Pos logit contributions
 picture+0.018
 smile+0.018
 said+0.018
 big+0.018
 say+0.018
Neg logit contributions
ocol-0.047
-0.046
ions-0.043
ipping-0.043
 situations-0.042
Token,
Token ID11
Feature activation+5.59e-02

Pos logit contributions
 said+0.394
 smile+0.340
 say+0.332
 big+0.312
 asked+0.310
Neg logit contributions
ocol-0.760
ions-0.674
 situations-0.649
-0.638
ipping-0.622
Token "
Token ID366
Feature activation+1.03e-01

Pos logit contributions
ocol+0.129
ions+0.112
 situations+0.109
+0.104
ipping+0.103
Neg logit contributions
 said-0.087
 smile-0.076
 say-0.074
 big-0.071
 asked-0.071
TokenI
Token ID40
Feature activation+3.85e-02

Pos logit contributions
ocol+0.223
ions+0.187
 situations+0.182
ipping+0.169
ubb+0.168
Neg logit contributions
 said-0.188
 say-0.165
 asked-0.162
 smile-0.161
 understand-0.152
Token don
Token ID836
Feature activation+1.72e-01

Pos logit contributions
ocol+0.088
ions+0.075
+0.073
 situations+0.072
+0.071
Neg logit contributions
 said-0.051
 smile-0.046
 dad-0.045
 big-0.044
 picture-0.044
Token't
Token ID470
Feature activation+2.47e-02

Pos logit contributions
ocol+0.410
ions+0.351
 situations+0.347
+0.329
 pleasures+0.325
Neg logit contributions
 said-0.255
 smile-0.220
 say-0.220
 big-0.215
 asked-0.208
Token
Token ID220
Feature activation-7.54e-02

Pos logit contributions
ocol+0.192
umm+0.167
ions+0.165
ubb+0.165
 situations+0.165
Neg logit contributions
 said-0.070
 say-0.053
 big-0.052
 smile-0.052
 asked-0.049
Token
Token ID198
Feature activation-1.63e-02

Pos logit contributions
 said+0.097
 smile+0.088
 say+0.087
 picture+0.082
 open+0.081
Neg logit contributions
ocol-0.177
ions-0.160
-0.151
-0.149
ipping-0.146
Token
Token ID198
Feature activation-1.57e-02

Pos logit contributions
 said+0.019
 smile+0.016
 say+0.015
 big+0.014
 asked+0.014
Neg logit contributions
ocol-0.044
ions-0.039
 situations-0.038
-0.037
ipping-0.036
TokenL
Token ID43
Feature activation+2.53e-01

Pos logit contributions
 said+0.027
 asked+0.024
 say+0.023
 smile+0.023
 big+0.023
Neg logit contributions
ocol-0.035
 situations-0.029
ions-0.029
umm-0.028
ipping-0.027
Tokenily
Token ID813
Feature activation-2.06e-02

Pos logit contributions
ocol+0.590
+0.527
 situations+0.526
+0.505
ions+0.505
Neg logit contributions
 said-0.369
 say-0.313
 smile-0.299
 asked-0.288
 understand-0.279
Token said
Token ID531
Feature activation-3.02e-01

Pos logit contributions
 picture+0.018
 smile+0.018
 said+0.018
 big+0.018
 say+0.018
Neg logit contributions
ocol-0.047
-0.046
ions-0.043
ipping-0.043
 situations-0.042
Token,
Token ID11
Feature activation+5.59e-02

Pos logit contributions
 said+0.394
 smile+0.340
 say+0.332
 big+0.312
 asked+0.310
Neg logit contributions
ocol-0.760
ions-0.674
 situations-0.649
-0.638
ipping-0.622
Token "
Token ID366
Feature activation+1.03e-01

Pos logit contributions
ocol+0.129
ions+0.112
 situations+0.109
+0.104
ipping+0.103
Neg logit contributions
 said-0.087
 smile-0.076
 say-0.074
 big-0.071
 asked-0.071
TokenWhy
Token ID5195
Feature activation+1.78e-01

Pos logit contributions
ocol+0.223
ions+0.187
 situations+0.182
ipping+0.169
ubb+0.168
Neg logit contributions
 said-0.188
 say-0.165
 asked-0.162
 smile-0.161
 understand-0.152
Token won
Token ID1839
Feature activation+5.59e-02

Pos logit contributions
ocol+0.394
ions+0.344
 situations+0.329
+0.323
ipping+0.313
Neg logit contributions
 said-0.284
 smile-0.252
 say-0.247
 big-0.236
 asked-0.236
Token't
Token ID470
Feature activation+2.27e-02

Pos logit contributions
ocol+0.125
ions+0.109
 situations+0.104
+0.102
ipping+0.099
Neg logit contributions
 said-0.089
 smile-0.078
 say-0.077
 big-0.074
 asked-0.073
Token me
Token ID502
Feature activation-3.13e-02

Pos logit contributions
ocol+0.140
+0.128
umm+0.126
ions+0.125
ipping+0.123
Neg logit contributions
 said-0.083
 dad-0.072
 big-0.071
 say-0.070
 asked-0.070
Token".
Token ID1911
Feature activation+1.82e-02

Pos logit contributions
 said+0.037
 smile+0.031
 say+0.030
 picture+0.029
 big+0.029
Neg logit contributions
ocol-0.078
ions-0.071
 situations-0.068
-0.066
ipping-0.066
Token
Token ID198
Feature activation+9.69e-04

Pos logit contributions
ocol+0.048
ions+0.043
 situations+0.042
+0.041
ipping+0.040
Neg logit contributions
 said-0.022
 smile-0.018
 say-0.018
 asked-0.017
 big-0.016
Token
Token ID198
Feature activation+1.49e-03

Pos logit contributions
ocol+0.003
ions+0.002
 situations+0.002
+0.002
ipping+0.002
Neg logit contributions
 said-0.001
 smile-0.001
 say-0.001
 big-0.001
 asked-0.001
TokenSarah
Token ID29284
Feature activation-4.91e-02

Pos logit contributions
ocol+0.003
 situations+0.003
umm+0.003
ions+0.003
ipping+0.003
Neg logit contributions
 said-0.003
 asked-0.002
 say-0.002
 understand-0.002
 smile-0.002
Token said
Token ID531
Feature activation-3.02e-01

Pos logit contributions
 picture+0.041
 smile+0.041
 big+0.040
 say+0.040
 said+0.040
Neg logit contributions
ocol-0.119
-0.110
ions-0.110
ipping-0.106
 situations-0.105
Token,
Token ID11
Feature activation+5.57e-02

Pos logit contributions
 said+0.372
 smile+0.317
 say+0.299
 big+0.289
 asked+0.283
Neg logit contributions
ocol-0.804
ions-0.716
 situations-0.697
-0.669
orns-0.663
Token "
Token ID366
Feature activation+1.02e-01

Pos logit contributions
ocol+0.129
ions+0.113
 situations+0.109
+0.104
ipping+0.103
Neg logit contributions
 said-0.086
 smile-0.077
 say-0.074
 big-0.071
 asked-0.071
TokenIt
Token ID1026
Feature activation-2.50e-02

Pos logit contributions
ocol+0.228
 situations+0.189
ions+0.188
ipping+0.173
ubb+0.173
Neg logit contributions
 said-0.178
 asked-0.154
 say-0.154
 smile-0.150
 saw-0.142
Token's
Token ID338
Feature activation-2.79e-02

Pos logit contributions
 said+0.024
 smile+0.020
 say+0.019
 big+0.017
 asked+0.017
Neg logit contributions
ocol-0.070
ions-0.063
 situations-0.062
-0.061
ipping-0.059
Token OK
Token ID7477
Feature activation+9.41e-02

Pos logit contributions
 said+0.040
 say+0.034
 big+0.034
 smile+0.033
 asked+0.032
Neg logit contributions
ocol-0.070
ions-0.062
-0.060
ipping-0.060
 situations-0.059
Token"
Token ID1
Feature activation+1.30e-01

Pos logit contributions
 said+0.007
 say+0.006
 smile+0.006
 asked+0.006
 big+0.006
Neg logit contributions
ocol-0.013
ions-0.011
 situations-0.011
umm-0.011
ipping-0.011
TokenPoor
Token ID43920
Feature activation+8.62e-02

Pos logit contributions
ocol+0.273
 situations+0.225
ions+0.220
ipping+0.210
ubb+0.205
Neg logit contributions
 said-0.258
 say-0.221
 asked-0.221
 smile-0.210
 saw-0.204
Token kitten
Token ID42143
Feature activation+3.10e-02

Pos logit contributions
ocol+0.210
ions+0.187
 situations+0.179
+0.178
ipping+0.174
Neg logit contributions
 said-0.121
 smile-0.105
 say-0.103
 big-0.101
 dad-0.099
Token,"
Token ID553
Feature activation-1.47e-01

Pos logit contributions
ocol+0.080
ions+0.072
 situations+0.070
+0.068
ipping+0.066
Neg logit contributions
 said-0.035
 smile-0.031
 say-0.029
 big-0.027
 asked-0.027
Token Sarah
Token ID10490
Feature activation-8.99e-02

Pos logit contributions
 said+0.158
 smile+0.150
 say+0.137
 picture+0.137
 big+0.134
Neg logit contributions
ocol-0.370
ions-0.329
 situations-0.313
-0.311
-0.299
Token said
Token ID531
Feature activation-3.02e-01

Pos logit contributions
 picture+0.066
 big+0.056
 letter+0.047
 smile+0.044
 pretty+0.044
Neg logit contributions
ocol-0.193
 situations-0.191
pill-0.190
ions-0.183
ar-0.182
Token.
Token ID13
Feature activation-2.11e-02

Pos logit contributions
 said+0.377
 say+0.347
 picture+0.343
 smile+0.336
 says+0.331
Neg logit contributions
ocol-0.694
ions-0.615
ipping-0.588
-0.587
-0.582
Token
Token ID198
Feature activation-6.75e-03

Pos logit contributions
 said+0.028
 smile+0.025
 say+0.024
 big+0.023
 asked+0.022
Neg logit contributions
ocol-0.051
ions-0.046
 situations-0.044
-0.043
-0.042
Token
Token ID198
Feature activation-6.72e-03

Pos logit contributions
 said+0.008
 smile+0.007
 say+0.007
 big+0.006
 picture+0.006
Neg logit contributions
ocol-0.017
ions-0.015
 situations-0.015
-0.015
-0.014
TokenThe
Token ID464
Feature activation+4.11e-02

Pos logit contributions
 said+0.012
 say+0.010
 asked+0.010
 smile+0.010
 big+0.010
Neg logit contributions
ocol-0.015
ions-0.012
 situations-0.012
umm-0.012
ubb-0.012
Token kitten
Token ID42143
Feature activation+3.84e-02

Pos logit contributions
ocol+0.097
ions+0.085
 situations+0.083
+0.082
+0.079
Neg logit contributions
 said-0.057
 smile-0.050
 say-0.049
 asked-0.045
 know-0.045
Token had
Token ID550
Feature activation-6.10e-02

Pos logit contributions
ocol+0.111
ions+0.105
+0.102
 situations+0.099
+0.098
Neg logit contributions
 said-0.045
 say-0.042
 smile-0.041
 says-0.039
 picture-0.037
Token zero
Token ID6632
Feature activation+5.75e-02

Pos logit contributions
 said+0.078
 smile+0.069
 say+0.068
 says+0.062
 big+0.062
Neg logit contributions
ocol-0.145
ions-0.130
 situations-0.125
-0.123
ipping-0.119
Token balloons
Token ID35485
Feature activation-8.80e-02

Pos logit contributions
ocol+0.129
ions+0.111
 situations+0.108
+0.106
ipping+0.103
Neg logit contributions
 said-0.091
 smile-0.080
 say-0.078
 big-0.076
 asked-0.074
Token.
Token ID13
Feature activation-5.40e-03

Pos logit contributions
 said+0.105
 smile+0.090
 say+0.090
 asked+0.085
 big+0.083
Neg logit contributions
ocol-0.218
ions-0.201
 situations-0.187
-0.184
-0.181
Token<|endoftext|>
Token ID50256
Feature activation-1.50e-02

Pos logit contributions
 said+0.008
 say+0.006
 asked+0.006
 big+0.006
 dad+0.006
Neg logit contributions
ocol-0.014
ions-0.012
ipping-0.012
 situations-0.012
ubb-0.011
TokenL
Token ID43
Feature activation+2.67e-01

Pos logit contributions
 said+0.025
 big+0.023
 asked+0.021
 saw+0.021
 says+0.021
Neg logit contributions
ocol-0.034
 situations-0.028
umm-0.028
ubb-0.028
 accompl-0.027
Tokenily
Token ID813
Feature activation-1.24e-02

Pos logit contributions
ocol+0.458
ions+0.430
orns+0.337
 situations+0.329
ipping+0.321
Neg logit contributions
 smile-0.521
 said-0.521
 say-0.475
 understand-0.472
 surprised-0.469
Token and
Token ID290
Feature activation-2.36e-02

Pos logit contributions
 said+0.012
 smile+0.011
 say+0.011
 big+0.010
 picture+0.010
Neg logit contributions
ocol-0.032
ions-0.029
-0.028
 situations-0.028
ipping-0.027
Token Tom
Token ID4186
Feature activation-3.14e-02

Pos logit contributions
 picture+0.032
bell+0.031
 said+0.028
 surprised+0.028
 blank+0.027
Neg logit contributions
ocol-0.044
-0.040
ions-0.039
 situations-0.036
mm-0.036
Token were
Token ID547
Feature activation+1.51e-02

Pos logit contributions
 said+0.037
 smile+0.032
 say+0.031
 picture+0.031
 big+0.030
Neg logit contributions
ocol-0.077
ions-0.068
-0.067
 situations-0.066
ipping-0.063
Token playing
Token ID2712
Feature activation+1.01e-01

Pos logit contributions
ocol+0.030
ions+0.026
ipping+0.024
 situations+0.023
+0.023
Neg logit contributions
 said-0.026
 smile-0.024
 picture-0.023
 say-0.023
 understand-0.022
Token too
Token ID1165
Feature activation+2.24e-02

Pos logit contributions
 said+0.045
 smile+0.038
 say+0.036
 big+0.035
 asked+0.034
Neg logit contributions
ocol-0.110
ions-0.098
 situations-0.095
-0.094
ipping-0.091
Token.
Token ID13
Feature activation-1.80e-02

Pos logit contributions
ocol+0.054
ions+0.049
 situations+0.047
+0.046
+0.046
Neg logit contributions
 said-0.028
 say-0.025
 smile-0.025
 says-0.023
 asked-0.022
Token
Token ID220
Feature activation-7.27e-02

Pos logit contributions
 said+0.022
 smile+0.019
 say+0.018
 big+0.017
 picture+0.017
Neg logit contributions
ocol-0.045
ions-0.041
-0.039
 situations-0.039
-0.039
Token
Token ID198
Feature activation-7.40e-04

Pos logit contributions
 said+0.098
 smile+0.094
 say+0.088
 picture+0.085
 open+0.081
Neg logit contributions
ocol-0.162
ions-0.152
-0.143
-0.141
 situations-0.139
Token
Token ID198
Feature activation-7.09e-04

Pos logit contributions
 smile+0.001
 said+0.001
 say+0.001
 picture+0.001
 letter+0.001
Neg logit contributions
ocol-0.001
ions-0.001
-0.001
Sp-0.001
-0.001
TokenL
Token ID43
Feature activation+2.63e-01

Pos logit contributions
 said+0.001
 smile+0.001
 say+0.001
 big+0.001
 asked+0.001
Neg logit contributions
ocol-0.001
ions-0.001
 situations-0.001
-0.001
ipping-0.001
Tokenily
Token ID813
Feature activation-1.57e-02

Pos logit contributions
ocol+0.546
ions+0.475
 situations+0.449
+0.436
ipping+0.427
Neg logit contributions
 said-0.459
 smile-0.415
 say-0.405
 big-0.388
 asked-0.387
Token decided
Token ID3066
Feature activation-7.63e-02

Pos logit contributions
 smile+0.018
 say+0.017
 picture+0.017
 dad+0.017
 big+0.016
Neg logit contributions
-0.031
ar-0.030
Too-0.029
ipping-0.029
we-0.028
Token to
Token ID284
Feature activation-5.40e-02

Pos logit contributions
 said+0.076
 smile+0.070
 say+0.067
 says+0.062
 picture+0.060
Neg logit contributions
ocol-0.180
-0.169
 situations-0.166
ions-0.164
Too-0.162
Token go
Token ID467
Feature activation+8.95e-02

Pos logit contributions
 said+0.066
 smile+0.055
 say+0.052
 big+0.051
 surprised+0.051
Neg logit contributions
ocol-0.135
ions-0.121
 situations-0.117
-0.115
-0.113
Token home
Token ID1363
Feature activation-4.56e-02

Pos logit contributions
ocol+0.259
ions+0.236
 situations+0.227
+0.224
ipping+0.222
Neg logit contributions
 said-0.079
 smile-0.062
 say-0.059
 asked-0.054
 big-0.054
Token nodded
Token ID14464
Feature activation-2.04e-01

Pos logit contributions
 said+0.037
 say+0.035
 smile+0.033
 big+0.032
 know+0.032
Neg logit contributions
ocol-0.131
ions-0.120
-0.119
ipping-0.114
-0.112
Token and
Token ID290
Feature activation-2.50e-02

Pos logit contributions
 said+0.223
 say+0.183
 smile+0.182
 big+0.168
 asked+0.166
Neg logit contributions
ocol-0.544
ions-0.487
 situations-0.467
-0.465
ipping-0.454
Token said
Token ID531
Feature activation-2.91e-01

Pos logit contributions
 said+0.018
 smile+0.015
 say+0.014
 big+0.013
 picture+0.013
Neg logit contributions
ocol-0.073
ions-0.067
 situations-0.064
-0.064
ipping-0.063
Token,
Token ID11
Feature activation+6.45e-02

Pos logit contributions
 said+0.338
 smile+0.296
 big+0.268
 say+0.267
 asked+0.251
Neg logit contributions
ocol-0.795
ions-0.699
 situations-0.697
orns-0.672
-0.660
Token "
Token ID366
Feature activation+1.09e-01

Pos logit contributions
ocol+0.167
 situations+0.147
ions+0.146
orns+0.139
rim+0.134
Neg logit contributions
 said-0.091
 smile-0.079
 say-0.075
 asked-0.074
 big-0.072
TokenBl
Token ID3629
Feature activation+2.79e-01

Pos logit contributions
ocol+0.257
 situations+0.219
ions+0.218
ipping+0.205
+0.203
Neg logit contributions
 said-0.168
 smile-0.143
 say-0.143
 asked-0.138
 big-0.134
Tokenank
Token ID962
Feature activation+4.49e-02

Pos logit contributions
ocol+0.469
 situations+0.415
ions+0.399
orns+0.367
pill+0.363
Neg logit contributions
 said-0.585
 smile-0.567
 say-0.539
 dad-0.514
 know-0.511
Tokenie
Token ID494
Feature activation-3.91e-02

Pos logit contributions
ocol+0.112
ions+0.099
 situations+0.095
+0.093
+0.092
Neg logit contributions
 said-0.061
 smile-0.053
 say-0.051
 big-0.049
 asked-0.047
Token!".
Token ID48220
Feature activation-3.76e-02

Pos logit contributions
 said+0.051
 smile+0.044
 say+0.042
 big+0.040
 asked+0.039
Neg logit contributions
ocol-0.100
ions-0.088
 situations-0.086
-0.083
ipping-0.081
Token The
Token ID383
Feature activation+6.77e-02

Pos logit contributions
 said+0.052
 smile+0.044
 say+0.043
 asked+0.041
 big+0.040
Neg logit contributions
ocol-0.095
ions-0.082
 situations-0.081
-0.078
ubb-0.076
Token mom
Token ID1995
Feature activation-1.75e-01

Pos logit contributions
ocol+0.156
+0.140
ions+0.138
umm+0.138
+0.135
Neg logit contributions
 said-0.080
 know-0.069
 say-0.069
 understand-0.066
 says-0.066
Token smart
Token ID4451
Feature activation+4.83e-03

Pos logit contributions
 said+0.063
 big+0.057
 smile+0.057
 say+0.054
 dad+0.051
Neg logit contributions
ocol-0.146
ions-0.134
 situations-0.127
-0.123
ipping-0.122
Token dog
Token ID3290
Feature activation+8.01e-03

Pos logit contributions
ocol+0.014
ions+0.012
 situations+0.012
+0.012
ipping+0.011
Neg logit contributions
 said-0.005
 smile-0.004
 say-0.004
 big-0.004
 asked-0.003
Token!"
Token ID2474
Feature activation-6.89e-02

Pos logit contributions
ocol+0.022
ions+0.019
 situations+0.019
+0.018
ipping+0.018
Neg logit contributions
 said-0.009
 smile-0.008
 say-0.007
 big-0.007
 asked-0.007
Token
Token ID198
Feature activation+1.26e-02

Pos logit contributions
 said+0.082
 smile+0.073
 say+0.069
 big+0.066
 picture+0.065
Neg logit contributions
ocol-0.174
ions-0.157
 situations-0.149
-0.148
ipping-0.143
Token
Token ID198
Feature activation+1.38e-02

Pos logit contributions
ocol+0.029
ions+0.027
 situations+0.024
+0.024
orns+0.024
Neg logit contributions
 said-0.017
 smile-0.015
 say-0.014
 picture-0.014
 open-0.013
TokenL
Token ID43
Feature activation+2.74e-01

Pos logit contributions
ocol+0.031
 situations+0.026
umm+0.025
ions+0.025
ubb+0.024
Neg logit contributions
 said-0.024
 say-0.021
 big-0.020
 asked-0.020
 smile-0.020
Tokenily
Token ID813
Feature activation-1.10e-02

Pos logit contributions
ocol+0.639
+0.583
 situations+0.570
ions+0.553
+0.552
Neg logit contributions
 said-0.395
 say-0.339
 smile-0.322
 says-0.310
 asked-0.308
Token and
Token ID290
Feature activation-1.45e-02

Pos logit contributions
 said+0.012
 picture+0.010
 smile+0.010
 say+0.010
 dad+0.010
Neg logit contributions
ocol-0.026
ions-0.024
-0.024
 situations-0.023
ipping-0.023
Token Ben
Token ID3932
Feature activation+7.82e-03

Pos logit contributions
 said+0.024
 surprised+0.022
 smile+0.022
 picture+0.022
 big+0.021
Neg logit contributions
ocol-0.029
ions-0.024
 situations-0.023
-0.022
ubb-0.021
Token get
Token ID651
Feature activation+1.16e-01

Pos logit contributions
ocol+0.018
+0.016
+0.016
ions+0.016
ipping+0.016
Neg logit contributions
 said-0.008
 picture-0.008
 dad-0.007
 big-0.007
 mother-0.007
Token their
Token ID511
Feature activation+9.38e-02

Pos logit contributions
ocol+0.285
ions+0.262
 situations+0.242
ipping+0.240
ubb+0.234
Neg logit contributions
 said-0.125
 smile-0.111
 picture-0.107
 say-0.103
 know-0.096
Token was
Token ID373
Feature activation+5.02e-02

Pos logit contributions
ocol+0.000
ions+0.000
+0.000
 situations+0.000
+0.000
Neg logit contributions
 said-0.000
 smile-0.000
 say-0.000
 big-0.000
 asked-0.000
Token not
Token ID407
Feature activation-7.82e-03

Pos logit contributions
ocol+0.132
ions+0.117
 situations+0.114
+0.112
ipping+0.110
Neg logit contributions
 said-0.061
 smile-0.052
 say-0.050
 big-0.048
 asked-0.047
Token bad
Token ID2089
Feature activation+5.66e-02

Pos logit contributions
 said+0.010
 smile+0.009
 say+0.009
 big+0.008
 asked+0.008
Neg logit contributions
ocol-0.020
ions-0.018
 situations-0.017
-0.017
-0.016
Token.
Token ID13
Feature activation-7.32e-03

Pos logit contributions
ocol+0.171
ions+0.152
+0.147
+0.146
rim+0.143
Neg logit contributions
 said-0.053
 say-0.040
 smile-0.040
 asked-0.037
 big-0.037
Token<|endoftext|>
Token ID50256
Feature activation-2.07e-02

Pos logit contributions
 said+0.008
 smile+0.006
 say+0.006
 big+0.006
 asked+0.006
Neg logit contributions
ocol-0.021
ions-0.018
 situations-0.018
ipping-0.017
-0.017
TokenL
Token ID43
Feature activation+2.67e-01

Pos logit contributions
 said+0.031
 big+0.027
 asked+0.026
 smile+0.026
 say+0.025
Neg logit contributions
ocol-0.051
 situations-0.042
ions-0.041
-0.041
ubb-0.041
Tokenily
Token ID813
Feature activation-1.36e-02

Pos logit contributions
ocol+0.473
ions+0.442
orns+0.374
 situations+0.360
ipping+0.350
Neg logit contributions
 said-0.485
 smile-0.481
 say-0.449
 understand-0.442
 surprised-0.434
Token and
Token ID290
Feature activation-2.41e-02

Pos logit contributions
 said+0.013
 smile+0.012
 say+0.012
 big+0.011
 picture+0.010
Neg logit contributions
ocol-0.034
ions-0.032
-0.031
 situations-0.031
ipping-0.030
Token Ben
Token ID3932
Feature activation+2.85e-03

Pos logit contributions
bell+0.034
 surprised+0.034
 picture+0.033
 said+0.032
 smile+0.031
Neg logit contributions
ocol-0.041
ions-0.038
-0.038
mm-0.034
 Wed-0.033
Token were
Token ID547
Feature activation+1.38e-02

Pos logit contributions
ocol+0.008
ions+0.007
+0.007
 situations+0.007
ipping+0.007
Neg logit contributions
 said-0.003
 smile-0.002
 say-0.002
 big-0.002
 picture-0.002
Token playing
Token ID2712
Feature activation+1.01e-01

Pos logit contributions
ocol+0.025
ions+0.022
ipping+0.020
+0.019
 situations+0.018
Neg logit contributions
 said-0.026
 smile-0.024
 say-0.023
 picture-0.023
 understand-0.023
Token said
Token ID531
Feature activation-2.91e-01

Pos logit contributions
 picture+0.007
 understand+0.006
 smile+0.006
 know+0.005
 big+0.005
Neg logit contributions
ocol-0.072
ions-0.069
-0.066
-0.065
ipping-0.065
Token,
Token ID11
Feature activation+6.35e-02

Pos logit contributions
 said+0.326
 smile+0.277
 big+0.251
 say+0.247
 asked+0.235
Neg logit contributions
ocol-0.810
 situations-0.723
ions-0.718
orns-0.688
rim-0.683
Token "
Token ID366
Feature activation+1.08e-01

Pos logit contributions
ocol+0.162
 situations+0.145
ions+0.142
orns+0.136
 pleasures+0.131
Neg logit contributions
 said-0.094
 smile-0.082
 say-0.077
 asked-0.076
 big-0.074
TokenOh
Token ID5812
Feature activation+1.02e-01

Pos logit contributions
ocol+0.239
 situations+0.201
ions+0.193
ipping+0.182
igs+0.176
Neg logit contributions
 said-0.191
 say-0.165
 smile-0.162
 asked-0.160
 big-0.156
Token!
Token ID0
Feature activation+3.26e-02

Pos logit contributions
ocol+0.252
ions+0.223
 situations+0.215
+0.211
ipping+0.205
Neg logit contributions
 said-0.137
 smile-0.119
 say-0.116
 big-0.109
 asked-0.108
Token That
Token ID1320
Feature activation+1.23e-01

Pos logit contributions
ocol+0.071
 situations+0.063
rim+0.061
orns+0.058
ions+0.058
Neg logit contributions
 said-0.056
 say-0.049
 smile-0.047
 asked-0.047
 says-0.046
Token's
Token ID338
Feature activation-2.58e-02

Pos logit contributions
ocol+0.252
+0.248
 situations+0.240
ions+0.237
 pleasures+0.228
Neg logit contributions
 said-0.148
 smile-0.140
 dad-0.134
 say-0.133
 surprised-0.129
Token very
Token ID845
Feature activation-5.26e-02

Pos logit contributions
 said+0.026
 say+0.021
 big+0.020
 smile+0.020
 dad+0.018
Neg logit contributions
ocol-0.076
ions-0.067
 situations-0.067
ipping-0.065
-0.065
Token nice
Token ID3621
Feature activation+3.91e-02

Pos logit contributions
 said+0.055
 smile+0.046
 say+0.044
 asked+0.041
 picture+0.039
Neg logit contributions
ocol-0.144
ions-0.127
 situations-0.122
-0.121
ubb-0.118
Token!
Token ID0
Feature activation+3.27e-02

Pos logit contributions
ocol+0.101
ions+0.089
 situations+0.086
+0.085
ipping+0.083
Neg logit contributions
 said-0.050
 smile-0.043
 say-0.042
 big-0.040
 picture-0.038
Token You
Token ID921
Feature activation+1.49e-01

Pos logit contributions
ocol+0.070
 situations+0.061
rim+0.058
ipping+0.058
orns+0.057
Neg logit contributions
 said-0.057
 say-0.051
 asked-0.050
 says-0.049
 know-0.047
Token find
Token ID1064
Feature activation-1.99e-02

Pos logit contributions
 said+0.164
 smile+0.134
 surprised+0.134
 big+0.129
 says+0.127
Neg logit contributions
ocol-0.306
ions-0.265
 situations-0.263
-0.257
-0.248
Token things
Token ID1243
Feature activation+1.96e-01

Pos logit contributions
 said+0.033
 smile+0.030
 say+0.028
 asked+0.027
 picture+0.026
Neg logit contributions
ocol-0.042
ions-0.037
 situations-0.036
-0.033
ipping-0.032
Token to
Token ID284
Feature activation-5.56e-02

Pos logit contributions
ocol+0.522
ions+0.463
 situations+0.449
+0.442
ipping+0.432
Neg logit contributions
 said-0.227
 smile-0.191
 say-0.186
 big-0.175
 asked-0.171
Token mark
Token ID1317
Feature activation-5.16e-03

Pos logit contributions
 said+0.066
 smile+0.056
 surprised+0.053
 asked+0.050
 says+0.050
Neg logit contributions
ocol-0.141
ions-0.122
 situations-0.121
-0.115
 pleasures-0.112
Token that
Token ID326
Feature activation-1.54e-01

Pos logit contributions
 said+0.007
 big+0.006
 say+0.006
 smile+0.006
 says+0.006
Neg logit contributions
ocol-0.013
-0.011
umm-0.011
-0.011
ions-0.011
Token you
Token ID345
Feature activation+8.49e-02

Pos logit contributions
 said+0.256
 smile+0.225
 say+0.225
 big+0.213
 asked+0.211
Neg logit contributions
ocol-0.340
ions-0.296
-0.277
 situations-0.277
ipping-0.273
Token can
Token ID460
Feature activation+3.31e-02

Pos logit contributions
ocol+0.186
ions+0.161
 situations+0.158
+0.156
+0.149
Neg logit contributions
 said-0.116
 smile-0.104
 dad-0.103
 say-0.097
 big-0.097
Token remove
Token ID4781
Feature activation+1.40e-01

Pos logit contributions
ocol+0.070
ions+0.061
+0.060
 situations+0.059
+0.058
Neg logit contributions
 said-0.048
 dad-0.041
 smile-0.040
 surprised-0.040
 says-0.040
Token easily
Token ID3538
Feature activation+1.77e-02

Pos logit contributions
ocol+0.344
+0.305
+0.304
ions+0.299
ipping+0.296
Neg logit contributions
 said-0.191
 big-0.164
 say-0.160
 smile-0.158
 open-0.151
Token and
Token ID290
Feature activation-2.55e-02

Pos logit contributions
ocol+0.044
ions+0.039
 situations+0.038
+0.037
ipping+0.036
Neg logit contributions
 said-0.023
 smile-0.020
 say-0.020
 big-0.019
 asked-0.018
Token that
Token ID326
Feature activation-1.54e-01

Pos logit contributions
 said+0.032
 smile+0.027
 say+0.026
 big+0.025
 asked+0.024
Neg logit contributions
ocol-0.064
ions-0.056
 situations-0.055
-0.054
ipping-0.052
Token
Token ID198
Feature activation+8.01e-03

Pos logit contributions
ocol+0.015
ions+0.014
 situations+0.013
+0.013
+0.013
Neg logit contributions
 said-0.009
 smile-0.009
 say-0.008
 picture-0.008
 lost-0.008
Token"
Token ID1
Feature activation+1.50e-01

Pos logit contributions
ocol+0.020
ions+0.016
 situations+0.016
umm+0.016
ubb+0.015
Neg logit contributions
 said-0.013
 asked-0.011
 say-0.011
 big-0.011
 understand-0.010
TokenOh
Token ID5812
Feature activation+1.07e-01

Pos logit contributions
ocol+0.334
 situations+0.277
ions+0.268
ipping+0.262
igs+0.250
Neg logit contributions
 said-0.275
 asked-0.240
 say-0.231
 saw-0.224
 big-0.221
Token no
Token ID645
Feature activation+5.24e-02

Pos logit contributions
ocol+0.250
ions+0.219
 situations+0.210
+0.207
ipping+0.200
Neg logit contributions
 said-0.160
 smile-0.140
 say-0.138
 big-0.130
 asked-0.130
Token,
Token ID11
Feature activation+7.09e-02

Pos logit contributions
ocol+0.133
ions+0.117
 situations+0.113
+0.111
ipping+0.109
Neg logit contributions
 said-0.068
 smile-0.058
 say-0.057
 big-0.054
 asked-0.053
Token you
Token ID345
Feature activation+9.23e-02

Pos logit contributions
ocol+0.166
ions+0.146
 situations+0.144
+0.142
ipping+0.140
Neg logit contributions
 said-0.106
 smile-0.093
 say-0.092
 big-0.090
 dad-0.088
Token are
Token ID389
Feature activation+6.34e-02

Pos logit contributions
ocol+0.190
ions+0.168
+0.165
+0.164
 situations+0.158
Neg logit contributions
 said-0.124
 smile-0.122
 say-0.116
 dad-0.106
 understand-0.105
Token a
Token ID257
Feature activation-6.13e-02

Pos logit contributions
ocol+0.156
ions+0.137
 situations+0.130
ubb+0.125
+0.125
Neg logit contributions
 said-0.082
 smile-0.073
 say-0.070
 understand-0.065
 says-0.065
Token bad
Token ID2089
Feature activation+6.07e-02

Pos logit contributions
 said+0.093
 smile+0.080
 say+0.080
 asked+0.076
 says+0.075
Neg logit contributions
ocol-0.138
ions-0.116
 situations-0.116
-0.113
ubb-0.110
Token pirate
Token ID25868
Feature activation+1.16e-01

Pos logit contributions
ocol+0.126
 situations+0.105
ions+0.102
+0.100
ipping+0.096
Neg logit contributions
 said-0.101
 smile-0.090
 say-0.087
 asked-0.085
 surprised-0.085
Token!"
Token ID2474
Feature activation-7.42e-02

Pos logit contributions
ocol+0.281
ions+0.250
 situations+0.244
+0.235
ar+0.232
Neg logit contributions
 said-0.138
 smile-0.124
 say-0.116
 asked-0.111
 big-0.111
Token hammer
Token ID15554
Feature activation+4.97e-03

Pos logit contributions
ocol+0.006
ions+0.005
 situations+0.005
+0.005
ipping+0.005
Neg logit contributions
 said-0.006
 smile-0.005
 say-0.005
 asked-0.005
 says-0.005
Token and
Token ID290
Feature activation-2.66e-02

Pos logit contributions
ocol+0.011
ions+0.011
ipping+0.010
+0.010
 situations+0.010
Neg logit contributions
 said-0.006
 say-0.005
 says-0.005
 asked-0.005
 know-0.005
Token pretended
Token ID36766
Feature activation+9.66e-03

Pos logit contributions
 said+0.026
 smile+0.024
 say+0.023
 understand+0.021
 big+0.021
Neg logit contributions
ocol-0.069
ions-0.064
 situations-0.060
-0.060
ipping-0.058
Token to
Token ID284
Feature activation-5.48e-02

Pos logit contributions
ocol+0.024
ions+0.022
 situations+0.021
+0.020
ipping+0.020
Neg logit contributions
 said-0.011
 smile-0.010
 say-0.010
 asked-0.009
 big-0.009
Token fix
Token ID4259
Feature activation+3.37e-02

Pos logit contributions
 said+0.065
 say+0.055
 smile+0.055
 big+0.052
 asked+0.051
Neg logit contributions
ocol-0.147
ions-0.131
 situations-0.125
-0.124
ipping-0.123
Token things
Token ID1243
Feature activation+1.95e-01

Pos logit contributions
ocol+0.092
ions+0.080
+0.080
rim+0.080
+0.079
Neg logit contributions
 said-0.038
 big-0.032
 say-0.029
 smile-0.029
 open-0.028
Token around
Token ID1088
Feature activation-1.90e-01

Pos logit contributions
ocol+0.507
ions+0.454
 situations+0.436
+0.430
ipping+0.419
Neg logit contributions
 said-0.232
 smile-0.199
 say-0.193
 big-0.180
 asked-0.178
Token the
Token ID262
Feature activation+3.42e-03

Pos logit contributions
 said+0.160
 big+0.136
 open+0.124
 asked+0.119
 smile+0.116
Neg logit contributions
ocol-0.566
Too-0.501
umm-0.499
rim-0.496
ipping-0.494
Token yard
Token ID12699
Feature activation+6.16e-03

Pos logit contributions
ocol+0.009
ions+0.008
 situations+0.007
ubb+0.007
+0.007
Neg logit contributions
 said-0.005
 big-0.004
 smile-0.004
 say-0.004
 saw-0.004
Token.
Token ID13
Feature activation-1.97e-02

Pos logit contributions
ocol+0.018
ions+0.015
 situations+0.015
+0.015
ipping+0.015
Neg logit contributions
 said-0.006
 smile-0.005
 say-0.005
 big-0.005
 asked-0.005
Token Suddenly
Token ID24975
Feature activation+3.81e-02

Pos logit contributions
 said+0.026
 smile+0.023
 say+0.022
 big+0.021
 asked+0.021
Neg logit contributions
ocol-0.048
ions-0.043
 situations-0.041
-0.041
-0.040
Token had
Token ID550
Feature activation-6.61e-02

Pos logit contributions
 said+0.045
 smile+0.044
 say+0.044
 dad+0.042
 picture+0.041
Neg logit contributions
ocol-0.087
ions-0.080
-0.078
 situations-0.077
ar-0.075
Token been
Token ID587
Feature activation+5.65e-02

Pos logit contributions
 said+0.080
 say+0.072
 smile+0.071
 says+0.064
 picture+0.063
Neg logit contributions
ocol-0.161
ions-0.144
 situations-0.141
-0.136
 pleasures-0.132
Token very
Token ID845
Feature activation-5.48e-02

Pos logit contributions
ocol+0.140
ions+0.124
 situations+0.119
+0.116
ipping+0.114
Neg logit contributions
 said-0.075
 smile-0.065
 say-0.064
 big-0.059
 says-0.058
Token brave
Token ID14802
Feature activation-2.50e-02

Pos logit contributions
 said+0.070
 say+0.061
 smile+0.061
 picture+0.060
 says+0.057
Neg logit contributions
ocol-0.134
ions-0.113
 situations-0.113
ubb-0.110
-0.108
Token and
Token ID290
Feature activation-2.30e-02

Pos logit contributions
 said+0.031
 smile+0.027
 say+0.027
 big+0.025
 asked+0.024
Neg logit contributions
ocol-0.063
ions-0.056
 situations-0.054
-0.053
ipping-0.052
Token faced
Token ID7452
Feature activation+9.51e-02

Pos logit contributions
 said+0.027
 smile+0.023
 say+0.023
 big+0.021
 asked+0.021
Neg logit contributions
ocol-0.059
ions-0.052
 situations-0.051
-0.050
ipping-0.048
Token his
Token ID465
Feature activation+9.04e-03

Pos logit contributions
ocol+0.250
ions+0.221
+0.216
 situations+0.213
+0.209
Neg logit contributions
 said-0.114
 smile-0.095
 say-0.093
 big-0.091
 asked-0.089
Token fear
Token ID3252
Feature activation+4.74e-02

Pos logit contributions
ocol+0.021
 situations+0.018
ions+0.017
+0.017
ubb+0.016
Neg logit contributions
 said-0.014
 say-0.011
 smile-0.011
 asked-0.011
 surprised-0.011
Token with
Token ID351
Feature activation+1.17e-01

Pos logit contributions
ocol+0.125
ions+0.112
 situations+0.108
+0.106
ipping+0.104
Neg logit contributions
 said-0.054
 smile-0.045
 say-0.044
 big-0.041
 asked-0.041
Token courage
Token ID11917
Feature activation-5.35e-02

Pos logit contributions
ocol+0.306
ions+0.273
+0.270
 situations+0.263
ipping+0.260
Neg logit contributions
 said-0.143
 smile-0.122
 say-0.117
 big-0.117
 asked-0.110
Token.
Token ID13
Feature activation-1.51e-02

Pos logit contributions
 said+0.055
 smile+0.045
 say+0.043
 big+0.040
 asked+0.039
Neg logit contributions
ocol-0.151
ions-0.135
 situations-0.130
-0.130
ipping-0.126
Token dirty
Token ID11841
Feature activation+4.28e-02

Pos logit contributions
ocol+0.026
ions+0.023
 situations+0.022
+0.021
ubb+0.021
Neg logit contributions
 said-0.012
 say-0.011
 smile-0.010
 says-0.010
 picture-0.009
Token from
Token ID422
Feature activation+6.02e-02

Pos logit contributions
ocol+0.136
ions+0.119
+0.118
Too+0.114
 situations+0.114
Neg logit contributions
 said-0.032
 big-0.026
 smile-0.025
 open-0.024
 say-0.021
Token carrying
Token ID6872
Feature activation+8.86e-02

Pos logit contributions
ocol+0.155
ions+0.137
+0.136
 situations+0.133
ipping+0.130
Neg logit contributions
 said-0.074
 smile-0.065
 big-0.063
 say-0.062
 asked-0.059
Token the
Token ID262
Feature activation+1.00e-03

Pos logit contributions
ocol+0.239
+0.229
umm+0.223
Too+0.221
mm+0.219
Neg logit contributions
 said-0.095
 big-0.088
 open-0.077
 lost-0.073
 wide-0.071
Token rock
Token ID3881
Feature activation-3.97e-02

Pos logit contributions
ocol+0.002
ions+0.002
 situations+0.002
+0.002
ipping+0.002
Neg logit contributions
 said-0.001
 say-0.001
 smile-0.001
 says-0.001
 asked-0.001
Token.
Token ID13
Feature activation-2.08e-02

Pos logit contributions
 said+0.048
 smile+0.042
 say+0.041
 asked+0.038
 says+0.038
Neg logit contributions
ocol-0.098
ions-0.090
 situations-0.086
ipping-0.083
-0.082
Token Twe
Token ID24205
Feature activation+1.69e-01

Pos logit contributions
 said+0.033
 smile+0.029
 say+0.028
 big+0.027
 asked+0.027
Neg logit contributions
ocol-0.047
ions-0.041
 situations-0.039
-0.038
ipping-0.037
Tokenety
Token ID2963
Feature activation-1.58e-02

Pos logit contributions
ocol+0.364
ions+0.318
 situations+0.300
+0.299
orns+0.293
Neg logit contributions
 said-0.277
 smile-0.255
 say-0.248
 big-0.236
 dad-0.229
Token said
Token ID531
Feature activation-2.93e-01

Pos logit contributions
 smile+0.012
 say+0.011
 picture+0.010
 understand+0.010
 dad+0.010
Neg logit contributions
ocol-0.038
-0.036
ions-0.036
pill-0.036
ar-0.035
Token,
Token ID11
Feature activation+6.26e-02

Pos logit contributions
 said+0.405
 say+0.352
 smile+0.351
 asked+0.326
 picture+0.325
Neg logit contributions
ocol-0.696
ions-0.614
-0.588
 situations-0.584
ipping-0.569
Token "
Token ID366
Feature activation+1.08e-01

Pos logit contributions
ocol+0.140
ions+0.122
 situations+0.117
+0.114
ipping+0.111
Neg logit contributions
 said-0.099
 smile-0.088
 say-0.086
 big-0.082
 asked-0.082
Token cartoon
Token ID16251
Feature activation+2.47e-02

Pos logit contributions
ocol+0.012
ions+0.011
+0.010
 situations+0.010
ipping+0.010
Neg logit contributions
 said-0.007
 smile-0.007
 big-0.006
 say-0.006
 dad-0.006
Token machine
Token ID4572
Feature activation+7.81e-02

Pos logit contributions
ocol+0.070
ions+0.060
+0.059
 situations+0.059
ubb+0.058
Neg logit contributions
 said-0.028
 smile-0.022
 say-0.022
 big-0.020
 asked-0.020
Token that
Token ID326
Feature activation-1.52e-01

Pos logit contributions
ocol+0.217
ions+0.193
 situations+0.186
+0.185
ipping+0.180
Neg logit contributions
 said-0.084
 smile-0.069
 say-0.067
 big-0.062
 asked-0.061
Token he
Token ID339
Feature activation-4.19e-02

Pos logit contributions
 said+0.197
 say+0.155
 smile+0.155
 big+0.153
 asked+0.152
Neg logit contributions
ocol-0.410
ions-0.360
-0.352
ipping-0.350
 situations-0.348
Token did
Token ID750
Feature activation-9.62e-03

Pos logit contributions
 said+0.043
 smile+0.038
 say+0.037
 big+0.033
 picture+0.033
Neg logit contributions
ocol-0.111
ions-0.100
 situations-0.096
-0.095
ipping-0.093
Token not
Token ID407
Feature activation-9.51e-03

Pos logit contributions
 said+0.010
 smile+0.009
 say+0.009
 big+0.008
 asked+0.008
Neg logit contributions
ocol-0.027
ions-0.024
 situations-0.023
-0.023
umm-0.022
Token hear
Token ID3285
Feature activation-4.69e-02

Pos logit contributions
 said+0.011
 says+0.009
 surprised+0.009
 replied+0.009
 dad+0.008
Neg logit contributions
ocol-0.023
 situations-0.020
ions-0.020
ubb-0.019
 pleasures-0.019
Token his
Token ID465
Feature activation+5.97e-03

Pos logit contributions
 said+0.068
 smile+0.060
 say+0.058
 big+0.057
 asked+0.055
Neg logit contributions
ocol-0.111
ions-0.098
-0.095
 situations-0.093
-0.091
Token mom
Token ID1995
Feature activation-1.72e-01

Pos logit contributions
ocol+0.015
 situations+0.013
ions+0.013
ubb+0.013
ipping+0.012
Neg logit contributions
 said-0.007
 asked-0.006
 say-0.005
 smile-0.005
 saw-0.005
Token call
Token ID869
Feature activation-2.51e-02

Pos logit contributions
 said+0.234
 smile+0.203
 say+0.194
 big+0.191
 asked+0.186
Neg logit contributions
ocol-0.414
ions-0.364
 situations-0.350
-0.341
ipping-0.338
Token him
Token ID683
Feature activation+7.66e-03

Pos logit contributions
 said+0.035
 smile+0.031
 say+0.031
 big+0.030
 dad+0.029
Neg logit contributions
ocol-0.061
ions-0.054
-0.053
-0.052
 situations-0.052
Token searched
Token ID16499
Feature activation+2.89e-02

Pos logit contributions
 picture+0.009
 smile+0.009
 dad+0.009
 say+0.009
 big+0.008
Neg logit contributions
ocol-0.016
-0.015
Too-0.015
ar-0.014
 situations-0.014
Token the
Token ID262
Feature activation+4.40e-03

Pos logit contributions
ocol+0.062
ions+0.059
 situations+0.054
orns+0.050
ipping+0.050
Neg logit contributions
 said-0.041
 say-0.035
 exclaimed-0.035
 picture-0.034
 says-0.034
Token house
Token ID2156
Feature activation-1.45e-02

Pos logit contributions
ocol+0.010
ions+0.009
 situations+0.008
+0.008
ipping+0.008
Neg logit contributions
 said-0.007
 say-0.006
 smile-0.006
 says-0.006
 know-0.005
Token,
Token ID11
Feature activation+6.07e-02

Pos logit contributions
 said+0.020
 say+0.018
 smile+0.017
 asked+0.016
 says+0.016
Neg logit contributions
ocol-0.033
ions-0.030
 situations-0.029
-0.028
ipping-0.028
Token but
Token ID475
Feature activation+8.52e-03

Pos logit contributions
ocol+0.125
ions+0.111
 situations+0.106
+0.103
ipping+0.101
Neg logit contributions
 said-0.096
 say-0.086
 smile-0.085
 says-0.082
 big-0.081
Token could
Token ID714
Feature activation-5.28e-02

Pos logit contributions
ocol+0.021
ions+0.019
 situations+0.018
+0.018
ipping+0.017
Neg logit contributions
 said-0.011
 smile-0.010
 say-0.009
 big-0.009
 asked-0.009
Token not
Token ID407
Feature activation-1.20e-02

Pos logit contributions
 said+0.081
 says+0.074
 picture+0.073
 say+0.071
 smile+0.069
Neg logit contributions
ocol-0.095
-0.091
-0.091
ions-0.086
Too-0.083
Token find
Token ID1064
Feature activation-1.70e-02

Pos logit contributions
 said+0.014
 says+0.013
 surprised+0.012
 replied+0.011
 smile+0.011
Neg logit contributions
ocol-0.027
-0.025
 situations-0.025
-0.024
ions-0.024
Token the
Token ID262
Feature activation+3.50e-03

Pos logit contributions
 said+0.027
 smile+0.025
 say+0.024
 asked+0.022
 surprised+0.022
Neg logit contributions
ocol-0.037
ions-0.033
 situations-0.031
-0.029
orns-0.028
Token cord
Token ID15050
Feature activation-2.19e-02

Pos logit contributions
ocol+0.008
ions+0.007
 situations+0.007
+0.007
 pleasures+0.007
Neg logit contributions
 said-0.005
 say-0.004
 smile-0.004
 asked-0.004
 says-0.004
Token.
Token ID13
Feature activation-1.81e-02

Pos logit contributions
 said+0.027
 smile+0.024
 say+0.023
 asked+0.022
 says+0.022
Neg logit contributions
ocol-0.053
ions-0.049
 situations-0.047
-0.045
ipping-0.044
Token to
Token ID284
Feature activation-5.50e-02

Pos logit contributions
ocol+0.065
ions+0.059
 situations+0.057
+0.057
ipping+0.055
Neg logit contributions
 said-0.017
 smile-0.013
 say-0.012
 big-0.011
 asked-0.011
Token his
Token ID465
Feature activation+3.30e-03

Pos logit contributions
 said+0.064
 say+0.056
 smile+0.055
 big+0.052
 dad+0.052
Neg logit contributions
ocol-0.148
ions-0.135
-0.130
ipping-0.127
-0.126
Token mom
Token ID1995
Feature activation-1.77e-01

Pos logit contributions
ocol+0.009
 situations+0.008
+0.008
 pleasures+0.008
+0.008
Neg logit contributions
 said-0.003
 say-0.002
 asked-0.002
 says-0.002
 exclaimed-0.002
Token.
Token ID13
Feature activation-2.00e-02

Pos logit contributions
 said+0.178
 smile+0.150
 say+0.140
 big+0.139
 asked+0.130
Neg logit contributions
ocol-0.477
ions-0.429
 situations-0.418
-0.408
ipping-0.401
Token
Token ID198
Feature activation-3.56e-03

Pos logit contributions
 said+0.029
 smile+0.027
 say+0.026
 big+0.025
 picture+0.025
Neg logit contributions
ocol-0.043
ions-0.038
-0.037
 situations-0.037
-0.036
Token
Token ID198
Feature activation-3.29e-03

Pos logit contributions
 smile+0.006
 said+0.005
 open+0.005
 say+0.005
 picture+0.005
Neg logit contributions
ocol-0.006
ions-0.006
-0.006
rim-0.005
Sp-0.005
TokenFrom
Token ID4863
Feature activation+8.41e-02

Pos logit contributions
 said+0.007
 say+0.006
 big+0.006
 smile+0.006
 asked+0.006
Neg logit contributions
ocol-0.007
ions-0.005
 situations-0.005
ubb-0.005
ipping-0.005
Token that
Token ID326
Feature activation-1.52e-01

Pos logit contributions
ocol+0.154
ions+0.147
orns+0.131
mm+0.131
ipping+0.130
Neg logit contributions
 smile-0.136
 know-0.124
 picture-0.121
 gasp-0.120
 understand-0.119
Token day
Token ID1110
Feature activation+3.13e-02

Pos logit contributions
 said+0.245
 smile+0.219
 say+0.215
 asked+0.202
 surprised+0.202
Neg logit contributions
ocol-0.313
ions-0.280
-0.271
 situations-0.268
-0.258
Token on
Token ID319
Feature activation-1.02e-02

Pos logit contributions
ocol+0.084
ions+0.073
 situations+0.073
+0.069
pill+0.069
Neg logit contributions
 said-0.037
 smile-0.031
 say-0.030
 big-0.030
 asked-0.028
Token,
Token ID11
Feature activation+6.39e-02

Pos logit contributions
 said+0.009
 mom+0.007
 smile+0.007
 big+0.007
 dad+0.006
Neg logit contributions
ocol-0.032
ions-0.029
pill-0.029
ubb-0.029
 situations-0.028
Token time
Token ID640
Feature activation+1.18e-01

Pos logit contributions
 said+0.130
 smile+0.117
 say+0.116
 asked+0.111
 says+0.108
Neg logit contributions
ocol-0.137
ions-0.116
 situations-0.111
-0.111
-0.107
Token,
Token ID11
Feature activation+6.55e-02

Pos logit contributions
ocol+0.326
ions+0.287
 situations+0.284
+0.274
umm+0.273
Neg logit contributions
 said-0.134
 smile-0.111
 big-0.107
 say-0.103
 dad-0.096
Token there
Token ID612
Feature activation-5.08e-02

Pos logit contributions
ocol+0.116
ions+0.109
+0.099
 situations+0.090
+0.090
Neg logit contributions
 said-0.111
 smile-0.107
 say-0.107
 picture-0.102
 says-0.098
Token was
Token ID373
Feature activation+4.21e-02

Pos logit contributions
 said+0.059
 smile+0.052
 say+0.050
 asked+0.046
 big+0.046
Neg logit contributions
ocol-0.131
ions-0.118
 situations-0.113
-0.111
-0.109
Token a
Token ID257
Feature activation-7.18e-02

Pos logit contributions
ocol+0.104
ions+0.097
+0.088
 situations+0.088
+0.087
Neg logit contributions
 said-0.050
 smile-0.045
 say-0.043
 says-0.040
 asked-0.038
Token little
Token ID1310
Feature activation-2.19e-02

Pos logit contributions
 said+0.063
 say+0.048
 smile+0.046
 says+0.043
 asked+0.043
Neg logit contributions
ocol-0.202
ions-0.178
-0.177
-0.173
 situations-0.172
Token girl
Token ID2576
Feature activation-9.03e-02

Pos logit contributions
 said+0.022
 smile+0.018
 say+0.018
 asked+0.016
 big+0.016
Neg logit contributions
ocol-0.060
ions-0.054
 situations-0.052
-0.052
ipping-0.051
Token named
Token ID3706
Feature activation+4.79e-02

Pos logit contributions
 said+0.156
 smile+0.142
 say+0.138
 big+0.131
 says+0.131
Neg logit contributions
ocol-0.180
ions-0.158
 situations-0.147
-0.146
ipping-0.139
Token Lily
Token ID20037
Feature activation+4.41e-02

Pos logit contributions
ocol+0.096
ions+0.086
+0.079
 situations+0.078
+0.076
Neg logit contributions
 said-0.082
 smile-0.075
 say-0.072
 surprised-0.068
 understand-0.068
Token.
Token ID13
Feature activation-1.68e-02

Pos logit contributions
ocol+0.074
ions+0.073
we+0.069
 situations+0.068
orns+0.066
Neg logit contributions
 said-0.066
 smile-0.066
 surprised-0.062
 say-0.060
 says-0.059
Token Lily
Token ID20037
Feature activation+4.46e-02

Pos logit contributions
 said+0.020
 smile+0.017
 say+0.017
 big+0.016
 asked+0.015
Neg logit contributions
ocol-0.045
ions-0.039
 situations-0.038
-0.038
ipping-0.037
Token happy
Token ID3772
Feature activation-9.13e-02

Pos logit contributions
ocol+0.112
ions+0.103
ubb+0.096
 accompl+0.095
ipping+0.095
Neg logit contributions
 said-0.049
 say-0.040
 smile-0.039
 says-0.038
 picture-0.038
Token and
Token ID290
Feature activation-2.24e-02

Pos logit contributions
 said+0.099
 smile+0.082
 say+0.079
 big+0.074
 asked+0.073
Neg logit contributions
ocol-0.252
ions-0.225
 situations-0.218
-0.214
ipping-0.209
Token it
Token ID340
Feature activation-4.19e-02

Pos logit contributions
 said+0.022
 smile+0.019
 picture+0.019
 say+0.019
 understand+0.018
Neg logit contributions
ocol-0.054
ions-0.050
-0.048
ipping-0.047
 situations-0.047
Token made
Token ID925
Feature activation-1.73e-02

Pos logit contributions
 said+0.045
 smile+0.038
 say+0.037
 big+0.034
 asked+0.034
Neg logit contributions
ocol-0.112
ions-0.101
 situations-0.097
-0.096
ipping-0.094
Token her
Token ID607
Feature activation-5.52e-02

Pos logit contributions
 said+0.018
 smile+0.015
 say+0.014
 big+0.014
 asked+0.013
Neg logit contributions
ocol-0.049
ions-0.043
-0.043
 situations-0.043
-0.041
Token mom
Token ID1995
Feature activation-1.73e-01

Pos logit contributions
 said+0.054
 asked+0.043
 whispered+0.040
 answered+0.039
 says+0.038
Neg logit contributions
ocol-0.132
 situations-0.125
-0.124
ions-0.122
ubb-0.121
Token smile
Token ID8212
Feature activation-1.74e-01

Pos logit contributions
 said+0.142
 says+0.107
 lost+0.107
 dad+0.106
 whispered+0.104
Neg logit contributions
ocol-0.434
ions-0.394
ipping-0.392
ubb-0.381
 pleasures-0.375
Token too
Token ID1165
Feature activation+2.33e-02

Pos logit contributions
 said+0.202
 smile+0.171
 say+0.167
 big+0.155
 asked+0.154
Neg logit contributions
ocol-0.459
ions-0.410
 situations-0.395
-0.390
ipping-0.380
Token.
Token ID13
Feature activation-1.66e-02

Pos logit contributions
ocol+0.068
ions+0.061
 situations+0.059
+0.058
orns+0.056
Neg logit contributions
 said-0.023
 smile-0.019
 say-0.018
 big-0.017
 asked-0.016
Token<|endoftext|>
Token ID50256
Feature activation-1.99e-02

Pos logit contributions
 said+0.019
 smile+0.016
 say+0.016
 big+0.015
 asked+0.015
Neg logit contributions
ocol-0.044
ions-0.039
 situations-0.038
-0.037
ipping-0.037
TokenOnce
Token ID7454
Feature activation+3.12e-02

Pos logit contributions
 said+0.031
 big+0.026
 smile+0.026
 asked+0.026
 say+0.025
Neg logit contributions
ocol-0.048
 situations-0.040
ions-0.039
ubb-0.038
umm-0.038
Token was
Token ID373
Feature activation+4.75e-02

Pos logit contributions
 smile+0.005
 say+0.004
 said+0.004
 picture+0.004
 big+0.004
Neg logit contributions
ocol-0.010
ions-0.010
-0.009
rim-0.009
ar-0.009
Token glad
Token ID9675
Feature activation-1.06e-01

Pos logit contributions
ocol+0.118
ions+0.108
 accompl+0.101
ubb+0.099
ipping+0.097
Neg logit contributions
 said-0.052
 say-0.045
 says-0.044
 smile-0.044
 picture-0.042
Token that
Token ID326
Feature activation-1.48e-01

Pos logit contributions
 said+0.115
 smile+0.093
 say+0.090
 asked+0.084
 big+0.083
Neg logit contributions
ocol-0.304
ions-0.267
 situations-0.262
-0.260
ipping-0.251
Token her
Token ID607
Feature activation-4.91e-02

Pos logit contributions
 said+0.153
 smile+0.119
 say+0.117
 asked+0.109
 big+0.108
Neg logit contributions
ocol-0.444
ions-0.384
rim-0.374
ipping-0.374
-0.373
Token mom
Token ID1995
Feature activation-1.70e-01

Pos logit contributions
 said+0.034
 say+0.025
 smile+0.025
 asked+0.023
 understand+0.021
Neg logit contributions
ocol-0.147
ions-0.133
 situations-0.127
-0.126
-0.124
Tokenmy
Token ID1820
Feature activation-1.54e-01

Pos logit contributions
 said+0.144
 big+0.140
 dad+0.138
 smile+0.138
 say+0.132
Neg logit contributions
ocol-0.409
ions-0.408
we-0.371
-0.368
ipping-0.367
Token was
Token ID373
Feature activation+4.74e-02

Pos logit contributions
 said+0.160
 smile+0.149
 say+0.145
 big+0.140
 dad+0.138
Neg logit contributions
ocol-0.372
ions-0.352
-0.333
 situations-0.323
ipping-0.323
Token feeling
Token ID4203
Feature activation+2.40e-02

Pos logit contributions
ocol+0.117
ions+0.106
+0.098
 situations+0.098
ipping+0.097
Neg logit contributions
 said-0.058
 smile-0.050
 say-0.049
 says-0.048
 picture-0.045
Token better
Token ID1365
Feature activation-6.63e-03

Pos logit contributions
ocol+0.068
ions+0.061
 situations+0.059
+0.058
ipping+0.057
Neg logit contributions
 said-0.025
 smile-0.020
 say-0.019
 big-0.018
 asked-0.018
Token,
Token ID11
Feature activation+6.63e-02

Pos logit contributions
 said+0.008
 smile+0.006
 say+0.006
 asked+0.006
 big+0.006
Neg logit contributions
ocol-0.018
ions-0.016
 situations-0.016
-0.015
pill-0.015
Token and
Token ID290
Feature activation-2.27e-02

Pos logit contributions
ocol+0.185
ions+0.164
 situations+0.158
+0.156
ipping+0.151
Neg logit contributions
 said-0.076
 smile-0.065
 say-0.062
 big-0.057
 asked-0.057
Token what
Token ID644
Feature activation+1.29e-02

Pos logit contributions
 said+0.252
 mom+0.201
 big+0.201
 say+0.199
 smile+0.199
Neg logit contributions
ocol-0.561
-0.506
umm-0.503
-0.494
rim-0.481
Token to
Token ID284
Feature activation-5.58e-02

Pos logit contributions
ocol+0.038
ions+0.034
 situations+0.033
+0.033
ipping+0.032
Neg logit contributions
 said-0.012
 smile-0.009
 say-0.009
 big-0.008
 asked-0.008
Token do
Token ID466
Feature activation+4.31e-02

Pos logit contributions
 said+0.074
 surprised+0.069
 confused+0.066
 smile+0.065
 dad+0.064
Neg logit contributions
ocol-0.112
 situations-0.098
ions-0.097
ipping-0.095
mm-0.092
Token.
Token ID13
Feature activation-2.14e-02

Pos logit contributions
ocol+0.136
ions+0.119
+0.115
 situations+0.115
ubb+0.115
Neg logit contributions
 said-0.037
 say-0.026
 smile-0.026
 asked-0.025
 big-0.024
Token But
Token ID887
Feature activation+8.36e-02

Pos logit contributions
 said+0.037
 smile+0.033
 say+0.032
 big+0.031
 asked+0.031
Neg logit contributions
ocol-0.045
ions-0.039
 situations-0.037
-0.036
ipping-0.035
Token then
Token ID788
Feature activation-1.28e-01

Pos logit contributions
ocol+0.229
ions+0.200
 situations+0.199
+0.191
umm+0.190
Neg logit contributions
 said-0.102
 say-0.081
 smile-0.080
 asked-0.078
 big-0.075
Token,
Token ID11
Feature activation+6.36e-02

Pos logit contributions
 said+0.144
 smile+0.111
 asked+0.109
 saw+0.107
 mom+0.105
Neg logit contributions
ocol-0.370
ions-0.327
umm-0.322
 situations-0.320
rim-0.316
Token Lily
Token ID20037
Feature activation+4.36e-02

Pos logit contributions
ocol+0.169
ions+0.148
 situations+0.146
+0.141
ipping+0.139
Neg logit contributions
 said-0.080
 smile-0.066
 say-0.065
 big-0.062
 asked-0.062
Token's
Token ID338
Feature activation-2.87e-02

Pos logit contributions
+0.089
ipping+0.084
ocol+0.084
ar+0.084
Sp+0.081
Neg logit contributions
 smile-0.044
 say-0.044
 picture-0.042
 know-0.041
 understand-0.041
Token mom
Token ID1995
Feature activation-1.78e-01

Pos logit contributions
 said+0.033
 says+0.028
 exclaimed+0.027
 say+0.027
 asked+0.025
Neg logit contributions
ocol-0.065
 situations-0.061
ubb-0.059
ipping-0.059
 environment-0.058
Token came
Token ID1625
Feature activation+6.45e-02

Pos logit contributions
 smile+0.209
 understand+0.195
 picture+0.194
 know+0.193
 gasp+0.190
Neg logit contributions
ocol-0.365
ions-0.341
ipping-0.335
-0.332
we-0.323
Token be
Token ID307
Feature activation+1.16e-01

Pos logit contributions
 said+0.037
 smile+0.031
 say+0.031
 big+0.029
 asked+0.028
Neg logit contributions
ocol-0.079
ions-0.071
 situations-0.068
-0.067
ipping-0.066
Token very
Token ID845
Feature activation-5.52e-02

Pos logit contributions
ocol+0.322
ions+0.285
+0.277
 situations+0.273
ipping+0.270
Neg logit contributions
 said-0.133
 smile-0.108
 big-0.106
 say-0.105
 asked-0.102
Token nice
Token ID3621
Feature activation+3.87e-02

Pos logit contributions
 said+0.066
 smile+0.060
 picture+0.059
 say+0.059
 says+0.055
Neg logit contributions
ocol-0.129
 situations-0.109
ions-0.108
-0.105
ubb-0.105
Token to
Token ID284
Feature activation-5.64e-02

Pos logit contributions
ocol+0.110
ions+0.100
 situations+0.097
+0.095
ipping+0.093
Neg logit contributions
 said-0.035
 smile-0.029
 say-0.028
 big-0.025
 asked-0.025
Token go
Token ID467
Feature activation+8.63e-02

Pos logit contributions
 said+0.064
 smile+0.054
 say+0.054
 big+0.050
 asked+0.049
Neg logit contributions
ocol-0.153
ions-0.137
 situations-0.131
-0.131
ipping-0.127
Token up
Token ID510
Feature activation-1.27e-01

Pos logit contributions
ocol+0.234
ions+0.209
 situations+0.202
+0.199
ipping+0.194
Neg logit contributions
 said-0.096
 smile-0.081
 say-0.078
 big-0.073
 asked-0.072
Token the
Token ID262
Feature activation-3.34e-04

Pos logit contributions
 said+0.112
 big+0.089
 smile+0.087
 say+0.080
 asked+0.080
Neg logit contributions
ocol-0.395
ions-0.349
-0.336
umm-0.335
 situations-0.332
Token hill
Token ID12788
Feature activation+7.55e-02

Pos logit contributions
 said+0.001
 smile+0.001
 say+0.001
 asked+0.001
 dad+0.001
Neg logit contributions
ocol-0.001
ions-0.000
 situations-0.000
-0.000
-0.000
Token.
Token ID13
Feature activation-2.06e-02

Pos logit contributions
ocol+0.196
ions+0.177
 situations+0.170
+0.167
ipping+0.164
Neg logit contributions
 said-0.087
 smile-0.074
 say-0.072
 big-0.067
 asked-0.066
Token
Token ID198
Feature activation-3.30e-03

Pos logit contributions
 said+0.027
 smile+0.025
 say+0.024
 big+0.023
 picture+0.022
Neg logit contributions
ocol-0.048
ions-0.043
 situations-0.042
-0.042
-0.041
Token
Token ID198
Feature activation-3.47e-03

Pos logit contributions
 smile+0.005
 said+0.005
 picture+0.004
 say+0.004
 open+0.004
Neg logit contributions
ocol-0.006
ions-0.006
-0.006
F-0.005
ground-0.005
Token today
Token ID1909
Feature activation+6.31e-02

Pos logit contributions
ocol+0.180
ions+0.158
 situations+0.153
+0.150
ubb+0.149
Neg logit contributions
 said-0.076
 smile-0.062
 say-0.061
 big-0.057
 asked-0.057
Token.
Token ID13
Feature activation-1.70e-02

Pos logit contributions
ocol+0.190
ions+0.165
+0.160
 situations+0.160
ubb+0.159
Neg logit contributions
 said-0.061
 smile-0.046
 big-0.045
 say-0.045
 asked-0.043
Token She
Token ID1375
Feature activation-3.15e-03

Pos logit contributions
 said+0.020
 smile+0.017
 say+0.017
 big+0.016
 asked+0.016
Neg logit contributions
ocol-0.045
ions-0.040
 situations-0.038
-0.038
ipping-0.037
Token was
Token ID373
Feature activation+4.09e-02

Pos logit contributions
 smile+0.003
 say+0.003
 said+0.003
 understand+0.003
 dad+0.003
Neg logit contributions
ocol-0.007
-0.006
ions-0.006
pill-0.006
Too-0.006
Token very
Token ID845
Feature activation-5.27e-02

Pos logit contributions
ocol+0.093
ions+0.085
 accompl+0.083
 managing+0.077
 centimeters+0.077
Neg logit contributions
 said-0.047
 smile-0.041
 say-0.041
 says-0.040
 replied-0.038
Token excited
Token ID6568
Feature activation-1.29e-01

Pos logit contributions
 said+0.057
 say+0.049
 picture+0.049
 smile+0.049
 replied+0.046
Neg logit contributions
ocol-0.129
ubb-0.109
 accompl-0.108
ions-0.107
-0.105
Token to
Token ID284
Feature activation-5.36e-02

Pos logit contributions
 said+0.143
 smile+0.128
 say+0.121
 understand+0.111
 big+0.111
Neg logit contributions
ocol-0.324
ions-0.300
 situations-0.292
-0.278
ipping-0.277
Token show
Token ID905
Feature activation-2.21e-02

Pos logit contributions
 said+0.074
 surprised+0.063
 lost+0.058
 says+0.058
 exclaimed+0.057
Neg logit contributions
ocol-0.112
 situations-0.099
ions-0.097
-0.097
 occasions-0.095
Token her
Token ID607
Feature activation-5.60e-02

Pos logit contributions
 said+0.031
 saw+0.028
 smile+0.027
 surprised+0.026
 lost+0.026
Neg logit contributions
ocol-0.046
ions-0.042
 situations-0.041
 occasions-0.037
ipping-0.036
Token teacher
Token ID4701
Feature activation-7.70e-02

Pos logit contributions
 said+0.063
 say+0.057
 asked+0.050
 smile+0.050
 whispered+0.049
Neg logit contributions
ocol-0.123
ions-0.119
 situations-0.118
-0.109
 Wed-0.108
Token how
Token ID703
Feature activation+1.53e-02

Pos logit contributions
 said+0.101
 smile+0.088
 say+0.085
 big+0.080
 asked+0.080
Neg logit contributions
ocol-0.190
ions-0.170
 situations-0.162
-0.159
ipping-0.155