syncrepl consumer is slow

Discussion:

Howard Chu

2015-01-29 03:12:17 UTC

Content preview: One thing I just noticed, while testing replication with 3
servers on my laptop - during a refresh, the provider gets blocked waiting
to write to the consumers after writing about 4000 entries. I.e., the consumers
aren't processing fast enough to keep up with the search running on the provider.
[...]

Content analysis details: (-4.2 points, 5.0 required)

pts rule name description
---- ---------------------- --------------------------------------------------
-2.3 RCVD_IN_DNSWL_MED RBL: Sender listed at http://www.dnswl.org/, medium
trust
[69.43.206.106 listed in list.dnswl.org]
0.0 URIBL_BLOCKED ADMINISTRATOR NOTICE: The query to URIBL was blocked.
See
http://wiki.apache.org/spamassassin/DnsBlocklists#dnsbl-block
for more information.
[URIs: highlandsun.com]
-1.9 BAYES_00 BODY: Bayes spam probability is 0 to 1%
[score: 0.0000]

One thing I just noticed, while testing replication with 3 servers on my
laptop - during a refresh, the provider gets blocked waiting to write to
the consumers after writing about 4000 entries. I.e., the consumers
aren't processing fast enough to keep up with the search running on the
provider.

(That's actually not too surprising since reads are usually faster than
writes anyway.)

The consumer code has lots of problems as it is, just adding this note
to the pile.

I'm considering adding an option to the consumer to write its entries
with dbnosync during the refresh phase. The rationale being, there's
nothing to lose anyway if the refresh is interrupted. I.e., the consumer
can't update its contextCSN until the very end of the refresh, so any
partial refresh that gets interrupted is wasted effort - the consumer
will always have to start over from the beginning on its next refresh
attempt. As such, there's no point in safely/synchronously writing any
of the received entries - they're useless until the final contextCSN update.

The implementation approach would be to define a new control e.g. "fast
write" for the consumer to pass to the underlying backend on any write
op. We would also have to e.g. add an MDB_TXN_NOSYNC flag to
mdb_txn_begin() (BDB already has the equivalent flag).

This would only be used for writes that are part of a refresh phase. In
persist mode the provider and consumers' write speeds should be more
closely matched so it wouldn't be necessary or useful.

Comments?

--
-- Howard Chu
CTO, Symas Corp. http://www.symas.com
Director, Highland Sun http://highlandsun.com/hyc/
Chief Architect, OpenLDAP http://www.openldap.org/project/

Quanah Gibson-Mount

2015-01-29 04:20:55 UTC

Permalink

Content preview: --On January 29, 2015 at 3:12:17 AM +0000 Howard Chu <***@symas.com>
wrote: > This would only be used for writes that are part of a refresh phase.
In > persist mode the provider and consumers' write speeds should be more

closely matched so it wouldn't be necessary or useful. [...]

Content analysis details: (-2.0 points, 5.0 required)

pts rule name description
---- ---------------------- --------------------------------------------------
0.0 RCVD_IN_DNSWL_BLOCKED RBL: ADMINISTRATOR NOTICE: The query to DNSWL
was blocked. See
http://wiki.apache.org/spamassassin/DnsBlocklists#dnsbl-block
for more information.
[162.209.122.174 listed in list.dnswl.org]
0.0 URIBL_BLOCKED ADMINISTRATOR NOTICE: The query to URIBL was blocked.
See
http://wiki.apache.org/spamassassin/DnsBlocklists#dnsbl-block
for more information.
[URIs: zimbra.com]
-0.0 T_RP_MATCHES_RCVD Envelope sender domain matches handover relay
domain
-0.0 SPF_PASS SPF: sender matches SPF record
-1.9 BAYES_00 BODY: Bayes spam probability is 0 to 1%
[score: 0.0000]
-0.1 DKIM_VALID_AU Message has a valid DKIM or DK signature from author's
domain
0.1 DKIM_SIGNED Message has a DKIM or DK signature, not necessarily valid
-0.1 DKIM_VALID Message has at least one valid DKIM or DK signature

This would only be used for writes that are part of a refresh phase. In
persist mode the provider and consumers' write speeds should be more
closely matched so it wouldn't be necessary or useful.

I've had a few cases on extremely busy systems with multiple replicas/mmr
nodes where they literally never catch up. Only way I've been able to
resolve those cases is to stop them, slapcat the master, slapadd, and
restart. Hopefully this change would alleviate that scenario.

--Quanah

--
Quanah Gibson-Mount
Platform Architect
Zimbra, Inc
--------------------
Zimbra :: the leader in open source messaging and collaboration

Howard Chu

2015-01-29 04:34:28 UTC

Permalink

Content preview: Quanah Gibson-Mount wrote: > > > --On January 29, 2015 at
3:12:17 AM +0000 Howard Chu <***@symas.com> wrote: > > >> This would only
be used for writes that are part of a refresh phase. In >> persist mode the
provider and consumers' write speeds should be more >> closely matched so
it wouldn't be necessary or useful. > > I've had a few cases on extremely
busy systems with multiple > replicas/mmr nodes where they literally never
catch up. Only way I've > been able to resolve those cases is to stop them,
slapcat the master, > slapadd, and restart. Hopefully this change would alleviate
that scenario. [...]

Content analysis details: (-1.9 points, 5.0 required)

pts rule name description
---- ---------------------- --------------------------------------------------
0.0 RCVD_IN_DNSWL_BLOCKED RBL: ADMINISTRATOR NOTICE: The query to DNSWL
was blocked. See
http://wiki.apache.org/spamassassin/DnsBlocklists#dnsbl-block
for more information.
[69.43.206.106 listed in list.dnswl.org]
0.0 URIBL_BLOCKED ADMINISTRATOR NOTICE: The query to URIBL was blocked.
See
http://wiki.apache.org/spamassassin/DnsBlocklists#dnsbl-block
for more information.
[URIs: highlandsun.com]
-1.9 BAYES_00 BODY: Bayes spam probability is 0 to 1%
[score: 0.0000]

Post by Quanah Gibson-Mount

Post by Howard Chu
This would only be used for writes that are part of a refresh phase. In
persist mode the provider and consumers' write speeds should be more
closely matched so it wouldn't be necessary or useful.

I've had a few cases on extremely busy systems with multiple
replicas/mmr nodes where they literally never catch up. Only way I've
been able to resolve those cases is to stop them, slapcat the master,
slapadd, and restart. Hopefully this change would alleviate that scenario.

Yes, I'm seeing the same thing. And yes, that's my hope as well. Not
sure if it's enough; like I said there are other performance issues in
the consumer code.

--
-- Howard Chu
CTO, Symas Corp. http://www.symas.com
Director, Highland Sun http://highlandsun.com/hyc/
Chief Architect, OpenLDAP http://www.openldap.org/project/

Emmanuel Lécharny

2015-01-29 07:10:44 UTC

Permalink

Content preview: Le 29/01/15 04:12, Howard Chu a écrit : > One thing I just
noticed, while testing replication with 3 servers on > my laptop - during
a refresh, the provider gets blocked waiting to > write to the consumers
after writing about 4000 entries. I.e., the > consumers aren't processing
fast enough to keep up with the search > running on the provider. > > (That's
actually not too surprising since reads are usually faster > than writes
anyway.) > > The consumer code has lots of problems as it is, just adding
this note > to the pile. > > I'm considering adding an option to the consumer
to write its entries > with dbnosync during the refresh phase. The rationale
being, there's > nothing to lose anyway if the refresh is interrupted. I.e.,
the > consumer can't update its contextCSN until the very end of the > refresh,
so any partial refresh that gets interrupted is wasted effort > - the consumer
will always have to start over from the beginning on > its next refresh attempt.
As such, there's no point in > safely/synchronously writing any of the received
entries - they're > useless until the final contextCSN update. > > The implementation
approach would be to define a new control e.g. > "fast write" for the consumer
to pass to the underlying backend on any > write op. We would also have to
e.g. add an MDB_TXN_NOSYNC flag to > mdb_txn_begin() (BDB already has the
equivalent flag). > > This would only be used for writes that are part of
a refresh phase. > In persist mode the provider and consumers' write speeds
should be > more closely matched so it wouldn't be necessary or useful. >

Comments? [...]

Content analysis details: (-2.7 points, 5.0 required)

pts rule name description
---- ---------------------- --------------------------------------------------
0.0 FREEMAIL_FROM Sender email is commonly abused enduser mail provider
(elecharny[at]gmail.com)
-0.7 RCVD_IN_DNSWL_LOW RBL: Sender listed at http://www.dnswl.org/, low
trust
[209.85.212.174 listed in list.dnswl.org]
-0.0 SPF_PASS SPF: sender matches SPF record
-1.9 BAYES_00 BODY: Bayes spam probability is 0 to 1%
[score: 0.0000]
-0.1 DKIM_VALID_AU Message has a valid DKIM or DK signature from author's
domain
0.1 DKIM_SIGNED Message has a DKIM or DK signature, not necessarily valid
-0.1 DKIM_VALID Message has at least one valid DKIM or DK signature
X-BeenThere: openldap-***@openldap.org
X-Mailman-Version: 2.1.15
Precedence: list
List-Id: OpenLDAP development discussion list <openldap-devel.openldap.org>
List-Unsubscribe: <http://www.openldap.org/lists/mm/options/openldap-devel>,
<mailto:openldap-devel-***@openldap.org?subject=unsubscribe>
List-Archive: <http://www.openldap.org/lists/openldap-devel/>
List-Post: <mailto:openldap-***@openldap.org>
List-Help: <mailto:openldap-devel-***@openldap.org?subject=help>
List-Subscribe: <http://www.openldap.org/lists/mm/listinfo/openldap-devel>,
<mailto:openldap-devel-***@openldap.org?subject=subscribe>
Errors-To: openldap-devel-***@openldap.org
Sender: "openldap-devel" <openldap-devel-***@openldap.org>
X-Spam-Score: -2.7 (--)
X-Spam-Report: Spam detection software, running on the system "gauss.openldap.net", has
identified this incoming email as possible spam. The original message
has been attached to this so you can view it (if it isn't spam) or label
similar future email. If you have any questions, see
the administrator of that system for details.

Content preview: Le 29/01/15 04:12, Howard Chu a écrit : > One thing I just
noticed, while testing replication with 3 servers on > my laptop - during
a refresh, the provider gets blocked waiting to > write to the consumers
after writing about 4000 entries. I.e., the > consumers aren't processing
fast enough to keep up with the search > running on the provider. > > (That's
actually not too surprising since reads are usually faster > than writes
anyway.) > > The consumer code has lots of problems as it is, just adding
this note > to the pile. > > I'm considering adding an option to the consumer
to write its entries > with dbnosync during the refresh phase. The rationale
being, there's > nothing to lose anyway if the refresh is interrupted. I.e.,
the > consumer can't update its contextCSN until the very end of the > refresh,
so any partial refresh that gets interrupted is wasted effort > - the consumer
will always have to start over from the beginning on > its next refresh attempt.
As such, there's no point in > safely/synchronously writing any of the received
entries - they're > useless until the final contextCSN update. > > The implementation
approach would be to define a new control e.g. > "fast write" for the consumer
to pass to the underlying backend on any > write op. We would also have to
e.g. add an MDB_TXN_NOSYNC flag to > mdb_txn_begin() (BDB already has the
equivalent flag). > > This would only be used for writes that are part of
a refresh phase. > In persist mode the provider and consumers' write speeds
should be > more closely matched so it wouldn't be necessary or useful. >

Comments? [...]

Content analysis details: (-2.7 points, 5.0 required)

pts rule name description
---- ---------------------- --------------------------------------------------
-0.7 RCVD_IN_DNSWL_LOW RBL: Sender listed at http://www.dnswl.org/, low
trust
[209.85.212.174 listed in list.dnswl.org]
0.0 FREEMAIL_FROM Sender email is commonly abused enduser mail provider
(elecharny[at]gmail.com)
-0.0 SPF_PASS SPF: sender matches SPF record
-1.9 BAYES_00 BODY: Bayes spam probability is 0 to 1%
[score: 0.0000]
-0.1 DKIM_VALID_AU Message has a valid DKIM or DK signature from author's
domain
0.1 DKIM_SIGNED Message has a DKIM or DK signature, not necessarily valid
-0.1 DKIM_VALID Message has at least one valid DKIM or DK signature

One thing I just noticed, while testing replication with 3 servers on
my laptop - during a refresh, the provider gets blocked waiting to
write to the consumers after writing about 4000 entries. I.e., the
consumers aren't processing fast enough to keep up with the search
running on the provider.
(That's actually not too surprising since reads are usually faster
than writes anyway.)
The consumer code has lots of problems as it is, just adding this note
to the pile.
I'm considering adding an option to the consumer to write its entries
with dbnosync during the refresh phase. The rationale being, there's
nothing to lose anyway if the refresh is interrupted. I.e., the
consumer can't update its contextCSN until the very end of the
refresh, so any partial refresh that gets interrupted is wasted effort
- the consumer will always have to start over from the beginning on
its next refresh attempt. As such, there's no point in
safely/synchronously writing any of the received entries - they're
useless until the final contextCSN update.
The implementation approach would be to define a new control e.g.
"fast write" for the consumer to pass to the underlying backend on any
write op. We would also have to e.g. add an MDB_TXN_NOSYNC flag to
mdb_txn_begin() (BDB already has the equivalent flag).
This would only be used for writes that are part of a refresh phase.
In persist mode the provider and consumers' write speeds should be
more closely matched so it wouldn't be necessary or useful.
Comments?

The proposal sounds sane.

Speaking of which we had a discussion about some other features that
could be fine to have : when a consumer reconnect to a provider, the
consumer has no idea about how many entries it will receives. It would
be valuable to pass an extra information in the exchanged cookie, which
would be the number of updated entries. That could provide a hint for
users or admin who would like to know about how long the update would
take on a consumer (assuming we log such an information). Also batching
the updates in the backend, ie grouping the updates before syncing them,
could be interesting to have, still associated with some logs, again
allowing the admin/user to know about the update progression.

Something like:

syncrepl : 1240 entries to update
syncrpel : 200/1240 entries updated
syncrpel : 400/1240 entries updated
...
syncrepl : server up to date.

Hallvard Breien Furuseth

2015-01-30 09:21:40 UTC

Permalink

Content preview: On 29. jan. 2015 04:12, Howard Chu wrote: > I'm considering
adding an option to the consumer to write its entries with > dbnosync during
the refresh phase. The rationale being, there's nothing to > lose anyway
if the refresh is interrupted. I.e., the consumer can't update > its contextCSN
until the very end of the refresh, so any partial refresh that > gets interrupted
is wasted effort - the consumer will always have to start > over from the
beginning on its next refresh attempt. [...]

Content analysis details: (-1.9 points, 5.0 required)

pts rule name description
---- ---------------------- --------------------------------------------------
-0.0 T_RP_MATCHES_RCVD Envelope sender domain matches handover relay
domain
-1.9 BAYES_00 BODY: Bayes spam probability is 0 to 1%
[score: 0.0000]

I'm considering adding an option to the consumer to write its entries with
dbnosync during the refresh phase. The rationale being, there's nothing to
lose anyway if the refresh is interrupted. I.e., the consumer can't update
its contextCSN until the very end of the refresh, so any partial refresh that
gets interrupted is wasted effort - the consumer will always have to start
over from the beginning on its next refresh attempt.

dbnosync loses consistency after a system crash, and it loses the knowledge
that the DB may be inconsistent. At least with back-mdb. The safe thing
to do after such a crash is to throw away the DB and fetch the entire thing
from the provider. Which I gather would need to happen automatically
with such an option.

--
Hallvard

Michael Ströder

2015-01-30 09:30:31 UTC

Permalink

Post by Hallvard Breien Furuseth

From my purely operatinal standpoint:

The consumer does not have valid contextCSN before being fully synced. This
must be ensured. Everyting else can be handled separately. In a serious
deployment the monitoring will have the red light on for this replica, decent
health-check in load-balancers will disable using this replica.

=> don't over-engineer too many things to happen automagically, especially if
you're not 100% sure that this auto-magic is rock-solid on every supported OS
platform and in every exotic operational situation.

Ciao, Michael.

Howard Chu

2015-02-03 04:11:50 UTC

Permalink

Content preview: Hallvard Breien Furuseth wrote: > On 29. jan. 2015 04:12,
Howard Chu wrote: >> I'm considering adding an option to the consumer to write
its entries >> with >> dbnosync during the refresh phase. The rationale being,
there's >> nothing to >> lose anyway if the refresh is interrupted. I.e.,
the consumer can't >> update >> its contextCSN until the very end of the
refresh, so any partial >> refresh that >> gets interrupted is wasted effort
- the consumer will always have to >> start >> over from the beginning on
its next refresh attempt. > > dbnosync loses consistency after a system crash,
and it loses the knowledge > that the DB may be inconsistent. At least with
back-mdb. The safe thing > to do after such a crash is to throw away the
DB and fetch the entire thing > from the provider. Which I gather would need
to happen automatically > with such an option. > Another option here is simply
to perform batching. Now that we have the TXN api exposed in the backend
interface, we could just batch up e.g. 500 entries per txn. much like slapadd
-q already does. Ultimately we ought to be able to get syncrepl refresh to
occur at nearly the same speed as slapadd -q. [...]

Content analysis details: (-4.2 points, 5.0 required)

pts rule name description
---- ---------------------- --------------------------------------------------
-2.3 RCVD_IN_DNSWL_MED RBL: Sender listed at http://www.dnswl.org/, medium
trust
[69.43.206.106 listed in list.dnswl.org]
0.0 URIBL_BLOCKED ADMINISTRATOR NOTICE: The query to URIBL was blocked.
See
http://wiki.apache.org/spamassassin/DnsBlocklists#dnsbl-block
for more information.
[URIs: highlandsun.com]
-1.9 BAYES_00 BODY: Bayes spam probability is 0 to 1%
[score: 0.0000]

Post by Hallvard Breien Furuseth

Another option here is simply to perform batching. Now that we have the
TXN api exposed in the backend interface, we could just batch up e.g.
500 entries per txn. much like slapadd -q already does. Ultimately we
ought to be able to get syncrepl refresh to occur at nearly the same
speed as slapadd -q.

--
-- Howard Chu
CTO, Symas Corp. http://www.symas.com
Director, Highland Sun http://highlandsun.com/hyc/
Chief Architect, OpenLDAP http://www.openldap.org/project/

Emmanuel Lécharny

2015-02-03 05:13:44 UTC

Permalink

Content preview: Le 03/02/15 05:11, Howard Chu a écrit : > Hallvard Breien
Furuseth wrote: >> On 29. jan. 2015 04:12, Howard Chu wrote: >>> I'm considering
adding an option to the consumer to write its entries >>> with >>> dbnosync
during the refresh phase. The rationale being, there's >>> nothing to >>>
lose anyway if the refresh is interrupted. I.e., the consumer can't >>> update