[SAC] [OSGeo] #1982: osgeo6 disk full: 100% /var
OSGeo
trac_osgeo at osgeo.org
Mon Aug 14 14:24:21 PDT 2017
#1982: osgeo6 disk full: 100% /var
---------------------------+--------------------
Reporter: neteler | Owner: sac@…
Type: defect | Status: new
Priority: blocker | Milestone:
Component: Systems Admin | Resolution:
Keywords: osgeo6 |
---------------------------+--------------------
Comment (by neteler):
Replying to [comment:3 wildintellect]:
> So clearly the log files contribute, but I'm not sure that's the root
cause. I think the mailman was starting to throw errors and that cascaded
it filling it's own logs, which then caused more errors. Anyone know if
there's a way to limit how big a mailman log will get before it cycles,
rather than waiting for logrotate?
AFAIK it is postfix, not mailman writing those two log files.
> Can someone try to extract the head of the logs to figure out what
caused the initial issue?
Most messages are like this:
{{{
Aug 14 14:01:15 osgeo6 postfix/smtp[9601]: 3FFB36332EB7: host
aspmx.l.google.com[74.125.28.26] said: 450-4.2.1 The user you are trying
to contact is receiving mail at a rate that 450-4.2.1 prevents additional
messages from being delivered. Please resend your 450-4.2.1 message at a
later time. If the user is able to receive mail at that 450-4.2.1 time,
your message will be delivered. For more information, please 450-4.2.1
visit 450 4.2.1 https://support.google.com/mail/?p=ReceivingRate
d11si5262351pln.414 - gsmtp (in reply to RCPT TO command)
}}}
Apparently we are hammering some other servers too much. At time it is a
kind of endless loop with those servers (i.e. for some recipients).
For now, I have added some postfix throttling:
{{{
/etc/postfix/main.cf
# throttle, see https://trac.osgeo.org/osgeo/ticket/1982
smtp_destination_concurrency_limit = 2
smtp_destination_rate_delay = 1s
smtp_extra_recipient_limit = 10
}}}
Maybe that will help to calm down google and dtag mail servers. We can
comment out these lines once the mail queue is closer to empty. At time:
{{{
mailq | grep mailman-bounces at lists.osgeo.org | wc -l
111064
}}}
which is 30,000 less than 11hs ago.
> I'll check on the munin email setup, I'm not sure what it's set to do
when it hits a limit.
ok thx
--
Ticket URL: <https://trac.osgeo.org/osgeo/ticket/1982#comment:4>
OSGeo <http://www.osgeo.org/>
OSGeo committee and general foundation issue tracker.
More information about the Sac
mailing list