GNU bug report logs - #22209
name resolution failures in installer image

Previous Next

Package: guix;

Reported by: Florian Paul Schmidt <mista.tapas <at> gmx.net>

Date: Sat, 19 Dec 2015 08:05:01 UTC

Severity: serious

Done: ludo <at> gnu.org (Ludovic Courtès)

Bug is archived. No further changes may be made.

To add a comment to this bug, you must first unarchive it, by sending
a message to control AT debbugs.gnu.org, with unarchive 22209 in the body.
You can then email your comments to 22209 AT debbugs.gnu.org in the normal way.

Toggle the display of automated, internal messages from the tracker.

View this report as an mbox folder, status mbox, maintainer mbox


Report forwarded to bug-guix <at> gnu.org:
bug#22209; Package guix. (Sat, 19 Dec 2015 08:05:01 GMT) Full text and rfc822 format available.

Acknowledgement sent to Florian Paul Schmidt <mista.tapas <at> gmx.net>:
New bug report received and forwarded. Copy sent to bug-guix <at> gnu.org. (Sat, 19 Dec 2015 08:05:01 GMT) Full text and rfc822 format available.

Message #5 received at submit <at> debbugs.gnu.org (full text, mbox):

From: Florian Paul Schmidt <mista.tapas <at> gmx.net>
To: bug-guix <at> gnu.org
Subject: name resolution failures in installer image
Date: Sat, 19 Dec 2015 09:04:41 +0100
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA256


Hi,

this occurs over and over to me and I saw other people's reports on
IRC, too:

After booting the installer image, doing the ifconfig/dhclient dance,
preparing the target fs, mounting it and starting the guix system init
invocation at some point in time, though not predictably so, but
usually after a couple of minutes (about 10 or so) name resolution
will start to fail causing guix system init to fail.

killall dhclient; dhclient <your_interface_name_here>

and a subsequent invocation to guix system init will make the
installation continue and usually succeed.

On the first few occasions it seemed to me that my local network could
be at fault, but it's _almost_ reliably reproducable in qemu,
virtualbox and even on bare metal and is not limited to myself (check
the channel logs with some good search queries).

So it might be worth looking into.

The installed system does not have this problem after a reboot.

Regards,
Flo

- -- 
https://fps.io
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v2

iQEcBAEBCAAGBQJWdQ+YAAoJEA5f4Coltk8ZgdoIAJ9rDLDhTztcfs4fHWBF+mcf
U6SYAX36nc7R+TiT7FCWQVxcp38bG5LrTInmP4C/ojyPHEYZcMQLQg2CoT4/q6Ys
kjMUKvBgFqMsjeZzvo9oXP1VDWygC9YeCg55BGPzpEW5x3oQHcVH0zw8acNrlZ6A
I9sVoQnaEDKhsNECBwPDMNujqjik6TilvddEl0AS7HYCbyNgPy+dR97G5z+gRt8w
Wrio0ua5QeyAW5kFCLX7LdDa44OBPoiAlLyWa9dXgFUOORgh88z/dtCPbV0vRNJc
FKpVmAzcovximrPGgDXs9DzERaSkROM05GGRb/uYFfs5GdoGja5YwjXcoWaHUzo=
=9+XX
-----END PGP SIGNATURE-----




Information forwarded to bug-guix <at> gnu.org:
bug#22209; Package guix. (Sat, 19 Dec 2015 08:46:01 GMT) Full text and rfc822 format available.

Message #8 received at 22209 <at> debbugs.gnu.org (full text, mbox):

From: Leo Famulari <leo <at> famulari.name>
To: Florian Paul Schmidt <mista.tapas <at> gmx.net>
Cc: 22209 <at> debbugs.gnu.org
Subject: Re: bug#22209: name resolution failures in installer image
Date: Sat, 19 Dec 2015 03:45:06 -0500
On Sat, Dec 19, 2015 at 09:04:41AM +0100, Florian Paul Schmidt wrote:
> After booting the installer image, doing the ifconfig/dhclient dance,
> preparing the target fs, mounting it and starting the guix system init
> invocation at some point in time, though not predictably so, but
> usually after a couple of minutes (about 10 or so) name resolution
> will start to fail causing guix system init to fail.

A little more information:

I was installing on i686 bare metal from the current USB installation
image.

I installed vim in the installation environment, and then when I
tried to run `guix system init...` I couldn't resolve hydra.gnu.org,
google.com, and a few other sites. I could resolve the domain
famulari.name, but not another domain that points to the same machine.

I installed dig and could successfully query the nameserver described in
/etc/resolv.conf for hydra.gnu.org.

I never have name resolution problems on this network with my other
machines.

I decided to edit /etc/hosts so I could continue with the installation.
If anyone has any ideas about how to debug this, I'm happy to help.




Information forwarded to bug-guix <at> gnu.org:
bug#22209; Package guix. (Sat, 19 Dec 2015 10:26:01 GMT) Full text and rfc822 format available.

Message #11 received at 22209 <at> debbugs.gnu.org (full text, mbox):

From: Ricardo Wurmus <rekado <at> elephly.net>
To: Leo Famulari <leo <at> famulari.name>
Cc: 22209 <at> debbugs.gnu.org, Florian Paul Schmidt <mista.tapas <at> gmx.net>
Subject: Re: bug#22209: name resolution failures in installer image
Date: Sat, 19 Dec 2015 11:25:24 +0100
Leo Famulari <leo <at> famulari.name> writes:

> On Sat, Dec 19, 2015 at 09:04:41AM +0100, Florian Paul Schmidt wrote:
>> After booting the installer image, doing the ifconfig/dhclient dance,
>> preparing the target fs, mounting it and starting the guix system init
>> invocation at some point in time, though not predictably so, but
>> usually after a couple of minutes (about 10 or so) name resolution
>> will start to fail causing guix system init to fail.
>
> A little more information:
>
> I was installing on i686 bare metal from the current USB installation
> image.
>
> I installed vim in the installation environment, and then when I
> tried to run `guix system init...` I couldn't resolve hydra.gnu.org,
> google.com, and a few other sites. I could resolve the domain
> famulari.name, but not another domain that points to the same machine.
>
> I installed dig and could successfully query the nameserver described in
> /etc/resolv.conf for hydra.gnu.org.

Could this be related to cached resolution failures in nscd?  Have you
tried restarting the nscd service?

~~ Ricardo





Information forwarded to bug-guix <at> gnu.org:
bug#22209; Package guix. (Sat, 19 Dec 2015 18:10:01 GMT) Full text and rfc822 format available.

Message #14 received at 22209 <at> debbugs.gnu.org (full text, mbox):

From: ludo <at> gnu.org (Ludovic Courtès)
To: Ricardo Wurmus <rekado <at> elephly.net>
Cc: 22209 <at> debbugs.gnu.org, Florian Paul Schmidt <mista.tapas <at> gmx.net>,
 Leo Famulari <leo <at> famulari.name>
Subject: Re: bug#22209: name resolution failures in installer image
Date: Sat, 19 Dec 2015 19:09:32 +0100
Ricardo Wurmus <rekado <at> elephly.net> skribis:

> Leo Famulari <leo <at> famulari.name> writes:
>
>> On Sat, Dec 19, 2015 at 09:04:41AM +0100, Florian Paul Schmidt wrote:
>>> After booting the installer image, doing the ifconfig/dhclient dance,
>>> preparing the target fs, mounting it and starting the guix system init
>>> invocation at some point in time, though not predictably so, but
>>> usually after a couple of minutes (about 10 or so) name resolution
>>> will start to fail causing guix system init to fail.
>>
>> A little more information:
>>
>> I was installing on i686 bare metal from the current USB installation
>> image.
>>
>> I installed vim in the installation environment, and then when I
>> tried to run `guix system init...` I couldn't resolve hydra.gnu.org,
>> google.com, and a few other sites. I could resolve the domain
>> famulari.name, but not another domain that points to the same machine.
>>
>> I installed dig and could successfully query the nameserver described in
>> /etc/resolv.conf for hydra.gnu.org.
>
> Could this be related to cached resolution failures in nscd?  Have you
> tried restarting the nscd service?

That sounds likely.

Nscd caches negative lookups, but normally only for 20 seconds (see
%nscd-minimal-caches in (gnu system install)).  So if you look up
hydra.gnu.org before the network is set up, it’ll remember for 20s that
this is a lookup failure, even if you’ve set up the network in the
meantime.

However, nscd is reportedly behaving strangely:
<http://bugs.gnu.org/20816>.

Do you initially get successful lookups for hydra.gnu.org, and
eventually lookup failures?

Ludo’.




Information forwarded to bug-guix <at> gnu.org:
bug#22209; Package guix. (Sun, 20 Dec 2015 10:13:02 GMT) Full text and rfc822 format available.

Message #17 received at 22209 <at> debbugs.gnu.org (full text, mbox):

From: Florian Paul Schmidt <mista.tapas <at> gmx.net>
To: Ludovic Courtès <ludo <at> gnu.org>,
 Ricardo Wurmus <rekado <at> elephly.net>
Cc: 22209 <at> debbugs.gnu.org, Leo Famulari <leo <at> famulari.name>
Subject: Re: bug#22209: name resolution failures in installer image
Date: Sun, 20 Dec 2015 11:12:32 +0100
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA256

On 19.12.2015 19:09, Ludovic Courtès wrote:

> However, nscd is reportedly behaving strangely: 
> <http://bugs.gnu.org/20816>.
> 
> Do you initially get successful lookups for hydra.gnu.org, and 
> eventually lookup failures?

Yes, that's at least in my case precisely what happens. It works for
the first few packages during system init, and then at some point in
time fails.

I'll try to reproduce and then restart the nscd to see if it helps
(when I find time to do it)..

Flo


- -- 
https://fps.io
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v2

iQEcBAEBCAAGBQJWdn8QAAoJEA5f4Coltk8ZrSIH/3YKmBO2ZDCqxxZGWYJDRgQc
Q74OFkSrw56ov29uSU+Xs63njem19PLxyECLoICex2tOMOuELpowKt0ee8hS/7PT
eYyRlMiUfIk4WC2qpFGQtfURDOmsAiWCGP2LzBO1RqhgU5CtgnChH5Z7ZVmlNloy
EqVKZ0tpquagf1k6FkMsk65fY6gPa2D+2Ova1SkwtEiIRwMDMNkguadHXu9LShKu
9fSOnP7BXwAZfPC6hPyrB8tjUWtr8PUlzYNsLsW6PTqtRNry5QSSa2I4wsx9Gjse
HQ3NeRjp3RPvOx/LwP5+nGaTsUN8R8aPSX3eMumpBfzrImLqANExhcOvkFtkL6U=
=R54m
-----END PGP SIGNATURE-----




Severity set to 'serious' from 'normal' Request was from ludo <at> gnu.org (Ludovic Courtès) to control <at> debbugs.gnu.org. (Sat, 16 Jan 2016 13:49:02 GMT) Full text and rfc822 format available.

Information forwarded to bug-guix <at> gnu.org:
bug#22209; Package guix. (Wed, 10 Feb 2016 20:55:01 GMT) Full text and rfc822 format available.

Message #22 received at 22209 <at> debbugs.gnu.org (full text, mbox):

From: ludo <at> gnu.org (Ludovic Courtès)
To: Florian Paul Schmidt <mista.tapas <at> gmx.net>,
 Leo Famulari <leo <at> famulari.name>
Cc: 22209 <at> debbugs.gnu.org, Ricardo Wurmus <rekado <at> elephly.net>
Subject: Re: bug#22209: name resolution failures in installer image
Date: Wed, 10 Feb 2016 21:54:40 +0100
ludo <at> gnu.org (Ludovic Courtès) skribis:

> Ricardo Wurmus <rekado <at> elephly.net> skribis:

[...]

>> Could this be related to cached resolution failures in nscd?  Have you
>> tried restarting the nscd service?
>
> That sounds likely.
>
> Nscd caches negative lookups, but normally only for 20 seconds (see
> %nscd-minimal-caches in (gnu system install)).  So if you look up
> hydra.gnu.org before the network is set up, it’ll remember for 20s that
> this is a lookup failure, even if you’ve set up the network in the
> meantime.

Commit c96ba2c disables caching of lookup failures altogether.

Could you try again to reproduce it?  (I think Florian wrote in the
initial bug report that this was reproducible in QEMU, so using
‘guix system vm gnu/system/install.scm’ might suffice.)

Alternately we could remove nscd altogether, but I think it’s still
somewhat useful to cache successful lookups.

TIA,
Ludo’.




Reply sent to ludo <at> gnu.org (Ludovic Courtès):
You have taken responsibility. (Tue, 03 May 2016 20:28:02 GMT) Full text and rfc822 format available.

Notification sent to Florian Paul Schmidt <mista.tapas <at> gmx.net>:
bug acknowledged by developer. (Tue, 03 May 2016 20:28:03 GMT) Full text and rfc822 format available.

Message #27 received at 22209-done <at> debbugs.gnu.org (full text, mbox):

From: ludo <at> gnu.org (Ludovic Courtès)
To: Florian Paul Schmidt <mista.tapas <at> gmx.net>
Cc: 22209-done <at> debbugs.gnu.org, Ricardo Wurmus <rekado <at> elephly.net>,
 Leo Famulari <leo <at> famulari.name>
Subject: Re: bug#22209: name resolution failures in installer image
Date: Tue, 03 May 2016 22:26:31 +0200
ludo <at> gnu.org (Ludovic Courtès) skribis:

> ludo <at> gnu.org (Ludovic Courtès) skribis:
>
>> Ricardo Wurmus <rekado <at> elephly.net> skribis:
>
> [...]
>
>>> Could this be related to cached resolution failures in nscd?  Have you
>>> tried restarting the nscd service?
>>
>> That sounds likely.
>>
>> Nscd caches negative lookups, but normally only for 20 seconds (see
>> %nscd-minimal-caches in (gnu system install)).  So if you look up
>> hydra.gnu.org before the network is set up, it’ll remember for 20s that
>> this is a lookup failure, even if you’ve set up the network in the
>> meantime.
>
> Commit c96ba2c disables caching of lookup failures altogether.

0.10.0 uses this fix and we haven’t heard from name resolution failures
in the installer image, so I’m closing it.

Ludo’.




bug archived. Request was from Debbugs Internal Request <help-debbugs <at> gnu.org> to internal_control <at> debbugs.gnu.org. (Wed, 01 Jun 2016 11:24:04 GMT) Full text and rfc822 format available.

This bug report was last modified 7 years and 323 days ago.

Previous Next


GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.