emo/ndhc - ndhc - Project Segfault Git

emo/ndhc

Author	SHA1	Message	Date
Nicholas J. Kain	f3766990f9	Speed up interface carrier checking. This is done by performing one synchronous query for carrier state at the start of the program; after that, we just monitor the nlsocket for carrier state changes and update the cached state accordingly. The benefit is that ifchd needs to do a lot less work and this should reduce the CPU cycle consumption; prior to this commit, the CPU time ends up being a few CPU-minutes per month.	2021-04-25 05:55:01 -04:00
Nicholas J. Kain	918555219b	Make fingerprinting more robust with rapid carrier state changes. The existing code tracked the fingerprinting completion state with a bool, which is insufficient; what is required is a tristate (none\|inprogress\|done) where the inprogress state reverts to none on carrier down, but done state stays done on carrier down.	2020-10-27 14:14:18 -04:00
Nicholas J. Kain	4575f74164	Remove legacy support for exiting after obtaining a DHCP lease.	2020-10-20 06:55:04 -04:00
Nicholas J. Kain	ade4e988af	Remove legacy support for forking to background.	2020-10-20 06:55:04 -04:00
Nicholas J. Kain	58067200d6	Remove legacy support for writing a pidfile.	2020-10-20 06:55:04 -04:00
Nicholas J. Kain	4d33c00e04	Use poll() instead of epoll() for ndhc-master.	2020-10-20 05:58:29 -04:00
Nicholas J. Kain	06a541261e	Stop using signalfd and audit signal handling code. There's really no advantage to using signalfd in ndhc, particularly since the normal POSIX signal API is now used for handling SIGCHLD in ndhc-master. So just use the tried and true volatile sig_atomic_t set and check approach. The only intended behavior change is in the dhcp RELEASE state -- before there would be a spurious attempt at renewing a nonexistent lease when the RENEW signal was received.	2020-10-20 04:42:58 -04:00
Nicholas J. Kain	32bc422d0e	Add a heuristic to detect when server ignores dhcp renews. If we get no response to three renews (unicast), switch to sending rebinds (broadcast). Servers are supposed to always reply with a DHCPACK or DHCPNAK even if the server doesn't update its internal lease duration database, so this behavior should be RFC compliant.	2020-10-19 07:03:03 -04:00
Nicholas J. Kain	f4365897bc	Make renew and rebinding directly track whether DHCPREQUEST was sent. Before it was inferred by examining timeouts. Also, simplify the associated timeout code so that there are no longer effectively two redundant paths.	2020-10-19 06:36:26 -04:00
Nicholas J. Kain	9b8c63d998	Give up on fingerprinting router/gateway if it doesn't reply.	2020-10-19 04:28:06 -04:00
Nicholas J. Kain	7bd551d564	Give up on fingerprinting relay agent/server if it doesn't reply. Try to send/wait three times; then if there's no response, then assume that the relay agent is ignoring or firewalled from receiving ARP requests.	2020-10-19 04:28:06 -04:00
Nicholas J. Kain	05a075aeb2	Replace '(c)' with 'Copyright'. '(c)' may not be a valid substitute for 'Copyright' in some legal domains/interpretations. So be safe, since I obviously am asserting copyright on my legal work.	2018-10-26 07:11:16 -04:00
Nicholas J. Kain	8983df3c86	Update copyright dates.	2018-02-18 08:25:10 -05:00
Nicholas J. Kain	e8d97205e9	Compile cleanly with -Wsign-conversion. I didn't notice anything that worried me.	2018-02-09 03:16:59 -05:00
Nicholas J. Kain	759b6bd831	Update to the new ncmlib random API.	2017-08-24 02:36:31 -04:00
Nicholas J. Kain	ed44a90114	state: Faster recovery when carrier lost during DHCP init. If carrier is lost before network fingerprinting is complete, we have a few problems; first, we don't know whether the network has changed underneath us. Second, we've not yet configured the interface properties, and it is not unlikely that doing so will fail as the underlying network device may have been destroyed and recreated during this time (eg, if ethtool has been run at start-up time). Thus, the safest reaction is to terminate and force a supervisor respawn. It is best to do this once carrier recovers, not when the carrier is lost, as it is more likely to minimize delays.	2017-04-10 10:07:34 -04:00
Nicholas J. Kain	34a8cd7ad9	arp: Handle initial announcement asynchronously and retry failures. We need to send two ARP announcements, so these are now done via a timeout callback so that failures can be handled properly.	2017-02-24 06:57:10 -05:00
Nicholas J. Kain	1c2a39c544	Reorder client_config_t members to reduce structure padding.	2017-01-19 05:20:29 -05:00
Nicholas J. Kain	4fdde404aa	Remove unused client_config_t foreground variable.	2017-01-19 05:18:04 -05:00
Nicholas J. Kain	c38fd2be9b	Convert logical booleans in client_config_t to bool type.	2017-01-19 05:13:30 -05:00
Nicholas J. Kain	571b22c4b2	Rename client_state_t init variable to program_init. Easier to grep. No functional change.	2017-01-19 05:05:35 -05:00
Nicholas J. Kain	931530786b	Convert logically boolean client_state_t variables from uint8_t to bool.	2017-01-19 05:01:23 -05:00
Nicholas J. Kain	b8ee0bd5c2	Update copyright dates to 2017.	2017-01-13 20:15:27 -05:00
Nicholas J. Kain	29498f5341	Remove ifsPrevState and set non-infinite timeout on a send error. We instead check carrier status as needed. This approach is more robust. For a simple example, imagine link state changes that happen while the machine is suspended.	2017-01-13 20:15:27 -05:00
Nicholas J. Kain	04ec7c8f4b	Update to latest write_pid semantics and don't write pidfile by default. There was no way to disable writing pidfiles before. pidfiles are an unreliable method of tracking processes, anyway; process supervisors are strongly recommended. If a pidfile is really needed, it can be explicitly specified.	2016-05-06 15:00:31 -04:00
Nicholas J. Kain	12d8af4c67	Convert remaining ull time-types to ll. Thus signed/unsigned conversion issues can't be a problem. The extra range provided by ull isn't useful, either, since we're dealing with uint32_t time_t seconds converted to ns, which doesn't come close to exhausting the amount of post-epoch ns that will be consumed until after 2100AD.	2015-05-27 23:40:28 -04:00
Nicholas J. Kain	e02d30dcc5	renewTime, rebindTime should be unsigned long long. While we're at it, rearrange the client state struct to save some space lost to padding.	2015-05-27 13:11:23 -04:00
Nicholas J. Kain	99ce918a31	Use a coroutine instead of several callback state machines. This change makes it much easier to reason about ndhc's behavior and properly handle errors. It is a very large changeset, but there is no way to make this sort of change incrementally. Lease acquisition is tested to work. It is highly likely that some bugs were both introduced and squashed here. Some obvious code cleanups will quickly follow.	2015-02-18 05:31:13 -05:00
Nicholas J. Kain	44175bd77c	Make ifch requests synchronous just like sockd requests. This change paves the way for allowing ifch to notify the core ndhc about failures. It would be far too difficult to reason about the state machine if the requests to ifch were asynchronous. Currently ndhc assumes that ifch requests never fail, but this is not always true because of eg, rfkill.	2015-02-14 16:49:50 -05:00
Nicholas J. Kain	702d8b0c5b	Mark pointer arguments that cannot ever be null as [static 1]. Also constify some cases, too.	2015-02-13 23:14:08 -05:00
Nicholas J. Kain	911d4cc58e	Fix the dhcp state bootstrapping when rfkill is set #3 .	2015-02-13 19:08:50 -05:00
Nicholas J. Kain	2e679ed491	Fix the dhcp state bootstrapping when rfkill is set #2 .	2015-02-13 18:35:44 -05:00
Nicholas J. Kain	5b050ba498	If the rfkill switch is enabled, do not react to netlink notifications until the rfkill is disabled. They will mostly fail outright.	2015-02-13 17:20:36 -05:00
Nicholas J. Kain	e3d4d4c1aa	rfkill: Add support for reacting to radio kill switch events. In order for this to work, the correct rfkill index must be specified with the rfkill-idx option. It might be possible to auto-detect the corresponding rfkill-idx option, but I'm not sure if there's a guaranteed mapping between rfkill name and interface name, as it seems that rfkills should represent phy devices and not wlan devices. The rfkill indexes can be found by checking /sys/class/rfkill/rfkill<IDX>.	2015-02-13 16:25:36 -05:00
Nicholas J. Kain	c58a071f52	Update copyright dates.	2015-02-13 01:54:57 -05:00
Nicholas J. Kain	27c9e2c553	Improve fingerprinting to support DHCP relay agents. Mostly reverts the previous commit and instead teaches ndhc to properly handle the case when it is communicating with a DHCP relay agent on its local segment rather than directly with a DHCP server.	2015-02-12 23:28:54 -05:00
Nicholas J. Kain	a395234a67	Support networks with relay agents that have the DHCP server on a different segment. The network fingerprinting would never complete if the DHCP server was on a different segment before this change, since it would be impossible for the ARP messages sent by ndhc to ever reach the DHCP server (and vice-versa). Now just give up trying to find the hardware address after two tries and assume that the DHCP server cannot be reached by ARP. An alternative would be to fingerprint the relay agent instead, but to do so would require a lot more work as the giaddr field is only meaningful in the client->server message path, not in the server->client path. Thus it would require gathering the source IP for DHCP replies sent by unicast or broadcast and ferrying along this information to the ARP checking code where it would be used in place of the DHCP server address. This is entirely possible to do, but is quite a bit more work.	2015-02-12 20:49:40 -05:00
Nicholas J. Kain	07cbd88049	Just use raw sockets for listening to DHCP requests. A UDP SO_BROADCAST socket was previously used only for receiving RENEWING packets, and it added needless complexity and was somewhat fragile.	2014-04-16 01:00:36 -04:00
Nicholas J. Kain	0884d96d1e	PR_SET_PDEATHSIG is not fully reliable, so instead maintain a pair of AF_UNIX SOCK_STREAM sockets between the master processes and each subprocess, and poll for the HUP event. At the same time, be specific about the events that are checked in epoll when dispatching on an event.	2014-04-15 23:19:24 -04:00
Nicholas J. Kain	a501789e04	Parse config options with ragel and support a configuration file.	2014-04-14 15:06:31 -04:00
Nicholas J. Kain	5fa2030bab	Use a socketpair rather than a pair of pipes for communication between ndhc and ifch, similar to sockd. A single pipe is also maintained so that SIGPIPE can bound the lifetime of an orphaned ifch process.	2014-04-07 03:44:02 -04:00
Nicholas J. Kain	b761889025	Move source from ndhc/ to src/ since ifchd is no longer a separate program.	2014-04-06 16:57:06 -04:00

42 Commits