Puneet Varma (Editor)

Getaddrinfo

Updated on
Edit
Like
Comment
Share on FacebookTweet on TwitterShare on LinkedInShare on Reddit

The functions getaddrinfo() and getnameinfo() convert domain names, hostnames, and IP addresses between human-readable text representations and structured binary formats for the operating system's networking API. Both functions are contained in the POSIX standard application programming interface (API).

Contents

getaddrinfo and getnameinfo are inverse functions of each other. They are network protocol agnostic, and support both IPv4 and IPv6. It is the recommended interface for name resolution in building protocol independent applications and for transitioning legacy IPv4 code to the IPv6 Internet.

Internally, the functions perform resolutions using the Domain Name System (DNS) by calling other, lower level functions, such as gethostbyname().

On February 16 2016 a security bug was announced in the glibc implementation of getaddrinfo(), using a buffer overflow technique, that may allow execution of arbitrary code by the attacker.

struct addrinfo

The C data structure used to represent addresses and hostnames within the networking API is the following:

In some operating systems the type of ai_addrlen was changed from size_t to socklen_t. Most socket functions, such as accept and getpeername, require the parameter to have type socklen_t* and programmers often pass the address to the ai_addrlen element of the addrinfo structure. If the types are incompatible, e.g., on a big-endian 64-bit Solaris 9 system where size_t is 8 bytes and socklen_t is 4 bytes, then run-time errors may result.

The structure contains structures ai_family and sockaddr with its own sa_family field. These are set to the same value when the structure is created with function getaddrinfo in some implementations.

getaddrinfo()

getaddrinfo() converts human-readable text strings representing hostnames or IP addresses into a dynamically allocated linked list of struct addrinfo structures. The function prototypes for these functions are specified as follows:

  • hostname can be either a domain name, such as "example.com", an address string, such as "127.0.0.1", or NULL, in which case the address 0.0.0.0 or 127.0.0.1 is assigned depending on the hints flags.
  • service can be a port number passed as string, such as "80", or a service name, e.g. "echo". In the latter case, gethostbyname() is used to query the file /etc/services to resolve the service to a port number.
  • hints can be either NULL or an addrinfo structure with the type of service requested.
  • res is a pointer that points to a new addrinfo structure with the information requested after successful completion of the function.
  • The function returns 0 upon success and negative if it fails.

    Although implementations vary among platforms, the function first attempts to obtain a port number usually by branching on service. If the string value is a number, it converts it to an integer and calls htons(). If it is a service name, such as www, the service is lookup with getservbyname(), using the protocol derived from hints->ai_socktype as the second parameter to that function. Then, if hostname is given (not NULL), a call to gethostbyname() resolves it, or otherwise the address 0.0.0.0 is used, if hints->ai_flags is set to AI_PASSIVE, and 127.0.0.1 otherwise. It calls malloc_ai in one of these conditions and passes the port retrieved at the beginning to allocate an addrinfo structure filled with the appropriate sockaddr_in. Finally, the **res parameter is dereferenced to make it point to a newly allocated addrinfo structure. In some implementations, such as the Unix version for Mac OS, the hints->ai_protocol overrides the hints->ai_socktype value while in others it is the opposite, so both need to be defined with equivalent values for the code to be work across multiple platforms.

    getnameinfo()

    getnameinfo() converts the internal binary representation of an IP address in the form of a struct sockaddr pointer into text strings consisting of the hostname or, if the address cannot be resolved into a name, a textual IP address representation, as well as the service port name or number. The function prototype is specified as follows:

    freeaddrinfo()

    This function frees the memory allocated by the getaddrinfo() function. As the result of the latter is a link list of addrinfo structures, freeaddrinfo() loops through the list and free each one it turn

  • ai is the head of the addrinfo list
  • Example

    The following example uses getaddrinfo() to resolve the domain name www.example.com into its list of addresses and then calls getnameinfo() on each result to return the canonical name for the address. In general, this produces the original hostname, unless the particular address has multiple names, in which case the canonical name is returned. In this example, the domain name is printed three times, once for each of the three results obtained.

    References

    Getaddrinfo Wikipedia