Nagios is a popular open-source monitoring system. It keeps an inventory of your servers and monitors them so you know your critical services are up and running. Using a monitoring system like Nagios is an essential tool for any production environment, because by monitoring uptime, CPU usage, or disk space, you can head off problems before they occur, or before your users call you.
In this tutorial, you’ll install Nagios 4 and configure it so you can monitor host resources via Nagios’ web interface. You’ll also set up the Nagios Remote Plugin Executor (NRPE), which runs as an agent on remote hosts so you can monitor their resources.
To complete this tutorial, you will need the following:
This tutorial assumes that your servers have private networking enabled so that monitoring happens on the private network rather than the public network. If you don’t have private networking enabled, you can still follow this tutorial by replacing all the references to private IP addresses with public IP addresses.
There are multiple ways to install Nagios, but we’ll install Nagios and its components from source to ensure we get the latest features, security updates, and bug fixes.
Log into your server that runs Apache. We’ll call this the Nagios server.
Create a nagios user and nagcmd group. You’ll use these to run the Nagios process.
Then add the user to the group:
Because we are building Nagios and its components from source, we must install a few development libraries to complete the build, including compilers, development headers, and OpenSSL.
Update your package lists to ensure you can download the latest versions of the prerequisites:
Then install the required packages:
With the prerequisites installed, we can install Nagios itself. Download the source code for the latest stable release of Nagios Core. Go to the Nagios downloads page, and click the Skip to download link below the form. Copy the link address for the latest stable release so you can download it to your Nagios server.
Download the release to your home directory with the curl
command:
Extract the Nagios archive:
Then change to the extracted directory:
Before building Nagios, run the configure
script to specify the user and group you want Nagios to use. Use the nagios user and nagcmd group you created:
If you want Nagios to send emails using Postfix, you must install Postfix and configure Nagios to use it by adding --with-mail=/usr/sbin/sendmail
to the configure
command. We won’t cover Postfix in this tutorial, but if you choose to use Postfix and Nagios later, you’ll need to reconfigure and reinstall Nagios to use Postfix support.
You’ll see the following output from the configure
command:
Output*** Configuration summary for nagios 4.3.4 2017-08-24 ***:
General Options:
-------------------------
Nagios executable: nagios
Nagios user/group: nagios,nagios
Command user/group: nagios,nagcmd
Event Broker: yes
Install ${prefix}: /usr/local/nagios
Install ${includedir}: /usr/local/nagios/include/nagios
Lock file: /run/nagios.lock
Check result directory: ${prefix}/var/spool/checkresults
Init directory: /etc/init.d
Apache conf.d directory: /etc/apache2/sites-available
Mail program: /bin/mail
Host OS: linux-gnu
IOBroker Method: epoll
Web Interface Options:
------------------------
HTML URL: http://localhost/nagios/
CGI URL: http://localhost/nagios/cgi-bin/
Traceroute (used by WAP):
Review the options above for accuracy. If they look okay,
type 'make all' to compile the main program and CGIs.
Now compile Nagios with this command:
Now run these make
commands to install Nagios, its init scripts, and its default configuration files:
You’ll use Apache to serve Nagios’ web interface, so copy the sample Apache configuration file to the /etc/apache2/sites-available
folder:
In order to issue external commands via the web interface to Nagios, add the web server user, www-data, to the nagcmd group:
Nagios is now installed. Let’s install a plugin which will allow Nagios to collect data from various hosts.
Nagios monitors remote hosts using the Nagios Remote Plugin Executor, or NRPE. It consists of two pieces:
check_nrpe
plugin which is used by Nagios server.Let’s install the check_nrpe
plugin on our Nagios server.
Find the download URL for the latest stable release of NRPE at the Nagios Exchange site.
Download it to your home directory with curl
:
Extract the NRPE archive:
Then change to the extracted directory:
Configure the check_nrpe
plugin:
Now build and install check_nrpe
:
Let’s configure the Nagios server next.
Now let’s perform the initial Nagios configuration, which involves editing some configuration files and configuring Apache to serve the Nagios web interface. You only need to perform this section once on your Nagios server.
Open the main Nagios configuration file in your text editor:
Find this line in the file:
...
#cfg_dir=/usr/local/nagios/etc/servers
...
Uncomment this line by deleting the #
character from the front of the line:
cfg_dir=/usr/local/nagios/etc/servers
Save the file and exit the editor.
Now create the directory that will store the configuration file for each server that you will monitor:
Open the Nagios contacts configuration in your text editor:
Find the email
directive and replace its value with your own email address:
...
define contact{
contact_name nagiosadmin ; Short name of user
use generic-contact ; Inherit default values from generic-contact template (defined above)
alias Nagios Admin ; Full name of user
email your_email@your_domain.com ; <<***** CHANGE THIS TO YOUR EMAIL ADDRESS ******
...
Save and exit the editor.
Next, add a new command to your Nagios configuration that lets you use the check_nrpe
command in Nagios service definitions. Open the file /usr/local/nagios/etc/objects/commands.cfg
in your editor:
Add the following to the end of the file to define a new command called check_nrpe
:
...
define command{
command_name check_nrpe
command_line $USER1$/check_nrpe -H $HOSTADDRESS$ -c $ARG1$
}
This defines the name and specifies the command-line options to execute the plugin. You’ll use this command in Step 5.
Save and exit the editor.
Now configure Apache to serve the Nagios user interface. Enable the Apache rewrite
and cgi
modules with the a2enmod
command:
Use the htpasswd
command to create an admin user called nagiosadmin that can access the Nagios web interface:
sudo htpasswd -c /usr/local/nagios/etc/htpasswd.users nagiosadmin
Enter a password at the prompt. Remember this password, as you will need it to access the Nagios web interface.
Note: If you create a user with a name other than nagiosadmin, you will need to edit /usr/local/nagios/etc/cgi.cfg
and change all the nagiosadmin references to the user you created.
Now create a symbolic link for nagios.conf
to the sites-enabled
directory. This enables the Nagios virtual host.
Next, open the Apache configuration file for Nagios.
If you’ve configured Apache to serve pages over HTTPS, locate both occurrences of this line:
# SSLRequireSSL
Uncomment both occurrances by removing the #
symbol.
If you want to restrict the IP addresses that can access the Nagios web interface so that only certain IP addresses can access the interface, find the following two lines:
Order allow,deny
Allow from all
Comment them out by adding #
symbols in front of them:
# Order allow,deny
# Allow from all
Then find the following lines:
# Order deny,allow
# Deny from all
# Allow from 127.0.0.1
Uncomment them by deleting the #
symbols, and add the IP addresses or ranges (space delimited) that you want to allow to in the Allow from
line:
Order deny,allow
Deny from all
Allow from 127.0.0.1 your_ip_address
These lines appear twice in the configuration file, so ensure you change both occurrences. Then save and exit the editor.
Restart Apache to load the new Apache configuration:
With the Apache configuration in place, you can set up the service for Nagios. Nagios does not provide a Systemd unit file to manage the service, so let’s create one. Create the nagios.service
file and open it in your editor:
Enter the following definition into the file. This definition specifies when Nagios should start and where Systemd can find the Nagios application. Learn more about Systemd unit files in the tutorial Understanding Systemd Units and Unit Files
[Unit]
Description=Nagios
BindTo=network.target
[Install]
WantedBy=multi-user.target
[Service]
Type=simple
User=nagios
Group=nagios
ExecStart=/usr/local/nagios/bin/nagios /usr/local/nagios/etc/nagios.cfg
Save the file and exit your editor.
Then start Nagios and enable it to start when the server boots:
Nagios is now running, so let’s log in to its web interface.
Open your favorite web browser, and go to your Nagios server by visiting http://nagios_server_public_ip/nagios
.
Enter the login credentials for the web interface in the popup that appears. Use nagiosadmin for the username, and the password you created for that user.
After authenticating, you will see the default Nagios home page. Click on the Hosts link in the left navigation bar to see which hosts Nagios is monitoring:
As you can see, Nagios is monitoring only “localhost”, or itself.
Let’s monitor our other server with Nagios,
Let’s add a new host so Nagios can monitor it. We’ll install the Nagios Remote Plugin Executor (NRPE) on the remote host, install some plugins, and then configure the Nagios server to monitor this host.
Log in to the second server, which we’ll call the monitored server.
First create create a “nagios” user which will run the NRPE agent.
We’ll install NRPE from source, which means you’ll need the same development libraries you installed on the Nagios server in Step 1. Update your package sources and install the NRPE prerequisites:
NRPE requires that Nagios plugins is installed on the remote host. Let’s install this package from source.
Find the latest release of Nagios Plugins from the Nagios Plugins Download page. Copy the link address for the latest version, and copy the link address so you can download it to your Nagios server.
Download Nagios Plugins to your home directory with curl
:
Extract the Nagios Plugins archive:
Change to the extracted directory:
Before building Nagios Plugins, configure it to use the nagios user and group, and configure OpenSSL support:
Now compile the plugins:
Then install them:
Next, install NRPE. Find the download URL for the latest stable release of NRPE at the Nagios Exchange site just like you did in Step 1. Download the latest stable release of NRPE to your monitored server’s home directory with curl
:
Extract the NRPE archive with this command:
Then change to the extracted directory:
Configure NRPE by specifying the Nagios user and group, and tell it you want SSL support:
Now build and install NRPE and its startup script with these commands:
Next, let’s update the NRPE configuration file:
Find the allowed_hosts
directive, and add the private IP address of your Nagios server to the comma-delimited list:
This configures NRPE to accept requests from your Nagios server via its private IP address.
Save and exit your editor. Now you can start NRPE:
Ensure that the service is running by checking its status:
You’ll see the following output:
Output...
Oct 16 07:10:00 nagios systemd[1]: Started Nagios Remote Plugin Executor.
Oct 16 07:10:00 nagios nrpe[14653]: Starting up daemon
Oct 16 07:10:00 nagios nrpe[14653]: Server listening on 0.0.0.0 port 5666.
Oct 16 07:10:00 nagios nrpe[14653]: Server listening on :: port 5666.
Oct 16 07:10:00 nagios nrpe[14653]: Listening for connections on port 5666
Oct 16 07:10:00 nagios nrpe[14653]: Allowing connections from: 127.0.0.1,::1,207.154.249.232
Next, allow access to port 5666
through the firewall. If you are using UFW, configure it to allow TCP connections to port 5666
:
You can learn more about UFW in How To Set Up a Firewall with UFW on Ubuntu 16.04.
Now you can check the communication with the remote NRPE server. Run the following command on the Nagios server:
You’ll see the following output:
OutputNRPE v3.2.1
Now let’s configure some basic checks that Nagios can monitor.
First, let’s monitor the disk usage of this server. Use the df -h
command to look for the root filesystem. You’ll use this filesystem name in the NRPE configuration:
You’ll see output similar to this:
OutputFilesystem Size Used Avail Use% Mounted on
udev 490M 0 490M 0% /dev
tmpfs 100M 3.1M 97M 4% /run
/dev/vda1 29G 1.4G 28G 5% /
tmpfs 497M 0 497M 0% /dev/shm
tmpfs 5.0M 0 5.0M 0% /run/lock
tmpfs 497M 0 497M 0% /sys/fs/cgroup
/dev/vda15 105M 3.4M 102M 4% /boot/efi
tmpfs 100M 0 100M 0% /run/user/0
Locate the filesystem associated with /
. On a Droplet, the filesystem you want is probably /dev/vda1
.
Now open /usr/local/nagios/etc/nrpe.cfg
file in your editor:
The NRPE configuration file is very long and full of comments. There are a few lines that you will need to find and modify:
/dev/hda1
to whatever your root filesystem is calledLocate these settings and alter them appropriately:
...
server_address=monitored_server_private_ip
...
command[check_vda1]=/usr/lib/nagios/plugins/check_disk -w 20% -c 10% -p /dev/vda1
...
Save and exit the editor.
Restart the NRPE service to put the change into effect:
Repeat the steps in this section for each additional server you want to monitor.
Once you are done installing and configuring NRPE on the hosts that you want to monitor, you will have to add these hosts to your Nagios server configuration before it will start monitoring them. Let’s do that next.
To monitor your hosts with Nagios, you’ll add configuration files for each host specifying what you want to monitor. You can then view those hosts in the Nagios web interface.
On your Nagios server, create a new configuration file for each of the remote hosts that you want to monitor in /usr/local/nagios/etc/servers/
. Replace the highlighted word, monitored_server_host_name
with the name of your host:
Add the following host definition, replacing the host_name
value with your remote hostname, the alias
value with a description of the host, and the address
value with the private IP address of the remote host:
define host {
use linux-server
host_name your_monitored_server_host_name
alias My client server
address your_monitored_server_private_ip
max_check_attempts 5
check_period 24x7
notification_interval 30
notification_period 24x7
}
With this configuration, Nagios will only tell you if the host is up or down. Let’s add some services to monitor.
First, add this block to monitor CPU usage:
define service {
use generic-service
host_name your_monitored_server_host_name
service_description CPU load
check_command check_nrpe!check_load
}
The use generic-service
directive tells Nagios to inherit the values of a service template called generic-service which is predefined by Nagios.
Next, add this block to monitor disk usage:
define service {
use generic-service
host_name your_monitored_server_host_name
service_description /dev/vda1 free space
check_command check_nrpe!check_vda1
}
Now save and quit. Restart the Nagios service to put any changes into effect:
After several minutes, Nagios will check the new hosts and you’ll see them in the Nagios web interface. Click on the Services link in the left navigation bar to see all of your monitored hosts and services.
You’ve installed Nagios on a server and configured it to monitor CPU and disk usage of at least one remote machine.
Now that you’re monitoring a host and some of its services, you can start using Nagios to monitor your mission-critical services. You can use Nagios to set up notifications for critical events. For example, you can receive an email when your disk utilization reaches a warning or critical threshold, or a notification when your main website is down. This way you can resolve the situation promptly, or even before a problem even occurs.
Thanks for learning with the DigitalOcean Community. Check out our offerings for compute, storage, networking, and managed databases.
This textbox defaults to using Markdown to format your answer.
You can type !ref in this text area to quickly search our full set of tutorials, documentation & marketplace offerings and insert the link!
Remote Client: Showing host is not allowed
Apache is not starting after changes on sites enabled, ScriptAlias /nagios/cgi-bin “/usr/local/nagios/sbin”
<Directory “/usr/local/nagios/sbin”> SSLRequireSSL Options ExecCGI AllowOverride None <IfVersion >= 2.3> <RequireAll> Require all granted
Require host 127.0.0.1
</IfVersion> <IfVersion < 2.3>
Order allow,deny
Allow from all
</IfVersion> </Directory>
Alias /nagios “/usr/local/nagios/share”
<Directory “/usr/local/nagios/share”> SSLRequireSSL Options None AllowOverride None <IfVersion >= 2.3> <RequireAll> Require all granted Require all granted
Require host 127.0.0.1
</IfVersion> <IfVersion < 2.3>
Order allow,deny
Allow from all
</IfVersion> </Directory>
I have completed the steps upto Step-3. When accessing the Nagios web interface in step-4, I am getting blank screen after putting user name and password. What is breaking in the configuration?
Thanks
Thank you so much for this tutorial! I have 1 question about when you install the check_nrpe plugin… I need to modify the check_nrpe.cfg because a couple of my servers on my network are old and in order for it to work I need to modify the check_nrpe command to use -n. I can’t find this file so I am wondering how to go about this.
To be specific: my issue is in my
define service{ check_nrpe!something }
I would need check_nrpe to use the -n… since to get it to work I need to do the commands like this from the nagios server:
/usr/local/nagios$ libexec/check_nrpe -H 192.168.0.21 -n -c check_mem
How do I go about that? :)
Thanks!
I am stuck on the end of Step 3. I have made it through all steps until this without problems. When I try to enable the service:
sudo systemctl enable /etc/systemd/system/nagios.service
I get this error:
Failed to execute operation: File exists
I am still able to start the Nagios, however I just get a blank white screen after authentication.
Any ideas?
Hello go followed the guide without having any mistake. I can run from the terminal to start the nagios service without problems, but when I access nagios through we the system says “nagios not found”. I also opened port 80 for the ip address of the server on my router. what is the problem?. server apache2 is ready. if i do access ti my addres ip with web browser it is ready
Thank you so much for writing these things in a very clear manner. we found this site very useful for all our searches :)
Hi, Thank you for the very good article !! I just wanted to add that the installation of the Nagios tools is missing:
In short the missing commands: cd ~ wget https://nagios-plugins.org/download/nagios-plugins-2.2.1.tar.gz tar -xzf nagios-plugins*.tar.gz cd nagios-plugins-2.2.1/
./configure --with-nagios-user=nagios --with-nagios-group=nagios make make install
Thanks and best regards Manfred
I get this error in the web page Error: Could not read object configuration data!
and in console gilmata@ServI:$ service nagios start Starting nagios:Contraseña: su: Fallo de autenticación ERROR: Could not create or update ‘/usr/local/nagios/var/nagios.configtest’