=head1 NAME

rrdtool - round robin database tool

=for html <div align="right"><a href="rrdtool.pdf">PDF</a> version.</div>

=head1 SYNOPSIS

B<rrdtool> B<-> [workdir]| I<function>

=head1 DESCRIPTION

=head2 OVERVIEW

It is pretty easy to gather status information from all sorts of
things, ranging from the temperature in your office to the number of
octets which have passed through the FDDI interface of your
router. But it is not so trivial to store this data in a efficient and
systematic manner. This is where B<rrdtool> kicks in. It lets you
I<log and analyze> the data you gather from all kinds of data-sources
(B<DS>). The data analysis part of rrdtool is based on the ability to
quickly generate graphical representations of the data values
collected over a definable time period.

In this man page you will find general information on the design and
functionality of the Round Robin Database Tool (rrdtool). For a more
detailed description of how to use the individual functions of the
B<rrdtool> check the corresponding man page.

For an introduction to the usage of rrdtool make sure you check L<rrdtutorial>.

=head2 FUNCTIONS

While the man pages talk of command line switches you have to set in
order to make B<rrdtool> work it is important to note that the
B<rrdtool> can be 'remote controlled' through a set of pipes. This
saves a considerable amount of startup time when you plan to make
B<rrdtool> do a lot of things quickly. Check the section on L<"Remote
Control"> further down. There is also a number of language bindings
for rrdtool which allow you to use it directly from perl, python, tcl,
php, ...

=over 8

=item B<create>

Set up a new Round Robin Database (RRD). Check L<rrdcreate>.

=item B<update>

Store new data values into an RRD. Check L<rrdupdate>.

=item B<updatev>

Operation equivalent to B<update> except for output. Check L<rrdupdate>.

=item B<graph>

Create a graph from data stored in one or several RRD. Apart from
generating graphs, data can also be extracted to stdout. Check L<rrdgraph>.

=item B<dump>

Dump the contents of an RRD in plain ASCII. In connection with 
restore you can use it to transport an rrd from one architecture to another.
Check L<rrddump>.

=item B<restore>

Restore an RRD in XML format to a binary rrd ... Check L<rrdrestore>

=item B<fetch>

Get data for a certain time period from a RRD. The graph function
uses fetch to retrieve its data from an rrd. Check L<rrdfetch>.

=item B<tune>

Alter setup of an RRD. Check L<rrdtune>.

=item B<last>

Find last update time of an RRD. Check L<rrdlast>.

=item B<info>

Get information about an RRD. Check L<rrdinfo>.

=item B<rrdresize>

Change the size of individual RRAs ... Dangerous! Check L<rrdresize>.

=item B<xport>

Export data retrieved from one or several RRD. Check L<rrdxport>

=item B<rrdcgi>

This is a standalone tool for producing rrd graphs on the fly. Check
L<rrdcgi>.

=back

=head2 HOW DOES RRDTOOL WORK?

=over 8

=item Data acquisition

When monitoring the state of a system, it is convenient to have the
data available at a constant interval. Unfortunately you may not
always be able to fetch data at exactly the time you want
to. Therefore B<rrdtool> lets you update the logfile at any time you
want. It will automatically interpolate the value of the data-source
(B<DS>) at the latest official time-slot and write this value to the
log. The value you have supplied is stored as well and is also taken
into account when interpolating the next log entry.

=item Consolidation

You may log data at a 1 minute interval, but you are also be
interested to know the development of the data over the last year. You
could do this by simply storing the data in 1 minute interval, for one
year. While this would take considerable disk space it would also take
a lot of time to analyze the data when you wanted to create a graph
covering the whole year. B<rrdtool> offers a solution to this of this
problem through its data consolidation feature. When setting up
an Round Robin Database (B<RRD>), you can define at which interval
this consolidation should occur, and what consolidation function
(B<CF>) (average, minimum, maximum, total, last) should be used to
build the consolidated values (see rrdcreate). You can define any
number of different consolidation setups within one B<RRD>. They will
all be maintained on the fly when new data is loaded into the B<RRD>.

=item Round Robin Archives

Data values of the same consolidation setup are stored into Round
Robin Archives (B<RRA>). This is a very efficient manner to store data
for a certain amount of time, while using a known amount of storage
space. 

It works like this: If you want to store 1000 values in 5 minute
interval, B<rrdtool> will allocate space for 1000 data values and a
header area. In the header it will store a pointer telling
which one of the values in the storage area was last written to. New
values are written to the Round Robin Archive in a ...  you guess it
... round robin manner. This automatically limits the history to the last
1000 values. Because you can define several B<RRA>s within a single B<RRD>,
you can setup another one, storing 750 data values at a 2 hour interval
and thus keeping a log for the last two months although at a lower
resolution.

The use of B<RRA>s guarantees that the B<RRD> does not grow over
time and that old data is automatically eliminated. By using the
consolidation feature, you can still keep data for a very long time,
while gradually reducing the resolution of the data along the time
axis. Using different consolidation functions (B<CF>) allows you to
store exactly the type of information that actually interests
you. (Maximum one minute traffic on the LAN, minimum temperature of
the wine cellar, total minutes down time ...)

=item Unknown Data

As mentioned earlier, the B<RRD> stores data at a constant
interval. Now it may happen that no new data is available when a
value has to be written to the B<RRD>. Data acquisition may not be
possible for one reason or an other. The B<rrdtool> handles these
situations by storing an I<*UNKNOWN*> value into the database. The
value 'I<*UNKNOWN*>' is supported through all the functions of the
database. When consolidating the amount of I<*UNKNOWN*> data is
accumulated and when a new consolidated value is ready to be written
to its Round Robin Archive (B<RRA>) a validity check is performed to
make sure that the percentage of unknown data in the new value is
below a configurable level. If so, an I<*UNKNOWN*> value will be
written to the B<RRA>.

=item Graphing

The B<rrdtool> also allows one to generate reports in numerical and
graphical form based on the data stored in one or several
B<RRD>s. The graphing feature is fully configurable. Size, color and
contents of the graph can be defined freely. Check L<rrdgraph>
for more information on this.

=item Aberrant Behavior Detection

by Jake Brutlag E<lt>jakeb@corp.webtv.netE<gt>

The  B<rrdtool> also provides the building blocks for near real-time
aberrant behavior detection. These components include:

=over 12

=item *

An algorithm for predicting the values time series one time step into the future.

=item *

A measure of deviation between the predicted values and the observed values.

=item *

A mechanism to decide if and when an observed value
or sequence of observed values is I<too deviant> from the predicted value(s).

=back

Each of these components is briefly described:

Holt-Winters Time Series Forecasting Algorithm is an online, or incremental, 
algorithm that adaptively predicts future observations in a time series. It's 
forecast is the sum of three components: a baseline (or intercept), a linear 
trend over time (or slope), and a seasonal coefficient (a periodic effect, 
such as a daily cycle). There is one seasonal coefficient for each time point 
in the period (cycle). After a value is observed, each of these components is 
updated via exponential smoothing. So the algorithm learns from past values 
and uses them to predict the future. The rate of adaptation is governed by 
3 parameters, alpha (intercept), beta (slope), and gamma (seasonal). The prediction 
can also be viewed as a smoothed value for the time series.

The measure of deviation is a seasonal weighted absolute deviation. The term 
I<seasonal> means deviation is measured separately for each time point in the 
seasonal cycle. As with Holt-Winters Forecasting, deviation is predicted using 
the measure computed from past values (but only at that point in the seasonal cycle). 
After the value is observed, the algorithm learns from the observed value via 
exponential smoothing. Confidence bands for the observed time series are generated 
by scaling the sequence of predicted deviation values (we usually think of the sequence 
as a continuous line rather than a set of discrete points).

Aberrant behavior (a potential failure) is reported whenever the number of 
times the observed value violates the confidence bands meets or exceeds a 
specified threshold within a specified temporal window (i.e. 5 violations 
during the past 45 minutes with a value observed every 5 mintues).

This functionality is embedded in a set of related B<RRAs>. In particular, a FAILURES
B<RRA> logs potential failures. Presumably a front-end application to B<rrdtool> can
utilize this B<RRA> to initiate real-time alerts if that is desired.

You can find a detailed description of how to set this up in L<rrdcreate>.

=back

=head2 REMOTE CONTROL

When you start B<rrdtool> with the command line option 'B<->', it waits
for input via standard in. With this feature you can improve
performance by attaching B<rrdtool> to another process (mrtg is one
example) through a set of pipes. Over the pipes B<rrdtool> accepts the
same arguments as on the command line and some spezial commands like
B<quit, cd, mkdir> and B<ls>. For detail helps about the server commands
type :

   rrdtool help cd|mkdir|ls|quit

When a command is completed, rrdtool will print the string  'C<OK>', 
followed by timing information of the form B<u:>I<usertime> 
B<s:>I<systemtime> both values are running totals of seconds 
since rrdtool was started. If an error occurs, a line of the 
form 'C<ERROR:> I<Description of error>' will be printed. B<rrdtool>
will not abort if possible, but follow the ERROR line with an OK line.
If a B<workdir> is spezified and the UID is 0, rrdtool will do a 
chroot to the workdir. If the UID is not 0, rrdtool only changes the
current directory to B<workdir>.

=head2 RRD Server

If you want to create a RRD-Server, you must choose a TCP/IP Service 
number and add them to I</etc/services> like this:

 rrdsrv      13900/tcp                       # rrd server

Attention: the tcp port 13900 isn't official registered for rrdsrv. You
can use any unused port in your services, but the server an the client
system must use the same port of curse.
After this you can add the rrdtool as meta-server to I</etc/inetd.conf>
for example:
  
 rrdsrv stream tcp nowait root /opt/rrd/bin/rrdtool rrdtool - /var/rrd

Don't forget to create the database directory /var/rrd and reinitialize
your inetd.
If all was correct, you can access the server with perl sockets, tools
like netcat or a quickhack test 'telnet localhost rrdsrv'.


=head1 SEE ALSO

rrdcreate, rrdupdate, rrdgraph, rrddump, rrdfetch, rrdtune, rrdlast, rrdxport

=head1 BUGS

Bugs ? Features !

=head1 AUTHOR

Tobias Oetiker <oetiker@ee.ethz.ch>