Sophie: buildbot-doc-0.8.5-2.mga2 noarch

buildbot-doc-0.8.5-2.mga2.noarch.rpm

.. _Introduction:

Introduction
============

BuildBot is a system to automate the compile/test cycle required by most
software projects to validate code changes. By automatically rebuilding and
testing the tree each time something has changed, build problems are
pinpointed quickly, before other developers are inconvenienced by the
failure. The guilty developer can be identified and harassed without human
intervention. By running the builds on a variety of platforms, developers
who do not have the facilities to test their changes everywhere before
checkin will at least know shortly afterwards whether they have broken the
build or not. Warning counts, lint checks, image size, compile time, and
other build parameters can be tracked over time, are more visible, and
are therefore easier to improve.

The overall goal is to reduce tree breakage and provide a platform to
run tests or code-quality checks that are too annoying or pedantic for
any human to waste their time with. Developers get immediate (and
potentially public) feedback about their changes, encouraging them to
be more careful about testing before checkin.

Features:

* run builds on a variety of slave platforms
* arbitrary build process: handles projects using C, Python, whatever
* minimal host requirements: python and Twisted
* slaves can be behind a firewall if they can still do checkout
* status delivery through web page, email, IRC, other protocols
* track builds in progress, provide estimated completion time
* flexible configuration by subclassing generic build process classes
* debug tools to force a new build, submit fake :class:`Change`\s,
query slave status
* released under the `GPL <http://opensource.org/licenses/gpl-2.0.php>`_

.. _History-and-Philosophy:

History and Philosophy
----------------------

The Buildbot was inspired by a similar project built for a development
team writing a cross-platform embedded system. The various components
of the project were supposed to compile and run on several flavors of
unix (linux, solaris, BSD), but individual developers had their own
preferences and tended to stick to a single platform. From time to
time, incompatibilities would sneak in (some unix platforms want to
use :file:`string.h`, some prefer :file:`strings.h`), and then the tree
would compile for some developers but not others. The buildbot was
written to automate the human process of walking into the office,
updating a tree, compiling (and discovering the breakage), finding the
developer at fault, and complaining to them about the problem they had
introduced. With multiple platforms it was difficult for developers to
do the right thing (compile their potential change on all platforms);
the buildbot offered a way to help.

Another problem was when programmers would change the behavior of a
library without warning its users, or change internal aspects that
other code was (unfortunately) depending upon. Adding unit tests to
the codebase helps here: if an application's unit tests pass despite
changes in the libraries it uses, you can have more confidence that
the library changes haven't broken anything. Many developers
complained that the unit tests were inconvenient or took too long to
run: having the buildbot run them reduces the developer's workload to
a minimum.

In general, having more visibility into the project is always good,
and automation makes it easier for developers to do the right thing.
When everyone can see the status of the project, developers are
encouraged to keep the tree in good working order. Unit tests that
aren't run on a regular basis tend to suffer from bitrot just like
code does: exercising them on a regular basis helps to keep them
functioning and useful.

The current version of the Buildbot is additionally targeted at
distributed free-software projects, where resources and platforms are
only available when provided by interested volunteers. The buildslaves
are designed to require an absolute minimum of configuration, reducing
the effort a potential volunteer needs to expend to be able to
contribute a new test environment to the project. The goal is for
anyone who wishes that a given project would run on their favorite
platform should be able to offer that project a buildslave, running on
that platform, where they can verify that their portability code
works, and keeps working.

.. _System-Architecture:

System Architecture
-------------------

The Buildbot consists of a single *buildmaster* and one or more
*buildslaves*, connected in a star topology. The buildmaster
makes all decisions about what, when, and how to build. It sends
commands to be run on the build slaves, which simply execute the
commands and return the results. (certain steps involve more local
decision making, where the overhead of sending a lot of commands back
and forth would be inappropriate, but in general the buildmaster is
responsible for everything).

The buildmaster is usually fed :class:`Change`\s by some sort of version control
system (:ref:`change-sources`), which may cause builds to be run. As the
builds are performed, various status messages are produced, which are then sent
to any registered :ref:`status-targets`.

.. image:: _images/overview.*
:alt: Overview Diagram

The buildmaster is configured and maintained by the *buildmaster
admin*, who is generally the project team member responsible for
build process issues. Each buildslave is maintained by a *buildslave
admin*, who do not need to be quite as involved. Generally slaves are
run by anyone who has an interest in seeing the project work well on
their favorite platform.

.. _BuildSlave-Connections:

BuildSlave Connections
~~~~~~~~~~~~~~~~~~~~~~

The buildslaves are typically run on a variety of separate machines,
at least one per platform of interest. These machines connect to the
buildmaster over a TCP connection to a publically-visible port. As a
result, the buildslaves can live behind a NAT box or similar
firewalls, as long as they can get to buildmaster. The TCP connections
are initiated by the buildslave and accepted by the buildmaster, but
commands and results travel both ways within this connection. The
buildmaster is always in charge, so all commands travel exclusively
from the buildmaster to the buildslave.

To perform builds, the buildslaves must typically obtain source code
from a CVS/SVN/etc repository. Therefore they must also be able to
reach the repository. The buildmaster provides instructions for
performing builds, but does not provide the source code itself.

.. image:: _images/slaves.*
:alt: BuildSlave Connections

.. _Buildmaster-Architecture:

Buildmaster Architecture
~~~~~~~~~~~~~~~~~~~~~~~~

The buildmaster consists of several pieces:

.. image:: _images/master.*
:alt: Buildmaster Architecture

Change Sources
Which create a Change object each time something is
modified in the VC repository. Most :class:`ChangeSource`\s listen for messages
from a hook script of some sort. Some sources actively poll the
repository on a regular basis. All :class:`Change`\s are fed to the
:class:`Scheduler`\s.

Schedulers
Which decide when builds should be performed. They collect
:class:`Change`\s into :class:`BuildRequest`\s, which are then queued for delivery to
:class:`Builders` until a buildslave is available.

Builders
Which control exactly *how* each build is performed
(with a series of :class:`BuildStep`\s, configured in a :class:`BuildFactory`). Each
:class:`Build` is run on a single buildslave.

Status plugins
Which deliver information about the build results
through protocols like HTTP, mail, and IRC.

Each :class:`Builder` is configured with a list of :class:`BuildSlave`\s that it will use
for its builds. These buildslaves are expected to behave identically:
the only reason to use multiple :class:`BuildSlave`\s for a single :class:`Builder` is to
provide a measure of load-balancing.

Within a single :class:`BuildSlave`, each :class:`Builder` creates its own :class:`SlaveBuilder`
instance. These :class:`SlaveBuilder`\s operate independently from each other.
Each gets its own base directory to work in. It is quite common to
have many :class:`Builder`\s sharing the same buildslave. For example, there
might be two buildslaves: one for i386, and a second for PowerPC.
There may then be a pair of :class:`Builder`\s that do a full compile/test run,
one for each architecture, and a lone :class:`Builder` that creates snapshot
source tarballs if the full builders complete successfully. The full
builders would each run on a single buildslave, whereas the tarball
creation step might run on either buildslave (since the platform
doesn't matter when creating source tarballs). In this case, the
mapping would look like:

.. code-block:: none

Builder(full-i386) -> BuildSlaves(slave-i386)
Builder(full-ppc) -> BuildSlaves(slave-ppc)
Builder(source-tarball) -> BuildSlaves(slave-i386, slave-ppc)

and each :class:`BuildSlave` would have two :class:`SlaveBuilders` inside it, one for a
full builder, and a second for the source-tarball builder.

Once a :class:`SlaveBuilder` is available, the :class:`Builder` pulls one or more
:class:`BuildRequest`\s off its incoming queue. (It may pull more than one if it
determines that it can merge the requests together; for example, there
may be multiple requests to build the current *HEAD* revision). These
requests are merged into a single :class:`Build` instance, which includes the
:class:`SourceStamp` that describes what exact version of the source code
should be used for the build. The :class:`Build` is then randomly assigned to a
free :class:`SlaveBuilder` and the build begins.

The behaviour when :class:`BuildRequest`\s are merged can be customized,
:ref:`Merging-Build-Requests`.

.. _Status-Delivery-Architecture:

Status Delivery Architecture
~~~~~~~~~~~~~~~~~~~~~~~~~~~~

The buildmaster maintains a central :class:`Status` object, to which various
status plugins are connected. Through this :class:`Status` object, a full
hierarchy of build status objects can be obtained.

.. image:: _images/status.*
:alt: Status Delivery

The configuration file controls which status plugins are active. Each
status plugin gets a reference to the top-level :class:`Status` object. From
there they can request information on each :class:`Builder`, :class:`Build`, :class:`Step`, and
:class:`LogFile`. This query-on-demand interface is used by the ``html.Waterfall``
plugin to create the main status page each time a web browser hits the
main URL.

The status plugins can also subscribe to hear about new :class:`Build`\s as they
occur: this is used by the :class:`MailNotifier` to create new email messages
for each recently-completed :class:`Build`.

The :class:`Status` object records the status of old builds on disk in the
buildmaster's base directory. This allows it to return information
about historical builds.

There are also status objects that correspond to :class:`Scheduler`\s and
:class:`BuildSlave`\s. These allow status plugins to report information about
upcoming builds, and the online/offline status of each buildslave.

.. _Control-Flow:

Control Flow
------------

A day in the life of the buildbot:

* A developer commits some source code changes to the repository. A hook
script or commit trigger of some sort sends information about this
change to the buildmaster through one of its configured Change
Sources. This notification might arrive via email, or over a network
connection (either initiated by the buildmaster as it *subscribes*
to changes, or by the commit trigger as it pushes :class:`Change`\s towards the
buildmaster). The :class:`Change` contains information about who made the
change, what files were modified, which revision contains the change,
and any checkin comments.

* The buildmaster distributes this change to all of its configured
:class:`Scheduler`\s. Any ``important`` changes cause the ``tree-stable-timer``
to be started, and the :class:`Change` is added to a list of those that will go
into a new :class:`Build`. When the timer expires, a :class:`Build` is started on each
of a set of configured Builders, all compiling/testing the same source
code. Unless configured otherwise, all :class:`Build`\s run in parallel on the
various buildslaves.

* The :class:`Build` consists of a series of :class:`Step`\s. Each :class:`Step` causes some number
of commands to be invoked on the remote buildslave associated with
that :class:`Builder`. The first step is almost always to perform a checkout of
the appropriate revision from the same VC system that produced the
:class:`Change`. The rest generally perform a compile and run unit tests. As
each :class:`Step` runs, the buildslave reports back command output and return
status to the buildmaster.

* As the :class:`Build` runs, status messages like "Build Started", "Step
Started", "Build Finished", etc, are published to a collection of
Status Targets. One of these targets is usually the HTML ``Waterfall``
display, which shows a chronological list of events, and summarizes
the results of the most recent build at the top of each column.
Developers can periodically check this page to see how their changes
have fared. If they see red, they know that they've made a mistake and
need to fix it. If they see green, they know that they've done their
duty and don't need to worry about their change breaking anything.

* If a :class:`MailNotifier` status target is active, the completion of a build
will cause email to be sent to any developers whose :class:`Change`\s were
incorporated into this :class:`Build`. The :class:`MailNotifier` can be configured to
only send mail upon failing builds, or for builds which have just
transitioned from passing to failing. Other status targets can provide
similar real-time notification via different communication channels,
like IRC.