diff -r 000000000000 -r 8083d21c0020 web/html/abc.html~ --- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/web/html/abc.html~ Mon Jan 25 18:56:45 2010 +0530 @@ -0,0 +1,530 @@ + +Chapter 9. Finding and fixing mistakes + + + + + + + + + + + +
+

+Chapter 14. Adding functionality with extensions

+
+

Table of Contents

+
+
14.1. Improve performance with the inotify extension
+
14.2. Flexible diff support with the extdiff extension
+
14.2.1. Defining command aliases
+
14.3. Cherrypicking changes with the transplant extension
+
14.4. Send changes via email with the patchbomb extension
+
14.4.1. Changing the behavior of patchbombs
+
+
+

While the core of Mercurial is quite complete from a + functionality standpoint, it's deliberately shorn of fancy + features. This approach of preserving simplicity keeps the + software easy to deal with for both maintainers and users.

+

However, Mercurial doesn't box you in with an inflexible + command set: you can add features to it as + extensions (sometimes known as + plugins). We've already discussed a few of + these extensions in earlier chapters.

+

When you provide a directory name, Mercurial will interpret + this as “operate on every file in this directory and its + subdirectories”. Mercurial traverses the files and + subdirectories in a directory in alphabetical order. When it + encounters a subdirectory, it will traverse that subdirectory + before continuing with the current directory.

+ + + +
+

In this chapter, we'll cover some of the other extensions that + are available for Mercurial, and briefly touch on some of the + machinery you'll need to know about if you want to write an + extension of your own.

+
+
+

+14.1. Improve performance with the inotify extension

+

Are you interested in having some of the most common + Mercurial operations run as much as a hundred times faster? + Read on!

+

Mercurial has great performance under normal circumstances. + For example, when you run the hg + status command, Mercurial has to scan almost every + directory and file in your repository so that it can display + file status. Many other Mercurial commands need to do the same + work behind the scenes; for example, the hg diff command uses the status + machinery to avoid doing an expensive comparison operation on + files that obviously haven't changed.

+

Because obtaining file status is crucial to good + performance, the authors of Mercurial have optimised this code + to within an inch of its life. However, there's no avoiding the + fact that when you run hg + status, Mercurial is going to have to perform at + least one expensive system call for each managed file to + determine whether it's changed since the last time Mercurial + checked. For a sufficiently large repository, this can take a + long time.

+

To put a number on the magnitude of this effect, I created a + repository containing 150,000 managed files. I timed hg status as taking ten seconds to + run, even when none of those files had been + modified.

+

Many modern operating systems contain a file notification + facility. If a program signs up to an appropriate service, the + operating system will notify it every time a file of interest is + created, modified, or deleted. On Linux systems, the kernel + component that does this is called + inotify.

+

Mercurial's inotify + extension talks to the kernel's inotify + component to optimise hg status + commands. The extension has two components. A daemon sits in + the background and receives notifications from the + inotify subsystem. It also listens for + connections from a regular Mercurial command. The extension + modifies Mercurial's behavior so that instead of scanning the + filesystem, it queries the daemon. Since the daemon has perfect + information about the state of the repository, it can respond + with a result instantaneously, avoiding the need to scan every + directory and file in the repository.

+

Recall the ten seconds that I measured plain Mercurial as + taking to run hg status on a + 150,000 file repository. With the inotify extension enabled, the time + dropped to 0.1 seconds, a factor of one + hundred faster.

+

Before we continue, please pay attention to some + caveats.

+
    +
  • The inotify + extension is Linux-specific. Because it interfaces directly + to the Linux kernel's inotify subsystem, + it does not work on other operating systems.

  • +
  • It should work on any Linux distribution that + was released after early 2005. Older distributions are + likely to have a kernel that lacks + inotify, or a version of + glibc that does not have the necessary + interfacing support.

  • +
  • Not all filesystems are suitable for use with + the inotify extension. + Network filesystems such as NFS are a non-starter, for + example, particularly if you're running Mercurial on several + systems, all mounting the same network filesystem. The + kernel's inotify system has no way of + knowing about changes made on another system. Most local + filesystems (e.g. ext3, XFS, ReiserFS) should work + fine.

  • +
+

The inotify extension is + not yet shipped with Mercurial as of May 2007, so it's a little + more involved to set up than other extensions. But the + performance improvement is worth it!

+

The extension currently comes in two parts: a set of patches + to the Mercurial source code, and a library of Python bindings + to the inotify subsystem.

+
+ + + + + +
[Note]Note

There are two Python + inotify binding libraries. One of them is + called pyinotify, and is packaged by some + Linux distributions as python-inotify. + This is not the one you'll need, as it is + too buggy and inefficient to be practical.

+

To get going, it's best to already have a functioning copy + of Mercurial installed.

+
+ + + + + +
[Note]Note

If you follow the instructions below, you'll be + replacing and overwriting any existing + installation of Mercurial that you might already have, using + the latest bleeding edge Mercurial code. Don't + say you weren't warned!

+
    +
  1. +

    Clone the Python inotify + binding repository. Build and install it.

    +
    hg clone http://hg.kublai.com/python/inotify
    +cd inotify
    +python setup.py build --force
    +sudo python setup.py install --skip-build
    +
  2. +
  3. +

    Clone the crew Mercurial repository. + Clone the inotify patch + repository so that Mercurial Queues will be able to apply + patches to your cope of the crew repository.

    +
    hg clone http://hg.intevation.org/mercurial/crew
    +hg clone crew inotify
    +hg clone http://hg.kublai.com/mercurial/patches/inotify inotify/.hg/patches
    +
  4. +
  5. Make sure that you have the Mercurial Queues + extension, mq, enabled. If + you've never used MQ, read Section 12.5, “Getting started with Mercurial Queues” to get started + quickly.

  6. +
  7. +

    Go into the inotify repo, and apply all + of the inotify patches + using the hg + -a option to the qpush command.

    +
    cd inotify
    +hg qpush -a
    +
  8. +
  9. If you get an error message from qpush, you should not continue. + Instead, ask for help.

  10. +
  11. +

    Build and install the patched version of + Mercurial.

    +
    python setup.py build --force
    +sudo python setup.py install --skip-build
    +
  12. +
+

Once you've build a suitably patched version of Mercurial, + all you need to do to enable the inotify extension is add an entry to + your ~/.hgrc.

+
[extensions] inotify =
+

When the inotify extension + is enabled, Mercurial will automatically and transparently start + the status daemon the first time you run a command that needs + status in a repository. It runs one status daemon per + repository.

+

The status daemon is started silently, and runs in the + background. If you look at a list of running processes after + you've enabled the inotify + extension and run a few commands in different repositories, + you'll thus see a few hg processes sitting + around, waiting for updates from the kernel and queries from + Mercurial.

+

The first time you run a Mercurial command in a repository + when you have the inotify + extension enabled, it will run with about the same performance + as a normal Mercurial command. This is because the status + daemon needs to perform a normal status scan so that it has a + baseline against which to apply later updates from the kernel. + However, every subsequent command that does + any kind of status check should be noticeably faster on + repositories of even fairly modest size. Better yet, the bigger + your repository is, the greater a performance advantage you'll + see. The inotify daemon makes + status operations almost instantaneous on repositories of all + sizes!

+

If you like, you can manually start a status daemon using + the inserve command. + This gives you slightly finer control over how the daemon ought + to run. This command will of course only be available when the + inotify extension is + enabled.

+

When you're using the inotify extension, you should notice + no difference at all in Mercurial's + behavior, with the sole exception of status-related commands + running a whole lot faster than they used to. You should + specifically expect that commands will not print different + output; neither should they give different results. If either of + these situations occurs, please report a bug.

+
+
+

+14.2. Flexible diff support with the extdiff extension

+

Mercurial's built-in hg + diff command outputs plaintext unified diffs.

+
$ hg diff
+diff -r 80997726a0ea myfile
+--- a/myfile	Wed Jan 06 06:50:18 2010 +0000
++++ b/myfile	Wed Jan 06 06:50:18 2010 +0000
+@@ -1,1 +1,2 @@
+ The first line.
++The second line.
+
+

If you would like to use an external tool to display + modifications, you'll want to use the extdiff extension. This will let you + use, for example, a graphical diff tool.

+

The extdiff extension is + bundled with Mercurial, so it's easy to set up. In the extensions section of your + ~/.hgrc, simply add a + one-line entry to enable the extension.

+
[extensions]
+extdiff =
+

This introduces a command named extdiff, which by default uses + your system's diff command to generate a + unified diff in the same form as the built-in hg diff command.

+
$ hg extdiff
+--- a.80997726a0ea/myfile	2010-01-06 06:50:18.613674526 +0000
++++ /tmp/extdiffNErQlu/a/myfile	2010-01-06 06:50:18.437687076 +0000
+@@ -1 +1,2 @@
+ The first line.
++The second line.
+
+

The result won't be exactly the same as with the built-in + hg diff variations, because the + output of diff varies from one system to + another, even when passed the same options.

+

As the making snapshot + lines of output above imply, the extdiff command works by + creating two snapshots of your source tree. The first snapshot + is of the source revision; the second, of the target revision or + working directory. The extdiff command generates + these snapshots in a temporary directory, passes the name of + each directory to an external diff viewer, then deletes the + temporary directory. For efficiency, it only snapshots the + directories and files that have changed between the two + revisions.

+

Snapshot directory names have the same base name as your + repository. If your repository path is /quux/bar/foo, then foo will be the name of each + snapshot directory. Each snapshot directory name has its + changeset ID appended, if appropriate. If a snapshot is of + revision a631aca1083f, the directory will be + named foo.a631aca1083f. + A snapshot of the working directory won't have a changeset ID + appended, so it would just be foo in this example. To see what + this looks like in practice, look again at the extdiff example above. Notice + that the diff has the snapshot directory names embedded in its + header.

+

The extdiff command + accepts two important options. The hg -p option + lets you choose a program to view differences with, instead of + diff. With the hg -o option, + you can change the options that extdiff passes to the program + (by default, these options are + -Npru, which only make sense + if you're running diff). In other respects, + the extdiff command + acts similarly to the built-in hg + diff command: you use the same option names, syntax, + and arguments to specify the revisions you want, the files you + want, and so on.

+

As an example, here's how to run the normal system + diff command, getting it to generate context + diffs (using the -c option) + instead of unified diffs, and five lines of context instead of + the default three (passing 5 as the argument + to the -C option).

+
$ hg extdiff -o -NprcC5
+*** a.80997726a0ea/myfile	Wed Jan  6 06:50:18 2010
+--- /tmp/extdiffNErQlu/a/myfile	Wed Jan  6 06:50:18 2010
+***************
+*** 1 ****
+--- 1,2 ----
+  The first line.
++ The second line.
+
+

Launching a visual diff tool is just as easy. Here's how to + launch the kdiff3 viewer.

+
hg extdiff -p kdiff3 -o
+

If your diff viewing command can't deal with directories, + you can easily work around this with a little scripting. For an + example of such scripting in action with the mq extension and the + interdiff command, see Section 13.9.2, “Viewing the history of a patch”.

+
+

+14.2.1. Defining command aliases

+

It can be cumbersome to remember the options to both the + extdiff command and + the diff viewer you want to use, so the extdiff extension lets you define + new commands that will invoke your diff + viewer with exactly the right options.

+

All you need to do is edit your ~/.hgrc, and add a section named + extdiff. Inside this + section, you can define multiple commands. Here's how to add + a kdiff3 command. Once you've defined + this, you can type hg kdiff3 + and the extdiff extension + will run kdiff3 for you.

+
[extdiff]
+cmd.kdiff3 =
+

If you leave the right hand side of the definition empty, + as above, the extdiff + extension uses the name of the command you defined as the name + of the external program to run. But these names don't have to + be the same. Here, we define a command named + hg wibble, which runs + kdiff3.

+
[extdiff]
+ cmd.wibble = kdiff3
+

You can also specify the default options that you want to + invoke your diff viewing program with. The prefix to use is + opts., followed by the name + of the command to which the options apply. This example + defines a hg vimdiff command + that runs the vim editor's + DirDiff extension.

+
[extdiff]
+ cmd.vimdiff = vim
+opts.vimdiff = -f '+next' '+execute "DirDiff" argv(0) argv(1)'
+
+
+
+

+14.3. Cherrypicking changes with the transplant extension

+

Need to have a long chat with Brendan about this.

+
+
+

+14.4. Send changes via email with the patchbomb extension

+

Many projects have a culture of change + review, in which people send their modifications to a + mailing list for others to read and comment on before they + commit the final version to a shared repository. Some projects + have people who act as gatekeepers; they apply changes from + other people to a repository to which those others don't have + access.

+

Mercurial makes it easy to send changes over email for + review or application, via its patchbomb extension. The extension is + so named because changes are formatted as patches, and it's usual + to send one changeset per email message. Sending a long series + of changes by email is thus much like bombing the + recipient's inbox, hence patchbomb.

+

As usual, the basic configuration of the patchbomb extension takes just one or + two lines in your + /.hgrc.

+
[extensions]
+patchbomb =
+

Once you've enabled the extension, you will have a new + command available, named email.

+

The safest and best way to invoke the email command is to + always run it first with the hg -n option. + This will show you what the command would + send, without actually sending anything. Once you've had a + quick glance over the changes and verified that you are sending + the right ones, you can rerun the same command, with the hg -n option + removed.

+

The email command + accepts the same kind of revision syntax as every other + Mercurial command. For example, this command will send every + revision between 7 and tip, inclusive.

+
hg email -n 7:tip
+

You can also specify a repository to + compare with. If you provide a repository but no revisions, the + email command will + send all revisions in the local repository that are not present + in the remote repository. If you additionally specify revisions + or a branch name (the latter using the hg -b option), + this will constrain the revisions sent.

+

It's perfectly safe to run the email command without the + names of the people you want to send to: if you do this, it will + just prompt you for those values interactively. (If you're + using a Linux or Unix-like system, you should have enhanced + readline-style editing capabilities when + entering those headers, too, which is useful.)

+

When you are sending just one revision, the email command will by + default use the first line of the changeset description as the + subject of the single email message it sends.

+

If you send multiple revisions, the email command will usually + send one message per changeset. It will preface the series with + an introductory message, in which you should describe the + purpose of the series of changes you're sending.

+
+

+14.4.1. Changing the behavior of patchbombs

+

Not every project has exactly the same conventions for + sending changes in email; the patchbomb extension tries to + accommodate a number of variations through command line + options.

+
    +
  • You can write a subject for the introductory + message on the command line using the hg -s + option. This takes one argument, the text of the subject + to use.

  • +
  • To change the email address from which the + messages originate, use the hg -f + option. This takes one argument, the email address to + use.

  • +
  • The default behavior is to send unified diffs + (see Section 12.4, “Understanding patches” for a + description of the + format), one per message. You can send a binary bundle + instead with the hg -b + option.

  • +
  • Unified diffs are normally prefaced with a + metadata header. You can omit this, and send unadorned + diffs, with the hg + --plain option.

  • +
  • Diffs are normally sent inline, + in the same body part as the description of a patch. This + makes it easiest for the largest number of readers to + quote and respond to parts of a diff, as some mail clients + will only quote the first MIME body part in a message. If + you'd prefer to send the description and the diff in + separate body parts, use the hg -a + option.

  • +
  • Instead of sending mail messages, you can + write them to an mbox-format mail + folder using the hg -m + option. That option takes one argument, the name of the + file to write to.

  • +
  • If you would like to add a + diffstat-format summary to each patch, + and one to the introductory message, use the hg -d + option. The diffstat command displays + a table containing the name of each file patched, the + number of lines affected, and a histogram showing how much + each file is modified. This gives readers a qualitative + glance at how complex a patch is.

  • +
+
+
+
+ + +