Editor Note: This essay, originally published in 2000, was rewritten in March 2003.
What is index depth? A good question. The depth of an index refers to how many entries and subentries there are, relative to the number of page numbers in the index or documentation. An index that has subentry after subentry after subentry is a deep index. An index that has only primary-level entries and page ranges (and thus little detail) is a “shallow” index.
Index depth is subjective. The documentation by itself might be a very “shallow,” in which case index depth is not going to be high anyway. On the other hand, the documentation might be quite detailed, in which case the indexer has to decide how detailed (deep) the index should be.
There is no ideal depth for the index, but there are certain rules of thumb that I follow in my own indexes. In each of these guidelines, it’s important to consider the book’s audience. Are these people looking for a quick reference or for some in-depth information? Are they highly educated on the subject matter, or are they novices who want decoration for their coffee tables? This knowledge needs to be provided in advance by the author, editor, or publisher.
Below are four rules of thumb, with examples.
1. It’s almost impossible to provide too much access.
If you’re even remotely concerned that your readers won’t know the terminology, you must do something about it. How many different ways are there to say “software”? Programs, applications, code, and so on. These aren’t synonyms, but they’re close enough for some readers. Then come the kinds of software, like utilities, games, and drivers. Software-related information could be made accessible through any of these terms. Choose your favorite term (in this case, software), and for the rest either use cross references or repeat the page numbers.
applications. See also software
code. See software
programs. See software
software, 10–20
The above example is simple, but information can be more complicated. In the example below, the term drivers is important enough to merit additional access. Notice how the page number for 15 is repeated under the software entry.
applications. See also software
code. See software
drivers, 15, 180
hardware, 1-10
drivers for, 15, 180
programs. See software
software, 10–20
drivers, 15, 180
There are two limits. One of them is absurdity: “zeros and ones. See software.” The other is caused by alphabetization, such that two entries are so close to each other in the list that one is enough:
World Wide Web (WWW), 41–46
WWW (World Wide Web), 41–46
2. Don’t use cross references for entries with one or two locators.
drugs. See medicine
medicine, 5
With a construction like this, you’re wasting the reader’s time. Avoid the cross reference in favor of the more direct “drugs, 5.”
3. If there are enough page numbers to be annoying, use subentries.
If a page range is too huge, readers are forced to go digging among those pages to find more specific content. Similarly, if you create long lists of undifferentiated page numbers, readers have to check each number for their desired information. Here are examples:
Both of these call out for the use of subentries. For long ranges, create subentries you’re your subsections; the result might resemble your table of contents, although in alphabetical order. For long series of page numbers, use subentries to provide at least some differentiation. In both cases, some page number repetition may occur.
coffee, history of, 133–178
agricultural research, 169-171
beans as currency, 140-149
caffeine as medical stimulant, 172–178
international trade, 150–157
smuggling, 157–169
tea, medicinal effects of, 93
caffeinated tea, 134, 149, 355
decaffeinated tea, 149, 206, 211, 217
It is important not to overdo this. Creating too many subentries is just as bad. In the example below, you’re forcing the reader to make all sorts of choices even though the information is clearly within pages 15, 30–33, or 101. The indexing effort required to create these subentries ends up putting the burden back on the reader.
blood, drawing
disposal of needles, 32. See also biological waste
eating before, 30
equipment for, 30–33
fainting behavior, 33
fear of, 30-31
HIV risk, 32
from left arm, 30
phlebotomists, 15, 101
right- vs. left-arm drawing, 30
vampire myths, 15
volunteer donation, 15, 30
Consider how much easier this is to read and understand:
blood, drawing, 15, 30–33
phlebotomists, 15, 101
See also biological waste
ven this variation below, which has placed greater importance on several concepts, is less burdensome.
blood, drawing, 15, 30–33
disposal of needles, 32. See also biological waste
fainting behavior, 33
HIV risk, 32
phlebotomists, 15, 101
4. When page numbers are almost sequential, use ranges anyway.
I know something feels wrong about doing this, but it demonstrates one kind of compromise between accuracy and usability. Trust that your readers know how to turn the page.