<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" href="article-html.xsl"?>

<article ratings="auto" toc="auto">

   <seriestitle></seriestitle>
   <papertitle>Graphics programming with libtiff, Part 2</papertitle>
   <subtitle>And now for a little color</subtitle>
   <author company="Tower Software Engineering" email="mikal@stillhq.com" jobtitle="Senior Software Engineer" name="Michael Still">Michael has been working in the image processing field for several years, including a couple of years managing and developing large image databases for an Australian government department. He currently works for Tower Software, which manufactures a world leading EDMS and records management package called TRIM. Michael is also the developer of Panda, an open source PDF generation API, as well as the maintainer of the comp.text.pdf USENET frequently asked questions document. You can contact Michael at <a href="mailto:mikal@stillhq.com">mikal@stillhq.com</a>.</author>

   <date month="June" year="2002"></date>
   <zone name="linux"></zone>
   <pdf url="ftp://www6.software.ibm.com/software/developer/library/[xxx.pdf]" size="xxKB"></pdf>
   <zip url="ftp://www6.software.ibm.com/software/developer/library/xxx.zip"></zip>

   <meta name="KEYWORDS" content="Michael Mikal Still tiff libtiff image imaging raster bitmap color colour" />	
   <meta name="LASTUPDATE" content="20020603 ty"/>

   <relatedlist>
   <a href="http://www-106.ibm.com/developerworks/linux/library/l-libtiff/">Graphics programming with libtiff</a>
   </relatedlist>

   <abstract>TIFF is an extremely common but quite complex raster image format. Libtiff is a standard implementation of the TIFF specification, is free, and works on many operating systems. This article provides examples on how to use libtiff for grayscale and color imaging.
</abstract>

<p>This article is a follow up to my <a href="http://www-106.ibm.com/developerworks/linux/library/l-libtiff/">previous article</a> on black and white graphics programming with libtiff. This article covers grayscale and color imaging, but it assumes that you have read and understand the code from the black and white article. It might be helpful to refer to that article before going too far here.</p>

<p>Here I'll discuss some of the theory required to understand how the image data is stored for color and grayscale. This theory applies to all imaging formats. I'll then discuss some of the specifics of using libtiff.</p>

<p><heading refname="h1" type="major" toc="yes">A bit of terminology</heading>
Images are made up of pixels. In black and white imaging, the pixel has one of two values, 0 or 1. This can be represented in a single bit. For grayscale and color images however, the pixel needs to store a much greater range of values; if a pixel was to have 255 levels of gray, we would need 8 bits to store that pixel. Each of these values is called a sample. TIFF expresses the size of the value in a tag called TIFFTAG_BITSPERSAMPLE. This will be 1 for black and white, and some larger number for grayscale.</p>

<p>For color images, we need to store even more information. For each pixel we will need to store a red, green, and blue value. Each of these values is stored in a separate "sample." Therefore, we will need to define TIFFTAG_SAMPLESPERPIXEL. This will be 1 for black and white or grayscale, but will normally be 3 for color images. We also need to define the size of each sample, so we'll still need to set a value for TIFFTAG_BITSPERSAMPLE.</p>

<p><heading refname="h2" type="major" toc="yes">Theory of color and grayscale storage</heading>
The first thing we need to understand to be able to support color and grayscale images is the format of the image data within memory. There are two main representations for color and grayscale images. I'll explain these by describing grayscale, and then extend it to color.</p>

<p><heading refname="h2a" type="minor" toc="no">Direct storage of pixel data</heading>
If you remember the way pixel information was stored in the black and white images from the previous article, the information was just in the strips. You can also do this with grayscale and color images, but this representation of image data is quite inefficient. For example, in a scenario in which the image has a solid background, there are many pixels with the same value. If the pixel data is stored in the strips, then this value will waste a large amount of space.
</p>

<p>Thankfully, there is a more efficient way to store image data. Imagine a simple four color, 24 bit per pixel image. If we build a lookup table of the four color values (the 24 bit values which represent those colors), then we just need to store the relevant entry number of the color in the image strip itself. This can be done in only two bits, instead of the full 24.</p>

<p>The math looks something like this: A 24 bit color image that is 1,000 by 1,000 pixels will take 24 million bits to store. The same image, if it was a four color image, would take 4 million bits for the strip data, and 98 bits for the color table. Neither of these numbers includes header and footer information for the file format, and the numbers are for uncompressed bitmaps. The advantages of the lookup table should be obvious. There is a name for this style of lookup table: it is called a <i>palette,</i> probably because of those things painters carry around.</p>

<p>This concept works for grayscale images as well. The only difference is that the "colors" in the palette are just shades of gray.</p>

<p><heading refname="h2b" type="minor" toc="no">Compression algorithms in libtiff</heading>
Several compression algorithms are available within libtiff. The table below helps sort them out.</p>

<b>Libtiff compression algorithms</b>
<htmlinsert><![CDATA[

<table width="100%" cellspacing="0" cellpadding="3" border="1">
<tr valign="top">
<td><b>Compression algorithm</b></td>
<td><b>Well suited for</b></td>
<td><b>TIFFTAG</b></td>
</tr>

<tr valign="top">
<td>CCITT Group 4 Fax and Group 3 Fax</td>
<td>This entry is here for completeness. If you're coding for black and white images, then you're probably using the CCITT fax compression methods. These compression algorithms don't support color.</td>
<td>COMPRESSION_CCITTFAX3, COMPRESSION_CCITTFAX4</td>
</tr>

<tr valign="top">
<td>JPEG</td>
<td>JPEG compression is great for large images such as photos. However, the compression is normally lossy (in that image data is thrown away as part of the compression process). This makes JPEG very poor for compressing text which needs to remain readable. The other thing to bear in mind is that the loss is cumulative -- see the sidebar for more information about this.</td>
<td>COMPRESSION_JPEG</td>
</tr>

<tr valign="top">
<td>LZW</td>
<td><i>This is the compression algorithm used in GIF images. Because of the licensing requirements from Unisys, support for this compression codec has been removed from libtiff. There are patches available if you would like to add it back, but the majority of programs your code will integrate with no longer support LZW.</i></td>
<td>COMPRESSION_LZW</td>
</tr>

<tr valign="top">
<td>Deflate</td>
<td>This is the gzip compression algorithm, which is also used for PNG. It is the compression algorithm I would recommend for color images.</td>
<td>COMPRESSION_DEFLATE</td>
</tr>
</table>

]]></htmlinsert>

<sidebar>
<heading refname="loss" type="sidebar" toc="no">Accumulating loss?</heading>
<p>Why does the loss in lossy compression algorithms such as JPEG accumulate? Imagine that you compress an image using JPEG. You then need to add, say, a barcode to the image, so you uncompress the image, add the barcode, and recompress it. When the recompression occurs, a new set of loss is introduced. You can imagine that if you do this enough, then you'll end up with an image which is a big blob.</p>

<p>Whether this is a problem depends on the type of your data. To test how much of a problem this is, I wrote a simple libtiff program that repeatedly uncompresses and recompresses an image. What I found was that with pictures, the data is much more resilient to repeated compression.</p>

<figure>
<heading refname="picture-start" type="figure" toc="no">Figure 1. The picture before compression</heading>
<img src="picture-start.jpg" width="160" height="47" alt="IBM logo"/>
</figure>

<figure>
<heading refname="text-start" type="figure" toc="no">Figure 2. The sample text before compression</heading>
<img src="text-start.jpg" width="292" height="258" alt="Some sample text"/>
</figure>

<p>The code I used had a "quality" rating of 25% on the JPEG compression, which is a way of tweaking the loss of the compression algorithm. The lower the quality, the higher the compression ratio. The default is 75%.</p>

<figure>
<heading refname="picture-200" type="figure" toc="no">Figure 3. The picture after it has been recompressed 200 times</heading>
<img src="picture-200.jpg" width="160" height="47" alt="The IBM logo, having been recompressed 200 times"/>
</figure>

<figure>
<heading refname="text-200" type="figure" toc="no">Figure 4. The text after it has been recompressed 200 times</heading>
<img src="text-200.jpg" width="292" height="258" alt="A picture of some text, having been compressed 200 times"/>
</figure>
</sidebar>

<p><heading refname="h3" type="major" toc="yes">Writing a color image</heading>
It's time to show how to write a color image to disk. Remember that this is a simple example and can be elaborated on greatly.
</p>

<code type="section">
<heading refname="write-example-simple" type="code" toc="no">Writing a color image</heading>
#include &lt;tiffio.h&gt;
#include &lt;stdio.h&gt;

int main(int argc, char *argv[]){
  TIFF *output;
  uint32 width, height;
  char *raster;

  // Open the output image
  if((output = TIFFOpen("output.tif", "w")) == NULL){
    fprintf(stderr, "Could not open outgoing image\n");
    exit(42);
  }

  // We need to know the width and the height before we can malloc
  width = 42;
  height = 42;

  if((raster = (char *) malloc(sizeof(char) * width * height * 3)) == NULL){
    fprintf(stderr, "Could not allocate enough memory\n");
    exit(42);
  }

  // Magical stuff for creating the image
  // ...

  // Write the tiff tags to the file
  TIFFSetField(output, TIFFTAG_IMAGEWIDTH, width);
  TIFFSetField(output, TIFFTAG_IMAGELENGTH, height);
  TIFFSetField(output, TIFFTAG_COMPRESSION, COMPRESSION_DEFLATE);
  TIFFSetField(output, TIFFTAG_PLANARCONFIG, PLANARCONFIG_CONTIG);
  <b>TIFFSetField(output, TIFFTAG_PHOTOMETRIC, PHOTOMETRIC_RGB);</b>
  TIFFSetField(output, TIFFTAG_BITSPERSAMPLE, 8);
  TIFFSetField(output, TIFFTAG_SAMPLESPERPIXEL, 3);

  // Actually write the image
  if(TIFFWriteEncodedStrip(output, 0, raster, width * height * 3) == 0){
    fprintf(stderr, "Could not write image\n");
    exit(42);
  }

  TIFFClose(output);
}
</code>

<p>You can see from this code some of the things we've discussed in theory. The image has three samples per pixel, each of eight bits. This means that the image is a 24 bit RGB image. If this was a black and white or grayscale image, then this value would be one. The tag PHOTOMETRIC_RGB says that the image data is stored within the strips themselves (as opposed to being paletted) -- more about this in a minute.</p>

<sidebar>
<heading refname="values" type="sidebar" toc="no">Other values for samples per pixel?</heading>
<p>In my example, I have three samples per pixel. If this was a black and white image, or a grayscale image, then we would have one sample per pixel. There are other valid values as well; for instance, sometimes people will store a transparency value for a given pixel, an <i>alpha channel.</i> This would result in having four samples per pixel. It is possible to have an arbitary number of samples per pixel, which is good if you need to pack in extra information about a pixel. <i>Note that doing this can break image viewers that make silly assumptions -- I once had to write code for a former employer to strip out alpha channels and the like so that their PDF generator wouldn't crash.</i></p>
</sidebar>

<p>The other interesting thing to discuss here is the planar configuration of the image. Here I've specifed PLANARCONFIG_CONTIG, which means that the red green and blue information for a given pixel is grouped together in the strips of image data. The other option is PLANARCONFIG_SEPARATE, where the red samples for the image are stored together, then the blue samples, and finally the green samples.</p>

<p><heading refname="h4" type="major" toc="yes">Writing a paletted color image</heading>
So how do we write a paletted version of this image? Well, libtiff makes this really easy -- all we need to do is change the value of TIFFTAG_PHOTOMETRIC to PHOTOMETRIC_PALETTE. It's not really worth including an example in this article, given it's a one word change.</p>

<p><heading refname="h5" type="major" toc="yes">Reading a color image</heading>
Now all we have to do is work out how to read other people's color and grayscale images reliably, and we're home free. Initially, I was very tempted not to tell you about the TIFFReadRGBAStrip() and TIFFReadRGBBSTile() calls, which hide some of the potential ugliness from the caller. These functions have some limitations I'm not very happy with. To quote the TIFFReadRGBAStrip() man page:</p>

<code type="section">
   TIFFReadRGBAStrip reads a single strip of a strip-based image into memory,
   storing  the  result  in  the  user supplied RGBA raster. The raster is
   assumed to be an array of width times   rowsperstrip   32-bit   entries,
   where   width   is  the  width  of  the  image (TIFFTAG_IMAGEWIDTH) and
   rowsperstrip is the maximum lines in a strip (TIFFTAG_ROWSPERSTRIP).

   The  strip  value  should  be  the  strip  number  (strip  zero  is the
   first) as returned by the TIFFComputeStrip function, but always for sample 0.

   Note  that  the  raster  is  assume  to  be  organized  such  that the pixel
   at location (x,y) is raster[y*width+x];  with  the  raster  origin in the
   lower-left hand corner of the strip. That is bottom  to  top  organization.
   When reading a partial last strip in the file the last line of the image
   will begin at the beginning of the buffer.

   Raster  pixels  are  8-bit packed red, green, blue, alpha samples. The
   macros TIFFGetR, TIFFGetG, TIFFGetB,  and  TIFFGetA  should  be used to
   access individual samples. Images without Associated Alpha matting
   information have a constant Alpha of 1.0 (255).

   See  the TIFFRGBAImage(3T) page for more details on how various image types
   are converted to RGBA values.

<b>NOTES</b>

   Samples must be either 1, 2, 4, 8, or 16 bits. Colorimetric samples/pixel
   must be either 1, 3, or 4 (i.e. SamplesPerPixel minus ExtraSamples).

   Palette  image  colormaps that appear to be incorrectly written as 8-bit
   values are automatically scaled to 16-bits.

   TIFFReadRGBAStrip  is  just  a wrapper around the more general
   TIFFRGBAImage(3T) facilities. It's main  advantage  over  the similar
   TIFFReadRGBAImage() function is that for large images a single buffer
   capable  of  holding  the  whole  image doesn't need to be allocated, only
   enough for one strip. The TIFFReadRGBATile() function does a similar
   operation for tiled images.
</code>

<p>There are a couple of odd things about this function. First, it defines (0, 0) to be in a different location than all the other code that we have been writing. In the previous code, the (0, 0) point has been in the top left of the image. This call defines (0, 0) to be in the bottom left. The other limitation to be aware of is that not all valid values for bits per sample are supported. If you find these quirks unacceptable, then remember that you can still use TIFFReadEncodedStrip() in the same manner that I did for the black and white images in the previous article.</p>

<code type="section">
<heading refname="read-example-simple" type="code" toc="no">Reading a color image with TIFFReadEncodedStrip()</heading>
#include &lt;stdio.h&gt;
#include &lt;tiffio.h&gt;

int main(int argc, char *argv[]){
  TIFF *image;
  uint32 width, height, *raster;
  tsize_t stripSize;
  unsigned long imagesize, c, d, e;

  // Open the TIFF image
  if((image = TIFFOpen(argv[1], "r")) == NULL){
    fprintf(stderr, "Could not open incoming image\n");
    exit(42);
  }

  // Find the width and height of the image
  TIFFGetField(image, TIFFTAG_IMAGEWIDTH, &amp;width);
  TIFFGetField(image, TIFFTAG_IMAGELENGTH, &amp;height);
  imagesize = height * width + 1;
  
  if((raster = (uint32 *) malloc(sizeof(uint32) * imagesize)) == NULL){
    fprintf(stderr, "Could not allocate enough memory\n");
    exit(42);
  }

  // Read the image into the memory buffer
  if(TIFFReadRGBAStrip(image, 0, raster) == 0){
    fprintf(stderr, "Could not read image\n");
    exit(42);
  }
<b>// Here I fix the reversal of the image (vertically) and show you how to get the color values from each pixel</b>
  d = 0;
  for(e = height - 1; e != -1; e--){
    for(c = 0; c &lt; width; c++){
      // Red = TIFFGetR(raster[e * width + c]);
      // Green = TIFFGetG(raster[e * width + c]);
      // Blue = TIFFGetB(raster[e * width + c]);
    }
  }

  free(raster);
  TIFFClose(image);
}
</code>

<p><heading refname="h6" type="major" toc="yes">Advanced topics</heading>
Well, now that we understand how to read and write basically any image format we can think of, there are two other things we should quickly discuss.</p>

<p><heading refname="h6a" type="minor" toc="no">Storing TIFF data in places other than files</heading>
All the exampels I've included to this point have read and written with files. There are many scenarios in which you wouldn't want to store your image data in a file, but would still want to use libtiff and tiff. For example, you might have customer pictures for id cards, and these would be stored in a database.</p>

<p>The example I am most familiar with is PDF documents. In PDF files, you can embed images into the document. These images can be in a subset of TIFF if desired, and TIFF is quite clearly the choice for black and white images.</p>

<sidebar>
<heading refname="expanded" type="sidebar" toc="no">An expanded example</heading>
<p>If you need more information about hooking the file input and output functions within libtiff, have a look at the images.c file in Panda, my PDF library. The Web pages for Panda can be found in <a href="#resources">Resources</a>.</p>
</sidebar>

<p>Libtiff allows you to replace the file input and output functions in the library with your own. This is done with the TIFFClientOpen() method. Here's an example (please note this code won't compile, and is shown only to describe the main concepts:</p>

<code type="section">
<heading refname="re" type="code" toc="no">Using TIFFClientOpen</heading>
#include &lt;tiffio.h&gt;
#include &lt;pthread.h&gt;

// Function prototypes
static tsize_t libtiffDummyReadProc (thandle_t fd, tdata_t buf, tsize_t size);
static tsize_t libtiffDummyWriteProc (thandle_t fd, tdata_t buf, tsize_t size);
static toff_t libtiffDummySeekProc (thandle_t fd, toff_t off, int i);
static int libtiffDummyCloseProc (thandle_t fd);

// We need globals because of the callbacks (they don't allow us to pass state)
char *globalImageBuffer;
unsigned long globalImageBufferOffset;

// This mutex keeps the globals safe by ensuring only one user at a time
pthread_mutex_t convMutex = PTHREAD_MUTEX_INITIALIZER;

<b>...</b>

TIFF *conv;

// Lock the mutex
pthread_mutex_lock (&amp;convMutex);

globalImageBuffer = NULL;
globalImageBufferOffset = 0;

// Open the dummy document (which actually only exists in memory)
conv = TIFFClientOpen ("dummy", "w", (thandle_t) - 1, libtiffDummyReadProc,
            libtiffDummyWriteProc, libtiffDummySeekProc,
            libtiffDummyCloseProc, NULL, NULL, NULL);

// Setup the image as if it was any other tiff image here, including setting tags

<b>...</b>

// Actually do the client open
TIFFWriteEncodedStrip (conv, 0, stripBuffer, imageOffset);

// Unlock the mutex
pthread_mutex_unlock (&amp;convMutex);

...

/////////////////// Callbacks to libtiff

<b>...</b>

static tsize_t
libtiffDummyReadProc (thandle_t fd, tdata_t buf, tsize_t size)
{
  // Return the amount of data read, which we will always set as 0 because
  // we only need to be able to write to these in-memory tiffs
  return 0;
}

static tsize_t
libtiffDummyWriteProc (thandle_t fd, tdata_t buf, tsize_t size)
{
  // libtiff will try to write an 8 byte header into the tiff file. We need
  // to ignore this because PDF does not use it...
  if ((size == 8) &amp;&amp; (((char *) buf)[0] == 'I') &amp;&amp; (((char *) buf)[1] == 'I')
     &amp;&amp; (((char *) buf)[2] == 42))
    {
    // Skip the header -- little endian
    }
  else if ((size == 8) &amp;&amp; (((char *) buf)[0] == 'M') &amp;&amp;
       (((char *) buf)[1] == 'M') &amp;&amp; (((char *) buf)[2] == 42))
    {
    // Skip the header -- big endian
    }
  else
    {
    // Have we done anything yet?
    if (globalImageBuffer == NULL)
    if((globalImageBuffer = (char *) malloc (size * sizeof (char))) == NULL)
        {
          fprintf(stderr, "Memory allocation error\n");
          exit(42);
        }

    // Otherwise, we need to grow the memory buffer
    else
    {
      if ((globalImageBuffer = (char *) realloc (globalImageBuffer,
                             (size * sizeof (char)) +
                             globalImageBufferOffset)) == NULL)
        fprintf(stderr, "Could not grow the tiff conversion memory buffer\n");
            exit(42);
    }

    // Now move the image data into the buffer
    memcpy (globalImageBuffer + globalImageBufferOffset, buf, size);
    globalImageBufferOffset += size;
    }

  return (size);
}

static toff_t
libtiffDummySeekProc (thandle_t fd, toff_t off, int i)
{
  // This appears to return the location that it went to
  return off;
}

static int
libtiffDummyCloseProc (thandle_t fd)
{
  // Return a zero meaning all is well
  return 0;
}
</code>

<p><heading refname="h6b" type="minor" toc="no">Converting color to grayscale</heading>
How do you convert color images to grayscale? When I first had to do this, my first answer was to just average the red, green, and blue values. That answer is wrong. The reality is that the human eye is much better at seeing some colors than others. To get an accurate grayscale representation, you need to apply different coefficients to the color samples. Appropriate coefficients are 0.299 for red, 0.587 for green and 0.114 for blue.</p>

<p><heading refname="h7" type="major" toc="yes">Conclusion</heading>
In this article I've discussed how to program with libtiff for grayscale and color images. I've shown you some sample code which should help to get you started. You should now know enough to have a great time coding with libtiff.</p>

<resourcelist>
<ul>
<li>Download the source files for performing the tasks mentioned in this article:
   <ul>
   <li><a href=".../read.c">read.c</a>, a reading example</li>
   <li><a href=".../write.c">write.c</a>, a writing example</li>
   <li><a href=".../client.c">client.c</a>: a client example</li>
   <li><a href=".../recompress.c">recompress.c</a>: repeated compression source</li>
   </ul>
   <li><a href=".../recompress.sh">recompress.sh</a>: Repeated compression script</li>

<li>For more information about hooking the file input and output functions within libtiff, take a look at the images.c file on Michael's <a href="http://www.stillhq.com/cgi-bin/getpage?area=panda&amp;page=index.htm"> Panda page</a>.</li>
</li>

<li>Check out <a href="http://www.inforamp.net/~poynton/notes/colour_and_gamma/ColorFAQ.html">Poynton's Color FAQ</a> for a discussion of converting to grayscale.</li>

<li>The <a href="http://www.libtiff.org/">libtiff</a> Web site is a good place to download the libtiff source and perhaps find a binary package for your operating system of choice.</li>
</ul>
</resourcelist>

</article>
