1 files changed, 1169 insertions, 0 deletions
diff --git a/docbook/wsdg_src/WSDG_chapter_dissection.xml b/docbook/wsdg_src/WSDG_chapter_dissection.xml
new file mode 100644
index 0000000000..d22f04863a
--- /dev/null
+++ b/docbook/wsdg_src/WSDG_chapter_dissection.xml
@@ -0,0 +1,1169 @@
+<!-- WSDG Chapter Dissection -->
+<!-- $Id$ -->
+
+<chapter id="ChapterDissection">
+  <title>Packet dissection</title>
+  <!-- Julian Onions additions -->
+  <section id="ChDissectWorks">
+	<title>How it works</title>
+	<para>
+	Each dissector decodes it's part of the protocol, and then hand off
+	decoding to subsequent dissectors for an encapsulated protocol.
+	</para>
+	<para>
+	So it might all start with a Frame dissector which dissects the packet details 
+	of the capture file itself (e.g. timestamps), passes the data on to an 
+	Ethernet frame dissector that decodes the Ethernet header, 
+	and then passes the payload to the next dissector (e.g. IP) and so on. 
+	At each stage, details of the packet will be decoded and displayed.
+	</para>
+	<para>
+	Dissection can be implemented in two possible ways. One is to have a dissector
+	module compiled into the main program, which means its always available.
+	Another way is to make a plugin (a shared library/DLL) that registers itself
+	to handle dissection. - XXX add a special explanation section for this?
+	</para>
+  </section>
+
+  <section id="ChDissectAdd">
+	<title>Adding a basic dissector</title>
+	<para>
+	Lets step through adding a basic dissector. We'll start with the made up
+	"foo" protocol. It consists of the following basic items.
+	</para>
+	<itemizedlist>
+	<listitem><para>
+	A packet type - 8 bits, possible values: 1 - initialisation, 2 - terminate, 3 - data.
+	</para></listitem>
+	<listitem><para>
+	A set of flags stored in 8 bits, 0x01 - start packet, 0x02 - end packet, 0x04 - priority packet.
+	</para></listitem>
+	<listitem><para>
+	A sequence number - 16 bits.
+	</para></listitem>
+	<listitem><para>
+	An IP address.
+	</para></listitem>
+	</itemizedlist>
+	<section id="ChDissectSetup">
+	<title>Setting up the dissector</title>
+	   <para>
+	   	The first decision you need to make is if this dissector will be a 
+	   	built in dissector, included in the main program, or a plugin.
+	   </para>
+	   <para>
+	   	Plugins are the easiest to write initially as they don't need
+	   	write permission on the main code base. So lets start with that.
+	   	With a little care, the plugin can be made to run as a built in
+	   	easily too - so we haven't lost anything.
+	   </para>
+	   <example><title>Basic Plugin setup.</title>
+	   <programlisting>
+	   <![CDATA[
+#ifdef HAVE_CONFIG_H
+# include "config.h"
+#endif
+
+#include <gmodule.h>
+#include <epan/packet.h>
+#include <epan/prefs.h>
+
+/* forward reference */
+void proto_register_foo();
+void proto_reg_handoff_foo();
+void dissect_foo(tvbuff_t *tvb, packet_info *pinfo, proto_tree *tree);
+
+/* Define version if we are not building Wireshark statically */
+#ifndef ENABLE_STATIC
+G_MODULE_EXPORT const gchar version[] = "0.0";
+#endif
+
+static int proto_foo = -1;
+static int global_foo_port = 1234;
+static dissector_handle_t foo_handle;
+
+#ifndef ENABLE_STATIC
+G_MODULE_EXPORT void
+plugin_register(void)
+{
+	/* register the new protocol, protocol fields, and subtrees */
+	if (proto_foo == -1) { /* execute protocol initialization only once */
+		proto_register_foo();
+	}
+}
+
+G_MODULE_EXPORT void
+plugin_reg_handoff(void){
+	proto_reg_handoff_foo();
+}
+#endif]]>
+	   </programlisting>
+	   </example>
+	<para>
+	Lets go through this a bit at a time. First we have some boiler plate
+	include files. These will be pretty constant to start with. Here we also
+	pre-declare some functions that we'll be writing shortly.
+	</para>
+	<para>
+	Next we have a section surrounded by #ifdef ENABLE_STATIC. This is what
+	makes this a plugin rather than a built in dissector.
+	</para>
+	<para>
+	The version is a simple string that is used to report on the version of this
+	dissector. You should increase this number each time you make changes that you
+	need to keep track of.
+	</para>	
+	<para>
+	Next we have an int that is initialised to -1 that records our protocol.
+	This will get updated when we register this plugin with the main program.
+	We can use this as a handy way to detect if we've been initialised yet.
+	Its good practice to make all variables and functions that aren't exported
+	static to keep name space pollution. Normally this isn't a problem unless your
+	dissector gets so big it has to span multiple files.
+	</para>
+	<para>
+	Then a global variable which contains the UDP port that we'll assume we are dissecting traffic for.
+	</para>
+	<para>
+	Next a dissector reference that we'll initialise later.
+	</para>
+	<para>
+	Next, the first plugin entry point. The function plugin_register() is called
+	when the plugin is loaded and allows you to do some initialisation stuff,
+	which will include communicating with the main program what you're plugins
+	capabilities are.
+	</para>
+	<para>
+	The plugin_reg_handoff routine is used when dissecting sub protocols. As our
+	hypothetical protocol will be hypothetically carried over UDP then we will
+	need to do this.
+	</para>
+	<para>
+	Now we have the basics in place to interact with the main program, we had
+	better fill in those missing functions. Lets start with register function.
+	</para>
+	   <example><title>Plugin Initialisation.</title>
+   <programlisting>
+   <![CDATA[
+void
+proto_register_foo(void)
+{
+	module_t *foo_module;
+
+	if (proto_foo == -1) {
+		proto_foo = proto_register_protocol (
+			"FOO Protocol",		/* name */
+			"FOO",		/* short name */
+			"foo"		/* abbrev */
+			);
+	}
+	foo_module	= prefs_register_protocol(proto_foo, proto_reg_handoff_foo);
+}]]>	
+   </programlisting></example>
+	<para>
+	First a call to proto_register_protocol that
+	registers the protocol. We can give it three names that
+	will be used in various places to display it.
+	- XXX explain where, this can be confusing
+	</para>
+	<para>
+	Then we call the preference register function. At the moment we have
+	no specific protocol preferences so this will be all that we need.
+	This takes a function parameter which is our handoff function.
+	I guess we'd better write that next.
+	</para>
+	   <example><title>Plugin Handoff.</title>
+   <programlisting>
+   <![CDATA[
+void
+proto_reg_handoff_foo(void)
+{
+	static int Initialized=FALSE;
+
+	if (!Initialized) {
+		foo_handle = create_dissector_handle(dissect_foo, proto_foo);
+		dissector_add("udp.port", global_foo_port, foo_handle);
+	}
+}]]>	
+   </programlisting></example>
+	<para>
+	What's happening here? We are initialising the dissector if it hasn't
+	been initialised yet. 
+	First we create the dissector. This registers a routine 
+	to be called to do the actual dissecting.
+	Then we associate it with a udp port number
+	so that the main program will know to call us when it gets UDP traffic on that port.
+	</para>
+	<para>
+	Now at last we finally get to write some dissecting code. For the moment we'll
+	leave it as a basic placeholder.
+	</para>
+	   <example><title>Plugin Dissection.</title>
+   <programlisting>
+   <![CDATA[
+static void
+dissect_foo(tvbuff_t *tvb, packet_info *pinfo, proto_tree *tree)
+{
+
+	if (check_col(pinfo->cinfo, COL_PROTOCOL))
+		col_set_str(pinfo->cinfo, COL_PROTOCOL, "FOO");
+	/* Clear out stuff in the info column */
+	if(check_col(pinfo->cinfo,COL_INFO)){
+		col_clear(pinfo->cinfo,COL_INFO);
+	}
+}]]>	
+   </programlisting></example>
+	<para>
+	This function is called to dissect the packets presented to it.
+	The packet data is held in a special buffer referenced here as tvb.
+	We shall become fairly familiar with this as we get deeper into the details
+	of the protocol.
+	The packet info structure contains general data about the protocol, and we
+	can update information here.
+	The tree parameter is where the detail dissection takes place.
+	</para>
+	<para>
+	For now we'll do the minimum we can get away with. 
+	The first two lines check to see if the Protocol column is being displayed in
+	the UI. If it is, we set the text of this to our protocol, so everyone
+	can see its been recognised.
+	The only other thing we do is to clear out any data in the INFO column
+	if its being displayed. 
+	</para>
+	<para>
+	At this point we should have a basic dissector ready to compile and install.
+	It doesn't do much at present, than identify the protocol and label it.
+	Compile the dissector to a dll or shared library, and copy it into the plugin
+	directory of the installation. To finish this off a Makefile of some sort will be 
+	required. A Makefile.nmake for Windows platforms and a Makefile.am for unix/linux
+	types.
+	</para>
+
+	   <example><title>Makefile.nmake for Windows.</title>
+   <programlisting>
+   <![CDATA[
+include ..\..\config.nmake
+
+############### no need to modify below this line #########
+
+CFLAGS=/DHAVE_CONFIG_H /I../.. /I../../wiretap $(GLIB_CFLAGS) \
+	/I$(PCAP_DIR)\include -D_U_="" $(LOCAL_CFLAGS)
+
+LDFLAGS = /NOLOGO /INCREMENTAL:no /MACHINE:I386 $(LOCAL_LDFLAGS)
+
+!IFDEF ENABLE_LIBWIRESHARK
+LINK_PLUGIN_WITH=..\..\epan\libwireshark.lib
+CFLAGS=/DHAVE_WIN32_LIBWIRESHARK_LIB /D_NEED_VAR_IMPORT_ $(CFLAGS)
+
+OBJECTS=foo.obj 
+
+foo.dll foo.exp foo.lib : $(OBJECTS) $(LINK_PLUGIN_WITH)
+	link -dll /out:foo.dll $(LDFLAGS) $(OBJECTS) $(LINK_PLUGIN_WITH) \
+	$(GLIB_LIBS)
+
+!ENDIF
+
+clean:
+	rm -f $(OBJECTS) foo.dll foo.exp foo.lib *.pdb
+
+distclean: clean
+
+maintainer-clean: distclean]]>	
+   </programlisting></example>
+   <example><title>Makefile.am for unix/linux.</title>
+   <programlisting>
+   <![CDATA[
+INCLUDES = -I$(top_srcdir)
+
+plugindir = @plugindir@
+
+plugin_LTLIBRARIES = foo.la
+foo_la_SOURCES = foo.c moduleinfo.h
+foo_la_LDFLAGS = -module -avoid-version
+foo_la_LIBADD = @PLUGIN_LIBS@
+
+# Libs must be cleared, or else libtool won't create a shared module.
+# If your module needs to be linked against any particular libraries,
+# add them here.
+LIBS =
+
+CLEANFILES = \
+	foo \
+	*~
+
+EXTRA_DIST = \
+	Makefile.nmake
+]]>	
+   </programlisting></example>
+   </section>
+
+	<section id="ChDissectDetails">
+	<title>Dissecting the details of the protocol</title>
+	   <para>
+	   Now we have our basic dissector up and running, lets do something with it.
+	   The simplest thing to do to start with is to just label the payload.
+	   This will allow us to set up some of the parts we will need.
+	   </para>
+	   <para>
+	   The first thing we will do is to build a subtree to decode our results into.
+	   This helps to keep things looking nice in the detailed display.
+	   Now the dissector is called in two different cases. In one case
+	   it is called to get a summary of the packet, in the other case it is
+	   called to look into details of the packet. These two cases can be
+	   distinguished by the tree pointer. If the tree pointer is NULL, then
+	   we are being asked for a summary. If it is non null, we can pick apart
+	   the protocol for display. So with that in mind, lets enhance our dissector.
+	   </para>
+	   <example><title>Plugin Packet Dissection.</title>
+   <programlisting>
+   <![CDATA[
+static void
+dissect_foo(tvbuff_t *tvb, packet_info *pinfo, proto_tree *tree)
+{
+
+	if (check_col(pinfo->cinfo, COL_PROTOCOL))
+		col_set_str(pinfo->cinfo, COL_PROTOCOL, "FOO");
+	/* Clear out stuff in the info column */
+	if(check_col(pinfo->cinfo,COL_INFO)){
+		col_clear(pinfo->cinfo,COL_INFO);
+	}
+	
+	if (tree) { /* we are being asked for details */
+		proto_item *ti = NULL;
+		ti = proto_tree_add_item(tree, proto_foo, tvb, 0, -1, FALSE);
+	}
+}]]>	
+   </programlisting></example>
+	<para>
+	What we're doing here is adding a subtree to the dissection.
+	This subtree will hold all the details of this protocol and so not clutter
+	up the display when not required.
+	</para>
+	<para>
+	We are also marking the area of data that is being consumed by this
+	protocol. In our cases its all that has been passed to us, as we're assuming
+	this protocol does not encapsulate another.
+	Therefore, we add the new tree node with proto_tree_add_item, adding
+	it to the passed in tree, label it with the protocol, use the passed in
+	tvb buffer as the data, and consume from 0 to the end (-1) of this data.
+	The FALSE we'll ignore for now.
+	</para>
+	<para>
+	After this change, there should be a label in the detailed display for the protocol,
+	and selecting this will highlight the remaining contents of the packet.
+	</para>
+	<para>
+	Now lets go to the next step and add some protocol dissection.
+	For this step we'll need to construct a couple of tables that help with dissection.
+	This needs some changes to proto_register_foo. First a couple of statically
+	declare arrays.
+	</para>
+	   <example><title>Plugin Registering data structures.</title>
+   <programlisting>
+   <![CDATA[
+static hf_register_info hf[] = {
+	{ &hf_foo_pdu_type,
+	{ "FOO PDU Type",           "foo.type",
+	FT_UINT8, BASE_DEC, NULL, 0x0,
+	"", HFILL }
+	},
+
+};
+
+
+/* Setup protocol subtree array */
+static gint *ett[] = {
+	&ett_foo,
+};
+]]>	
+   </programlisting></example>
+   	<para>
+   	Then, after the registration code, we register these arrays.
+   	</para>
+	   <example><title>Plugin Registering data structures.</title>
+   <programlisting>
+   <![CDATA[
+
+	proto_register_field_array(proto_foo, hf, array_length(hf));
+	proto_register_subtree_array(ett, array_length(ett));
+]]>	
+   </programlisting></example>
+   	<para>
+   	The variables hf_foo_pdu_type and ett_foo also need to be declared
+   	somewhere near the top of the file.
+   	</para>
+	   <example><title>Plugin data structure globals.</title>
+   <programlisting>
+   <![CDATA[
+static int hf_foo_pdu_type = -1;
+
+static gint ett_foo = -1;
+]]>	
+   </programlisting></example>
+	<para>
+	Now we can enhance the protocol display with some detail.
+	</para>
+	   <example><title>Plugin starting to dissect the packets.</title>
+   <programlisting>
+   <![CDATA[
+	if (tree) { /* we are being asked for details */
+		proto_item *ti = NULL;
+		proto_tree *foo_tree = NULL;
+
+		ti = proto_tree_add_item(tree, proto_foo, tvb, 0, -1, FALSE);
+		foo_tree = proto_item_add_subtree(ti, ett_foo);
+		proto_tree_add_item(foo_tree, hf_foo_pdu_type, tvb, 0, 1, FALSE);
+	}
+]]>	
+   </programlisting></example>
+	<para>
+	Now the dissection is starting to look more interesting. We have picked apart
+	our first bit of the protocol. One byte of data at the start of the packet
+	that defines the packet type for foo protocol.
+	</para>
+	<para>
+	The proto_item_add_subtree call has added a child node to the protocol tree
+	which is where we will do our detail dissection. The expansion of this
+	node is controlled by the ett_foo variable. This remembers if the node should
+	be expanded or not as you move between packets.
+	All subsequent dissection will be added to this tree, as you can see from the next call.
+	A call to proto_tree_add_item in the foo_tree, this time using the 
+	hf_foo_pdu_type to control the formatting of the item. The pdu type
+	is one byte of data, starting at 0. We assume it is in network order,
+	so that is why we use FALSE. Although for 1 byte there is no order issues
+	its best to keep right.
+	</para>
+	<para>
+	If we look in detail at the hf_foo_pdu_type declaration in the static array
+	we can see the details of the definition.
+	</para>
+	<itemizedlist>
+	<listitem><para>
+	hf_foo_pdu_type - the index for this node.
+	</para></listitem>
+	<listitem><para>
+	FOO PDU Type - the label for this item.
+	</para></listitem>
+	<listitem><para>
+	foo.type - this is the filter string. It enables us to type constructs such
+	as foo.type=1 into the filter box.
+	</para></listitem>
+	<listitem><para>
+	FT_UNIT8 - this specifies this item is an 8bit unsigned integer.
+	This tallies with our call above where we tell it to only look at one byte.
+	</para></listitem>
+	<listitem><para>
+	BASE_DEC - for an integer type, this tells it to be printed as a decimal
+	number. It could be BASE_HEX or BASE_OCT if that made more sense.
+	</para></listitem>
+	</itemizedlist>
+	<para>
+	We'll ignore the rest of the structure for now.
+	</para>
+	<para>
+	If you install this plugin and try it out, you'll see something that begins to look
+	useful. 
+	</para>
+	<para>
+	Now lets finish off dissecting the simple protocol. We need to add a few
+	more variables to the hf array, and a couple more procedure calls.
+	</para>
+	   <example><title>Plugin wrapping up the packet dissection.</title>
+   <programlisting>
+   <![CDATA[
+static int hf_foo_flags = -1;
+static int hf_foo_sequenceno = -1;
+static int hf_foo_initialip = -1;
+...
+	{ &hf_foo_flags,
+	{ "FOO PDU Flags",           "foo.flags",
+	FT_UINT8, BASE_HEX, NULL, 0x0,
+	"", HFILL }
+	},
+	{ &hf_foo_sequenceno,
+	{ "FOO PDU Sequence Number",           "foo.seqn",
+	FT_UINT16, BASE_DEC, NULL, 0x0,
+	"", HFILL }
+	},
+	{ &hf_foo_initialip,
+	{ "FOO PDU Initial IP",           "foo.initialip",
+	FT_IPv4, BASE_NONE, NULL, 0x0,
+	"", HFILL }
+	},
+...
+		gint offset = 0;
+
+		ti = proto_tree_add_item(tree, proto_foo, tvb, 0, -1, FALSE);
+		foo_tree = proto_item_add_subtree(ti, ett_foo);
+		proto_tree_add_item(foo_tree, hf_foo_pdu_type, tvb, offset, 1, FALSE); offset += 1;
+		proto_tree_add_item(foo_tree, hf_foo_flags, tvb, offset, 1, FALSE); offset += 1;
+		proto_tree_add_item(foo_tree, hf_foo_sequenceno, tvb, offset, 2, FALSE); offset += 2;
+		proto_tree_add_item(foo_tree, hf_foo_initialip, tvb, offset, 4, FALSE); offset += 4;
+
+]]>	
+   </programlisting></example>
+   <para>
+   This dissects all the bits of this simple hypothetical protocol. We've introduced a new 
+   variable offset into the mix to help keep track of where we are in the packet dissection.
+   With these extra bits in place, the whole protocol is now dissected. 
+   </para>
+   </section>
+   <section><title>Improving the dissection information</title>
+   <para>
+   We can certainly improve the display of the proto with a bit of extra data.
+   The first step is to add some text labels. Lets start by labelling the packet types.
+   There is some useful support for this sort of thing by adding a couple of extra things.
+   First we add a simple table of type to name.
+   </para>
+	   <example><title>Naming the packet types.</title>
+   <programlisting>
+   <![CDATA[
+static const value_string packettypenames[] = {
+	{ 1, "Initialise" },
+	{ 2, "Terminate" },
+	{ 3, "Data" },
+	{ 0, NULL }
+};
+]]>	
+   </programlisting></example>
+   <para>
+   This is a handy data structure that can be used to look up value to names.
+   There are routines to directly access this lookup table, but we don't need to
+   do that, as the support code already has that added in. We just have to give 
+   these details to the appropriate part of data, using the VALS macro.
+   </para>
+	   <example><title>Adding Names to the protocol.</title>
+   <programlisting>
+   <![CDATA[
+	{ &hf_foo_pdu_type,
+	{ "FOO PDU Type",           "foo.type",
+	FT_UINT8, BASE_DEC, VALS(packettypenames), 0x0,
+	"", HFILL }
+	},
+]]>	
+   </programlisting></example>
+    <para>
+    This helps in deciphering the packets, and we can do a similar thing for the
+    flags structure. For this we need to add some more data to the table though.
+    </para>
+	   <example><title>Adding Flags to the protocol.</title>
+   <programlisting>
+   <![CDATA[
+#define FOO_START_FLAG	0x01
+#define FOO_END_FLAG		0x02
+#define FOO_PRIORITY_FLAG	0x04
+
+static int hf_foo_startflag = -1;
+static int hf_foo_endflag = -1;
+static int hf_foo_priorityflag = -1;
+...
+	{ &hf_foo_startflag,
+	{ "FOO PDU Start Flags",           "foo.flags.start",
+	FT_BOOLEAN, 8, NULL, FOO_START_FLAG,
+	"", HFILL }
+	},
+	{ &hf_foo_endflag,
+	{ "FOO PDU End Flags",           "foo.flags.end",
+	FT_BOOLEAN, 8, NULL, FOO_END_FLAG,
+	"", HFILL }
+	},
+	{ &hf_foo_priorityflag,
+	{ "FOO PDU Priority Flags",           "foo.flags.priority",
+	FT_BOOLEAN, 8, NULL, FOO_PRIORITY_FLAG,
+	"", HFILL }
+	},
+...
+	proto_tree_add_item(foo_tree, hf_foo_flags, tvb, offset, 1, FALSE);
+	proto_tree_add_item(foo_tree, hf_foo_startflag, tvb, offset, 1, FALSE);
+	proto_tree_add_item(foo_tree, hf_foo_endflag, tvb, offset, 1, FALSE);
+	proto_tree_add_item(foo_tree, hf_foo_priorityflag, tvb, offset, 1, FALSE); offset += 1;
+]]>	
+   </programlisting></example>
+   <para>
+   Some things to note here. For the flags, as each bit is a different flag, we use
+   the type FT_BOOLEAN, as the flag is either on or off. Second, we include the flag
+   mask in the 7th field of the data, which allows the system to mask the relevant bit.
+   We've also changed the 5th field to 8, to indicate that we are looking at an 8 bit 
+   quantity when the flags are extracted. Then finally we add the extra constructs
+   to the dissection routine. Note we keep the same offset for each of the flags.
+   </para>
+   <para>
+   This is starting to look fairly full featured now, but there are a couple of other
+   things we can do to make things look even more pretty. At the moment our dissection
+   shows the packets as "Foo Protocol" which whilst correct is a little uninformative.
+   We can enhance this by adding a little more detail.
+   First, lets get hold of the actual value of the protocol type. We can use the handy
+   function tvb_get_guint8 to do this.
+   With this value in hand, there are a couple of things we can do.
+   First we can set the INFO column of the non-detailed view to show what sort of 
+   PDU it is - which is extremely helpful when looking at protocol traces.
+   Second, we can also display this information in the dissection window.
+   </para>
+	   <example><title>Enhancing the display.</title>
+   <programlisting>
+   <![CDATA[
+static void
+dissect_foo(tvbuff_t *tvb, packet_info *pinfo, proto_tree *tree)
+{
+	guint8 packet_type = tvb_get_guint8(tvb, 0);
+
+	if (check_col(pinfo->cinfo, COL_PROTOCOL))
+		col_set_str(pinfo->cinfo, COL_PROTOCOL, "FOO");
+	/* Clear out stuff in the info column */
+	if(check_col(pinfo->cinfo,COL_INFO)){
+		col_clear(pinfo->cinfo,COL_INFO);
+	}
+	if (check_col(pinfo->cinfo, COL_INFO)) {
+		col_add_fstr(pinfo->cinfo, COL_INFO, "Type %s",
+			   val_to_str(packet_type, packettypenames, "Unknown (0x%02x)"));
+	}
+
+	if (tree) { /* we are being asked for details */
+		proto_item *ti = NULL;
+		proto_tree *foo_tree = NULL;
+		gint offset = 0;
+
+		ti = proto_tree_add_item(tree, proto_foo, tvb, 0, -1, FALSE);
+		proto_item_append_text(ti, ", Type %s",
+			val_to_str(packet_type, packettypenames, "Unknown (0x%02x)"));
+		foo_tree = proto_item_add_subtree(ti, ett_foo);
+		proto_tree_add_item(foo_tree, hf_foo_pdu_type, tvb, offset, 1, FALSE); offset += 1;
+...
+]]>
+   </programlisting></example>
+   <para>
+   So here, after grabbing the value of the first 8 bits, we use it with one of the
+   built in utility routines val_to_str, to lookup the value. If the value isn't 
+   found we provide a fallback which just prints the value in hex. 
+   We use this twice, once in the INFO field of the columns - if its displayed, and
+   similarly we append this data to the base of our dissecting tree.
+   </para>
+
+   </section>
+  </section>
+
+  <section id="ChDissectTransformed">
+	<title>How to handle transformed data</title>
+	<para>
+	Some protocols do clever things with data. They might possibly 
+	encrypt the data, or compress data, or part of it. If you know
+	how these steps are taken it is possible to reverse them within the
+	dissector. 
+	</para>
+	<para> 
+	As encryption can be tricky, lets consider the case of compression.
+	These techniques can also work for other transformations of data,
+	where some step is required before the data can be examined.
+	</para>
+	<para> 
+	What basically needs to happen here, is to identify the data that
+	needs conversion, take that data and transform it into a new stream,
+	and then call a dissector on it.
+	Often this needs to be done "on-the-fly" based on clues in the packet.
+	Sometimes this needs to be used in conjunction with other techniques,
+	such as packet reassembly. The following shows a technique to
+	achieve this effect.
+	</para>
+	   <example><title>Decompressing data packets for dissection.</title>
+   <programlisting>
+   <![CDATA[
+   	guint8 flags = tvb_get_guint8(tvb, offset); offset ++;
+	if (flags & FLAG_COMPRESSED) { /* the remainder of the packet is compressed */
+	 	guint16 orig_size = tvb_get_ntohs(tvb, offset); offset += 2;
+		guchar *decompressed_buffer; /* Buffers for decompression */
+		decompressed_buffer = (guchar*) g_malloc (orig_size);
+		decompress_packet (tvb_get_ptr(tvb, offset, -1), tvb_length_remaining(tvb, offset),
+				decompressed_buffer, orig_size);
+		/* Now re-setup the tvb buffer to have the new data */
+		next_tvb = tvb_new_real_data(decompressed_buffer, orig_size, orig_size);
+		tvb_set_child_real_data_tvbuff(tvb, next_tvb);
+		add_new_data_source(pinfo, next_tvb, "Decompressed Data");
+		tvb_set_free_cb(next_tvb, g_free);
+	} else {
+		next_tvb = tvb_new_subset(tvb, offset, -1, -1);
+	}
+	offset = 0;
+	/* process next_tvb from here on */
+]]>
+   </programlisting></example>
+   	<para>
+   	The first steps here are to recognise the compression. In this case
+   	a flag byte alerts us to the fact the remainder of the packet is compressed.
+   	Next we retrieve the original size of the packet, which in this case
+   	is conveniently within the protocol. If its not, it may be part of the 
+   	compression routine to work it out for you, in which case the logic would
+   	be different.
+   	</para>
+   	<para>
+   	So armed with the size, a buffer is allocated to receive the uncompressed
+   	data using g_malloc, and the packet is decompressed into it.
+   	The tvb_get_ptr function is useful to get a pointer to the raw data of 
+   	the packet from the offset onwards. In this case the 
+   	decompression routine also needs to know the length, which is
+   	given by the tvb_length_remaining function.
+   	</para>
+   	<para>
+   	Next we build a new tvb buffer from this data, using the tvb_new_real_data
+   	call. This data is a child of our original data, so we acknowledge that
+   	in the next call to tvb_set_child_real_data_tvbuff. 
+   	Finally we add this data as a new data source, so that
+   	the detailed display can show the decompressed bytes as well as the original.
+   	One procedural step is to add a handler to free the data when its no longer needed.
+   	In this case as g_malloc was used to allocate the memory, g_free is the appropriate
+   	function.
+   	</para>
+	<para> 
+	After this has been set up the remainder of the dissector can dissect the
+	buffer next_tvb, as its a new buffer the offset needs to be 0 as we start
+	again from the beginning of this buffer. To make the rest of the dissector
+	work regardless of whether compression was involved or not, in the case that
+	compression was not signaled, we use the tvb_new_subset to deliver us
+	a new buffer based on the old one but starting at the current offset, and
+	extending to the end. This makes dissecting the packet from this point on
+	exactly the same regardless of compression.
+	</para>
+  </section>
+  <section id="ChDissectReassemble">
+	<title>How to reassemble split packets</title>
+	<para>
+	Some protocols have times when they have to split a large packet across
+	multiple other packets. In this case the dissection can't be carried out correctly
+	until you have all the data. The first packet doesn't have enough data,
+	and the subsequent packets don't have the expect format.
+	To dissect these packets you need to wait until all the parts have
+	arrived and then start the dissection.
+	</para>
+	  <section id="ChDissectReassembleUdp">
+		<title>How to reassemble split UDP packets</title>
+		<para>
+		As an example, lets examine a protocol that is layered on
+		top of UDP that splits up its own data stream.
+		If a packet is bigger than some given size, it will be split into
+		chunks, and somehow signaled within its protocol.
+		</para>
+		<para>
+		To deal with such streams, we need several things to trigger
+		from. We need to know that this packet is part of a multi-packet 
+		sequence. We need to know how many packets are in the sequence.
+		We also need to know when we have all the packets.
+		</para>
+		<para>
+		For this example we'll assume there is a simple in-protocol
+		signaling mechanism to give details. A flag byte that signals
+		the presence of a multi-packet sequence and also the last packet, 
+		followed by an ID of the sequence and a packet sequence number.
+		</para>
+                <programlisting>
+<![CDATA[
+msg_pkt ::= SEQUENCE {
+	.....
+	flags ::= SEQUENCE {
+		fragment	BOOLEAN,
+		last_fragment	BOOLEAN,
+	.....
+	}
+	msg_id	INTEGER(0..65535),
+	frag_id	INTEGER(0..65535),
+	.....
+}
+]]>
+                </programlisting>
+	   <example><title>Reassembling fragments - Part 1</title>
+   <programlisting>
+   <![CDATA[
+#include <epan/reassemble.h>
+   ...
+save_fragmented = pinfo->fragmented;
+flags = tvb_get_guint8(tvb, offset); offset++;
+if (flags & FL_FRAGMENT) { /* fragmented */
+	tvbuff_t* new_tvb = NULL;
+	fragment_data *frag_msg = NULL;
+	guint16 msg_seqid = tvb_get_ntohs(tvb, offset); offset += 2;
+	guint16 msg_num = tvb_get_ntohs(tvb, offset); offset += 2;
+
+	pinfo->fragmented = TRUE;
+	frag_msg = fragment_add_seq_check(tvb, offset, pinfo,
+		msg_seqid, /* ID for fragments belonging together */
+		msg_fragment_table, /* list of message fragments */
+		msg_reassembled_table, /* list of reassembled messages */
+		msg_num, /* fragment sequence number */
+		tvb_length_remaining(tvb, offset), /* fragment length - to the end */
+		flags & FL_FRAG_LAST); /* More fragments? */
+]]>
+   </programlisting></example>
+	<para>
+	We start by saving the fragmented state of this packet, so we can restore it later.
+	Next comes some protocol specific stuff, to dig the fragment data
+	out of the stream if it's present. Having decided it is present, we 
+	let the function fragment_add_seq_check do its work.
+	We need to provide this with a certain amount of data.
+	</para>
+	<itemizedlist>
+	<listitem><para>
+	The tvb buffer we are dissecting.
+	</para></listitem>
+	<listitem><para>
+	The offset where the partial packet starts.
+	</para></listitem>
+	<listitem><para>
+	The provided packet info.
+	</para></listitem>
+	<listitem><para>
+	The sequence number of the fragment stream. There may be several 
+	streams of fragments in flight, and this is used to key the 
+	relevant one to be used for reassembly.
+	</para></listitem>
+	<listitem><para>
+	The msg_fragment_table and the msg_reassembled_table are variables
+	we need to declare. We'll consider these in detail later.
+	</para></listitem>
+	<listitem><para>
+	msg_num is the packet number within the sequence. 
+	</para></listitem>
+	<listitem><para>
+	The length here is specified as the rest of the tvb as we want the rest of the packet data. 
+	</para></listitem>
+	<listitem><para>
+	Finally a parameter that signals if this is the last fragment or not.
+	This might be a flag as in this case, or there may be a counter in the
+	protocol.
+	</para></listitem>
+	</itemizedlist>
+	   <example><title>Reassembling fragments part 2</title>
+	   <programlisting>
+	   <![CDATA[
+	new_tvb = process_reassembled_data(tvb, offset, pinfo,
+		"Reassembled Message", frag_msg, &msg_frag_items,
+		NULL, msg_tree);
+
+	if (frag_msg) { /* Reassembled */
+		if (check_col(pinfo->cinfo, COL_INFO))
+			col_append_str(pinfo->cinfo, COL_INFO, 
+			" (Message Reassembled)");
+	} else { /* Not last packet of reassembled Short Message */
+		if (check_col(pinfo->cinfo, COL_INFO))
+			col_append_fstr(pinfo->cinfo, COL_INFO,
+			" (Message fragment %u)", msg_num);
+	}
+
+	if (new_tvb) { /* take it all */
+		next_tvb = new_tvb;
+	} else { /* make a new subset */
+	 	next_tvb = tvb_new_subset(tvb, offset, -1, -1);
+	}
+}
+else { /* Not fragmented */
+	next_tvb = tvb_new_subset(tvb, offset, -1, -1);
+}
+
+.....
+pinfo->fragmented = save_fragmented;
+	]]>
+   </programlisting></example>
+	   <para>
+	   Having passed the fragment data to the reassembly handler, we can
+	   now check if we have the whole message. If there is enough information, 
+	   this routine will return the newly reassembled data buffer. 
+	   </para>
+	   <para>
+	   After that, we add a couple of informative messages to the display
+	   to show that this is part of a sequence. Then a bit of manipulation
+	   of the buffers and the dissection can proceed.
+	   Normally you will probably not bother dissecting further unless the 
+	   fragments have been reassembled as there won't be much to find. Sometimes
+	   the first packet in the sequence can be partially decoded though if you wish.
+	   </para>
+	   <para>
+	   Now the mysterious data we passed into the fragment_add_seq_check.
+	   </para>
+	   <example><title>Reassembling fragments - Initialisation</title>
+	   <programlisting>
+	   <![CDATA[
+static GHashTable *msg_fragment_table = NULL;
+static GHashTable *msg_reassembled_table = NULL;
+
+
+static void
+msg_init_protocol(void)
+{
+	fragment_table_init (&msg_fragment_table);
+	reassembled_table_init(&msg_reassembled_table);
+}
+]]>
+   </programlisting></example>
+   	<para>
+   	First a couple of hash tables are declared, and these are initialised
+   	in the protocol initialisation routine.
+   	Following that, a fragment_items structure is allocated and filled 
+   	in with a series of ett items, hf data items, and a string tag.
+   	The ett and hf values should be included in the relevant tables like
+   	all the other variables your protocol may use. The hf variables
+   	need to be placed in the structure something like the following.
+   	Of course the names may need to be adjusted.
+   	</para>
+	   <example><title>Reassembling fragments - Data</title>
+	   <programlisting>
+	   <![CDATA[
+...
+static int hf_msg_fragments = -1;
+static int hf_msg_fragment = -1;
+static int hf_msg_fragment_overlap = -1;
+static int hf_msg_fragment_overlap_conflicts = -1;
+static int hf_msg_fragment_multiple_tails = -1;
+static int hf_msg_fragment_too_long_fragment = -1;
+static int hf_msg_fragment_error = -1;
+static int hf_msg_reassembled_in = -1;
+...
+static gint ett_msg_fragment = -1;
+static gint ett_msg_fragments = -1;
+...
+static const fragment_items msg_frag_items = {
+	/* Fragment subtrees */
+	&ett_msg_fragment,
+	&ett_msg_fragments,
+	/* Fragment fields */
+	&hf_msg_fragments,
+	&hf_msg_fragment,
+	&hf_msg_fragment_overlap,
+	&hf_msg_fragment_overlap_conflicts,
+	&hf_msg_fragment_multiple_tails,
+	&hf_msg_fragment_too_long_fragment,
+	&hf_msg_fragment_error,
+	/* Reassembled in field */
+	&hf_msg_reassembled_in,
+	/* Tag */
+	"Message fragments"
+};
+...
+static hf_register_info hf[] = 
+{
+...
+{&hf_msg_fragments,
+	{"Message fragments", "msg.fragments",
+	FT_NONE, BASE_NONE, NULL, 0x00,	NULL, HFILL } },
+{&hf_msg_fragment,
+	{"Message fragment", "msg.fragment",
+	FT_FRAMENUM, BASE_NONE, NULL, 0x00, NULL, HFILL } },
+{&hf_msg_fragment_overlap,
+	{"Message fragment overlap", "msg.fragment.overlap",
+	FT_BOOLEAN, BASE_NONE, NULL, 0x00, NULL, HFILL } },
+{&hf_msg_fragment_overlap_conflicts,
+	{"Message fragment overlapping with conflicting data",
+	"msg.fragment.overlap.conflicts",
+	FT_BOOLEAN, BASE_NONE, NULL, 0x00, NULL, HFILL } },
+{&hf_msg_fragment_multiple_tails,
+	{"Message has multiple tail fragments",
+	"msg.fragment.multiple_tails", 
+	FT_BOOLEAN, BASE_NONE, NULL, 0x00, NULL, HFILL } },
+{&hf_msg_fragment_too_long_fragment,
+	{"Message fragment too long", "msg.fragment.too_long_fragment",
+	FT_BOOLEAN, BASE_NONE, NULL, 0x00, NULL, HFILL } },
+{&hf_msg_fragment_error,
+	{"Message defragmentation error", "msg.fragment.error",
+	FT_FRAMENUM, BASE_NONE, NULL, 0x00, NULL, HFILL } },
+{&hf_msg_reassembled_in,
+	{"Reassembled in", "msg.reassembled.in",
+	FT_FRAMENUM, BASE_NONE, NULL, 0x00, NULL, HFILL } },
+...
+static gint *ett[] = 
+{
+...
+&ett_msg_fragment,
+&ett_msg_fragments
+...
+]]>
+   </programlisting></example>
+   <para>
+   These hf variables are used internally within the reassembly routines
+   to make useful links, and to add data to the dissection. It produces 
+   links from one packet to another - such as a partial packet having 
+   a link to the fully reassembled packet. Likewise there are back pointers
+   to the individual packets from the reassembled one.
+   The other variables are used for flagging up errors.
+   </para>
+	</section>
+  </section>
+  <section id="ChDissectTap">
+	<title>How to tap protocols</title>
+	<para>
+	Adding a Tap interface to a protocol allows it to do some useful things.
+	In particular you can produce protocol statistics from teh tap interface.
+	</para>
+	<para>
+	A tap is basically a way of allowing other items to see whats happening as
+	a protocol is dissected. A tap is registered with the main program, and
+	then called on each dissection. Some arbritary protocol specific data
+	is provided with the routine that can be used.
+	</para>
+	<para>
+	To create a tap, you first need to register a tap. 
+	A tap is registered with an integer handle, and registered 
+	with the routine register_tap. This takes a string name
+	with which to find it again.
+	</para>
+	   <example><title>Initialising a tap</title>
+	   <programlisting>
+	   <![CDATA[
+#include <epan/tap.h>
+
+static int foo_tap = -1;
+
+struct FooTap {
+	gint packet_type;
+	gint priorty;
+	...
+};
+...
+	foo_tap = register_tap("foo");
+]]>
+   </programlisting></example>
+	<para>
+	Whilst you can program a tap without protocol specific data, it
+	is generally not very useful. Therefore its a good idea
+	to declare a structure that can be passed through the tap.
+	This needs to be a static structure as it will be used after the
+	dissection routine has returned. Its generally best to pick out some
+	generic parts of the protocol you are dissecting into the tap data.
+	A packet type, a priority, a status code maybe.
+	The structure really needs to be included in a header file so
+	that it can be included by other components that want to listen in
+	to the tap.
+	</para>
+	<para>
+	Once you have these defined, its simply a case of populating the
+	protocol specific structure and then calling tap_queue_packet probably
+	as the last part of the dissector.
+	</para>
+	   <example><title>Calling a protocol tap</title>
+	   <programlisting>
+	   <![CDATA[
+	static struct FooTap pinfo;
+	
+	pinfo.packet_type = tvb_get_guint8(tvb, 0);
+	pinfo.priority = tvb_get_ntohs(tvb, 8);
+	   ...
+	tap_queue_packet(foo_tap, pinfo, &pinfo);
+]]>
+   </programlisting></example>
+	<para>	
+	This now enables those interested parties to listen in on the details
+	of this protocol conversation.
+	</para>
+  </section>
+  <section id="ChDissectStats">
+	<title>How to produce protocol stats</title>
+	<para>
+	Given that you have a tap interface for the protocol, you can use this
+	to produce some interesting statistics (well presumably interesting!) from
+	protocol traces.
+	</para>
+	<para>
+	This can be done in a separate plugin, or in the same plugin that is 
+	doing the dissection. The latter scheme is better, as the tap and stats
+	module typically rely on sharing protocol specific data, which might get out
+	of step between two different plugins.
+	</para>
+	<para> 
+	Here is a mechanism to produce statistics from the above TAP interface.
+	</para>
+	   <example><title>Initialising a stats interface</title>
+	   <programlisting>
+	   <![CDATA[
+/* register all http trees */
+static void register_foo_stat_trees(void) {
+	stats_tree_register("foo","foo","Foo/Packet Types", 
+		foo_stats_tree_packet, foo_stats_tree_init, NULL );
+}
+#ifndef ENABLE_STATIC
+//G_MODULE_EXPORT const gchar version[] = "0.0";
+
+G_MODULE_EXPORT void plugin_register_tap_listener(void)
+{
+	register_foo_stat_trees();
+}
+
+#endif
+]]>
+   </programlisting></example>
+   	<para>
+   	Working from the bottom up, first the plugin interface entry point is defined,
+   	plugin_register_tap_listener. This simply calls the initialisation function
+   	register_foo_stat_trees.
+   	</para>
+	<para>
+	This in turn calls the stats_tree_register function, which takes
+	three strings, and three functions.
+	</para>
+	<orderedlist>
+	<listitem><para>
+	This is the tap name that is registered.
+	</para></listitem>
+	<listitem><para>
+	An abbreviation of the stats name.
+	</para></listitem>
+	<listitem><para>
+	The name of the stats module. A '/' character can be used to make sub menus.
+	</para></listitem>
+	<listitem><para>
+	The function that will called to generate the stats.
+	</para></listitem>
+	<listitem><para>
+	A function that can be called to initialise the stats data.
+	</para></listitem>
+	<listitem><para>
+	A function that will be called to clean up the stats data.
+	</para></listitem>
+	</orderedlist>
+	<para>
+	In this case we only need the first two functions, as there is nothing specific to clean up.
+	</para>
+	   <example><title>Initialising a stats session</title>
+	   <programlisting>
+	   <![CDATA[
+static const guint8* st_str_packets = "Total Packets";
+static const guint8* st_str_packet_types = "FOO Packet Types";
+static int st_node_packets = -1;
+static int st_node_packet_types = -1;
+
+static void foo_stats_tree_init(stats_tree* st) {
+	st_node_packets = stats_tree_create_node(st, st_str_packets, 0, TRUE);	
+		st_node_packet_types = stats_tree_create_pivot(st, st_str_packet_types, st_node_packets);
+}
+]]>
+   </programlisting></example>
+   	<para>
+   	In this case we create a new tree node, to handle the total packets,
+   	and as a child of that we create a pivot table to handle the stats about 
+   	different packet types.
+   	</para>
+	   <example><title>Generating the stats</title>
+	   <programlisting>
+	   <![CDATA[
+static int foo_stats_tree_packet(stats_tree* st, packet_info* pinfo , epan_dissect_t* edt , const void* p) {
+	struct FooTap *pi = (struct FooTap *)p;
+	tick_stat_node(st, st_str_packets, 0, FALSE);
+	stats_tree_tick_pivot(st, st_node_packet_types, 
+			val_to_str(pi->packet_type, msgtypevalues, "Unknown packet type (%d)"));
+	return 1;
+}	
+]]>
+   </programlisting></example>
+   	<para>
+   	In this case the processing of the stats is quite simple.
+   	First we call the tick_stat_node for the st_str_packets packet node, to count
+   	packets.
+   	Then a call to stats_tree_tick_pivot on the st_node_packet_types subtree
+   	allows us to record statistics by packet type.
+   	</para>
+  </section>
+
+  <section id="ChDissectConversation">
+	<title>How to use conversations</title>
+	<para>
+	Some info about how to use conversations in a dissector can be 
+	found in the file doc/README.developer.
+	</para>
+  </section>
+
+</chapter>
+<!-- End of WSDG Chapter Dissection -->