Benjamin Niemann
2005-05-03 18:07:04 UTC
Hello,
I'm working on a SGML parser and struggle with tag minimization. E.g.
<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01//EN">
<html>
<title>foo</title>
<p>bar</p>
</html>
where <head>, </head>, <body> and </body> are implicitly inserted. Is the
algorithm used to determine how tags are implicitly opened and closed
documented anywhere? I guess it's in the ISO 8879:1986 document, but it is
not freely available. Does anyone has (online-) references to such a
documentation? I searched Google for a while, but without success.
Thanks in advance,
Benjamin Niemann
I'm working on a SGML parser and struggle with tag minimization. E.g.
<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01//EN">
<html>
<title>foo</title>
<p>bar</p>
</html>
where <head>, </head>, <body> and </body> are implicitly inserted. Is the
algorithm used to determine how tags are implicitly opened and closed
documented anywhere? I guess it's in the ISO 8879:1986 document, but it is
not freely available. Does anyone has (online-) references to such a
documentation? I searched Google for a while, but without success.
Thanks in advance,
Benjamin Niemann
--
Benjamin Niemann
Email: pink at odahoda dot de
WWW: http://www.odahoda.de/
Benjamin Niemann
Email: pink at odahoda dot de
WWW: http://www.odahoda.de/