| Paketname | libhtmlcxx3 |
| Beschreibung | simple HTML parser library for C++ |
| Archiv/Repository | Offizielles Debian Archiv squeeze (main) |
| Version | 0.84-1 |
| Sektion | libs |
| Priorität | extra |
| Installierte Größe | 140 Byte |
| Hängt ab von | libc6 (>= 2.1.3), libgcc1 (>= 1:4.1.1), libstdc++6 (>= 4.4.0) |
| Empfohlene Pakete | |
| Paketbetreuer | Ludovico Cavedon |
| Quelle | htmlcxx |
| Paketgröße | 38000 Byte |
| Prüfsumme MD5 | ed5e02bacff0c421a8ec3158b8bb0589 |
| Prüfsumme SHA1 | 6e35c6da0553ebe8e478d9445026cb8bf7e61c71 |
| Prüfsumme SHA256 | 9cbd7e7b0a1047d5657843338caaa1cf073075529dbdcafd5267b1fb0e7e67d0 |
| Link zum Herunterladen | libhtmlcxx3_0.84-1_i386.deb |
| Ausführliche Beschreibung | htmlcxx is a simple non-validating CSS1 and HTML parser for C++. Although
there are several other html parsers available, htmlcxx has some
characteristics that make it unique:
.
* STL like navigation of DOM tree, using excellent tree.hh library from
Kasper Peeters
* It is possible to reproduce exactly, character by character, the original
document from the parse tree
* Bundled CSS parser
* Optional parsing of attributes
* C++ code that looks like C++ (not so true anymore)
* Offsets of tags/elements in the original document are stored in the nodes
of the DOM tree
.
The parsing politics of htmlcxx were created trying to mimic Mozilla Firefox
(http://www.mozilla.org) behavior. So you should expect parse trees similar to
those create by Firefox. However, differently from Firefox, htmlcxx does not
insert non-existent stuff in your html. Therefore, serializing the DOM tree
gives exactly the same bytes contained in the original HTML document.
|