66♥213

Markup languages for content management and Web publishing

Posted by: Ayanta on: December 14, 2008

Even though HyperText Markup Language has been the most extended markup language since the appearance of the Web, recent steps towards semantic integration brought up the necessity of a new tool capable of administrating data. This led to the creation of eXtended Marked Language.

While the sintaxis of XML and HTML is similar (they were both based on SGML), their functions and characteristics differ:

  1. Both HTML and XML place tags around an element to describe it. HTML uses tags to determine the visual display (e.g. font size), whereas tags in XML indicate the category of each element (e.g. “city”, “date”, “name”…). This helps structure the content of the text.
  2. Most HTML users put their attention only on getting the page look the way they want it to, even if the structure behind it is left disorganized. With XML, documents won’t show up unless they’re correctly constructed (well formed), thus forcing an arrangement to be respected. The format of XML documents makes them portable to different platforms and allows structured data coming from other sources to be combined easily.
  3. HTML tags are predefined and limited. Instead, XML lets users create their own tags to classify elements with more precision. As an example, if we had every book written by Shakespeare marked as such, we’d be able to access a list to all of them. With the current methods, however, performing a search about “books” and “Shakespeare” gives us mixed results between works written by him and about him.

The internet is now heading towards eXtended HyperText Markup Language. XHTML is a hybrid of XML and HTML, where information is described in one layer and given the format needed to present it in a browser separately.

References:

1 Response to "Markup languages for content management and Web publishing"

“Most HTML users put their attention only on getting the page look the way they want it to, even if the structure behind it is left disorganized” this is not because of HTML but of web browsers or HTML editing tools! Fine, in any case, it is true of a large number of HTML pages. It can happen with XHTML as well, for the very same reason.

Nice title, where did you get it from?

Leave a Reply

Categories

Blog Stats

  • 13,745 hits

RSS Books (Littera)

RSS Articles (Littera)

RSS Littera

  • Larga noche February 10, 2010
    3am… y aquí estoy yo, en un pu…ñetero bus, de camino a Bilbo, una vez más. Anda que no he hecho este recorrido veces… Actualizo desde el móvil, que con estas nuevas tecnologías megachachiguachis, servidora está conectada 24/7… con pausas publicitarias para disfrutar de las tierras javienses. ¿Y qué hago yo escribiendo a las 3 de la [. […]
    Yera Espinosa
  • Google books February 1, 2010
    Google Books (anteriormente conocido como Google Book Search y Google Print) es un servicio de Google que busca en el texto completo de libros que Google escanea, se convierte en texto usando el reconocimiento óptico de caracteres, y tiendas en su base de datos digital. El servicio era anteriormente conocida como Google Print, cuando se [...]
    Olatz
  • Un año ya January 31, 2010
    Porque eso de que la distancia es el olvido es la mentira más grande jamás contada. Te quiero, marido Suena: Comptine D’un Autre Été, L’après-Midi  -  Yann Tiersen
    Ainhize León Villate