Utilisez-le sur un serveur
universitaire canadien

Formatting Your Documents

As mentioned in the Limitations section, support for formatted text is minimal. On one hand, this frugality contributes to the relative simplicity of the software, the speed of command execution, and the absence of storage quotas (if you hadn't noticed). On the other hand, it can make browsing documents tedious.

This section provides some tips on how to format your documents.


Your Files Are Cleaned

If you're wondering how your files are cleaned, consult the documentation for the two tools on which LibreQDA is based:

  • Bleach cybersecures the files you upload
  • Calibre allows multiple formats to be accepted and converts them to plain text

Our cleaning process may cause tables, italics, hyperlinks, custom fonts, custom margins, or other formatting elements supported by other software to disappear.

This can result in a corpus that is difficult to browse and code.

For example, this excerpt from Hydro-Québec's 2023 annual report contains images and sometimes includes two columns of text:

extrait_document.png

Once loaded into LibreQDA, this document is nearly impossible to browse because its different columns blend together. Its complex formatting could also slow down your interface.

Tips for Improving the Readability of Your Files

If you have the flexibility to edit your files, a few rules of thumb should be followed:

Stick to **bold** character tags

Italics or underlined text may not be detected. Bolding passages can help you identify, for example, the speaker and the timestamp.

If needed, use special symbols to simulate section delimiters

For example =====. Using asterisks ( * ), underscores ( _ ) or other special characters may not work.

Finally, convert your PDFs to other text formats if they display poorly

Some of your PDF files may be more readable if you use a tool to convert them to another format, for example .docx. Issues related to two-column text may still persist.

This screenshot shows the result of applying these formatting tips:

ex-format.png


Note that more stylistic elements may be recognized by using the Markdown formatting language, which is partially supported.