Skip to content

Best practices for structuring tsv data prior to conversion #549

Answered by ilius
Victor20x asked this question in Q&A
Discussion options

You must be logged in to vote

If I have raw data that I can get into tsv format in a spreadsheet program, and want to convert to slob and StarDict formats, is there a way to tell pyglossary that some items are alternative headwords and not free-standing entries?

This is how we process CSV:

  • Column 1: main headword
  • Column 2: definition
  • Column 3 (optional): alternative headwords separated by ,

If you export from spreadsheet, it should automatically quote everything that needs quoting (For example if there are several alternative headwords, or your definition contains , or newlines)

Also, is there a standard practice for separating lines of the definition or is this just preference? I can experiment of course but if …

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@Victor20x
Comment options

Answer selected by ilius
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
2 participants