Sunday, October 19, 2014

Tatoeba update (October 19, 2014)

Search results sorted by sentence length

Shorter sentences will have higher priority over longer ones in the search results. Even though the length of a sentence does not necessarily imply that it's a better example sentence, this should make the results more relevant overall.

Possibility to comment deleted sentences

The comment form was displayed on deleted sentences, but the comment was not saved after submission. This has been fixed and it is now possible to post comments on deleted sentences.

Script to remove duplicate sentences

This is just a little note that there has been good progress on the deduplication script. We'll hopefully be able to clean up the corpus soon :)

Other fixes

  • Fixed truncation of long URL's containing non Latin characters.
  • Long words or links that exceed their container box are now split into a new lines instead.
  • Fixed a bug where a part of an URL would be converted into a sentence's link.
  • Fixed a bug where some Wall message previews were displayed as empty on the homepage.

Saturday, October 4, 2014

Tatoeba update (October 4th, 2014)

Sphinx 2.1.9

We have upgraded the search engine to Sphinx 2.1.9. This fixes an issue where searching the word "why" would return no result, despite the fact that many sentences in the database use this word.


New sentences quickly available to search

You will no more have to wait weeks before you can find, through the search, a sentence that you have added. We know that many people have been wondering how come they cannot find a sentence that they have recently added, and the reason, in short, is because sentences need to be indexed before you can find them through the search. We couldn't index too often, because it would take too long, and too much resources.
But with the new server, and with gillux's work on implementing a "delta index", we can now provide search results that are much more up-to-date. New sentences will appear in the search results within an hour or less.


Sentences of a user visible to everyone

We have fixed a bug where the page listing the sentence of a specific user was only accessible to logged in users. The page is now visible to any user, logged in or not.