Saturday, January 24, 2015

Tatoeba update addendum (January 24th, 2015)

Previously, on a "Browse by language" page, linking a sentence to a main sentence caused all translations of the main sentence to be displayed, even when "Show translations in" was set to "None" or a single language. Now the displayed translations are correctly filtered according to the value of "Show translations in".

Tatoeba update (January 24th, 2015)

Sentences with audio are locked

From now on, sentences cannot be deleted or edited if they have audio. We consider that a sentence that has audio can safely be considered as correct and has no reason to be changed or deleted.

Admins can still remove the audio from the sentence, and therefore make it editable and deletable again, if it turns out that the sentence did have a mistake after all.

UI fixes

  • For advanced contributors: the link/unlink button has moved to be placed before the arrow. Users reported that having the link/unlink button next to the text could lead them more often to link or unlink by mistake than to have the button completely to the left.
  • The sentence link (that is automatically generated when you type #123 for instance) now works also when you have punctuation before the #. For instance if your comment contains a parenthesis before the #, like (#123), it will now display the link properly.
  • Linking sentences from the "Browse by language" when a language is selected in "Show translations in" will no more display all translations after linking, but only show translations in the language selected.

New language

A language has been: Eastern Punjabi. We actually had the language "Punjabi" but it mistakenly contained sentences in Eastern Punjabi while it was meant to be Western Punjabi.
There fore "Punjabi" was renamed to "Punjabi (Western)" and "Punjabi (Eastern)" has been added.

Sunday, January 18, 2015

Tatoeba update (January 18th, 2015)

Sentences new design

We've redesigned a bit the sentences.

The initial goal was to make the texts non-clickable, in order to ease the process of copy-pasting a part of a sentence. In order to navigate to a sentence, you will now need to click on the # icon (for the main sentence), or the arrow icon (for the translations).

All the icons have been changed in order to use images in SVG. We're trying to slowly transition towards this format for all our icons.

The "adopt" button has been moved to be placed next to the name of the owner of the sentence.

There is still something left to do: at the moment if you are the owner of a sentence, clicking on it will display the form to edit it. We want to replace this mechanism and add an edit button on which you would click to edit the sentence.

Sentences deduplication

This wasn't easy work but the sentences deduplication script is finally ready to run on the real website.

You can thank saeb (aka. lool0 on IRC). He's the one who wrote most of the script.

Before we can run the script, there is one last thing to do: fix the time on the server. We will need to shut down Tatoeba for a few minutes. But we will do this when there is less traffic.

In any case you can expect the script to be running on Monday :)

Google Summer of Code 2015

We are preparing to apply for Google Summer of Code 2015. We participated last year (for the first time) and had 4 students working on projects related to Tatoeba. We hope to get to participate again this year.

We are in the phase of defining ideas of projects for students. If you have an idea, please don't hesitate to submit it to us. You can submit ideas also by replying to my message on the Wall.

On a side note, if you are interested in being a mentor (this requires to have programming knowledge), please let us know in the related thread on the Tatoeba Google group, or contact us at

Wednesday, December 17, 2014

Tatoeba update (December 17th, 2014)

Small update

  • We fixed the problem of the languages not being displayed on the translate page, in the list for random sentences.
  • We fixed an issue where a sentence was not part of the search results, even though it had been indexed previously. This happened when the sentence was recently translated, or for which the owner or correctness has changed.

UI translations

I'd like to mention that we now have the website interface translated 100% in 7 languages: Arabic, Esperanto, Finnish, French, German, Italian and Russian.
We have as well Marathi (97%), Japanese (92%) and Polish (90%) not far from being completed.

Sentences deduplication

We're delaying once more the sentences deduplication. There are still some details we'd like to fix. Even though they are not critical and the deduplication itself is working properly (as far as we know), it's better to fix them sooner than later.

When everything is fixed, there will be another round of deduplication on the dev website. We will leave a few days again for everyone to check that there's indeed no major issue. Then we will run the script finally on the real website.

Thank you for your patience.

Saturday, December 6, 2014

Tatoeba update (December 6th, 2014)

Bug fix for login redirection

The problem with the redirection to a random sentence's page when logging in is now fixed. When you log in you will now stay on the same page you were at.

UI translations

Gillux implement various improvements for the UI translations. You can read more details about it in his post on the Wall.

Sentences deduplication

We're not forgetting about the deduplication script. We have still a few issues to fix before we can run it. There are no big issue though, so we can probably start deduplicating sentences next weekend.

Friday, November 28, 2014

Tatoeba update (November 29th, 2014)


Tatoeba will be under maintenance for approximately at least 3 hours.
Scheduled time: 04:00 to 07:00 11:00 UTC.

As mentioned in the previous blog post, we need to shut down Tatoeba for a few hours in order to do some changes in the database. These changes are needed for us to run later on the sentences deduplication script.

While Tatoeba is down, if you feel like translating, we always need people to help us translate the website interface.

Edit: The maintenance is over now. It took more time than planned due to MySQL logs still being activated during the changes, but everything went okay.

More frequent indexation

We do not have a lot of new things for this update, but there is nonetheless one piece of good news. Thanks to some optimization that gillux did a couple of weeks ago, we can afford to index new sentences more often. The interval was previously set to 1 hour, and we reduced it to 15 minutes. In other words, you will never have to wait more than 15 minutes to be able to find your sentences via the search function.

Donations and thanks

We recently received a small donation from Stanislav. So thank you, Stanislav! And thanks again to everyone before him who donated to make sure that Tatoeba will be hosted on a stable and fast server for the next few years :)

Sunday, November 23, 2014

Tatoeba update (November 23rd, 2014)


  • Regarding the link feature for advanced contributors: it is now possible to drag-and-drop the icons (instead of the sentence text) into the link icon in the menu.
  • Our assets files (images, CSS, javascript) now have a timestamp, so that the browser knows whether or not it needs to update them. This means you should no more have to worry about clearing your browser's cache.

Development website

Gillux recently set up a development (dev) website. The purpose of the dev website is to let members test new features and check the interface translations BEFORE they get released into the production (prod) website, that is the actual Tatoeba website.


We are planning to disable Tatoeba temporarily next weekend (November 29) for maintenance.
The maintenance is about changing the engine of our MySQL database from MyISAM to InnoDB. For this operation we need to stop access to the database, that's why we need to shut down Tatoeba. It should take around 3 hours.
We need to do this change in order to run the sentences deduplication script. More about this below.

Sentences deduplication

First of all, note that the deduplication script will not be running during the maintenance, but after. The script can run with Tatoeba being available. It is still unsure whether we will run the script next weekend or later. We are still in the phase of debugging the script.

There was a first test of the script of the dev website. It took 9.5 hours to complete. You can help us make sure that the script works well by checking the dev website. Duplicates that were removed can be identified as they were deleted by Horus (it's the current name of the deduplication bot).
If you notice any issue such as sentences that were deleted while they shouldn't have, or information that was not re-linked properly, report the problem to us on the Wall of the real website (not on the dev please) or on our Google group.