Saturday, February 28, 2015

Tatoeba update (February 28th, 2015)

Top menu

  • The icons in the top menu have been updated.
  • The "Random sentence" under the "Browse" menu item has been moved to the 1st position in the list.


  • The page that lists all the tags has now pagination.
  • The tags can be sorted by name and by count.

Saturday, February 21, 2015

Tatoeba update (February 21st, 2015)

Sentences translation feature

The design of the translation form has been reviewed.
  • Labels have been added for each input field.
  • The buttons have been styled to match the ones in the comments form.
  • The corresponding language icon is displayed next to the dropdown list when selecting a language.

Sentences edit feature

  • Clicking on a sentence will no more open the edit form.
  • An "edit" button has been added next to the "translate" button.
  • The button is displayed only on sentences that you can edit.
  • If the sentence has audio, the edit button will displayed but disabled. You cannot edit sentences that have audio.
  • The buttons have also been styled to match the ones in the comments and the position of the textarea has been fixed (it used to appear one line below, instead of appearing where the sentence text is).


  • The shortcuts to navigate to the previous or next page (Ctrl + arrow key) are no more triggered when typing in a message.
  • The logo has slightly changed. We now have an SVG version of the logo.

Saturday, February 14, 2015

Tatoeba update (February 14th, 2015)

User interface

  • On pages with pagination, such as the Wall, you can navigate to the previous or next page with [Ctrl + →] or [Ctrl + ← ]. Note: on Mac it's [Ctrl + Shift + arrow key].
  • The "Browse" item in the top menu does not link to a random sentence anymore. We have added instead a sub-item ("Random sentence") for this purpose. This should make the sub-menu accessible for most people using a mobile device.
  • The icon that indicates whether a Chinese sentence is written with simplified or traditional characters has been restored. This icon has also been added for Cantonese and Shanghainese.
  • We fixed an issue where CJK characters were not displayed with the correct glyph.

Users can deleted their own sentences

Anyone can now delete their own sentence, if (and only if) it doesn't have any translation.
You will not need to ask a corpus maintainer to delete your sentences anymore, when you have added a sentence by mistake.


  • When a sentence is tagged 'OK', the tag is removed when the sentence is modified. Note: if you are editing a sentence from the sentence's page, you may notice that the tag remains even after you have edited it. This is normal. You will need to refresh the page to see it disappear.
  • In some situations, logging in systematically redirects to the homepage. We changed the redirection to lead instead to the last page viewed before attempting to log in.
  • Email notifications subjects are now encoded properly.

Wall messages re-imported

The Wall messages that were lost during a server crash have been re-imported.

Google Summer of Code 2015

We are preparing to apply as a mentoring organization for Google Summer of Code 2015. We will submit our application some time this weekend, and we will know on March 2 if we are accepted or not.

Even if we are not yet officially part of GSoC 2015, a wave of students already started contacted us. Some of them have even contributed code that has been released:
  • the keyboard shortcuts to navigate from page to page
  • the "Random sentence" sub-item in the menu
  • the clickable arrow icon in the search bar, to swap languages (released last week)
And there's more to come :)

Saturday, February 7, 2015

Tatoeba update (February 7th, 2015)


  • It's now possible to search words with the dot, comma and apostrophe symbols in Lojban.
  • There is no more distinction between half-width and full-width characters. Searching half-width characters (for instance 10) and full-width characters (for instance 10) will return the same results. This  covers Japanese kanas, Korean hangul, and Latin alphabet.
  • Non-CJK words are now searchable in sentences of CJK-based languages.
Note: The full/half-width and non-CJK searches modifications are not applied yet. They will be applied tomorrow when the corpus is fully re-indexed.

UI fixes and improvements

  • Clicking on the arrow in the search bar will swap the "from" and "to" languages.
  • Pressing enter when adding a sentence while using an IME will no more submit the sentence.

Wall messages lost in crash re-importation

A while back we had a crash where all the messages on the Wall got lost. We had some backup but the data in the backup was not structured in a way that would enable an easy restoration of the messages.

Gillux worked on a way to reimport them and the script is now ready. It is not scheduled yet, but you should be aware that Tatoeba will be under maintenance for a short time (around half an hour) when we reimport the messages.

Saturday, January 31, 2015

Tatoeba update (January 31st, 2015)

UI Improvements

  • We changed the position of the language icons. It is now displayed on the left of the sentence.
  • The link/unlink button (accessible only for advanced contributors) is displayed between the language icon and the sentence's text.
  • A button to hide/show replies on the Wall has been added.

Empty search

Searching for an empty string would redirect you to a message saying "Please enter a nonempty search string".
The search feature has been modified so that searching an empty string will instead display all the sentences.

This feature works also with the language filters, so you can search for from Bulgarian, and it will display all the sentences in Bulgarian. You can as well search from Bulgarian to English, and it will display all the sentences in Bulgarian that have a translation in English.

Note that there is still a limitation of 1000 results. The count will display the total number of results but you will not be able to browse through all the results, only the 1000 first ones.

Saturday, January 24, 2015

Tatoeba update (January 24th, 2015)

Sentences with audio are locked

From now on, sentences cannot be deleted or edited if they have audio. We consider that a sentence that has audio can safely be considered as correct and has no reason to be changed or deleted.

Admins can still remove the audio from the sentence, and therefore make it editable and deletable again, if it turns out that the sentence did have a mistake after all.

UI fixes

  • For advanced contributors: the link/unlink button has moved to be placed before the arrow. Users reported that having the link/unlink button next to the text could lead them more often to link or unlink by mistake than to have the button completely to the left.
  • The sentence link (that is automatically generated when you type #123 for instance) now works also when you have punctuation before the #. For instance if your comment contains a parenthesis before the #, like (#123), it will now display the link properly.
  • Linking sentences from the "Browse by language" when a language is selected in "Show translations in" will no more display all translations after linking, but only show translations in the language selected.

New language

A language has been: Eastern Punjabi. We actually had the language "Punjabi" but it mistakenly contained sentences in Eastern Punjabi while it was meant to be Western Punjabi.
There fore "Punjabi" was renamed to "Punjabi (Western)" and "Punjabi (Eastern)" has been added.

Sunday, January 18, 2015

Tatoeba update (January 18th, 2015)

Sentences new design

We've redesigned a bit the sentences.

The initial goal was to make the texts non-clickable, in order to ease the process of copy-pasting a part of a sentence. In order to navigate to a sentence, you will now need to click on the # icon (for the main sentence), or the arrow icon (for the translations).

All the icons have been changed in order to use images in SVG. We're trying to slowly transition towards this format for all our icons.

The "adopt" button has been moved to be placed next to the name of the owner of the sentence.

There is still something left to do: at the moment if you are the owner of a sentence, clicking on it will display the form to edit it. We want to replace this mechanism and add an edit button on which you would click to edit the sentence.

Sentences deduplication

This wasn't easy work but the sentences deduplication script is finally ready to run on the real website.

You can thank saeb (aka. lool0 on IRC). He's the one who wrote most of the script.

Before we can run the script, there is one last thing to do: fix the time on the server. We will need to shut down Tatoeba for a few minutes. But we will do this when there is less traffic.

In any case you can expect the script to be running on Monday :)

Google Summer of Code 2015

We are preparing to apply for Google Summer of Code 2015. We participated last year (for the first time) and had 4 students working on projects related to Tatoeba. We hope to get to participate again this year.

We are in the phase of defining ideas of projects for students. If you have an idea, please don't hesitate to submit it to us. You can submit ideas also by replying to my message on the Wall.

On a side note, if you are interested in being a mentor (this requires to have programming knowledge), please let us know in the related thread on the Tatoeba Google group, or contact us at