Showing posts with label TomeRaider Files. Show all posts
Showing posts with label TomeRaider Files. Show all posts

Wednesday, 18 March 2009

We are in need of some beta testers for our forthcoming iPhone and Android applications

Hi Folks

We are in need of some beta testers for our forthcoming iPhone and Android applications including the long awaited TomeRaider 4 E- Reference Reader.

If you would like to help out that would be great, please email beta@yadabyte.com stating which device you have, along with your device serial number and we will start the ball rolling for getting you beta testing.

Thanks very much,

Mat
Yadabyte

Friday, 4 April 2008

New release: Full Spanish Wikipedia TomeRaider Ebook

TomeRaider Spanish Wikipedia ebook with over 570,000 articles.

This ebook is published as an Torrent, you can download TomeRaider Spanish Wikipedia ebook from the download link below

TomeRaider ebook español de Wikipedia, con más de 570000 artículos.

Este libro electrónico es una publicación como Torrent, puede descargar TomeRaider español de la Wikipedia ebook enlace de descarga de abajo.

Download URL :
http://www.tomeraider.com/ebooks/reference/exclusive/the_spanish_wikipedia_encyclopedia_tr3_ebook--BK1058.php

Wednesday, 12 March 2008

New release: Full English Wikipedia TomeRaider Ebook File

We have just released the latest version of our super popular TomeRaider Wikipedia file. This baby takes 2 weeks to make and comes in at under 4gigs, so it should fit on most devices with the right card.

We are trying to publish this by Torrent, which is a first for us.

Download here:
http://www.mininova.org/tor/1237296

Tuesday, 6 March 2007

Introducing the New Internet Movie Database (IMDB) TomeRaider files


Before the Wikipedia TomeRaider files started being released a few years back, the IMDB files were our most popular TomeRaider files by a long long shot.

When we first released the TR IMDB it was about ten megs – this was in the days when a 32 meg flash card was considered pretty chunky. At the time only the Psion version for TomeRaider was available and that had, as direct competition Halliwells from Palmtop Software (Who I used to work for Palmtop, great company, since gone on making a mint rebranding as TomTom.).

I remember how back then the IMDB really was considered inferior but now… jeepers… is Haliwell’s still going? Even St Mark Kermode makes many references to it.

We are going to start releasing the new versions of the IMDB TomeRaider files every month or two, at least. Its all automated in house now so there is no excuse not to.

The new IMDB TomeRaider files:

  • English Movies after 1960 – Plot summaries, actors actress etc. Lots of info. We have taken the
  • English Movies after 2005 – Plot summaries, actors actress etc. Lots of info. Ideal for those interested in new films rather than your full on moovos
  • English Movies Full – This is the big boy. 52 Megs
  • TV Shows – The IMDB has a huge TV show section and we have put this in a separate TomeRaider file.

Notes:

  • We have taken the quotes and goofs out now. It seems that people who contribute to the IMDB think that quotes mean anything from the film rather than the really memorable or special ones. They just make the files messy and "quote heavy". Let us know if this is not a good move.
  • The English language versions don’t have details on films like Apocalypto.

Introducing: The Wordnet Dictionary and Thesaurus TomeRaider file.


We are currently generating the brand new Wiktionary File. This is to dictionaries what the Wikipedia is to Encyclopaedias. It’s a pretty remarkable dictionary.

But one of the top files we have is the Wordnet Dictionary. This is taken from the Princeton Wordnet project. Our version has all 130k definitions and the hyperlinked Thesaurus embedded. It’s a great file if your looking for a very comprehensive Dictionary and Thesaurus.


You can download Wornet as a TomeRaider file it from here

Thursday, 1 March 2007

How we make the TomeRaider Wikipedia File

Thanks for all your suggestions and glitch finds in the Wikipedia released last week. We have just uploaded a new one that’s much purer. Also this one links to the print version of the Wikipedia article as this renders better on handhelds.

It generally takes about 40 hours to make one of these TomeRaider Wikipedia files, and that’s not including checking and glitch fixing.

The process:

  • We download the raw text data of the Wikipedia. Uncompressed this comes in at over ten gigs. One file. Ten Gigs. It’s a whopper!
  • Then we pass it through some simple filters we wrote in C to get the size down to something manageable. This filter truncates the articles and removes some other “known junk”.
  • Then we have to pass it through a second program we wrote that does a bazillion find and replaces using regular expressions.
  • Lastly, the output of this file should be ready to import into TomeRaider.
  • When a TomeRaider file of this size – it is now 1.5 million entries – is made three very processor intensive tasks are performed.
    • The text gets compressed using our own methods. This takes an age – but its so efficient to uncompress on your smartphone that the trade off is worth it..
    • All of the hyperlinks, in every page need to be checked and fixed or removed. Even using binary search this takes a long long time.
    • The file then needs to be sorted into the right order for TomeRaider. Again, a simple tasks but on such huge quantities of text takes a long long time.

And the result, when you take it out of the oven, is a perfectly baked TomeRaider file of the wonderful Wikipedia.

I remember a couple of years ago talking to Andrew Orlowski about the Wikipedia, he was pretty pessimistic about its potential and the validity of its content. But it does seem that peer review on such a huge scale is a method that can produce factual content with a new kind of reliability.

You can download the new version of the TomeRaider Wikipedia Here and the new TomeRaider PPC /Smartphone here

Sunday, 25 February 2007

Introducing the new Wikipedia TomeRaider file.

Hooray!

We have just finished the first cut of the new Wikipedia TR3 file and it really rocks. A few years ago Eric Zachte converted the entire Wikipedia text to TR2 and then to TR3. You can still download or get on CD the files from Eriks website (see below).


But as the wiki grows it becomes more and more cumbersome to handle and the returns of relevance diminish with this growth, it is also now over that SD important 1 gigabyte mark. So, last month we made a decision which is to supply regular cuts of the full Wikipedia text but only the introductory paragraph and then a link to the online article for those who want to read.

It works really well:

  • The files is less than 250 megs so it will fit on most SD cards.
  • It is super fast to browse and index search, even with 1.3+ million entries.

There are some text errors in the file, and to be honest there probably always will be – that’s a drawback of a content source written by so many people. But we will be doing regular updates and constantly removing glitches ion the text. If you find and “significant repeaters” then drop us an email at the Yadabyte support address.

We currently have these cuts of the Wikipedia data, all available for immediate download:

  1. Wikipedia (226 MB) - This is the new file which holds the first paragraph of the articles, all 1.3+ million of them. For most people this will be the most useful. (File Version 01)
  2. Wikipedia Complete (1.1 GB +) - various cuts made by Erik Zachte. Its, contains the full wikipedia with full articiles
  3. Wikipedia Compact (49MB) - For this one we took a number of indexes from various encylopedia sources, merged them and then compared against the wikipedia. Close to a traditional encyclopedia in terms of scope.
Download TomeRaider for Pocket PC, palm OS, WM5, Windows etc at www.tomeraider.com
Email support issues to support at www.yadabyte.com.

Thanks:)

Monday, 19 February 2007

The Forthcoming Wikipedia TomeRaider File

To accompany TomeRaider 3.5 we are going to release a new cut of the Wikipedia as a TomeRaider file.

The Problem With The Wikipedia from a Portable Point of View

When Erik Zachte first spent month converting the Wikipedia to TomeRaider format a few years back the Wiki was a huge but manageable beast. But in that time its grown exponentially both in terms of articles and individual article content. It’s a wonderful thing, but its pretty cumbersome, even when powered by the TR engine.

We did do some various reduced size versions but when your dealing with 1.3 million articles and you want to cut that down there is no clear point at which to “yay or nay” any given data.

What our research found for the vast majority of Wikipedia uses is that most “day to day” uses of the Wikipeda involve that first paragraph, the abstract that every article has. The bit that sums up the article in a nut shell. The new TomeRadier wiki uses just this initial summary with a link to the full article online.

It is hard to express just how blindingly fast and cool this is. All 1.3 million entries in your pocket and, in the new TR 3.5 interface, browsing is close to addictive.

The new TomeRaider Wikipedia will be released within days and comes in at a super compressed 270 Megs.