Skip to main content

MapReduce

Google's MapReduce is a new parallelism framework for processing large amounts of data. Some recommended links are:

Comments

  1. Hey Alex
    you may like this book about about large scale text processing with mapreduce
    http://www.umiacs.umd.edu/~jimmylin/book.html

    ReplyDelete
  2. Thanks, Andrei. Seems very good book.

    ReplyDelete

Post a Comment

Popular posts from this blog

SQLite Musicbrainz database

Here is a rar file with a stripped down SQLite Musicbrainz database. The compressed file is about 700 MB, so it will take around 10 min to download with a good connection.

This database is for educational purposes. I have not deleted tuples from the main tables, just removed some columns and tables I thought weren't necessary to experience and understand the data.

Here is a database schema diagram and a file with create table statements and explanatory comments. Also here is a Toad data model.

The raw data used for the database was downloaded from Musicbrainz on Oct 14, 2012.

Latent Semantic Analysis Tutorial

I wrote a tutorial on Latent Semantic Analysis (LSA). It can be accessed by following this link. I believe LSA is a very interesting method for ranking documents in response to a query.

LSA is a method for discovering hidden concepts in document data. Each document and term (word) is expressed as a vector with elements corresponding to these concepts. Each element in a vector gives the degree of participation of the document or term in the corresponding concept.

The goal is not to describe the concepts verbally, but to be able to represent the documents and terms in a unified way for exposing document-document, document-term, and term-term similarities or semantic relationship which are otherwise hidden.

An Example

Suppose we have the following set of five documents

d1 : Romeo and Juliet.
d2 : Juliet: O happy dagger!
d3 : Romeo died by dagger.
d4 : “Live free or die”, that’s the New-Hampshire’s motto.
d5 : Did you know, New-Hampshire is in New-England.

and search query: dies, dagger.

A classical…

JavaFX on Android

JavaFX is a great software framework for creating and delivering desktop applications. It is intended to replace Swing as the standard GUI library for Java SE. However, it is not quite easy to develop applications for mobile devices with the framework.

This video is great at explaining how to make a JavaFX application for Android:
How to make a JavaFX Android App - July 2017The main process consists of: Installing a specific plugin on your Java IDEInstalling the Android SDK (Software Development Kit)Building the final APK file which you can then upload to the Google Play Store