Depending on the use case engineering the Analytics solution (Raw Data -> Aggregated Data -> Contextual Intelligence -> Analytical Insights (reporting vs. prediction) -> Decisions (Human or Automated Downstream Actions)) will require choices and decisions along various dimensions.
Cascade 2.0 - To Go from Big Data to Big Insight, Start with a Visual
We are documenting every tweet, retweet, and click on every shortened URL from Twitter and Facebook that points back to New York Times content, and then combining that with the browsing logs of what those users do when they land at the Times. This project is a relative of the widely noted Cascade project. Think of it as Cascade 2.0.
EfraudBox address internet fraud (detection engine, investigative tools). EfraudBox is an ANR project in collaboration with the GIE Cartes Bancaires, Thales, KXEN, Paris laboratories LIP6, LIP13 and LIPN, the National Gendarmerie and the Judicial Police. Technologies used by Altic: Hadoop, Mahout, Rhipe, SpagoBI, Palo GPU.
Differences in use case for Column and Row-Oriented Databases.
Column-Oriented Databases are best used for analytics against large data volumes. Row-oriented Databases are best suited for transaction processing with low-complexity, high-volume datasets. NoSQL databases are best suited for high-complexity, high-volume datasets.