Category Archives: data

Big Data Workshop – Squeezing more out of Data

Das Laboratory for Web Science (LWS) der Fernfachhochschule Schweiz organisiert den Workshop “Squeezing more out of Data” zum Thema “Big Data”. Chancen und Risiken in Zusammenhang mit “Big Data” füllen inzwischen die Medien. Durch das Verknüpfen von verschiedenen Datenquellen wird … Continue reading

Posted in Big Data, data, Data Science | Leave a comment

One size fits all – no way!

Lately I attended a Big Data conference (Euroforum Big Data, Switzerland). The conference was nicely organized and the topics were interesting. Participants were mostly  from IT industry and from the list of participants one could expect rather executive-summary-style talks. Nothing … Continue reading

Posted in Big Data, data, Data Science | Tagged , , | Leave a comment

The Role of Trends in Evolving Networks

Modelling complex networks has been the focus of much research for over a decade [1]-[3]. Preferential attachment (PA) [4] is considered a common explanation to the self organization of evolving networks, suggesting that new nodes prefer to attach to more … Continue reading

Posted in algos, complex_system, data | Tagged , , | Leave a comment

WebScience 13: our contribution

We are pleased to announce that our paper “Preferential Attachment in Online Networks: Measurement and Explanations” has been accepted for the ACM Web Science Conference (www.websci13.org). The paper was a joined work together with Jerome Kunegis from the University of Koblenz, … Continue reading

Posted in complex_system, data | Tagged , , , | 1 Comment

Sorting Cats with Hadoop and psort

This is my first “self” tutorial on hadoop mapreduce streaming. If you are really IT oriented you probably want to read http://hadoop.apache.org/docs/r0.15.2/streaming.html (or any newer version). This post doesn’t add much to that document with respect to hadoop mapreduce streaming. Here I play a … Continue reading

Posted in data, misc | Tagged , , | Leave a comment

psort: Parallel sorting on the command line. An example.

I am in the process to understand hadoop and the map-reduce framework. This introductory line will be clarified with the next post, but keep in mind that in this post I am not seeking for the fastest sort but a … Continue reading

Posted in data, misc | Leave a comment

Recommender algorithm in Mathematica

This post is a bit geeky but never mind! In our Lab we are not defending a particular tool or method to code or to analyze data. We stick on the simple strategy: chose the tool fitting best the task! … Continue reading

Posted in algos, data | 3 Comments