Category Archives: Uncategorized

Thoughts about data operations

Very hard to give guidelines here since that each project have its own deployment process that depends on many factors such as the business context and practical issues associated with it. Continue reading

Advertisements
Posted in Big Data, Uncategorized | Tagged , | 1 Comment

The balance of exploratory analysis and development

Exploratory analysis should precede and follow any task from the modelling, design and development to the benchmarking. Major problem is how do you share, track and monitor your findings? How do you make your analysis repeatable and scrutinizable from the outside? This is still an open problem. Continue reading

Posted in Agile, Machine Learning, Uncategorized | Tagged , | 2 Comments

Mapping DataFrame to a typed RDD

I have recently published a blog post on DZone¬†“Making the Impossible Possible with Tachyon: Accelerate Spark Jobs from Hours to Seconds”¬†which describes the workflow and methodology that we use at Barclays to load data from the raw source (relational database) … Continue reading

Posted in Spark, Uncategorized | Tagged , , | Leave a comment

‘Companies will stop hiring data scientists when they realise that the majority bring no value’ says data scientist – Computing

via ‘Companies will stop hiring data scientists when they realise that the majority bring no value’ says data scientist – Computing.

Posted in Uncategorized | Leave a comment

The amount of digital data in the new era has grown exponentially in recent years and with the development of new technologies, is growing more rapidly than ever before. Simply recording data is one thing, whereas the ability to utilize it and turn it into a profit is another. Supposing we want to collect as many pieces of information as we can gather from any source, our database will be populated with a lot of sparse, unstructured, and not-explicitly-well-clear correlated data. In this essay we summarized the approach proposed in Chapter IV “Uncertain Knowledge and Representation” of the book “Artificial Intelligence: A Modern Approach” written by Russel S. and Norvig P., showing how the problem of reasoning under uncertainty is applied in data science, and in particular in the recent data revolution scenario. The proposed approach analyzes an extension of the Bayesian networks called Decisions networks that resulted to be a simple but elegant model for reasoning in presence of uncertainty. Continue reading

Link | Posted on by | Tagged , , , | Leave a comment

Ubiquitous Computing for Big Data Insight: Helpful Tool or Privacy Breaker? The next generation of devices able to access to the Internet and the Web may not be characterized by computers, smartphones, tablets or appliances designed for this specific purpose. … Continue reading

Link | Posted on by | Tagged , | Leave a comment

Space: Cisco Security Grand Challenge | NineSights

Space: Cisco Security Grand Challenge | NineSights.

Posted in Uncategorized | Leave a comment