Overview It’s been a few weeks since we posted something in our 5 Minute Analysis series, so we decided to do two quick analyses on two different datasets we found on Kaggle. Instead of doing the analysis locally using PivotBillions on Docker, we opted to run the analysis in the free cloud version available on […] continue reading »
Category: All
all blog posts
Taming 1.5 Billion Rows of “Big Apple” Data
The age of data has arrived. With it, more and more datasets are created and they just keep getting bigger. Whether dealing with private or open data, individuals and organizations across the world are realizing that there are enormous amounts of information and insights to be gained from massive data. The public NYC Taxi and […] continue reading »
R NewYorkers Feeling the Holiday Spirit? Here’s Your Tip
The holiday season brings with it a degree of cheer and joy that many claim makes people act friendlier towards each other. I wanted to see how this effect translates to action so I decided to look into tips for New York green taxis both during the holiday season and the rest of the year. […] continue reading »
Do the holidays mean bigger tips for NYC taxi drivers?
The holiday season brings with it a degree of cheer and joy that many claim makes people act friendlier towards each other. I wanted to see how this effect translates to action so I decided to look into tips for New York green taxis both during the holiday season and the rest of the year. […] continue reading »
Pivot Billions and Deep Learning enhanced trading models achieve 30% net profit
Deep Learning has revolutionized the fields of image classification, personal assistance, competitive board game play, and many more. However, the financial currency markets have been surprisingly stagnant. In our efforts to create a profitable and accurate trading model, we came upon the question: what if financial currency data could be represented as an image? The […] continue reading »
Health Data Analysis: CDC Behavioral Risk Factor data says eat your green veggies
Everyone wants to be healthy but there are many competing claims as to how you can achieve this. With so many contradictory diets, exercise routines that take enormous amounts of time and dedication, and many other perceived paths to a healthy body and mind; tying these claims to actual data becomes very necessary and useful. […] continue reading »
Completing the Picture: Who is the Fantasy Football GOAT for Offense?
Fantasy football can be a relaxing past time but for anyone who takes the competition seriously, data immediately becomes very necessary. While many people track their favorite players from their favorite teams, to truly put together a winning team you need to be able to explore and understand large amounts of data. Moreover, the data […] continue reading »
Completing the Picture: Uncovering NHL MVP’s in a pile of data
Data is rarely consistent. The most consistent attribute of data is that it is usually dispersed across many files and needs to be put back together again to truly understand it. Pivot Billions dramatically improves this process and makes it easy to merge your data and start to analyze it. As an example of this […] continue reading »
Finding Underutilized Kaggle Data
Overview In this 5 Minute Analysis we are exploring a Kaggle dataset about Kaggle datasets. This dataset lets us see a list of the datasets on Kaggle, and shows which ones have the most engagement and activity. Our goal is to explore and filter the data to find popular datasets with many downloads but very […] continue reading »
Powering Insight Through Massive Optimization
Data comes with a price. Accuracy comes with an even greater price. And the two together can demand enormous resources. That’s why it is important to achieve the greatest efficiency in your research process and make use of any tools that can help you. This is particularly true if you are trying to develop a […] continue reading »