The age of data has arrived. With it, more and more datasets are created and they just keep getting bigger. Whether dealing with private or open data, individuals and organizations across the world are realizing that there are enormous amounts of information and insights to be gained from massive data. The public NYC Taxi and […] continue reading »
Author: pivotteam
Do the holidays mean bigger tips for NYC taxi drivers?
The holiday season brings with it a degree of cheer and joy that many claim makes people act friendlier towards each other. I wanted to see how this effect translates to action so I decided to look into tips for New York green taxis both during the holiday season and the rest of the year. […] continue reading »
Pivot Billions and Deep Learning enhanced trading models achieve 30% net profit
Deep Learning has revolutionized the fields of image classification, personal assistance, competitive board game play, and many more. However, the financial currency markets have been surprisingly stagnant. In our efforts to create a profitable and accurate trading model, we came upon the question: what if financial currency data could be represented as an image? The […] continue reading »
Health Data Analysis: CDC Behavioral Risk Factor data says eat your green veggies
Everyone wants to be healthy but there are many competing claims as to how you can achieve this. With so many contradictory diets, exercise routines that take enormous amounts of time and dedication, and many other perceived paths to a healthy body and mind; tying these claims to actual data becomes very necessary and useful. […] continue reading »
Completing the Picture: Who is the Fantasy Football GOAT for Offense?
Fantasy football can be a relaxing past time but for anyone who takes the competition seriously, data immediately becomes very necessary. While many people track their favorite players from their favorite teams, to truly put together a winning team you need to be able to explore and understand large amounts of data. Moreover, the data […] continue reading »
Completing the Picture: Uncovering NHL MVP’s in a pile of data
Data is rarely consistent. The most consistent attribute of data is that it is usually dispersed across many files and needs to be put back together again to truly understand it. Pivot Billions dramatically improves this process and makes it easy to merge your data and start to analyze it. As an example of this […] continue reading »
Finding Underutilized Kaggle Data
Overview In this 5 Minute Analysis we are exploring a Kaggle dataset about Kaggle datasets. This dataset lets us see a list of the datasets on Kaggle, and shows which ones have the most engagement and activity. Our goal is to explore and filter the data to find popular datasets with many downloads but very […] continue reading »
Powering Insight Through Massive Optimization
Data comes with a price. Accuracy comes with an even greater price. And the two together can demand enormous resources. That’s why it is important to achieve the greatest efficiency in your research process and make use of any tools that can help you. This is particularly true if you are trying to develop a […] continue reading »
Streamlining EDA (exploratory data analysis) with Pivot Billions enhances workflows in R.
Incorporating Pivot Billions into your R analysis workflow can dramatically improve the research cycle and your ability to get results. R is a great statistical analysis tool that a wide variety of data analysts use to analyze and model data. But R has limits on the data it can load onto your machine and tends […] continue reading »
Blazing Fast Financial Backtesting from R
As a data scientist, whenever I am developing and testing financial models in R I’ve consistently run into data size limitations, large or distributed compute clusters, and many long waits for my results to be processed and returned. That’s why I was genuinely impressed with how our recently released docker image of Pivot Billions, […] continue reading »