Health Data Analysis – Pivot Billions https://pivotbillions.com Fri, 16 Sep 2022 23:21:15 +0000 en-US hourly 1 https://wordpress.org/?v=4.9.26 Indoor vs Outdoor Activities? CDC Health Data Shows Which Is Better For You https://pivotbillions.com/indoor-vs-outdoor-activities-cdc-health-data-shows-which-is-better-for-you/ Tue, 05 Mar 2019 01:41:44 +0000 https://www.pivotbillions.com/?p=2222 continue reading »]]>

 

Diving into CDC Behavioral Risk Factor data using Pivot Billions to learn what exercise behaviors are associated with improved health.

 

Motivating yourself to go outside and get some exercise or play a sport can be hard, but it is worth it. I had trouble with this myself but after looking into the CDC’s Behavioral Risk Factor Surveillance System data from Kaggle I decided to redouble my efforts. The CDC dataset I examined contains over 400,000 individuals’ responses to survey questions for each year from 2011 to 2015 and is an extremely comprehensive dataset containing answers to hundreds of questions regarding the individuals’ behaviors and their resulting health.

However, the size and structure of the data makes it hard to explore. Particularly, the survey schema changes each year and the english meaning of each survey question (and the corresponding answers) are found in a different pdf for each year of the data. I decided to use Pivot Billions to analyze the data since it has useful ETL and data joining features, allowing me to quickly narrow down on the data and survey questions I wanted to explore and discover some of the trends buried within.

One relationship I wanted to explore was whether various types of physical activity affect a person’s physical, mental, and emotional health before and after retirement. Using Pivot Billions I enhanced the data with three new columns based off of the existing survey questions determining the surveyed persons’ age, main sport over the 30 days prior to the survey, and degree of health limitation. I needed to use a lookup table I’d made to create the MainSport column (matching survey answer to the corresponding activity) so I was grateful that it was easy to combine into my data.

Now that I had a full view of my data along with its enhancements, I quickly filtered the data to eliminate “Refused” and blank answers to the experienced health limitations survey question. This way I had a clear metric for my data: the average degree of experienced physical, mental, or emotional health limitations on a scale from 0 to 1.

My data was now ready for more detailed analysis so I used Pivot Billions’ pivot function to reorganize my data by age category and main sport. Viewing the result as a Bar Graph a picture started to form of what types of activities were associated with better overall health.

What I noticed first in this visualization is most of the activities associated with worse health for both age groups were activities a person would do in or around the house. I wanted to see how this effect affected each age group so I quickly filtered the pivoted data to the 18 to 64 age group and sorted by decreasing health limitations (increasing overall health).

Sure enough, with a few exceptions such as swimming in laps, the most health limitations were experienced pre-retirement when the individuals exercised or did chores primarily around the house. There was a dramatic improvement in overall health when an individual consistently went outside, even if it was just to play Golf.

Now, I wanted to see whether there was any effect on the post-retirement age group as well so I quickly re-filtered the pivoted data.

Again we see staying inside of or working around the house associated with a much greater degree of physical, mental, and emotional limitations. In fact getting outside of the house, even to go bowling or golfing, brought an over 40% reduction in experienced limitations for the more than 900,000 people surveyed. It turns out that finding a physical activity that will consistently get you out of the house can help not only your physical fitness, but also dramatically improve your mental and emotional well being.

]]>
Health Data Analysis: CDC Behavioral Risk Factor data says eat your green veggies https://pivotbillions.com/health-data-analysis-cdc-behavioral-risk-factor-data-says-eat-your-green-veggies/ Wed, 19 Dec 2018 22:49:03 +0000 https://www.pivotbillions.com/?p=1621 continue reading »]]>

Everyone wants to be healthy but there are many competing claims as to how you can achieve this. With so many contradictory diets, exercise routines that take enormous amounts of time and dedication, and many other perceived paths to a healthy body and mind; tying these claims to actual data becomes very necessary and useful. That is why we decided to explore the CDC’s Behavioral Risk Factor Surveillance System data from Kaggle.

This data contains over 400,000 individuals’ responses to survey questions from 2011 to 2015. It is an extremely comprehensive dataset containing answers to hundreds of questions regarding the individuals’ behaviors and their resulting health. However, all of this data becomes quite tedious and can be arduous to dive into. With Pivot Billions we are able to easily load and enhance the dataset and immediately start to understand the data and pivot it in various ways to see the trends buried within.

For our Behavioral Risk Factor data, we created additional columns categorizing each individuals’ intake of fruit, fruit juice, and green or orange vegetables from the data using the column-creation feature. We can then pivot the data by some of these columns and the amount of activity limitation due to health problems to immediately see the average activity limitation by green vegetable and fruit intake.

The over 400,000 surveys clearly show health improvements due to either daily intake of green vegetables or fruit. Moreover, we can easily see that daily eating of fruit is better than not eating fruit or green vegetables, but daily eating of green vegetables is preferred over eating fruit and that eating both green vegetables and fruit at least once daily is the best option. In fact, the data reveals that daily intake of green vegetables and fruit leads to an over 40% reduction in experienced health limitations!

By pivoting the data by orange vegetable and fruit juice intake, we can now quickly analyze their impact.

It appears that orange vegetables and fruit juice do reduce experienced health limitations by approximately 16%, but green vegetables and fruit are still much more effective.

These are just some of the underlying trends in the data that we found using Pivot Billions. There are many more to explore in this dataset. Stay tuned for our next analysis or feel free to try it out yourself.

 

 

]]>