All posts filed under: Coursera

Data Management – Missing Values

Continuing my course in Data Management and Visualization with Coursera… For those that haven’t followed on my previous posts, I am currently enrolled in a Data Analysis Specialization with Wesleyan University through Coursera. With data from Gapminder, I am exploring a broad and basic question: does urbanization drive economic growth? For those of you interested in reading my literature review to gain a background in this project, please visit this page. Though I am working in SAS and Python in an attempt to learn both, I will only be presenting my work in SAS here (though I will also include my Python code for reference). The output format in SAS is easier on the eyes in my opinion. For this third assignment, I was originally going to try to calculate the urban population growth rate for 2010. Instead, I found the data available on Gapminder so I did not have to create a secondary variable for population growth. After inserting the latest variable, I decided to code out missing data. Since my focus is on GDP, …

Frequency Tables…The World is Mainly Poor…

I have always been hungry for knowledge and been ambitious about my goals. Having the chance to finally learn some programming, I decided I was going to learn both SAS and Python. Luckily, this isn’t my first exposure. While at ASU, I learned the basics of R programming and it really helped me to get an understanding of writing syntax in both SAS and Python. For my second assignment for the Data Management and Visualization course with Wesleyan University through Coursera, I am required to post the program I wrote and the frequency tables that it produced. Just as a reminder, I am working with Gapminder data on urbanization and economic growth. For the present analysis, I am using GDP per Capita, GDP Growth, Urban Population as a percentage of total population from 2010 and Urban Population in agglomerations over 1 million people from 2007. This is my code in SAS: FILENAME REFFILE “/home/wfhsu.taiwan0/my_courses/Data1.xlsx” TERMSTR=CR; PROC IMPORT DATAFILE=REFFILE DBMS=XLSX OUT=Gapminder2010; GETNAMES=YES; RUN; PROC CONTENTS DATA=Gapminder2010; RUN; /*to importat data from Excel file upload*/ LIBNAME mydata …

Data Visualization Assignment 1 – That Urbanization Thing

“When you are that curious about the world, scholarship never ends.” – October 18, 2015 As I mentioned in my last post, I started the Data Analysis and Interpretations Specialization with Coursera in order to gain more skills relevant to my pursuit of urban studies and interest in urban planning and development. Our assignment for the first week is to develop a research question based on the data sets provided by the course or another data set of our own choice. With my background and interest in cities, I looked through the code books for each data set looking for relevant data. I decided that the Gapminder data set had the information I needed to look into the effects of urbanization globally. Though the scale is on a national level, which can obscure many relationships, particularly the distinction between rural and urban areas and their respective economies, I look forward to comparing the general trends and effects of urbanization. The increasing rate of urbanization has been accompanied by a corresponding rise in number of urban …