2017 Columbia Data Science Hackathon
The Columbia Data Science Society proudly hosted the third annual Columbia Data Science Hackathon. We were excited by what you could do in collaboration with other students and mentors using datasets provided by our corporate sponsors. We hope you enjoyed the hackathon as much as we did, and we hope to see you again next year!
This dataset contains five hundred and seventeen thousand emails from the corporate email servers of the Enron Corporation. The corpus was released to the public domain by the Federal Energy Regulatory Commission post the bankruptcy of Enron on December 2, 2001. Enron employed over twenty thousand employees and was a major force in the American energy sector. Despite being named “America’s Most Innovative Company” by Fortune magazine for years, it was discovered that senior Enron employees had engaged in systematic accounting fraud to inflate corporate assets and defraud investors. Its subsequent failure remains one of the most infamous events of the twenty-first century.
Economic, Financial and Energy Data
Dive into the timeseries data that Bloomberg customers can't live without!
Think you can beat the market? Backtest your model on 10 years of stock prices from thousands of tickers.
Think green tech is poised to challenge the conventional energy industry? Find the relationship between Bloomberg New Energy Finance data (released for the first time at this hackathon!) and our other datasets that back up your claim.
Government Contract Data
In 2016 alone the U.S. government spent over $3 trillion dollars. Government procurement is a sizeable part of the U.S. economy — as procurement overall is a sizeable part of the global economy. This dataset included over 15 years of contracts, from 2000-2017, with the federal government. The data is quite granular, including not only the vendor who won the contract, the value of the contract, the agency within the federal government relevant to the contract and 200+ other columns of details. Digging into government procurement data, i.e. where the government is spending money, reveals a lot about government priorities, as well as the types of businesses that are receiving contracts.
Team Members: Arman Uygur, Moorissa Tjokro, Jonathan Galsurkar
Team Members: Adarsh Chavakula, Gaurav Singh, Somya Singhal, Vinay Kale
Team Members: Jason Cheuk Nam Liang, Chang Liu, Dawei Zhang, Wenjun Wang
Roone Arledge Auditorium
Friday, September 22, 2017