2017 Columbia Data Science Hackathon

Screen Shot 2017-09-06 at 9.12.57 PM.png


The Columbia Data Science Society proudly hosted the third annual Columbia Data Science Hackathon. We were excited by what you could do in collaboration with other students and mentors using datasets provided by our corporate sponsors. We hope you enjoyed the hackathon as much as we did, and we hope to see you again next year!

Hackathon Datasets

Digital Reasoning



Enron Corpus

This dataset contains five hundred and seventeen thousand emails from the corporate email servers of the Enron Corporation. The corpus was released to the public domain by the Federal Energy Regulatory Commission post the bankruptcy of Enron on December 2, 2001. Enron employed over twenty thousand employees and was a major force in the American energy sector. Despite being named “America’s Most Innovative Company” by Fortune magazine for years, it was discovered that senior Enron employees had engaged in systematic accounting fraud to inflate corporate assets and defraud investors. Its subsequent failure remains one of the most infamous events of the twenty-first century.

Economic, Financial and Energy Data

Dive into the timeseries data that Bloomberg customers can't live without!

  • Think you can beat the market? Backtest your model on 10 years of stock prices from thousands of tickers.

  • Think green tech is poised to challenge the conventional energy industry? Find the relationship between Bloomberg New Energy Finance data (released for the first time at this hackathon!) and our other datasets that back up your claim.


Government Contract Data

In 2016 alone the U.S. government spent over $3 trillion dollars. Government procurement is a sizeable part of the U.S. economy — as procurement overall is a sizeable part of the global economy. This dataset included over 15 years of contracts, from 2000-2017, with the federal government. The data is quite granular, including not only the vendor who won the contract, the value of the contract, the agency within the federal government relevant to the contract and 200+ other columns of details. Digging into government procurement data, i.e. where the government is spending money, reveals a lot about government priorities, as well as the types of businesses that are receiving contracts.


Hackathon Winners


1st Place

Unusual Suspects

Team Members: Arman Uygur, Moorissa Tjokro, Jonathan Galsurkar


2nd Place

Noob Network

Team Members: Adarsh Chavakula, Gaurav Singh, Somya Singhal, Vinay Kale


3rd Place

The Hedgers

Team Members: Jason Cheuk Nam Liang, Chang Liu, Dawei Zhang, Wenjun Wang



Screen Shot 2017-09-13 at 12.42.23 PM.png



Roone Arledge Auditorium
Lerner Hall Floor 1W
Columbia University
New York, NY 10027


Friday, September 22, 2017
Start: 4 pm
Saturday, September 23, 2017
End: 2 pm




Previous Hackathons

Organized By