BattleFin with capital B and F-1-1

Alternative Data
Science Courses

battlefin-discovery-new-york-logo-home

The Plaza
June 19 - 20th, 2019 



Duov75_WoAI7t5_

Learn more about the educational alternative data science courses that you will be available at Discovery Day New York, June 19- 20th. 

 

Refine your skills, while exploring actual alternative data types. All courses will be live coding courses where each participant will work over the Ensemble sandbox platform to code in sync with the instructor.

 
Learn what it takes to evaluate, work with and generate insight from alternative data. The courses are built for both quantitative researchers and fundamental analysts. 
 
Attendees of the educational track should have an intermediate understanding of programming, grasp the basics of python, and bring their laptops to the course.
 
 
 
 

 

Day 1: Alternative Data Analysis Basics

 

Course 1 — Introduction to Data Analysis and Tools 
8:30am - 10:30am 

Learn how to accelerate your data analyses using the Python language and Pandas, a library specifically designed for interactive data analysis. Work with specific samples of data to understand what specific Alternative Data Types are like.

  • Understand Python Pandas core functionality (loading, filtering, grouping and transforming data)
  • Explore alternative data (samples of web scraping data products)

Having completed this workshop, you will understand the fundamentals of Panda, be aware of common pitfalls and be ready to perform your own analyses on alternative data sets.

Course 2 — Advanced Tools of The Trade & Alternative Data Types 
10:50am - 12:20pm

Learn how to perform advanced analyses on alternative data using PySpark and Ensemble. We will focus on merging datasets too large to fit in memory together and running analyses at scale.

  • Conduct an advanced exploration and analysis of Alternative Data samples with PySpark
  • Learn how to work “out of memory” vs “in memory”
  • Use samples of B2B, ESG or Geo-Location data products

Having completed this workshop, you will be ready to compute on your own Big Data to generate new insights. Course 2, prepares participants for the Alternative Data Exploration Modules in day 2.

 

Day 2: Alternative Data Exploration Modules

 

Course 3 — Sentiment Data Exploration Module
10:50am - 12:20am 

Focus on using signals from multiple sentiment datasets to predict daily growth for a bucket of stocks. We will then evaluate which stocks had a stronger correlation to the sentiment indicators, and which indicators had the strongest influence on price movement.

  • Use a Panda dataframe to extract our bucket of stocks from the datasets and join them together
  • Use featexp to do an initial analysis on which features may have the strongest correlation with price movement
  • Build a model using PyTorch and FastAI to forecast price movement.

 

Meet our Instructor: Dan Gerlanc


With more than 15 years of experience creating data intensive software, Dan Gerlanc is a data scientist and technologists who specializes in projects at the intersection of data science and software development.

Dan spent five years as a quantitative analyst with two Boston hedge funds before starting Enplus Advisors Inc., a boutique data science and custom software firm, in 2011.

Additionally Dan teaches data science and software development both at conference seminars and for private clients. He's an author and contributor to several open source projects, speaks at industry conferences, has published articles in peer-reviewed journals and is a Williams College alum.

892459