Computer Visions | GenAI Speaker Series
Fall’23 Speakers
We will host a number of speakers from the world of generative media, all active artists, researchers and leaders that will share the actual methods they use for their work (including links to code and examples).
Prof. Ethan Mollick
Associate Professor at The Wharton School, University of Pennsylvania |
||
Guillermo Rauch
Co-Founder and CEO at Vercel |
||
Lior Sinclair
AI Eduactor, ML Engineer/Researcher, Founder AlphaSignal.ai |
||
Akshay Pachaar
Lead Data Scientist at TomTom |
||
Jonathan Whitaker
Data scientist and AI researcher |
||
Prof. Dan Novy
Assistant Professor of Emerging Media Arts at the University of Nebraska-Lincoln |
||
Rinon Gal
PhD Student at TAU Graphics, GenAI Researcher at Nvidia |
||
Doron Adler
Software engineer, GenAI artist and engineer |
||
Pat Pataranutaporn
PhD Candidate at MIT, GenAI researcher |
||
Santiago Valdarrama
ML educator and GenAI engineer |
MAS.S63 Fall 2023
The class portion of Computer Visions is open for student registrations for Fall’23.
Location: E15-359 (some sessions will be remote)
Time: Mondays, 4-6pm
Instructors: Dr. Roy Shilkrot, Prof. Pattie Maes
Units: 0-6-0
Topics Covered
- Students with use online materials to get familiar with the mechanics and concepts in Machine Learning and Deep Learning (model, classification, regression, supervision, train-test, loss, overfitting, regularization, linear models, neural nets, back-propagation optimization, convolution, recurrence, attention, transformers). While it will be mentioned in class, we will not cover them in depth.
- Working environments for generative media, e.g. Google Colab, Jupyter Notebooks, Streamlit/Gradio APIs, Hugginface Spaces, etc.
- Significant ML models and architectures of interest: GANs (pix2pixX, Style), Diffusion Models (e.g. Stable), LLMs (GPT-n, llama-n, etc.), Speech (SeamlessM4T, Whisper), Music & Audio (MuseFormer, AudioCraft, AudioLDM), and more.