Materials

Learning materials from Xiaohan
- LLM Hands-on Tutorial
- link: LLM tutorial
- Deploy your NLP Model as an Interactive Web Application
- link: Deploy NLP models
- Fine Tuning BERT for Question Answering (QA) Task
- link: Fine tuning BERT

Learning materials online
Listed in order of when the latest technology was updated.
- (NLP) Infographics using Large Language Models (08-28-2023)
- LIDA is a library for generating data visualizations and data-faithful infographics. LIDA is grammar agnostic (will work with any programming language and visualization libraries e.g. matplotlib, seaborn, altair, d3 etc) and works with multiple large language model providers (OpenAI, PaLM, Cohere, Huggingface).
- https://github.com/microsoft/lida
- (NLP) Graph of Thought (08-24-2023)
- (NLP) Code Llama (08-24-2023)
- Code Llama is a family of large language models for code based on Llama 2 providing state-of-the-art performance among open models, infilling capabilities, support for large input contexts, and zero-shot instruction following ability for programming tasks.
- https://github.com/facebookresearch/codellama
- (audio) Seamless M4T
- Bringing the world closer together with a foundational multimodal model for speech translation
- (NLP) LlamaGPT (08-21-2023)
- A self-hosted, offline, ChatGPT-like chatbot, powered by Llama 2. 100% private, with no data leaving your device.
- https://github.com/getumbrel/llama-gpt
- (multimodal) CoDeF: Content Deformation Fields for Temporally Consistent Video Processing (08-20-2023)
- (AI simulation) Generative Agents: Interactive Simulacra of Human Behavior (08-13-2023)
- (NLP) gpt-llm-trainer (08-07-2023)
- This project aims to explore an experimental new pipeline to train a high-performing task-specific model. We try to abstract away all the complexity, so it’s as easy as possible to go from idea -> performant fully-trained model.
- Simply input a description of your task, and the system will generate a dataset from scratch, parse it into the correct format, and fine-tune a LLaMA 2 model for you.
- link: https://github.com/mshumer/gpt-llm-trainer
- (multimodal) Awesome-Multimodal-Large-Language-Models
- link: github repo
- (audio) Whisper:
- OpenAI trained and are open-sourcing a neural net called Whisper that approaches human-level robustness and accuracy on English speech recognition.
- https://openai.com/research/whisper
- (NLP) Doctor GPT (08-06-2023)
- DoctorGPT is a Large Language Model that can pass the US Medical Licensing Exam.
- link: github repo

Practical/Interesting Information
- ⏰ TOP HCI Conference Deadlines: https://hci-deadlines.github.io/
- ⏰ TOP AI Conference Deadlines: https://aideadlin.es/?sub=ML,CV,CG,NLP,RO,SP,DM,AP,KR,HCI
- 🚨 2023 ACL Paper: https://aclanthology.org/events/acl-2023/#2023acl-long