Adapting Large Language Models via Reading Comprehension

Prof. Otto NomosMay 27, 2024 ∙ 2 min read

Abstract Commentary & Rating

OpenBA: An Open-sourced 15B Bilingual Asymmetric seq2seq Model Pre-trained from Scratch

Prof. Otto NomosMay 27, 2024 ∙ 2 min read

Abstract Commentary & Rating

PDFTriage: Question Answering over Long, Structured Documents

Prof. Otto NomosMay 27, 2024 ∙ 2 min read

Abstract Commentary & Rating

Sorted LLaMA: Unlocking the Potential of Intermediate Layers of Large Language Models for Dynamic Inference Using Sorted Fine-Tuning (SoFT)

Prof. Otto NomosMay 27, 2024 ∙ 2 min read

Abstract Commentary & Rating

An Empirical Study of Scaling Instruct-Tuned Large Multimodal Models

Prof. Otto NomosMay 27, 2024 ∙ 2 min read

Abstract Commentary & Rating

MindAgent: Emergent Gaming Interaction

Prof. Otto NomosMay 27, 2024 ∙ 2 min read

Abstract Commentary & Rating

Struc-Bench: Are Large Language Models Really Good at Generating Complex Structured Data?

Prof. Otto NomosMay 25, 2024 ∙ 2 min read

Abstract Commentary & Rating

Recovering from Privacy-Preserving Masking with Large Language Models

Prof. Otto NomosMay 25, 2024 ∙ 2 min read

Abstract Commentary & Rating

S3-DST: Structured Open-Domain Dialogue Segmentation and State Tracking in the Era of LLMs

Prof. Otto NomosMay 25, 2024 ∙ 2 min read

Abstract Commentary & Rating

Augmenting text for spoken language understanding with Large Language Models

Prof. Otto NomosMay 25, 2024 ∙ 2 min read

Abstract Commentary & Rating

Language Modeling Is Compression

Prof. Otto NomosMay 25, 2024 ∙ 2 min read

Abstract Commentary & Rating

Baichuan 2: Open Large-scale Language Models

Prof. Otto NomosMay 24, 2024 ∙ 2 min read

Abstract Commentary & Rating

Stabilizing RLHF through Advantage Model and Selective Rehearsal

Prof. Otto NomosMay 24, 2024 ∙ 2 min read

Abstract Commentary & Rating

Chain-of-Verification Reduces Hallucination in Large Language Models

Prof. Otto NomosMay 24, 2024 ∙ 1 min read

Abstract Commentary & Rating

LMDX: Language Model-based Document Information Extraction and Localization

Prof. Otto NomosMay 24, 2024 ∙ 2 min read

Abstract Commentary & Rating

SlimPajama-DC: Understanding Data Combinations for LLM Training

Prof. Otto NomosOct 04, 2023 ∙ 2 min read

Abstract Commentary & Rating

Contrastive Decoding Improves Reasoning in Large Language Models

Prof. Otto NomosOct 04, 2023 ∙ 2 min read

Abstract Commentary & Rating

CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages

Prof. Otto NomosOct 04, 2023 ∙ 2 min read

Abstract Commentary & Rating

A Data Source for Reasoning Embodied Agents

Prof. Otto NomosOct 04, 2023 ∙ 2 min read

Abstract Commentary & Rating

Leveraging Contextual Information for Effective Entity Salience Detection

Prof. Otto NomosOct 04, 2023 ∙ 2 min read

Abstract Commentary & Rating

LASER: LLM Agent with State-Space Exploration for Web Navigation

Prof. Otto NomosOct 04, 2023 ∙ 2 min read

Abstract Commentary & Rating

Sparse Autoencoders Find Highly Interpretable Features in Language Models

Prof. Otto NomosOct 04, 2023 ∙ 2 min read

Abstract Commentary & Rating

Investigating Answerability of LLMs for Long-Form Question Answering

Prof. Otto NomosOct 04, 2023 ∙ 2 min read

Abstract Commentary & Rating

Scaling Laws for Sparsely-Connected Foundation Models

Prof. Otto NomosOct 04, 2023 ∙ 2 min read

Abstract Commentary & Rating

Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers

Prof. Otto NomosOct 04, 2023 ∙ 2 min read

Abstract Commentary & Rating

Ambiguity-Aware In-Context Learning with Large Language Models

Prof. Otto NomosOct 04, 2023 ∙ 2 min read

Abstract Commentary & Rating

Are Large Language Model-based Evaluators the Solution to Scaling Up Multilingual Evaluation?

Prof. Otto NomosOct 04, 2023 ∙ 2 min read

Abstract Commentary & Rating

Clinical Text Summarization: Adapting Large Language Models Can Outperform Human Experts

Prof. Otto NomosOct 04, 2023 ∙ 2 min read

Abstract Commentary & Rating

Agents: An Open-source Framework for Autonomous Language Agents

Prof. Otto NomosOct 04, 2023 ∙ 2 min read

Abstract Commentary & Rating

Statistical Rejection Sampling Improves Preference Optimization

Prof. Otto NomosOct 04, 2023 ∙ 2 min read

Abstract Commentary & Rating

Large Language Models for Compiler Optimization

Prof. Otto NomosOct 03, 2023 ∙ 2 min read

Abstract Commentary & Rating

AstroLLaMA: Towards Specialized Foundation Models in Astronomy

Prof. Otto NomosOct 03, 2023 ∙ 2 min read

Abstract Commentary & Rating

Large Language Model for Science: A Study on P vs. NP

Prof. Otto NomosOct 03, 2023 ∙ 2 min read

Abstract Commentary & Rating

Efficient Memory Management for Large Language Model Serving with PagedAttention

Prof. Otto NomosOct 03, 2023 ∙ 2 min read

Abstract Commentary & Rating

FIAT: Fusing learning paradigms with Instruction-Accelerated Tuning

Prof. Otto NomosOct 03, 2023 ∙ 2 min read

Abstract Commentary & Rating

Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMs

Prof. Otto NomosOct 03, 2023 ∙ 2 min read

Abstract Commentary & Rating

When Less is More: Investigating Data Pruning for Pretraining LLMs at Scale

Prof. Otto NomosOct 03, 2023 ∙ 2 min read

Abstract Commentary & Rating

Neurons in Large Language Models: Dead, N-gram, Positional

Prof. Otto NomosOct 03, 2023 ∙ 2 min read

Abstract Commentary & Rating

Textbooks Are All You Need II: phi-1.5 technical report

Prof. Otto NomosOct 03, 2023 ∙ 3 min read

Abstract Commentary & Rating

DrugChat: Towards Enabling ChatGPT-Like Capabilities on Drug Molecule Graphs

Prof. Otto NomosOct 03, 2023 ∙ 2 min read

Abstract Commentary & Rating

From Sparse to Dense: GPT-4 Summarization with Chain of Density Prompting

Prof. Otto NomosOct 03, 2023 ∙ 2 min read

Abstract Commentary & Rating

XGen-7B Technical Report

Prof. Otto NomosOct 03, 2023 ∙ 2 min read

Abstract Commentary & Rating

DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models

Prof. Otto NomosOct 03, 2023 ∙ 2 min read

Abstract Commentary & Rating

GPT Can Solve Mathematical Problems Without a Calculator

Prof. Otto NomosOct 03, 2023 ∙ 1 min read

Abstract Commentary & Rating

Large Language Models as Optimizers

Prof. Otto NomosOct 03, 2023 ∙ 2 min read

Abstract Commentary & Rating

Efficient RLHF: Reducing the Memory Usage of PPO

Prof. Otto NomosOct 03, 2023 ∙ 2 min read

Abstract Commentary & Rating

ModelScope-Agent: Building Your Customizable Agent System with Open-source Large Language Models

Prof. Otto NomosOct 03, 2023 ∙ 2 min read

Abstract Commentary & Rating

Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback [Summary]

Prof. Otto NomosOct 02, 2023 ∙ 8 min read

Uncover the challenges, limitations, and future of Reinforcement Learning from Human Feedback (RLHF) in AI systems. Explore governance, safety, and more.

Llama 2: Open Foundation and Fine-Tuned Chat Models [Commentary]

Prof. Otto NomosOct 02, 2023 ∙ 5 min read

Deep dive into Llama 2: Explore the pretraining, fine-tuning, safety measures, and insightful discussions in our comprehensive summary.

Challenges and Applications of Large Language Models [Summary]

Prof. Otto NomosOct 02, 2023 ∙ 20 min read

Explore our summary and key insights of 'Challenges and Applications of Large Language Models', a research paper that delves into the potential, challenges, and applications of LLMs.

LoraHub: Efficient Cross-Task Generalization Via Dynamic LoRA Composition [Commentary]

Prof. Otto NomosOct 02, 2023 ∙ 8 min read

Explore the dynamic composition of LoRA modules with LoraHub for adaptable LLM performance. Dive into problem statements, methodology, evaluation, and more.

ToolLLM: Facilitating Large Language Models To Master 16000+ Real-World APIs [Commentary]

Prof. Otto NomosOct 02, 2023 ∙ 4 min read

Explore the power of Large Language Models (LLMs) in API interaction with our summary of 'Tool LLM'

FacTool: Factuality Detection in Generative AI -- A Tool Augmented Framework for Multi-Task and Multi-Domain Scenarios

Prof. Otto NomosOct 02, 2023 ∙ 2 min read

Abstract Commentary & Rating

Graph of Thoughts: Solving Elaborate Problems with Large Language Models

Prof. Otto NomosOct 02, 2023 ∙ 2 min read

Abstract Commentary & Rating

Efficient Guided Generation for Large Language Models

Prof. Otto NomosOct 02, 2023 ∙ 2 min read

Abstract Commentary & Rating

Predicting transcriptional outcomes of novel multigene perturbations with GEARS

Prof. Otto NomosOct 02, 2023 ∙ 2 min read

Abstract Commentary & Rating

A Survey on Model Compression for Large Language Models

Prof. Otto NomosOct 02, 2023 ∙ 2 min read

Abstract Commentary & Rating

From Pretraining Data to Language Models to Downstream Tasks: Tracking the Trails of Political Biases Leading to Unfair NLP Models

Prof. Otto NomosOct 02, 2023 ∙ 2 min read

Abstract Commentary & Rating

LLM As DBA

Prof. Otto NomosOct 02, 2023 ∙ 2 min read

Abstract Commentary & Rating

Self-Alignment with Instruction Backtranslation

Prof. Otto NomosOct 02, 2023 ∙ 2 min read

Abstract Commentary & Rating

RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback

Prof. Otto NomosOct 02, 2023 ∙ 2 min read

Abstract Commentary & Rating

Large Content And Behavior Models To Understand, Simulate, And Optimize Content And Behavior

Prof. Otto NomosOct 02, 2023 ∙ 2 min read

Abstract Commentary & Rating

BioCoder: A Benchmark for Bioinformatics Code Generation with Contextual Pragmatic Knowledge

Prof. Otto NomosOct 02, 2023 ∙ 2 min read

Abstract Commentary & Rating

The Belebele Benchmark: a Parallel Reading Comprehension Dataset in 122 Language Variants

Prof. Otto NomosOct 02, 2023 ∙ 2 min read

Abstract Commentary & Rating

Can Programming Languages Boost Each Other via Instruction Tuning?

Prof. Otto NomosOct 02, 2023 ∙ 2 min read

Abstract Commentary & Rating

WeatherBench 2: A benchmark for the next generation of data-driven global weather models

Prof. Otto NomosOct 02, 2023 ∙ 2 min read

Abstract Commentary & Rating

Jais and Jais-chat: Arabic-Centric Foundation and Instruction-Tuned Open Generative Large Language Models

Prof. Otto NomosOct 02, 2023 ∙ 2 min read

Abstract Commentary & Rating

MedAlign: A Clinician-Generated Dataset for Instruction Following with Electronic Medical Records

Prof. Otto NomosOct 02, 2023 ∙ 2 min read

Abstract Commentary & Rating

SoTaNa: The Open-Source Software Development Assistant

Prof. Otto NomosOct 02, 2023 ∙ 2 min read

Abstract Commentary & Rating

Teach LLMs to Personalize -- An Approach inspired by Writing Education

Prof. Otto NomosOct 02, 2023 ∙ 2 min read

Abstract Commentary & Rating

RAVEN: In-Context Learning with Retrieval Augmented Encoder-Decoder Language Models

Prof. Otto NomosOct 02, 2023 ∙ 2 min read

Abstract Commentary & Rating

Solving Challenging Math Word Problems Using GPT-4 Code Interpreter with Code-based Self-Verification

Prof. Otto NomosOct 02, 2023 ∙ 2 min read

Abstract Commentary & Rating

The Devil is in the Errors: Leveraging Large Language Models for Fine-grained Machine Translation Evaluation

Prof. Otto NomosOct 02, 2023 ∙ 2 min read

Abstract Commentary & Rating

CausalLM is not optimal for in-context learning

Prof. Otto NomosOct 02, 2023 ∙ 2 min read

Abstract Commentary & Rating

Platypus: Quick, Cheap, and Powerful Refinement of LLMs

Prof. Otto NomosOct 02, 2023 ∙ 2 min read

Abstract Commentary & Rating

OctoPack: Instruction Tuning Code Large Language Models

Prof. Otto NomosOct 02, 2023 ∙ 2 min read

Abstract Commentary & Rating

Enhancing Network Management Using Code Generated by Large Language Models

Prof. Otto NomosOct 02, 2023 ∙ 2 min read

Abstract Commentary & Rating

Improving Joint Speech-Text Representations Without Alignment

Prof. Otto NomosOct 02, 2023 ∙ 2 min read

Abstract Commentary & Rating

BOLAA: Benchmarking and Orchestrating LLM-augmented Autonomous Agents

Prof. Otto NomosOct 02, 2023 ∙ 2 min read

Abstract Commentary & Rating

PIPPA: A Partially Synthetic Conversational Dataset

Prof. Otto NomosOct 02, 2023 ∙ 2 min read

Abstract Commentary & Rating

Self-Alignment with Instruction Backtranslation

Prof. Otto NomosOct 02, 2023 ∙ 2 min read

Abstract Commentary & Rating

OpenProteinSet: Training data for structural biology at scale

Prof. Otto NomosOct 02, 2023 ∙ 2 min read

Abstract Commentary & Rating

Trustworthy LLMs: a Survey and Guideline for Evaluating Large Language Models' Alignment

Prof. Otto NomosOct 02, 2023 ∙ 2 min read

Abstract Commentary & Rating

Accelerating LLM Inference with Staged Speculative Decoding

Prof. Otto NomosOct 02, 2023 ∙ 1 min read

Abstract Commentary & Rating

Shepherd: A Critic for Language Model Generation

Prof. Otto NomosOct 02, 2023 ∙ 2 min read

Abstract Commentary & Rating

SILO Language Models: Isolating Legal Risk In a Nonparametric Datastore

Prof. Otto NomosOct 02, 2023 ∙ 2 min read

Abstract Commentary & Rating

Simple synthetic data reduces sycophancy in large language models

Prof. Otto NomosOct 02, 2023 ∙ 2 min read

Abstract Commentary & Rating

AI Research

MindAgent: Emergent Gaming Interaction

MindAgent: Emergent Gaming Interaction

A Data Source for Reasoning Embodied Agents

A Data Source for Reasoning Embodied Agents

LASER: LLM Agent with State-Space Exploration for Web Navigation

LASER: LLM Agent with State-Space Exploration for Web Navigation

Agents: An Open-source Framework for Autonomous Language Agents

Agents: An Open-source Framework for Autonomous Language Agents

ModelScope-Agent: Building Your Customizable Agent System with Open-source Large Language Models

ModelScope-Agent: Building Your Customizable Agent System with Open-source Large Language Models

BOLAA: Benchmarking and Orchestrating LLM-augmented Autonomous Agents

BOLAA: Benchmarking and Orchestrating LLM-augmented Autonomous Agents

From Pretraining Data to Language Models to Downstream Tasks: Tracking the Trails of Political Biases Leading to Unfair NLP Models

From Pretraining Data to Language Models to Downstream Tasks: Tracking the Trails of Political Biases Leading to Unfair NLP Models

Trustworthy LLMs: a Survey and Guideline for Evaluating Large Language Models' Alignment

Trustworthy LLMs: a Survey and Guideline for Evaluating Large Language Models' Alignment

Simple synthetic data reduces sycophancy in large language models

Simple synthetic data reduces sycophancy in large language models

Augmenting text for spoken language understanding with Large Language Models

Augmenting text for spoken language understanding with Large Language Models

Natural Language Supervision for General-Purpose Audio Representations

Natural Language Supervision for General-Purpose Audio Representations

Improving Joint Speech-Text Representations Without Alignment

Improving Joint Speech-Text Representations Without Alignment

S3-DST: Structured Open-Domain Dialogue Segmentation and State Tracking in the Era of LLMs

S3-DST: Structured Open-Domain Dialogue Segmentation and State Tracking in the Era of LLMs

Investigating Answerability of LLMs for Long-Form Question Answering

Investigating Answerability of LLMs for Long-Form Question Answering

SoTaNa: The Open-Source Software Development Assistant

SoTaNa: The Open-Source Software Development Assistant

PIPPA: A Partially Synthetic Conversational Dataset

PIPPA: A Partially Synthetic Conversational Dataset

WeatherBench 2: A benchmark for the next generation of data-driven global weather models

WeatherBench 2: A benchmark for the next generation of data-driven global weather models

Large Language Models for Compiler Optimization

Large Language Models for Compiler Optimization

LLM As DBA

LLM As DBA

BioCoder: A Benchmark for Bioinformatics Code Generation with Contextual Pragmatic Knowledge

BioCoder: A Benchmark for Bioinformatics Code Generation with Contextual Pragmatic Knowledge

Can Programming Languages Boost Each Other via Instruction Tuning?

Can Programming Languages Boost Each Other via Instruction Tuning?

SoTaNa: The Open-Source Software Development Assistant

SoTaNa: The Open-Source Software Development Assistant

Solving Challenging Math Word Problems Using GPT-4 Code Interpreter with Code-based Self-Verification

Solving Challenging Math Word Problems Using GPT-4 Code Interpreter with Code-based Self-Verification

OctoPack: Instruction Tuning Code Large Language Models

OctoPack: Instruction Tuning Code Large Language Models

Enhancing Network Management Using Code Generated by Large Language Models

Enhancing Network Management Using Code Generated by Large Language Models

LoraHub: Efficient Cross-Task Generalization Via Dynamic LoRA Composition [Commentary]

LoraHub: Efficient Cross-Task Generalization Via Dynamic LoRA Composition [Commentary]

ToolLLM: Facilitating Large Language Models To Master 16000+ Real-World APIs [Commentary]

ToolLLM: Facilitating Large Language Models To Master 16000+ Real-World APIs [Commentary]

Language Modeling Is Compression

Language Modeling Is Compression

A Survey on Model Compression for Large Language Models

A Survey on Model Compression for Large Language Models

SlimPajama-DC: Understanding Data Combinations for LLM Training

SlimPajama-DC: Understanding Data Combinations for LLM Training

CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages

CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages

A Data Source for Reasoning Embodied Agents

A Data Source for Reasoning Embodied Agents

When Less is More: Investigating Data Pruning for Pretraining LLMs at Scale

When Less is More: Investigating Data Pruning for Pretraining LLMs at Scale

Platypus: Quick, Cheap, and Powerful Refinement of LLMs

Platypus: Quick, Cheap, and Powerful Refinement of LLMs

PIPPA: A Partially Synthetic Conversational Dataset

PIPPA: A Partially Synthetic Conversational Dataset

Shepherd: A Critic for Language Model Generation

Shepherd: A Critic for Language Model Generation

PDE-Refiner: Achieving Accurate Long Rollouts with Neural PDE Solvers

PDE-Refiner: Achieving Accurate Long Rollouts with Neural PDE Solvers

FLIRT: Feedback Loop In-context Red Teaming

FLIRT: Feedback Loop In-context Red Teaming

Recovering from Privacy-Preserving Masking with Large Language Models

Recovering from Privacy-Preserving Masking with Large Language Models

LMDX: Language Model-based Document Information Extraction and Localization