Börje Karlsson's picture

Börje Karlsson

tellarin

·

https://tellarin.com/borje/

AI & ML interests

Machine Learning Systems, Mobile Sensing, Knowledge Mining, Digital Entertainment

Organizations

tellarin's activity

upvoted 3 papers 6 days ago

DynaMem: Online Dynamic Spatio-Semantic Memory for Open World Mobile Manipulation

Paper • 2411.04999 • Published 9 days ago • 16

Agent-to-Sim: Learning Interactive Behavior Models from Casual Longitudinal Videos

Paper • 2410.16259 • Published 26 days ago • 5

MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks

Paper • 2410.10563 • Published Oct 14 • 37

upvoted a paper 26 days ago

Web Agents with World Models: Learning and Leveraging Environment Dynamics in Web Navigation

Paper • 2410.13232 • Published about 1 month ago • 40

upvoted 5 papers about 1 month ago

Depth Pro: Sharp Monocular Metric Depth in Less Than a Second

Paper • 2410.02073 • Published Oct 2 • 40

WALL-E: World Alignment by Rule Learning Improves World Model-based LLM Agents

Paper • 2410.07484 • Published Oct 9 • 48

Everything Everywhere All at Once: LLMs can In-Context Learn Multiple Tasks in Superposition

Paper • 2410.05603 • Published Oct 8 • 11

Agent S: An Open Agentic Framework that Uses Computers Like a Human

Paper • 2410.08164 • Published Oct 10 • 24

Intriguing Properties of Large Language and Vision Models

Paper • 2410.04751 • Published Oct 7 • 16

upvoted a collection about 1 month ago

Foundation AI Papers

Curated List of Must-Reads on LLM reasoning at Temus AI team • 135 items • Updated Jun 15 • 26

upvoted 2 papers about 1 month ago

MLLM as Retriever: Interactively Learning Multimodal Retrieval for Embodied Agents

Paper • 2410.03450 • Published Oct 4 • 35

CANVAS: Commonsense-Aware Navigation System for Intuitive Human-Robot Interaction

Paper • 2410.01273 • Published Oct 2 • 8

upvoted a paper about 2 months ago

EuroLLM: Multilingual Language Models for Europe

Paper • 2409.16235 • Published Sep 24 • 24

upvoted 2 papers 2 months ago

Open Language Data Initiative: Advancing Low-Resource Machine Translation for Karakalpak

Paper • 2409.04269 • Published Sep 6 • 9

Robot Utility Models: General Policies for Zero-Shot Deployment in New Environments

Paper • 2409.05865 • Published Sep 9 • 14

upvoted a paper 3 months ago

MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?

Paper • 2408.13257 • Published Aug 23 • 25

upvoted 4 papers 5 months ago

Cognitive Map for Language Models: Optimal Planning via Verbally Representing the World Model

Paper • 2406.15275 • Published Jun 21 • 10

Two Giraffes in a Dirt Field: Using Game Play to Investigate Situation Modelling in Large Multimodal Models

Paper • 2406.14035 • Published Jun 20 • 12

Towards General Computer Control: A Multimodal Agent for Red Dead Redemption II as a Case Study

Paper • 2403.03186 • Published Mar 5 • 5

VideoGUI: A Benchmark for GUI Automation from Instructional Videos

Paper • 2406.10227 • Published Jun 14 • 9