2582 位用户此时在线
24小时点击排行 Top 10:
- 本站自动实时分享网络热点
- 24小时实时更新
- 所有言论不代表本站态度
- 欢迎对信息踊跃评论评分
- 评分越高,信息越新,排列越靠前
搜索数据不够精准? 请点击上面👆👆👆其它时间段筛选试试吧!
1
2
1
1
2
If you think OpenAI Sora is a creative toy like DALLE, ... think again. Sora is a data-driven physics engine. It is a simulation of many worlds, real or fantastical. The simulator learns intricate rendering, "intuitive" physics, long-horizon reasoning, and semantic grounding, all…
2
1
1
If you think OpenAI Sora is a creative toy like DALLE, ... think again. Sora is a data-driven physics engine. It is a simulation of many worlds, real or fantastical. The simulator learns intricate rendering, "intuitive" physics, long-horizon reasoning, and semantic grounding, all…
时政
(
twitter.com)
00:00:15
3
Gemini-1.5 Pro has its spotlight stolen today, and people are poking fun at Sora vs Google memes. Well, I think it's the biggest boost in LLM capability so far in 2024. v1.5's 10M token context (1) excels at retrieval; (2) generalizes zero-shot to extremely long instructions like…
2
1
1
Gemini-1.5 Pro has its spotlight stolen today, and people are poking fun at Sora vs Google memes. Well, I think it's the biggest boost in LLM capability so far in 2024. v1.5's 10M token context (1) excels at retrieval; (2) generalizes zero-shot to extremely long instructions like…
时政
(
twitter.com)
00:00:20
5
I was honored to share the TED AI stage with Ilya on Oct. 17. His speech video is out today (mine's still being edited). I think it provides relevant context tokens to the ongoing events. Transcript starting at ~10'20":
2
1
1
I was honored to share the TED AI stage with Ilya on Oct. 17. His speech video is out today (mine's still being edited). I think it provides relevant context tokens to the ongoing events. Transcript starting at ~10'20":
As AI continues to progress, as technology advances, [...]…
时政
(
twitter.com)
00:12:24
7
So many announcements today. Meta just dropped EmuVideo, generating 4-second short videos at 512x512 resolution and 16 FPS. Idea is quite straightforward: text -> image first, then do a "super-resolution" of the image along the temporal axis to synthesize motion.
2
1
1
So many announcements today. Meta just dropped EmuVideo, generating 4-second short videos at 512x512 resolution and 16 FPS. Idea is quite straightforward: text -> image first, then do a "super-resolution" of the image along the temporal axis to synthesize motion.
Long-form…
时政
(
twitter.com)
00:00:07
8
3
2
2
9
2
1
1
10
Autonomous driving with Chain of Thought - autopilot thinking out loud in text!
2
1
1
Autonomous driving with Chain of Thought - autopilot thinking out loud in text!
LINGO-1 is the most interesting work I've read in autodriving for a while.
Before: perception -> driving action
After: perception -> textual reasoning -> action
LINGO-1 trains a video-language…
时政
(
twitter.com)
00:00:35
11
This is "Sequential Dexterity", a neural network that controls a robot arm to build legos given a manual 🤖
2
1
1
This is "Sequential Dexterity", a neural network that controls a robot arm to build legos given a manual 🤖
To do this task, the robot needs to chain together multiple skills (grasping, re-orienting, pushing, etc.) and execute without compounding error.
I find some very simple…
时政
(
twitter.com)
00:00:46
12
2
1
1
13
This is an ape ("Kanzi") playing Minecraft! A fascinating experiment on non-human biological neural networks 🙉
3
2
2
This is an ape ("Kanzi") playing Minecraft! A fascinating experiment on non-human biological neural networks 🙉
I've been teaching AI to play Minecraft for too long. There're so many similar techniques that the ape trainers used:
- In-context reinforcement learning: Kanzi gets…
时政
(
twitter.com)
00:03:01