site stats
Had to stack up 8 Mac Minis to get it running.~5 tok/sec for now.First time running inference on 8 Mac Minis - performance can be improved a lot (theoretical limit is >10 tok/sec on this setup).
发布时间:
1
数据加载中
Markdown支持
评论加载中...
您可能感兴趣的: 更多