Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Large Git repos aren't able to be indexed #3715

Open
realhackcraft opened this issue Jan 16, 2025 · 4 comments
Open

Large Git repos aren't able to be indexed #3715

realhackcraft opened this issue Jan 16, 2025 · 4 comments

Comments

@realhackcraft
Copy link

Describe the bug

I'm not sure if this is an issue about my device, because it isn't very powerful.
When using tabby to index repos, it always shows some amount of logs like these:

2025-01-16T14:10:36.837159Z  WARN tabby_index::indexer: crates/tabby-index/src/indexer.rs:90: Failed to build chunk for document 'git:V1xYGx:::{"path":"/Users/me/.tabby/repositories/https_github.com_bevyengine_bevy/tools/example-showcase/src/main.rs","language":"rust","git_hash":"49afb15c2048c316cdc37d27390618f7a9f90055"}': Failed to embed chunk text: error decoding response body
2025-01-16T14:10:36.878975Z  WARN tabby_index::indexer: crates/tabby-index/src/indexer.rs:90: Failed to build chunk for document 'git:V1xYGx:::{"path":"/Users/me/.tabby/repositories/https_github.com_bevyengine_bevy/tools/example-showcase/src/main.rs","language":"rust","git_hash":"49afb15c2048c316cdc37d27390618f7a9f90055"}': Failed to embed chunk text: error decoding response body
2025-01-16T14:10:36.889699Z  WARN tabby_index::indexer: crates/tabby-index/src/indexer.rs:90: Failed to build chunk for document 'git:V1xYGx:::{"path":"/Users/me/.tabby/repositories/https_github.com_bevyengine_bevy/tools/example-showcase/src/main.rs","language":"rust","git_hash":"49afb15c2048c316cdc37d27390618f7a9f90055"}': Failed to embed chunk text: error decoding response body

This is especially apparent on /~https://github.com/bevyengine/bevy, since I left it overnight and it still didn't finish indexing. As stated, I'm not sure if it's just my computer not being able to index very fast, but it even overnight it didn't finish.

Information about your version
tabby 0.22.0

Information about your GPU
Apple M2 (4+4) @ 3.50 GHz

Additional context
Any repo will output those warning logs, but they still finish. For the bevy repo, it gets stuck on the example showcase main file

@realhackcraft
Copy link
Author

I'm using this to run: tabby serve --model Qwen2.5-Coder-3B --parallelism 16 --device metal

@wsxiaoys
Copy link
Member

We have set a timeout for each embedding model request to keep the indexing time manageable. As a result, some requests may fail and trigger warnings.

Please note that these warnings can be safely ignored. Tabby's background indexing job will incrementally re-index any failed chunks

@realhackcraft
Copy link
Author

some requests may fail and trigger warnings
That's what I thought, but even after 8 hours, it's still not finished. Do you reckon that is a thing to do with a M2 base GPU or not?

@wsxiaoys
Copy link
Member

if you have a really large repo, that's somewhat expected. The only way to speed it up is to use more powerful model serving backend (and more powerful hardware ) to reduce the time of a cold start.

Good thing is, once finished for the first run, the future incremental indexing shall be much faster.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants