-
Notifications
You must be signed in to change notification settings - Fork 30.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Explore using a native third-party search tool such as ripgrep or Silver Searcher #19983
Comments
I was going to look at The Silver Searcher this month, I had skipped ripgrep because it doesn't support multiline search. I know ripgrep can be faster though. Do you think there's enough difference between different search tools that it would be worth allowing people to hook up arbitrary ones to vscode? |
Yes, I was just thinking that maybe there should be an API that allows extension authors to hookup whatever they want. However, I'm not sure there is anything that interesting beyond ripgrep and Silver Searcher. When I think of awesome search tools, those are what come to mind. So, I just did some comparisons on my project. Searching for a particular term, I get these results:
If we can get back to 2 sec performance again, then there is less difference between vscode and ag, but ripgrep is still significantly faster. However, as you said, you do lose multiline searches. If vscode implements multiline searching, then I would be happy with doing multiline with vscode's builtin search and using ripgrep for most of my other searching as that would be most common. |
Ripgrep would be perfect, but I really want multiline search support. The problem with Silver Searcher right now is that it can't handle ignoring The ripgrep blog post you posted lists other tools, but I eliminate them on various other grounds, like lacking windows support or perf that apparently breaks down. There are also some specialist tools I've found, like ICgrep or Hyperscan, that focus on advanced unicode or regex features. Considering all this, we should either
Still leaning towards |
My thoughts are something like this in order of preference:
|
For 1., if we were using ripgrep, it would replace vscode search entirely, so it would be much faster than 1.8.1, and there would be no multiline search. |
Ahh, I didn't realize you were considering it as a replacement. So you would then bundle ripgrep or Silver Searcher as part of vscode? |
Yeah that's the idea, to have it drive the search viewlet behind the scenes. Possibly could also be involved in driving quick open. |
(ripgrep author here.) What do you folks use multiline search for? I've long considered it something I'd be unlikely to add support for, but I've been known to bend if there's strong demand for it. Alternatively, maybe there's a compromise that can be reached. Note that Are there other things you folks care about? What about Unicode support? Support for searching UTF-16 (planned, not actually available yet)? |
I'm also in the process of moving a lot of code in ripgrep out into distinct distinct Rust libraries, which would give you a lot more control over how search operates. But, you'd need to build out a C FFI for it, which wouldn't be especially hard, but it wouldn't be something someone could bang out in a day either. |
@BurntSushi here are some sample use cases of multiline search. The most often is simply Code statements often don't always exist as single lines
|
@dakaraphi If ripgrep asked you to use two distinct regexes, would that suffice? Or do you want to use one regex? |
@BurntSushi If I follow what you imply, then that would only help answer if 2 different terms exist in the same file. or example searching for xml tag with given id |
I'll do a writeup for this investigation next week, but for vscode's purposes, we're interested in multiline search, UTF-16 support, and also I like a search that returns results in sorted order by path, which ripgrep doesn't do right now. |
I don't think any search tool with parallelism returns results in sorted order. ripgrep does have the --sort-files option which I think will do what you want, but it disables parallelism. |
Silver Searcher does actually - I see why it could be a perf hit to order the results though. |
The silver searcher does not. I just tried. I can easily observe
non-deterministic ordering of output by running the same command a few
times.
Also, UTF-16 support is on my roadmap for ripgrep. :-)
…On Feb 19, 2017 21:04, "Rob Lourens" ***@***.***> wrote:
Silver Searcher does actually - I see why it could be a perf hit to order
the results though.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#19983 (comment)>,
or mute the thread
</~https://github.com/notifications/unsubscribe-auth/AAb34nz0Svq_rdev4-F85biLkJZ6osV_ks5rePSygaJpZM4L3i7G>
.
|
You're right - looking at it more closely, SS tends to be closer to being in order, and often is in order when I run it in my vscode workspace, but not always. |
And I'm glad to hear that UTF-16 support is on the roadmap. Any idea in what timeframe you'd expect to look at it? |
No, sorry. "Within the next year" is probably the best I can do. Hopefully sooner. |
Given there isn't an ideal match around features you wish to provide out of the box and uncertainty around the timing of the availability of those features, does it make sense to reevaluate the option of simply providing an API that extension authors can use to integrate external search providers? It would probably be useful also if the API provided the ability to invoke VS Code's builtin search as a fallback if the extension is able to detect a type of regular expression might not be supported in the external tool, it can then pass it on to VS Code. |
@BurntSushi I could also fall back to VS Code's builtin search for UTF-16 files, but don't want to duplicate the file tree walking work that ripgrep does. An easy compromise would be if ripgrep prints a message each time it encounters a UTF-16 (or binary) file. I imagine this hidden behind an option but it would also be useful for CLI users who are missing matches because they don't realize a file is an unsupported encoding. @dakaraphi Thought about it, but it seems like overkill. Creating an extension API is a lot of work and there are probably only a few search providers in the world that anyone would want to use. I want to focus on the out of box experience. |
Using ripgrep as primary and VS Code builtin as fallback seems like a good solution. I would actually be very happy with that compromise. Especially if VS Code could implement multiline search, then multiline regular expressions could just be passed on to VS Code. VS Code's builtin is fast enough that it wouldn't be a bad solution given that multiline searching will be less common. |
Speed is awesome, but I will have to strongly disagree here. Not being able to do something entirely is fairly a significant disadvantage. I certainly can understand there is no desire to support another engine that you have to implement yourself. I would suggest actually consider having a secondary external engine. Silver searcher or platinum searcher as a fallback. If Platinum searcher is easier to integrate, it doesn't have to be the fastest, but feature completeness could then be provided without having to support your own implementation. |
@dakaraphi I don't think the platinum searcher has any functionality that ripgrep doesn't have at this point. It doesn't appear to have multiline search and its regex engine is FSM based like ripgrep's. The only real choice available to you if you want multiline search and PCRE is the silver searcher. |
@BurntSushi ahh ok thanks. Yes, the most important feature for me would be multiline. That's a big one. I do use it somewhat often. PCRE would be nice, but I could live without it. |
@dakaraphi I've thought about multiline search for a long time. You folks aren't the only ones who have requested it. I re-opened the issue on ripgrep's tracker and left some thoughts: BurntSushi/ripgrep#176 (comment) |
@roblourens @BurntSushi Thanks for making this happen and bringing to VS Code in such a short time! Truly is a pleasure to use. |
Does any of this apply to searches in a file that's being edited? That is, the "Find" command, not "Find in Files". |
No |
So I have given some additional thought to feature gap of things like look arounds. I think some of the feature gap would be mitigated if the search results could be easily sent to a new editor. Then you could further search the results using the more feature rich in editor regex engine. Potentially it would also be useful to have a way to send results directly to an editor and have a much higher result limit cap. There is already a request for this for other use cases. So this would just be an additional benefit. |
Was looking for how I can use ag to search faster in VSCode and found this issue. |
@dakaraphi This is a feature of Sublime Text that I miss in VSCode -- ST just automatically dumps everything into a special, searchable "Find Results" tab that also doesn't auto-clear between searches. (Relying on this feels a bit like a crutch, like maybe I could have gotten ideal results if I'd composed my original search filters + regex better, but I end up using it often anyway because I want to keep my brain on the original task at hand.) |
@Ethan-VisualVocal having the ability to dump the results to a document tab opens up some very useful possibilities; however, I currently strongly prefer VSCode's default implementation for finding and navigating code. I find it much better at browsing the files from the results. |
Can we just use
|
@ThunderEX - |
@jonathandturner you can check config grep.fallbackToNoIndex |
I didn't realize that |
|
I don't think it supports UTF-16 either. |
I am very impressed with the performance of the new parallel search; however, there is an opportunity to take search speed to the absolute limit by optionally allowing a user to configure ripgrep as the search provider.
Even with the new search speed, ripgrep is still an order of magnitude faster. Ripgrep is actually an order of magnitude faster than pretty much anything.
http://blog.burntsushi.net/ripgrep/
The text was updated successfully, but these errors were encountered: