doublestar is a golang implementation of path pattern matching and globbing with support for “doublestar” (aka globstar: **
) patterns.
doublestar patterns match files and directories recursively. For example, if you had the following directory structure:
grandparent `-- parent |-- child1 `-- child2
You could find the children with patterns such as: **/child*
, grandparent/**/child?
, **/parent/*
, or even just **
by itself (which will return all files and directories recursively).
Bash‘s globstar is doublestar’s inspiration and, as such, works similarly. Note that the doublestar must appear as a path component by itself. A pattern such as /path**
is invalid and will be treated the same as /path*
, but /path*/**
should achieve the desired result. Additionally, /path/**
will match all directories and files under the path directory, but /path/**/
will only match directories.
v4 is a complete rewrite with a focus on performance. Additionally, doublestar has been updated to use the new io/fs package for filesystem access. As a result, it is only supported by golang v1.16+.
doublestar can be installed via go get
:
go get github.com/bmatcuk/doublestar/v4
To use it in your code, you must import it:
import "github.com/bmatcuk/doublestar/v4"
func Match(pattern, name string) (bool, error)
Match returns true if name
matches the file name pattern
(see “patterns”). name
and pattern
are split on forward slash (/
) characters and may be relative or absolute.
Match requires pattern to match all of name, not just a substring. The only possible returned error is ErrBadPattern, when pattern is malformed.
Note: this is meant as a drop-in replacement for path.Match()
which always uses '/'
as the path separator. If you want to support systems which use a different path separator (such as Windows), what you want is PathMatch()
. Alternatively, you can run filepath.ToSlash()
on both pattern and name and then use this function.
func PathMatch(pattern, name string) (bool, error)
PathMatch returns true if name
matches the file name pattern
(see “patterns”). The difference between Match and PathMatch is that PathMatch will automatically use your system's path separator to split name
and pattern
. On systems where the path separator is '\'
, escaping will be disabled.
Note: this is meant as a drop-in replacement for filepath.Match()
. It assumes that both pattern
and name
are using the system‘s path separator. If you can’t be sure of that, use filepath.ToSlash()
on both pattern
and name
, and then use the Match()
function instead.
func Glob(fsys fs.FS, pattern string) ([]string, error)
Glob returns the names of all files matching pattern or nil if there is no matching file. The syntax of patterns is the same as in Match()
. The pattern may describe hierarchical names such as usr/*/bin/ed
.
Glob ignores file system errors such as I/O errors reading directories. The only possible returned error is ErrBadPattern, reporting that the pattern is malformed.
Note: this is meant as a drop-in replacement for io/fs.Glob()
. Like io/fs.Glob()
, this function assumes that your pattern uses /
as the path separator even if that‘s not correct for your OS (like Windows). If you aren’t sure if that's the case, you can use filepath.ToSlash()
on your pattern before calling Glob()
.
type GlobWalkFunc func(path string, d fs.DirEntry) error
func GlobWalk(fsys fs.FS, pattern string, fn GlobWalkFunc) error
GlobWalk calls the callback function fn
for every file matching pattern. The syntax of pattern is the same as in Match(). The pattern may describe hierarchical names such as usr/*/bin/ed.
GlobWalk may have a small performance benefit over Glob if you do not need a slice of matches because it can avoid allocating memory for the matches. Additionally, GlobWalk gives you access to the fs.DirEntry
objects for each match, and lets you quit early by returning a non-nil error from your callback function.
GlobWalk ignores file system errors such as I/O errors reading directories. GlobWalk may return ErrBadPattern, reporting that the pattern is malformed. Additionally, if the callback function fn
returns an error, GlobWalk will exit immediately and return that error.
Like Glob(), this function assumes that your pattern uses /
as the path separator even if that‘s not correct for your OS (like Windows). If you aren’t sure if that's the case, you can use filepath.ToSlash() on your pattern before calling GlobWalk().
func SplitPattern(p string) (base, pattern string)
SplitPattern is a utility function. Given a pattern, SplitPattern will return two strings: the first string is everything up to the last slash (/
) that appears before any unescaped “meta” characters (ie, *?[{
). The second string is everything after that slash. For example, given the pattern:
../../path/to/meta*/** ^----------- split here
SplitPattern returns “../../path/to” and “meta*/**”. This is useful for initializing os.DirFS() to call Glob() because Glob() will silently fail if your pattern includes /./
or /../
. For example:
base, pattern := SplitPattern("../../path/to/meta*/**")
fsys := os.DirFS(base)
matches, err := Glob(fsys, pattern)
If SplitPattern cannot find somewhere to split the pattern (for example, meta*/**
), it will return “.” and the unaltered pattern (meta*/**
in this example).
Of course, it is your responsibility to decide if the returned base path is “safe” in the context of your application. Perhaps you could use Match() to validate against a list of approved base directories?
doublestar supports the following special terms in the patterns:
Special Terms | Meaning |
---|---|
* | matches any sequence of non-path-separators |
/**/ | matches zero or more directories |
? | matches any single non-path-separator character |
[class] | matches any single non-path-separator character against a class of characters (see “character classes”) |
{alt1,...} | matches a sequence of characters if one of the comma-separated alternatives matches |
Any character with a special meaning can be escaped with a backslash (\
).
A doublestar (**
) should appear surrounded by path separators such as /**/
. A mid-pattern doublestar (**
) behaves like bash‘s globstar option: a pattern such as path/to/**.txt
would return the same results as path/to/*.txt
. The pattern you’re looking for is path/to/**/*.txt
.
Character classes support the following:
Class | Meaning |
---|---|
[abc] | matches any single character within the set |
[a-z] | matches any single character in the range |
[^class] | matches any single character which does not match the class |
[!class] | same as ^ : negates the class |
goos: darwin goarch: amd64 pkg: github.com/bmatcuk/doublestar/v4 cpu: Intel(R) Core(TM) i7-4870HQ CPU @ 2.50GHz BenchmarkMatch-8 285639 3868 ns/op 0 B/op 0 allocs/op BenchmarkGoMatch-8 286945 3726 ns/op 0 B/op 0 allocs/op BenchmarkPathMatch-8 320511 3493 ns/op 0 B/op 0 allocs/op BenchmarkGoPathMatch-8 304236 3434 ns/op 0 B/op 0 allocs/op BenchmarkGlob-8 466 2501123 ns/op 190225 B/op 2849 allocs/op BenchmarkGlobWalk-8 476 2536293 ns/op 184017 B/op 2750 allocs/op BenchmarkGoGlob-8 463 2574836 ns/op 194249 B/op 2929 allocs/op
These benchmarks (in doublestar_test.go
) compare Match() to path.Match(), PathMath() to filepath.Match(), and Glob() + GlobWalk() to io/fs.Glob(). They only run patterns that the standard go packages can understand as well (so, no {alts}
or **
) for a fair comparison. Of course, alts and doublestars will be less performant than the other pattern meta characters.
Alts are essentially like running multiple patterns, the number of which can get large if your pattern has alts nested inside alts. This affects both matching (ie, Match()) and globbing (Glob()).
**
performance in matching is actually pretty similar to a regular *
, but can cause a large number of reads when globbing as it will need to recursively traverse your filesystem.