Commits · boltless.me/zoekt · Tangled

boltless.me / zoekt

0

fork of https://github.com/sourcegraph/zoekt

0

Commits

Author

Commit

Message

Date

0a17ccb2

scoring: reduce allocations for addScore (#670)

2y ago

5bbf05d4

fix tests (#671)

2y ago

ca7ee51e

scoring: show atom count in debug score (#669)

2y ago

c869a248

scoring: score methods and funcs the same (#666)

2y ago

Keegan Carruthers-Smith

d3fc0dce

score: remove repetition-boost (#667)

2y ago

Keegan Carruthers-Smith

7cc2872d

nix: use tag for rev in ctags derivation

2y ago

Keegan Carruthers-Smith

328fcb7a

nix: use go 1.21 and universal-ctags 6.0.0 (#664)

2y ago

Keegan Carruthers-Smith

4f214152

score: clean up debug output (#663)

2y ago

Keegan Carruthers-Smith

d8bfea1e

score: factors for headers in markdown (#661)

2y ago

Keegan Carruthers-Smith

70f5dd3f

score: always upscore symbol matches (#662)

2y ago

Keegan Carruthers-Smith

081cd037

boost graphql types in results (#659)

2y ago

Keegan Carruthers-Smith

f39e6eb5

zoekt: add debug flag to show DebugScore output (#660)

2y ago

William Bezuidenhout

16e2ff8c

logger: remove description param (#657)

2y ago

Keegan Carruthers-Smith +1

f17ff0ba

scoring: handle scip-ctags kinds (#655)

2y ago

Keegan Carruthers-Smith

659eac98

all: remove deprecated RepoList.Minimal (#624)

2y ago

Anatoli Babenia

8c5bd7de

Use Go 1.21.2 (#653)

2y ago

Keegan Carruthers-Smith

cc1b5cda

ctags: allow binary to be anything with validation (#652)

2y ago

Keegan Carruthers-Smith

1065c664

gomod: bump go-ctags

2y ago

Geoffrey Gilmore

dfc14cb6

grpc: add support for prometheus metrics that calculates message size (#651)

2y ago

Geoffrey Gilmore

2011bba5

grpc: zoekt-sourcegraph-indexserer: enable by default, support reading from SG_FEATURE_FLAG_GRPC (#650)

2y ago

Geoffrey Gilmore

089709e4

zoekt: upgrade to grpc-ecosystem/go-grpc-middleware@v2.0.0 (#648)

2y ago

19f03eed

chore: upgrade sourcegraph/log (#649)

2y ago

Keegan Carruthers-Smith

adf376d3

web: informative and verbose error message when watchdog fails (#647)

2y ago

Stefan Hengl +2

af126653

indexserver: delete tmp dir on startup (#646)

2y ago

Geoffrey Gilmore

48ed5ac5

grpc: zoekt-sourcegraph-indexserver: support retries when frontend isn't available (#645)

2y ago

Geoffrey Gilmore

2d1affd4

grpc: RepoList: actually persist "repos" field when converting to protobuf message (#644)

2y ago

Geoffrey Gilmore

3ce1f2b2

grpc: add prometheus server and client prometheus metrics (#642)

2y ago

Geoffrey Gilmore

40a9a23b

grpc: FileMatch: tweak file_name to be bytes instead of string (#641)

2y ago

Geoffrey Gilmore

f75df3d8

grpc: port messagesize interceptors and raise default client message size to 90mb (#640)

2y ago

Geoffrey Gilmore

993cfdb2

grpc: port internal error interceptors from sourcegraph/sourcegraph (#639)

2y ago

Geoffrey Gilmore

fcb279ae

grpc: zoekt-webserver: stream search: break up file matches across multiple messages (#636)

2y ago

956d775e

Extract samplingSender and use it for gRPC (#637)

2y ago

d5723536

remove bazel (#634)

2y ago

Keegan Carruthers-Smith

63da184a

stat: introduce timing stats around shard search (#633)

2y ago

9559422b

DisplayTruncator: always apply both limits (#632)

2y ago

Keegan Carruthers-Smith

eede1229

gofmt -s -w .

2y ago

Keegan Carruthers-Smith

626c7d8f

introduce DisplayTruncator (#630)

2y ago

6a428ad6

SearchOptions: add MaxMatchDisplayCount (#615)

All clients of zoekt have a shared problem: they have no reliable way to
bound the size of the SearchResult. The primary dimension that
determines the size of a SearchResult is the number of matches. None of
the existing levers zoekt provides sufficiently limit this size:
- MaxDocDisplayCount is a hard limit on the number of Files in the
SearchResult. But when a single File can have an arbitrary number of
matches for the query, you can still end up with enormous
SearchResults when this parameter is 1.

The existing *MaxMatchCount parameters are more about limiting the
amount of work zoekt does when executing queries than they are about
limiting the response size:
- TotalMaxMatchCount is a soft limit on the number of matches
across shards. But it is only evaluated after handling each shard, so
if a single shard has an enormous number of matches, the SearchResult
will be enormous.
- ShardMaxMatchCount is a soft limit on the number of matches from a
single shard. But it is only evaluated after handling each document, so
if a single document has an enormous number of matches, the
SearchResult will be enormous.
- ShardRepoMaxMatchCount, well, you get the idea.

Different clients have a differing ability to tolerate enormous
SearchResults. Sourcegraph, for example, is apparently doing just fine;
they put hard limits on the number of matches in their own server, which
is itself a client of zoekt. They're presumably able to tolerate large
responses from zoekt as it's running colocated in a datacenter
environment.

But clients that are, for example, running in browsers, and using the
less-compact JSON-encoded API, are much less able to cope with enormous
SearchResults, which can be multiple megabytes large even with the most
conservative applications of the existing parameters.

Enter MaxMatchDisplayCount, which has similar semantics to
MaxDocDisplayCount, and is used by zoekt in the exact same places as
that parameter. With this, clients can get a much better handle on the
size of zoekt SearchResults.

2y ago

9c20a034

fix tracing (#627)

2y ago

0f6564bd

trace: add service.instance.id (#629)

2y ago

Manuel Ucles +1

99233243

Create buf-breaking-check.yml (#625)

2y ago

Keegan Carruthers-Smith

f9b3ea5d

Revert "indexdata: read posting list iff all ng exist (#619)" (#626)

2y ago

b7e5070b

indexdata: read posting list iff all ng exist (#619)

2y ago

Keegan Carruthers-Smith

0aefb15e

rename ngrams to contentNgrams (#623)

2y ago

Keegan Carruthers-Smith

cbe083c9

remove ZOEKT_ENABLE_LAZY_DOC_SECTIONS (#620)

2y ago

Keegan Carruthers-Smith

1d71fd02

ci: remove sync-zoekt step (#621)

2y ago

Keegan Carruthers-Smith

34f694c3

maximise distance between ngrams (#618)

2y ago

Keegan Carruthers-Smith

2632acf4

rm ngramoffset.go from BUILD.bazel

2y ago

Keegan Carruthers-Smith +1

45f608ff

sort ngrams before looking them up (#617)

2y ago

Keegan Carruthers-Smith

3d0bdd5c

remove ngram offset code (#616)

2y ago

Keegan Carruthers-Smith

f9d3a0e2

zoekt: add fgprof for full profiling (#614)

2y ago

Keegan Carruthers-Smith

008a775b

zoekt-indexserver: use value format directive for bad conf warning

2y ago

Philipp Wollermann

9abbb8b0

zoekt-indexserver: Prevent invalid config from causing an NPE (#612)

3y ago

Keegan Carruthers-Smith

25c1ea51

all: observe missing Stats RegexpsConsidered and FlushReason (#611)

3y ago

e2e8aede

Fix template documentation comments (#610)

3y ago

Keegan Carruthers-Smith

a176bde1

go get -u -t ./... (#609)

3y ago

Keegan Carruthers-Smith

7643f3b3

matchiter: capture metric NgramLookups (#608)

3y ago

Keegan Carruthers-Smith

93f7b0c9

matchtree: capture Stats before pruning (#607)

3y ago

Rodrigo Silva Mendoza

b9e6d943

zoekt-indexserver: Check stderr for git fetch (#603)

3y ago

Keegan Carruthers-Smith

7078a585

shards: populate RepoList.Stats.Repos (#605)

3y ago

scoring: reduce allocations for addScore (#670)

0a17ccb2

Stefan Hengl

2y

fix tests (#671)

5bbf05d4

Stefan Hengl

2y

scoring: show atom count in debug score (#669)

ca7ee51e

Stefan Hengl

2y

scoring: score methods and funcs the same (#666)

c869a248

Stefan Hengl

2y

score: remove repetition-boost (#667)

d3fc0dce

Keegan Carruthers-Smith

2y

nix: use tag for rev in ctags derivation

7cc2872d

Keegan Carruthers-Smith

2y

nix: use go 1.21 and universal-ctags 6.0.0 (#664)

328fcb7a

Keegan Carruthers-Smith

2y

score: clean up debug output (#663)

4f214152

Keegan Carruthers-Smith

2y

score: factors for headers in markdown (#661)

d8bfea1e

Keegan Carruthers-Smith

2y

score: always upscore symbol matches (#662)

70f5dd3f

Keegan Carruthers-Smith

2y

boost graphql types in results (#659)

081cd037

Keegan Carruthers-Smith

2y

zoekt: add debug flag to show DebugScore output (#660)

f39e6eb5

Keegan Carruthers-Smith

2y

logger: remove description param (#657)

16e2ff8c

William Bezuidenhout

2y

scoring: handle scip-ctags kinds (#655)

f17ff0ba

Keegan Carruthers-Smith +1

2y

all: remove deprecated RepoList.Minimal (#624)

659eac98

Keegan Carruthers-Smith

2y

Use Go 1.21.2 (#653)

8c5bd7de

Anatoli Babenia

2y

ctags: allow binary to be anything with validation (#652)

cc1b5cda

Keegan Carruthers-Smith

2y

gomod: bump go-ctags

1065c664

Keegan Carruthers-Smith

2y

grpc: add support for prometheus metrics that calculates message size (#651)

dfc14cb6

Geoffrey Gilmore

2y

grpc: zoekt-sourcegraph-indexserer: enable by default, support reading from SG_FEATURE_FLAG_GRPC (#650)

2011bba5

Geoffrey Gilmore

2y

zoekt: upgrade to grpc-ecosystem/go-grpc-middleware@v2.0.0 (#648)

089709e4

Geoffrey Gilmore

2y

chore: upgrade sourcegraph/log (#649)

19f03eed

Michael Lin

2y

web: informative and verbose error message when watchdog fails (#647)

adf376d3

Keegan Carruthers-Smith

2y

indexserver: delete tmp dir on startup (#646)

af126653

Stefan Hengl +2

2y

grpc: zoekt-sourcegraph-indexserver: support retries when frontend isn't available (#645)

48ed5ac5

Geoffrey Gilmore

2y

grpc: RepoList: actually persist "repos" field when converting to protobuf message (#644)

2d1affd4

Geoffrey Gilmore

2y

grpc: add prometheus server and client prometheus metrics (#642)

3ce1f2b2

Geoffrey Gilmore

2y

grpc: FileMatch: tweak file_name to be bytes instead of string (#641)

40a9a23b

Geoffrey Gilmore

2y

grpc: port messagesize interceptors and raise default client message size to 90mb (#640)

f75df3d8

Geoffrey Gilmore

2y

grpc: port internal error interceptors from sourcegraph/sourcegraph (#639)

993cfdb2

Geoffrey Gilmore

2y

grpc: zoekt-webserver: stream search: break up file matches across multiple messages (#636)

fcb279ae

Geoffrey Gilmore

2y

Extract samplingSender and use it for gRPC (#637)

956d775e

Camden Cheek

2y

remove bazel (#634)

d5723536

Dave Try

2y

stat: introduce timing stats around shard search (#633)

63da184a

Keegan Carruthers-Smith

2y

DisplayTruncator: always apply both limits (#632)

9559422b

Ian Kerins

2y

gofmt -s -w .

eede1229

Keegan Carruthers-Smith

2y

introduce DisplayTruncator (#630)

626c7d8f

Keegan Carruthers-Smith

2y

SearchOptions: add MaxMatchDisplayCount (#615)

All clients of zoekt have a shared problem: they have no reliable way to
bound the size of the SearchResult. The primary dimension that
determines the size of a SearchResult is the number of matches. None of
the existing levers zoekt provides sufficiently limit this size:
- MaxDocDisplayCount is a hard limit on the number of Files in the
SearchResult. But when a single File can have an arbitrary number of
matches for the query, you can still end up with enormous
SearchResults when this parameter is 1.

The existing *MaxMatchCount parameters are more about limiting the
amount of work zoekt does when executing queries than they are about
limiting the response size:
- TotalMaxMatchCount is a soft limit on the number of matches
across shards. But it is only evaluated after handling each shard, so
if a single shard has an enormous number of matches, the SearchResult
will be enormous.
- ShardMaxMatchCount is a soft limit on the number of matches from a
single shard. But it is only evaluated after handling each document, so
if a single document has an enormous number of matches, the
SearchResult will be enormous.
- ShardRepoMaxMatchCount, well, you get the idea.

Different clients have a differing ability to tolerate enormous
SearchResults. Sourcegraph, for example, is apparently doing just fine;
they put hard limits on the number of matches in their own server, which
is itself a client of zoekt. They're presumably able to tolerate large
responses from zoekt as it's running colocated in a datacenter
environment.

But clients that are, for example, running in browsers, and using the
less-compact JSON-encoded API, are much less able to cope with enormous
SearchResults, which can be multiple megabytes large even with the most
conservative applications of the existing parameters.

Enter MaxMatchDisplayCount, which has similar semantics to
MaxDocDisplayCount, and is used by zoekt in the exact same places as
that parameter. With this, clients can get a much better handle on the
size of zoekt SearchResults.

6a428ad6

Ian Kerins

2y

fix tracing (#627)

9c20a034

Stefan Hengl

2y

trace: add service.instance.id (#629)

0f6564bd

Stefan Hengl

2y

Create buf-breaking-check.yml (#625)

99233243

Manuel Ucles +1

2y

Revert "indexdata: read posting list iff all ng exist (#619)" (#626)

f9b3ea5d

Keegan Carruthers-Smith

2y

indexdata: read posting list iff all ng exist (#619)

b7e5070b

Stefan Hengl

2y

rename ngrams to contentNgrams (#623)

0aefb15e

Keegan Carruthers-Smith

2y

remove ZOEKT_ENABLE_LAZY_DOC_SECTIONS (#620)

cbe083c9

Keegan Carruthers-Smith

2y

ci: remove sync-zoekt step (#621)

1d71fd02

Keegan Carruthers-Smith

2y

maximise distance between ngrams (#618)

34f694c3

Keegan Carruthers-Smith

2y

rm ngramoffset.go from BUILD.bazel

2632acf4

Keegan Carruthers-Smith

2y

sort ngrams before looking them up (#617)

45f608ff

Keegan Carruthers-Smith +1

2y

remove ngram offset code (#616)

3d0bdd5c

Keegan Carruthers-Smith

2y

zoekt: add fgprof for full profiling (#614)

f9d3a0e2

Keegan Carruthers-Smith

2y

zoekt-indexserver: use value format directive for bad conf warning

008a775b

Keegan Carruthers-Smith

2y

zoekt-indexserver: Prevent invalid config from causing an NPE (#612)

9abbb8b0

Philipp Wollermann

3y

all: observe missing Stats RegexpsConsidered and FlushReason (#611)

25c1ea51

Keegan Carruthers-Smith

3y

Fix template documentation comments (#610)

e2e8aede

Ian Kerins

3y

go get -u -t ./... (#609)

a176bde1

Keegan Carruthers-Smith

3y

matchiter: capture metric NgramLookups (#608)

7643f3b3

Keegan Carruthers-Smith

3y

matchtree: capture Stats before pruning (#607)

93f7b0c9

Keegan Carruthers-Smith

3y

zoekt-indexserver: Check stderr for git fetch (#603)

b9e6d943

Rodrigo Silva Mendoza

3y

shards: populate RepoList.Stats.Repos (#605)

7078a585

Keegan Carruthers-Smith

3y

Next