-
-
Notifications
You must be signed in to change notification settings - Fork 4.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Modify EL batching to doc-wise streaming approach #12367
Merged
svlandeg
merged 109 commits into
explosion:v4
from
rmitsch:feature/docwise-generator-batching
Apr 9, 2024
Merged
Changes from 97 commits
Commits
Show all changes
109 commits
Select commit
Hold shift + click to select a range
cd98ab4
Convert Candidate from Cython to Python class.
rmitsch 5a9d8ba
Format.
rmitsch a97ef65
Fix .entity_ typo in _add_activations() usage.
rmitsch 8596fb8
Change type for mentions to look up entity candidates for to SpanGrou…
rmitsch 50b3475
Update docs.
rmitsch 0680958
Update spacy/kb/candidate.py
rmitsch 3da0712
Update doc string of BaseCandidate.__init__().
rmitsch 21fa22d
Merge branch 'refactor/el-candidates' of github.com:rmitsch/spaCy int…
rmitsch 417e8fe
Update spacy/kb/candidate.py
rmitsch 49abf4f
Rename Candidate to InMemoryCandidate, BaseCandidate to Candidate.
rmitsch fa39061
Adjust Candidate to support and mandate numerical entity IDs.
rmitsch 257bca3
Format.
rmitsch 9bd498c
Fix docstring and docs.
rmitsch 3beda2b
Merge branch 'refactor/el-candidates' into refactor/span-group-for-me…
rmitsch 61bacf8
Update website/docs/api/kb.mdx
rmitsch 46fe069
Rename alias -> mention.
rmitsch 94e57d0
Refactor Candidate attribute names. Update docs and tests accordingly.
rmitsch 38dce96
Refacor Candidate attributes and their usage.
rmitsch 5f40b3e
Format.
rmitsch 670e1ca
Fix mypy error.
rmitsch 2ac586f
Update error code in line with v4 convention.
rmitsch bb7418e
Modify EL batching system.
rmitsch e4e55b8
Update leftover get_candidates() mention in docs.
rmitsch f33f0ed
Merge branch 'v4' into feature/docwise-generator-batching
rmitsch 8b24f31
Format docs.
rmitsch d0abc32
Format.
rmitsch 4bdb359
Merge branch 'v4' into feature/docwise-generator-batching
rmitsch 8dbb74c
Merge branch 'v4' into refactor/el-candidates
rmitsch 082992a
Update spacy/kb/candidate.py
rmitsch f8a02f7
Updated error code.
rmitsch 0c63940
Merge branch 'v4' into refactor/el-candidates
rmitsch cea58ad
Simplify interface for int/str representations.
rmitsch 1ba2fc4
Update website/docs/api/kb.mdx
rmitsch 1c937db
Rename 'alias' to 'mention'.
rmitsch b476041
Port Candidate and InMemoryCandidate to Cython.
rmitsch 845864b
Remove redundant entry in setup.py.
rmitsch b0ee341
Add abstract class check.
rmitsch c61654e
Drop storing mention.
rmitsch 34e092e
Update spacy/kb/candidate.pxd
rmitsch 6fc7997
Fix entity_id refactoring problems in docstrings.
rmitsch 2705391
Drop unused InMemoryCandidate._entity_hash.
rmitsch 348dd1c
Update docstrings.
rmitsch ce23942
Merge branch 'refactor/el-candidates' of github.com:rmitsch/spaCy int…
rmitsch 649c146
Move attributes out of Candidate.
rmitsch 6adc151
Partially fix alias/mention terminology usage. Convert Candidate to i…
rmitsch 4a92176
Remove prior_prob from supported properties in Candidate. Introduce K…
rmitsch be85898
Update docstrings related to prior_prob.
rmitsch 28dbed6
Update alias/mention usage in doc(strings).
rmitsch b7b4282
Update spacy/ml/models/entity_linker.py
rmitsch 961795d
Update spacy/ml/models/entity_linker.py
rmitsch 3cfc1c6
Mention -> alias renaming. Drop Candidate.mentions(). Drop InMemoryLo…
rmitsch 80fb066
Update docstrings.
rmitsch 830939e
Fix InMemoryCandidate attribute names.
rmitsch 978fbdc
Update spacy/kb/kb.pyx
rmitsch 307bbab
Update spacy/ml/models/entity_linker.py
rmitsch 2377b67
Update W401 test.
rmitsch 4d8dce5
Update spacy/errors.py
rmitsch faede71
Update spacy/kb/kb.pyx
rmitsch 9e71adc
Use Candidate output type for toy generators in the test suite to mim…
svlandeg 0365d3d
fix docs
svlandeg b834073
fix import
svlandeg 73bdeb0
Merge branch 'refactor/el-candidates' into feature/docwise-generator-…
rmitsch cb79af3
Fix merge leftovers.
rmitsch e5be5d6
Merge branch 'v4' into feature/docwise-generator-batching
rmitsch 4974769
Merge branch 'v4' into feature/docwise-generator-batching
rmitsch 571eaf6
Update spacy/kb/kb.pyx
rmitsch 9b677ad
Update spacy/kb/kb.pyx
rmitsch 3ae31f7
Update website/docs/api/kb.mdx
rmitsch fb79b52
Update website/docs/api/entitylinker.mdx
rmitsch 10ddefa
Update spacy/kb/kb_in_memory.pyx
rmitsch 1ece9ec
Update website/docs/api/inmemorylookupkb.mdx
rmitsch cfbb4a5
Update get_candidates() docstring.
rmitsch 7aa3758
Reformat imports in entity_linker.py.
rmitsch 40e3aca
Merge branch 'v4' into feature/docwise-generator-batching
rmitsch ee5d7f4
Drop valid_ent_idx_per_doc.
rmitsch 638103e
Update docs.
rmitsch 2c80db9
Format.
rmitsch d1371d1
Simplify doc loop in predict().
rmitsch c655b36
Remove E1044 comment.
rmitsch 8aa59c4
Merge branch 'v4' into feature/docwise-generator-batching
rmitsch a258533
Fix merge errors.
rmitsch 61b2215
Format.
rmitsch aca4ada
Format.
rmitsch 5bad3d2
Format.
rmitsch 645b525
Fix merge error & tests.
rmitsch 25bce73
Format.
rmitsch 78c72d3
Merge branch 'main' into feature/docwise-generator-batching
rmitsch c8691a2
Apply suggestions from code review
rmitsch f169614
Use type alias.
rmitsch c174ebf
isort.
rmitsch d778da3
isort.
rmitsch aa87845
Lint.
rmitsch 1d2994a
Add typedefs.pyx.
rmitsch 4c7bd30
Fix typedef import.
rmitsch 7d6ae1b
Fix type aliases.
rmitsch 6401856
Format.
rmitsch af336ac
Merge branch 'upstream_main' into feature/docwise-generator-batching
svlandeg d6c7636
Update docstring and type usage.
rmitsch 8a2a7f1
Merge branch 'feature/docwise-generator-batching' of github.com:rmits…
rmitsch 5f87b6a
Add info on get_candidates(), get_candidates_batched().
rmitsch 5d1ecf1
Readd get_candidates info to v3 changelog.
rmitsch c4d4926
Update website/docs/api/entitylinker.mdx
rmitsch 2951c19
Update factory functions for backwards compatibility.
rmitsch ca1f86e
Merge branch 'feature/docwise-generator-batching' of github.com:rmits…
rmitsch 79798c0
Format.
rmitsch c187b13
Ignore mypy error.
rmitsch 9391de6
Fix mypy error.
rmitsch e83a988
Format.
rmitsch eef3de0
Add test for multiple docs with multiple entities.
rmitsch File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Docstring needs to be updated
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Updated in cfbb4a5.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This still requires updating - the text refers to a single specified Span Mention.