![]() ![]() It's json output files should have a total of no more than 100K. Add a script test_unstructured_ingest/test-ingest-.sh.Create a folder under examples/ingest that includes at least one well documented script.Update unstructured/ingest/main.py with support for the new connector. ![]() The subclass of BaseIngestDoc overrides process_file() if extra processing logic is needed other than what is provided by auto.partition().Create a new module under unstructured/ingest/connector/ implementing the 3 abstract base classes, similar to unstructured/ingest/connector/s3_connector.py. ![]() Also add test case verifying that 2 files are indeed created, like should be.Renamed github-access-token, github-branch and github-file-glob to git-access-token, git-branch and git-file-glob, respectively.Prevent code duplication for functionality between GitHub and GitLab ingest connectors.Involves more general Git functionality that is shared between the GitHub and GitLab data connectors.Supports the ability to filter documents through globs. ![]()
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |