[DOCS-4018] MongoDB Connector needs Pig documentation Created: 10/Sep/14  Updated: 11/Jan/17  Resolved: 03/Mar/15

Status: Closed
Project: Documentation
Component/s: ecosystem
Affects Version/s: None
Fix Version/s: 01112017-cleanup

Type: Task Priority: Major - P3
Reporter: Eric Daniels (Inactive) Assignee: Unassigned
Resolution: Won't Fix Votes: 0
Labels: None
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified

Participants:
Days since reply: 9 years, 23 weeks ago

 Description   

I'm not sure how many users exist that are using Pig with the MongoDB Hadoop connector but no documentation exists about it. The README in the pig folder of mongo-hadoop basically tells you everything you need to know except for one caveat: If a subsequent statement depends on a previous statement that uses MongoStorage*, then the EXEC statement must be used after the former statement. Not doing this causes out of order job execution and in the case where you have an insert on test.a and an update on test.a, the update document may be lost.

This is called Implicit Dependencies.

With that it may also be useful to write some Hive documentation as well.


Generated at Thu Feb 08 07:47:01 UTC 2024 using Jira 9.7.1#970001-sha1:2222b88b221c4928ef0de3161136cc90c8356a66.