Treat list of struct as nested dataframe

XMLWordPrintableJSON

    • Type: Improvement
    • Resolution: Unresolved
    • Priority: Unknown
    • pymongoarrow-next
    • Affects Version/s: None
    • Component/s: None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None

      As requested in https://github.com/mongodb-labs/mongo-arrow/issues/170:

      I have a mongo document which has a list field containing child documents.

      Pandas data frames can be nested. And PyArrow has Table and RecordBatch types.

      I would like to avoid having to call pandas.json_normalize on the child list and instead have find_all_pandas return directly a nested dataframe.

      Would it be possible to use Table or RecordBatch type in the schema to get this behaviour?

            Assignee:
            Unassigned
            Reporter:
            Steve Silvester
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

              Created:
              Updated: