Uploaded image for project: 'pymongoarrow'
  1. pymongoarrow
  2. ARROW-185

Treat list of struct as nested dataframe

    • Type: Icon: Improvement Improvement
    • Resolution: Unresolved
    • Priority: Icon: Unknown Unknown
    • 1.6
    • Affects Version/s: None
    • Component/s: None
    • None

      As requested in https://github.com/mongodb-labs/mongo-arrow/issues/170:

      I have a mongo document which has a list field containing child documents.

      Pandas data frames can be nested. And PyArrow has Table and RecordBatch types.

      I would like to avoid having to call pandas.json_normalize on the child list and instead have find_all_pandas return directly a nested dataframe.

      Would it be possible to use Table or RecordBatch type in the schema to get this behaviour?

            Assignee:
            Unassigned Unassigned
            Reporter:
            steve.silvester@mongodb.com Steve Silvester
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

              Created:
              Updated: