[SERVER-84981] Streamline TPC-H data and query generation Created: 30/Nov/21  Updated: 13/Jan/24  Resolved: 26/May/22

Status: Closed
Project: Core Server
Component/s: None
Affects Version/s: None
Fix Version/s: None

Type: Task Priority: Major - P3
Reporter: Alya Berciu Assignee: Backlog - Query Optimization
Resolution: Won't Fix Votes: 0
Labels: None
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified

Assigned Teams:
Query Optimization
Participants:

 Description   

There have been a few comments on the design suggesting improvements to the process for generating MQL queries/data using the TPC-H dbgen and qgen tools. These include:

  • Modifying dbgen/qgen to generate CSVs instead of |-separated files (note that some strings in the data contain commas)
  • Modifying dbgen/qgen to generate SQL queries for portgres or mysql instead
  • Converting date fields from strings to ISODates in the csv files before importing them into mongod using mongoimport


 Comments   
Comment by Alya Berciu [ 26/May/22 ]

These improvements are nice to have if we are generating a new scale of data (in particular if we generate data frequently). However, for now, we have all the datasets we need.

Generated at Thu Feb 08 06:56:30 UTC 2024 using Jira 9.7.1#970001-sha1:2222b88b221c4928ef0de3161136cc90c8356a66.