Loading...

XML

Word

Printable

JSON

Type: Improvement
Resolution: Fixed
Priority: Unknown
Fix Version/s: 4.4
Affects Version/s: 4.4
Component/s: BSON
Labels:
None

Confidence Status:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name:
None
Goal Link:
None

In _cbsonmodule.c, the _type_marker function uses PyObject_HasAttrString(object, "_type_marker") and PyObject_GetAttrString(object, "_type_marker"). In my workloads (highly nested documents with many large array fields), these functions become severe bottlenecks to performance, because they each create new Python string objects by calling PyUnicode_FromString("_type_marker") every time they run.

A simple change that helped substantially (~60% faster) was creating a global TYPEMARKERSTR object, defining it once in PyInit__cbson as PyUnicode_FromString("_type_marker"), and replacing PyObject_Has/GetAttrString(object, "_type_marker") with PyObject_Has/GetAttr(object, TYPEMARKERSTR). One caveat is that this leaks the TYPEMARKERSTR object in the case that the cbson module is unloaded.

Also, correct me if I'm wrong, but I believe these lines are redundant because the function returns type at the end regardless.

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

Screenshot 2023-06-01 at 11.24.54 AM.png
323 kB
Jun 01 2023 06:28:53 PM UTC

causes

PYTHON-3798 add error checking and visit for _type_marker_str

Closed

PYTHON-3728 Coverity issue with convert_codec_options

Closed

is related to

PYTHON-3729 Speed up C BSON encoding by using PyObject_GetAttr instead of PyObject_GetAttrString

Closed

PYTHON-3718 Faster INT2STRING

Closed

PYTHON-3797 Cache commonly used strings in C extensions

Closed

related to

PYTHON-3819 Optimize BSON encoding/decoding performance

Released

(1 related to)

Assignee:: Steve Silvester
Reporter:: thalassemia N/A
Votes:: 0 Vote for this issue
Watchers:: 4 Start watching this issue

Created:: May 22 2023 09:18:57 PM UTC
Updated:: Oct 29 2023 02:27:52 AM UTC
Resolved:: May 26 2023 02:40:48 PM UTC

Details

Description

Attachments

Attachments

Issue Links

Activity

People

Dates