-
Type: Bug
-
Resolution: Unresolved
-
Priority: Major - P3
-
None
-
Affects Version/s: None
-
Component/s: None
-
ALL
We have mongodb 5.0.10, shurded cluster with one shard.
Config servers - 6 nodes, 3 in main data center, 3 in second data center. In second data center 1 node has votes = 0, all nodes in second data center have priority = 0.
Data replica set - 6 nodes, 3 in main data center, 3 in second data center. In second data center 1 node has votes = 0, all nodes in second data center have priority = 0.
During some period of time we had network problems between main and second data centers. We still had majority 3 voting nodes of 5, but app had errors:
Query failed with error code 202 and error message 'Encountered non-retryable error during query :: caused by :: could not get updated shard list from config server :: caused by :: Request 621142959 timed out, deadline was 2024-08-01T13:16:38.351+03:00, op was RemoteCommand 621142959 -- target:[gorodec-mongo-config-2:27019] db:config expDate:2024-08-01T13:16:38.351+03:00 cmd:\{ find: "shards", maxTimeMS: 30000, readConcern: { level: "majority", afterOpTime: { ts: Timestamp(1722507368, 4), t: -1 } } }' on server mongo-config-1:27017; nested exception is com.mongodb.MongoQueryException: Query failed with error code 202 and error message 'Encountered non-retryable error during query :: caused by :: could not get updated shard list from config server :: caused by :: Request 621142959 timed out, deadline was 2024-08-01T13:16:38.351+03:00, op was RemoteCommand 621142959 -- target:[gorodec-mongo-config-2:27019] db:config expDate:2024-08-01T13:16:38.351+03:00 cmd:{ find: "shards", maxTimeMS: 30000, readConcern: { lev… at ru.lanit.torgi.relations.bidder.service.BidderRelationsReplicationServiceImpl.findPrincipalBidderCodes(BidderRelationsReplicationServiceImpl.java:78) at ru.lanit.torgi.relations.bidder.service.BidderRelationsReplicationServiceImpl.getPrincipalBidderCodesWithRepresentativeSigningRole(BidderRelationsReplicationServiceImpl.java:58) at ru.lanit.torgi.charges.service.impl.PaymentsCriteriaConverterImpl.getPayerCodes(PaymentsCriteriaConverterImpl.java:159) at ru.lanit.torgi.charges.service.impl.PaymentsCriteriaConverterImpl.convertToCriteria(PaymentsCriteriaConverterImpl.java:48) at ru.lanit.torgi.charges.data.impl.ChargesRepositoryServiceImpl.createPayedAggregationQuery(ChargesRepositoryServiceImpl.java:216) at ru.lanit.torgi.charges.data.impl.ChargesRepositoryServiceImpl.getTotalCount(ChargesRepositoryServiceImpl.java:152) at ru.lanit.torgi.charges.service.impl.ChargesServiceImpl.getTotalCount(ChargesServiceImpl.java:379) at ru.lanit.torgi.charges.service.impl.ChargesServiceImpl$$FastClassBySpringCGLIB$$2e00b4f2.invoke(<generated>) at ru.lanit.torgi.charges.service.impl.ChargesServiceImpl$$EnhancerBySpringCGLIB$$5033b8d9.getTotalCount(<generated>) at ru.lanit.torgi.charges.controller.PaymentsController.getTotalCount(PaymentsController.java:55) at ru.lanit.torgi.charges.controller.PaymentsController$$FastClassBySpringCGLIB$$3430e5e2.invoke(<generated>) at ru.lanit.torgi.charges.controller.PaymentsController$$EnhancerBySpringCGLIB$$8b927082.getTotalCount(<generated>)