Uploaded image for project: 'WiredTiger'
  1. WiredTiger
  2. WT-3175

New hang in internal page split

    • Type: Icon: Bug Bug
    • Resolution: Done
    • Priority: Icon: Major - P3 Major - P3
    • WT2.9.2, 3.2.13, 3.4.3, 3.5.4
    • Affects Version/s: None
    • Component/s: None
    • None
    • Storage 2017-02-13

      Several of our Jenkins jobs have hung, and it appears to be related to internal page splits from the call stacks. That means it's likely due to the changes in either WT-3170 or WT-3088. There have been several changes merged recently, so I opened a new ticket.

      The call stacks I see are:

            7 select,__wt_sleep,__wt_page_in_func,__wt_page_swap_func,__wt_row_search,__cursor_row_search,__wt_btcur_search,__curfile_search,worker,start_thread,clone
            4 select,__wt_sleep,__wt_writelock,__split_internal_lock,__wt_split_multi,__evict_page_dirty_update,__wt_evict,__evict_page,__wt_cache_eviction_worker,__wt_cache_eviction_check,__cursor_enter,__curfile_enter,__cursor_func_init,__wt_btcur_insert,__curfile_insert,worker,start_thread,clone
            2 select,__wt_sleep,__wt_writelock,__split_internal_lock,__wt_split_multi,__evict_page_dirty_update,__wt_evict,__evict_page,__evict_lru_pages,__wt_evict_thread_run,__wt_thread_run,start_thread,clone
            2
            1 select,__wt_sleep,__wt_page_in_func,__wt_page_swap_func,__tree_walk_internal,__wt_tree_walk,__wt_btcur_next,__curfile_next,do_range_reads,worker,start_thread,clone
            1 select,__wt_sleep,__wt_page_in_func,__split_verify_intl,__split_internal,__split_parent_climb,__wt_split_multi,__evict_page_dirty_update,__wt_evict,__evict_page,__evict_lru_pages,__wt_evict_thread_run,__wt_thread_run,start_thread,clone
            1 pthread_join,stop_threads,execute_workload,start_run,start_all_runs,main
            1 pthread_cond_timedwait@@GLIBC_2.3.2,__wt_cond_wait_signal,__wt_cond_wait,__sweep_server,start_thread,clone
            1 pthread_cond_timedwait@@GLIBC_2.3.2,__wt_cond_wait_signal,__wt_cond_wait,__statlog_server,start_thread,clone
            1 pthread_cond_timedwait@@GLIBC_2.3.2,__wt_cond_wait_signal,__wt_cond_wait,__evict_pass,__evict_server,__wt_evict_thread_run,__wt_thread_run,start_thread,clone
      

      From the Jenkins wtperf test job: http://build.wiredtiger.com:8080/job/wiredtiger-test-wtperf/2433/

      The configuration being run is:

      ./bench/wtperf/wtperf -O ../bench/wtperf/stress//btree-split-stress.wtperf -o verbose=2
      

            Assignee:
            michael.cahill@mongodb.com Michael Cahill (Inactive)
            Reporter:
            alexander.gorrod@mongodb.com Alexander Gorrod
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

              Created:
              Updated:
              Resolved: