Skip to content
  • Raghavendra K T's avatar
    kvm: Handle yield_to failure return code for potential undercommit case · c45c528e
    Raghavendra K T authored
    
    
    yield_to returns -ESRCH, When source and target of yield_to
    run queue length is one. When we see three successive failures of
    yield_to we assume we are in potential undercommit case and abort
    from PLE handler.
    The assumption is backed by low probability of wrong decision
    for even worst case scenarios such as average runqueue length
    between 1 and 2.
    
    More detail on rationale behind using three tries:
    if p is the probability of finding rq length one on a particular cpu,
    and if we do n tries, then probability of exiting ple handler is:
    
     p^(n+1) [ because we would have come across one source with rq length
    1 and n target cpu rqs  with length 1 ]
    
    so
    num tries:         probability of aborting ple handler (1.5x overcommit)
     1                 1/4
     2                 1/8
     3                 1/16
    
    We can increase this probability with more tries, but the problem is
    the overhead.
    Also, If we have tried three times that means we would have iterated
    over 3 good eligible vcpus along with many non-eligible candidates. In
    worst case if we iterate all the vcpus, we reduce 1x performance and
    overcommit performance get hit.
    
    note that we do not update last boosted vcpu in failure cases.
    Thank Avi for raising question on aborting after first fail from yield_to.
    
    Reviewed-by: default avatarSrikar Dronamraju <srikar@linux.vnet.ibm.com>
    Signed-off-by: default avatarRaghavendra K T <raghavendra.kt@linux.vnet.ibm.com>
    Tested-by: default avatarChegu Vinod <chegu_vinod@hp.com>
    Signed-off-by: default avatarGleb Natapov <gleb@redhat.com>
    c45c528e