Skip to content
  • Chris Mason's avatar
    eventfd: don't take the spinlock in eventfd_poll · e22553e2
    Chris Mason authored
    
    
    The spinlock in eventfd_poll is trying to protect the count of events so
    it can decide if it should return POLLIN, POLLERR, or POLLOUT.  But,
    because of the way we drop the lock after calling poll_wait, and drop it
    again before returning, we have the same pile of races with the lock as
    we do with a single read of ctx->count().
    
    This replaces the lock with a read barrier and single read.
    
    eventfd_write does a single bump of ctx->count, so this should not add
    new races with adding events.  eventfd_read is similar, it will do a
    single decrement with the lock held, and so we're making the race with
    concurrent readers slightly larger.
    
    This spinlock is the top CPU user in kernel code during one of our
    workloads.  Removing it gives us a ~2% boost.
    
    [arnd@arndb.de: avoid unused variable warning]
    [dan.carpenter@oracle.com: type bug in eventfd_poll()]
    Signed-off-by: default avatarChris Mason <clm@fb.com>
    Cc: Davide Libenzi <davidel@xmailserver.org>
    Signed-off-by: Arnd Bergmann <a...
    e22553e2