Skip to content
Snippets Groups Projects
  1. Jul 24, 2011
  2. Mar 31, 2011
  3. Mar 28, 2011
  4. Jan 08, 2011
  5. Dec 23, 2010
  6. Dec 16, 2010
  7. Oct 08, 2010
    • Sunil Mushran's avatar
      · 0e105d37
      Sunil Mushran authored
      ocfs2/cluster: Check slots for unconfigured live nodes
      
      o2hb currently checks slots for configured nodes only. This patch makes
      it check the slots for the live nodes too to take care of a race in which
      a node is removed from the configuration but not from the live map.
      
      Signed-off-by: default avatarSunil Mushran <sunil.mushran@oracle.com>
      0e105d37
  8. Sep 18, 2010
  9. Aug 07, 2010
  10. May 06, 2010
  11. Mar 13, 2010
  12. Feb 08, 2010
  13. Jan 26, 2010
  14. Jun 15, 2009
  15. Oct 31, 2008
  16. Aug 22, 2008
  17. May 31, 2008
  18. Apr 18, 2008
    • Sunil Mushran's avatar
      ocfs2/net: Add debug interface to o2net · 2309e9e0
      Sunil Mushran authored
      
      This patch exposes o2net information via debugfs. The information includes
      the list of sockets (sock_containers) as well as the list of outstanding
      messages (send_tracking). Useful for o2dlm debugging.
      
      (This patch is derived from an earlier one written by Zach Brown that
      exposed the same information via /proc.)
      
      [Mark: checkpatch fixes]
      
      Signed-off-by: default avatarSunil Mushran <sunil.mushran@oracle.com>
      Reviewed-by: default avatarJoel Becker <joel.becker@oracle.com>
      Signed-off-by: default avatarMark Fasheh <mfasheh@suse.com>
      2309e9e0
    • Jeff Mahoney's avatar
      ocfs2/cluster: Get rid of arguments to the timeout routines · 409753bf
      Jeff Mahoney authored
      
      We keep seeing bug reports related to NULL pointer derefs in
      o2net_set_nn_state(). When I originally wrote up the configurable timeout
      patch, I had tried to plan for multiple clusters. This was silly.
      
      The timeout routines all use o2nm_single_cluster so there's no point in
      passing an argument at all. This patch removes the arguments and kills those
      bugs dead.
      
      Signed-off-by: default avatarJeff Mahoney <jeffm@suse.com>
      Signed-off-by: default avatarMark Fasheh <mfasheh@suse.com>
      409753bf
    • Tao Ma's avatar
      ocfs2: Reconnect after idle time out. · 5cc3bf27
      Tao Ma authored
      
      Currently, o2net connects to a node on hb_up and disconnects on
      hb_down and net timeout.
      
      It disconnects on net timeout is ok, but it should attempt to
      reconnect back. This is because sometimes nodes get overloaded
      enough that the network connection breaks but the disk hb does not.
      And if we get into that situation, we either fence (unnecessarily)
      or wait for its disk hb to die (and sometimes hang in the process).
      
      So in this updated scheme, when the network disconnects, we keep
      attempting to reconnect till we succeed or we get a disk hb down
      event.
      
      If the other node is really dead, then we will eventually get a
      node down event. If not, we should be able to connect again and
      continue.
      
      Signed-off-by: default avatarTao Ma <tao.ma@oracle.com>
      Signed-off-by: default avatarMark Fasheh <mfasheh@suse.com>
      5cc3bf27
  19. Mar 10, 2008
  20. Dec 05, 2007
  21. Nov 13, 2007
  22. Aug 10, 2007
  23. Jul 11, 2007
  24. May 03, 2007
  25. Mar 14, 2007
  26. Feb 07, 2007
    • Zhen Wei's avatar
      ocfs2: introduce sc->sc_send_lock to protect outbound outbound messages · 925037bc
      Zhen Wei authored
      
      When there is a lot of multithreaded I/O usage, two threads can collide
      while sending out a message to the other nodes. This is due to the lack of
      locking between threads while sending out the messages.
      
      When a connected TCP send(), sendto(), or sendmsg() arrives in the Linux
      kernel, it eventually comes through tcp_sendmsg(). tcp_sendmsg() protects
      itself by acquiring a lock at invocation by calling lock_sock().
      tcp_sendmsg() then loops over the buffers in the iovec, allocating
      associated sk_buff's and cache pages for use in the actual send. As it does
      so, it pushes the data out to tcp for actual transmission. However, if one
      of those allocation fails (because a large number of large sends is being
      processed, for example), it must wait for memory to become available. It
      does so by jumping to wait_for_sndbuf or wait_for_memory, both of which
      eventually cause a call to sk_stream_wait_memory(). sk_stream_wait_memory()
      contains a code path that calls sk_wait_event(). Finally, sk_wait_event()
      contains the call to release_sock().
      
      The following patch adds a lock to the socket container in order to
      properly serialize outbound requests.
      
      From: Zhen Wei <zwei@novell.com>
      Acked-by: default avatarJeff Mahoney <jeffm@suse.com>
      Signed-off-by: default avatarMark Fasheh <mark.fasheh@oracle.com>
      925037bc
    • Sunil Mushran's avatar
      ocfs2: Binds listener to the configured ip address · ab81afd3
      Sunil Mushran authored
      
      This patch binds the o2net listener to the configured ip address
      instead of INADDR_ANY for security. Fixes oss.oracle.com bugzilla#814.
      
      Signed-off-by: default avatarSunil Mushran <sunil.mushran@oracle.com>
      Signed-off-by: default avatarMark Fasheh <mark.fasheh@oracle.com>
      ab81afd3
    • Kurt Hackel's avatar
      ocfs2: Added post handler callable function in o2net message handler · d74c9803
      Kurt Hackel authored
      
      Currently o2net allows one handler function per message type. This
      patch adds the ability to call another function to be called after
      the handler has returned the message to the other node.
      
      Handlers are now given the option of returning a context (in the form of a
      void **) which will be passed back into the post message handler function.
      
      Signed-off-by: default avatarKurt Hackel <kurt.hackel@oracle.com>
      Signed-off-by: default avatarSunil Mushran <sunil.mushran@oracle.com>
      Signed-off-by: default avatarMark Fasheh <mark.fasheh@oracle.com>
      d74c9803
  27. Dec 13, 2006
  28. Dec 11, 2006
    • Andrew Beekhof's avatar
      [patch 3/3] OCFS2 Configurable timeouts - Protocol changes · 828ae6af
      Andrew Beekhof authored
      
      Modify the OCFS2 handshake to ensure essential timeouts are configured
      identically on all nodes.
      
      Only allow changes when there are no connected peers
      
      Improves the logic in o2net_advance_rx() which broke now that
      sizeof(struct o2net_handshake) is greater than sizeof(struct o2net_msg)
      
      Included is the field for userspace-heartbeat timeout to avoid the need for
      further protocol changes.
      
      Uses a global spinlock to ensure the decisions to update configfs entries
      are made on the correct value.  The region covered by the spinlock when
      incrementing the counter is much larger as this is the more critical case.
      
      Small cleanup contributed by Adrian Bunk <bunk@stusta.de>
      
      Signed-off-by: default avatarAndrew Beekhof <abeekhof@suse.de>
      Signed-off-by: default avatarMark Fasheh <mark.fasheh@oracle.com>
      828ae6af
  29. Dec 08, 2006
  30. Nov 22, 2006
  31. Jun 30, 2006
Loading