shoal
  1. shoal
  2. SHOAL-42

masterNode.getRouteControl().isConnected returns false intermittently or all the time

    Details

    • Type: Bug Bug
    • Status: Resolved
    • Priority: Critical Critical
    • Resolution: Fixed
    • Affects Version/s: current
    • Fix Version/s: milestone 1
    • Component/s: GMS
    • Labels:
      None
    • Environment:

      Operating System: All
      Platform: Macintosh

    • Issuezilla Id:
      42

      Description

      Hi Mo,
      I added a log message to print out the value of isConnected() in the HealthMonitor class. I ran the
      rungmsdemo.sh test on 2 terminals and I see the value printed as false all the time.
      Please comment. This is crucial since the "false" value would mean that our logic to determine
      IN_DOUBT state would need to be altered.
      Thanks
      Sheetal

      On Feb 14, 2008, at 4:27 PM, Sheetal Vartak wrote:

      Hi Mo,
      As you are aware, one of the instances that is started while running my MultiGroupTest suddenly
      decides to go into IN_DOUBT state. I was looking at the values computed for
      masterNode.getRouteControl().isConnected(entry.id). I found that the value is sometime false and
      sometimes true. What I don't understand is how can the value be false at one point (time = t) and then
      at some other point (time t+delta) it becomes true.
      Can you please shed some light on this?
      Thanks
      Sheetal

        Activity

        Hide
        sheetalv added a comment -

        Mo has fixed this issue in the single cluster scenario. It still does not work
        correctly in a multi-cluster environment. The MultiGroupJoinTest produces false
        failures due to isConnected() returning false intermittently.
        Test can be run in 2 terminals as follows :
        sh runmultigroupjointest.sh C1
        sh runmultigroupjointest.sh C2

        change log :
        https://shoal.dev.java.net/servlets/ReadMsg?list=cvs&msgNo=531

        Show
        sheetalv added a comment - Mo has fixed this issue in the single cluster scenario. It still does not work correctly in a multi-cluster environment. The MultiGroupJoinTest produces false failures due to isConnected() returning false intermittently. Test can be run in 2 terminals as follows : sh runmultigroupjointest.sh C1 sh runmultigroupjointest.sh C2 change log : https://shoal.dev.java.net/servlets/ReadMsg?list=cvs&msgNo=531
        Hide
        sheetalv added a comment -

        This issue is now resolved. Added a fix in HealthMonitor to not check for
        isConnected for the same instance where the VM is running. The HealthMonitor's
        InDoubtPeerDetector thread now iterates through all the entries but skips its
        own entry since isConnected() obviously returns false for its own self.

        https://shoal.dev.java.net/servlets/ReadMsg?list=cvs&msgNo=536

        Show
        sheetalv added a comment - This issue is now resolved. Added a fix in HealthMonitor to not check for isConnected for the same instance where the VM is running. The HealthMonitor's InDoubtPeerDetector thread now iterates through all the entries but skips its own entry since isConnected() obviously returns false for its own self. https://shoal.dev.java.net/servlets/ReadMsg?list=cvs&msgNo=536

          People

          • Assignee:
            hamada
            Reporter:
            sheetalv
          • Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved: