Uploaded image for project: 'HPCC'
  1. HPCC
  2. HPCC-24141

Unexplained/unreported ftslave crashes

    XMLWordPrintable

Details

    • Bug
    • Status: Resolved
    • Not specified
    • Resolution: Cannot Reproduce
    • None
    • None
    • FTSlave
    • None

    Description

      There were several dozen core files from ftslave on 19th/20th May, with this stack:

      00000000 PRG 2020-05-20 08:01:06.295 389180 389180 "Starting ftslave 10.194.90.204 6814 0 6416 D20200520-080104 /var/log/HPCCSystems/dfuserver"
      00000001 PRG 2020-05-20 08:01:06.295 389180 389180 "Starting remote slave.  Master=10.194.90.204 reply=6814 port=6416"
      00000002 PRG 2020-05-20 08:01:06.295 389180 389180 "Ready to listen. reply=6814 port=6416"
      00000003 PRG 2020-05-20 08:01:06.295 389180 389180 "Ready to accept connection. reply=6814"
      00000004 PRG 2020-05-20 08:01:06.306 389180 389180 "Process incoming connection. reply=6814 got(6814,10.194.90.204)"
      00000005 PRG 2020-05-20 08:01:06.306 389180 389180 "Connection matched - continue...."
      00000006 USR 2020-05-20 08:01:06.306 389180 389180 "================================================"
      00000007 USR 2020-05-20 08:01:06.306 389180 389180 "Program:   10.194.90.143:/mnt/disk1/HPCCSystems/bin/ftslave"
      00000008 USR 2020-05-20 08:01:06.306 389180 389180 "Signal:    11 Segmentation fault"
      00000009 USR 2020-05-20 08:01:06.306 389180 389180 "Fault IP:  00007F096587C318"
      0000000A USR 2020-05-20 08:01:06.306 389180 389180 "Accessing: 0000000001FAA000"
      0000000B PRG 2020-05-20 08:01:06.306 389180 389180 "Backtrace:"
      0000000C PRG 2020-05-20 08:01:06.307 389180 389180 "  /usr/lib64/libc.so.6(+0x156318) [0x7f096587c318]"
      0000000D PRG 2020-05-20 08:01:06.307 389180 389180 "  /opt/HPCCSystems/lib/libjlib.so(_ZN12MemoryBuffer4readEjPv+0x1f) [0x7f096752d8cf]"
      0000000E PRG 2020-05-20 08:01:06.307 389180 389180 "  /opt/HPCCSystems/lib/libjlib.so(_Z11deserializeR12MemoryBufferR10MemoryAttr+0x31) [0x7f096752de21]"
      0000000F PRG 2020-05-20 08:01:06.307 389180 389180 "  /opt/HPCCSystems/lib/libdalift.so(_Z11deserializeR9CIArrayOfI14PartitionPointER12MemoryBuffer+0x54) [0x7f096c03f5e4]"
      00000010 PRG 2020-05-20 08:01:06.307 389180 389180 "  /opt/HPCCSystems/lib/libdalift.so(_ZN14TransferServer17deserializeActionER12MemoryBufferj+0xba) [0x7f096c040eba]"
      00000011 PRG 2020-05-20 08:01:06.307 389180 389180 "  ftslave() [0x40211e]"
      00000012 PRG 2020-05-20 08:01:06.307 389180 389180 "  /opt/HPCCSystems/lib/libremote.so(_ZN12CRemoteSlave3runEiPPc+0x331) [0x7f096b5ed6f1]"
      00000013 PRG 2020-05-20 08:01:06.307 389180 389180 "  ftslave() [0x401fd3]"
      00000014 PRG 2020-05-20 08:01:06.307 389180 389180 "  /usr/lib64/libc.so.6(__libc_start_main+0xf5) [0x7f0965748505]"
      00000015 PRG 2020-05-20 08:01:06.307 389180 389180 "  ftslave() [0x402026]"
      00000016 USR 2020-05-20 08:01:06.307 389180 389180 "Registers:"
      00000017 USR 2020-05-20 08:01:06.307 389180 389180 "EAX:00007F08EF278010  EBX:00007FFFAE77F500  ECX:0000000000000000  EDX:0000000074681C52  ESI:0000000001FA9F81  EDI:00007F08EF27D230"
      00000018 USR 2020-05-20 08:01:06.307 389180 389180 "R8 :00007F08EF278010  R9 :0000000000500000  R10:0000000000000022  R11:0000000000001000"
      00000019 USR 2020-05-20 08:01:06.307 389180 389180 "R12:0000000000000001  R13:00007FFFAE77F500  R14:00007FFFAE77F500  R15:00007FFFAE77F560"
      0000001A USR 2020-05-20 08:01:06.307 389180 389180 "CS:EIP:0033:00007F096587C318"
      0000001B USR 2020-05-20 08:01:06.307 389180 389180 "   ESP:00007FFFAE77F218  EBP:0000000074686F72"
      

      Attachments

        Activity

          People

            Unassigned Unassigned
            jakesmith Jake Smith
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: