Details
-
Sub-task
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
-
DFU Server, EclAgent, eclccserver, ESP, JLib, Sasha, Thor
-
None
Description
Several components fail to contact Dali if dali is configured using a hostname which contains a dash:
Starting mydafilesrv ... [ OK ]
Starting mydali ... [ OK ]
Starting mydfuserver ... [ TIMEOUT ]
Starting myeclagent ... [ TIMEOUT ]
Starting myeclccserver ... [ TIMEOUT ]
Starting myeclscheduler ... [ TIMEOUT ]
Starting myesp ... [ TIMEOUT ]
Starting myroxie ... [ OK ]
Starting mysasha ... [ TIMEOUT ]
Starting mythor ... [ TIMEOUT ]
Sample log output from dfu when dali configured w/ hosthome: hadoop-Virtualbox
00000000 2017-02-17 16:41:06.815 31761 31761 "ftslave log dir set to /var/log/HPCCSystems/mydfuserver"
00000001 2017-02-21 11:05:05.374 31761 31761 "jsocket(1,3007) getaddrinfo failed err = -3 : hadoop"
00000002 2017-02-21 11:05:05.381 31761 31761 "Backtrace:"
00000003 2017-02-21 11:05:05.384 31761 31761 " /home/hadoop/GIT/HPCC-build/Debug/libs/libjlib.so(_Z16printStackReportx+0x5a) [0x7ffff66645ff]"
00000004 2017-02-21 11:05:05.384 31761 31761 " /home/hadoop/GIT/HPCC-build/Debug/libs/libjlib.so(+0x1408dd) [0x7ffff67078dd]"
00000005 2017-02-21 11:05:05.384 31761 31761 " /home/hadoop/GIT/HPCC-build/Debug/libs/libjlib.so(_ZN9IpAddress5ipsetEPKc+0xfa) [0x7ffff6707c66]"
00000006 2017-02-21 11:05:05.384 31761 31761 " /home/hadoop/GIT/HPCC-build/Debug/libs/libjlib.so(+0x14d059) [0x7ffff6714059]"
00000007 2017-02-21 11:05:05.384 31761 31761 " /home/hadoop/GIT/HPCC-build/Debug/libs/libjlib.so(_ZN19SocketEndpointArray8fromTextEPKcj+0x1f4) [0x7ffff670b33c]"
00000008 2017-02-21 11:05:05.384 31761 31761 " /home/hadoop/GIT/HPCC-build/Debug/libs/libmp.so(+0xfc37) [0x7ffff6a0ac37]"
00000009 2017-02-21 11:05:05.385 31761 31761 " /home/hadoop/GIT/HPCC-build/Debug/libs/libmp.so(_Z12createIGroupPKct+0x11c) [0x7ffff6a07d6c]"
0000000A 2017-02-21 11:05:05.385 31761 31761 " /home/hadoop/GIT/HPCC-build/Debug/bin/dfuserver() [0x4182da]"
0000000B 2017-02-21 11:05:05.385 31761 31761 " /lib/x86_64-linux-gnu/libc.so.6(__libc_start_main+0xf0) [0x7ffff5a69830]"
0000000C 2017-02-21 11:05:05.385 31761 31761 " /home/hadoop/GIT/HPCC-build/Debug/bin/dfuserver(_start+0x29) [0x406b19]"
0000000D 2017-02-21 11:05:26.160 31761 31761 "Failed to connect to Dali Server 0.0.0.0."
0000000E 2017-02-21 11:05:26.160 31761 31761 "0.0.0.0 Retrying..."
0000000F 2017-02-21 11:05:36.161 31761 31761 "Failed to connect to Dali Server 0.0.0.0."
00000010 2017-02-21 11:05:46.163 31761 31761 "Failed to connect to Dali Server 0.0.0.0."
00000011 2017-02-21 11:05:56.165 31761 31761 "Failed to connect to Dali Server 0.0.0.0."
00000012 2017-02-21 11:05:56.165 31761 31761 "0.0.0.0 Retrying..."
00000013 2017-02-21 11:06:06.167 31761 31761 "Failed to connect to Dali Server 0.0.0.0."
00000014 2017-02-21 11:06:16.169 31761 31761 "Failed to connect to Dali Server 0.0.0.0."
00000015 2017-02-21 11:06:26.173 31761 31761 "Failed to connect to Dali Server 0.0.0.0."
00000016 2017-02-21 11:06:36.175 31761 31761 "Failed to connect to Dali Server 0.0.0.0."
00000017 2017-02-21 11:06:46.177 31761 31761 "Failed to connect to Dali Server 0.0.0.0."
00000018 2017-02-21 11:06:46.177 31761 31761 "0.0.0.0 Retrying..."
00000019 2017-02-21 11:06:56.178 31761 31761 "Failed to connect to Dali Server 0.0.0.0."
Issue can be traced back to SocketEndpointArray::fromText() calls which see the "dash" as an IP Range delimiter:
Thread #1 [dfuserver] 12721 [core: 3] (Suspended : Breakpoint)
appendv4range() at jsocket.cpp:5,815 0x7ffff6713dc1
SocketEndpointArray::fromText() at jsocket.cpp:5,935 0x7ffff670b33c
CGroup::fromText() at mpbase.cpp:557 0x7ffff6a0ac37
createIGroup() at mpbase.cpp:702 0x7ffff6a07d6c
main() at dfuserver.cpp:162 0x4182da
Attachments
Issue Links
- is duplicated by
-
HPCC-17270 IP range handling regression (from HPCC-17110). No longer handling files with published ranges correctly.
-
- Resolved
-