-
Story
-
Resolution: Done
-
Normal
-
None
-
None
-
RELENG Sprint 32, RELENG Sprint 33, RELENG Sprint 34
From comments, outstanding issues:
- 504 Gateway Timeouts with lftools deploy archives / Nexus unpack plugin
- Connection timeouts caused by sockets in time_wait state, because incoming requests are exceeding 20-30k per/second on nginx.Â
What is addressed until now?
- ODL Nexus is moved to larger flavor (8 cpu + 32 GiB).
- ingress/router network latency issues are resolved.
- Nexus schedule task (to rebuild maven metadata) is disabled.
- Nexus schedule task (to delete old snapshots) is moved to Sundays.
- Nexus cron jobs (purge old archives) was taking about 6-8hours to complete is fixed (CR in review).
- rdiff backup script has been disabled.Â
- Identified socket/port exhaustion issue, expanded port range.
- Identified an MTU issue between ingress and Nexus nodes, and that has been addressed.
- FS tweaks and noatime is been put in place for ODL Nexus.
- Network tunables in place for handling max connections, to address the time_wait on open sockets. Â
AI:
- Pro-active monitoring and alearts are required Nexus system (and other internal nodes).  Â
= = = = = = = = = = ====
as PER
https://rt-sso.linuxfoundation.org/Ticket/Display.html?id=60166
this error from build logs
02:34:02 [ERROR] Failed to execute goal on project daexim-impl: Could not resolve dependencies for project org.opendaylight.daexim:daexim-impl:bundle:1.3.4: Failed to collect dependencies at com.jayway.jsonpath:json-path-assert:jar:2.2.0: Failed to read artifact descriptor for com.jayway.jsonpath:json-path-assert:jar:2.2.0: Could not transfer artifact com.jayway.jsonpath:json-path-assert:pom:2.2.0 from/to opendaylight-mirror (https://nexus.opendaylight.org/content/repositories/public/): Network is unreachable (connect failed) -> [Help 1]
might coorelate with this from odl-nexus
[root@vex-yul-odl-nexus-1.ci logs]# grep 02:34 wrapper.log jvm 1 | 2018-08-27 02:34:11 WARN [32473851-542750] - org.sonatype.nexus.content.internal.ContentServlet - org.eclipse.jetty.io.EofException, caused by: java.io.IOException: Connection reset by peer [client=221.215.106.202,ua=Mozilla/4.0 (compatible; MSIE 7.0; Windows NT 6.1; WOW64; Trident/4.0; SLCC2; .NET CLR 2.0.50727; .NET CLR 3.5.30729; .NET CLR 3.0.30729; Media Center PC 6.0; .NET4.0C; .NET4.0E),req=GET http://nexus.opendaylight.org/content/repositories/public/org/opendaylight/integration/karaf/0.8.3/karaf-0.8.3.zip] jvm 1 | 2018-08-27 02:34:11 ERROR [32473851-542750] - org.sonatype.nexus.web.internal.ErrorPageFilter - Internal error jvm 1 | 2018-08-27 02:34:13 WARN [32473851-542742] - org.sonatype.nexus.content.internal.ContentServlet - org.eclipse.jetty.io.EofException, caused by: java.io.IOException: Connection reset by peer [client=221.215.106.202,ua=Mozilla/4.0 (compatible; MSIE 7.0; Windows NT 6.1; WOW64; Trident/4.0; SLCC2; .NET CLR 2.0.50727; .NET CLR 3.5.30729; .NET CLR 3.0.30729; Media Center PC 6.0; .NET4.0C; .NET4.0E),req=GET http://nexus.opendaylight.org/content/repositories/public/org/opendaylight/integration/karaf/0.8.3/karaf-0.8.3.zip] jvm 1 | 2018-08-27 02:34:13 ERROR [32473851-542742] - org.sonatype.nexus.web.internal.ErrorPageFilter - Internal error jvm 1 | 2018-08-27 02:34:14 WARN [32473851-542732] - org.sonatype.nexus.proxy.storage.remote.httpclient.HttpClientRemoteStorage - Remote HTTP request with malformed path attempted: repository M2Repository(id=central-new), path /com/huawei/bsp/com.huawei.bsp.commonlib.roa.restserver/${commonlib.version}/com.huawei.bsp.commonlib.roa.restserver-${commonlib.version}.jar jvm 1 | 2018-08-27 02:34:14 WARN [32473851-542732] - org.sonatype.nexus.proxy.storage.remote.httpclient.HttpClientRemoteStorage - Remote HTTP request with malformed path attempted: repository M2Repository(id=maven-restlet), path /com/huawei/bsp/com.huawei.bsp.commonlib.roa.restserver/${commonlib.version}/com.huawei.bsp.commonlib.roa.restserver-${commonlib.version}.jar jvm 1 | 2018-08-27 02:34:14 WARN [32473851-542732] - org.sonatype.nexus.proxy.storage.remote.httpclient.HttpClientRemoteStorage - Remote HTTP request with malformed path attempted: repository M2Repository(id=servicemix), path /com/huawei/bsp/com.huawei.bsp.commonlib.roa.restserver/${commonlib.version}/com.huawei.bsp.commonlib.roa.restserver-${commonlib.version}.jar jvm 1 | 2018-08-27 02:34:14 WARN [32473851-542732] - org.sonatype.nexus.proxy.storage.remote.httpclient.HttpClientRemoteStorage - Remote HTTP request with malformed path attempted: repository M2Repository(id=juniper-contrail), path /com/huawei/bsp/com.huawei.bsp.commonlib.roa.restserver/${commonlib.version}/com.huawei.bsp.commonlib.roa.restserver-${commonlib.version}.jar jvm 1 | 2018-08-27 02:34:14 WARN [32473851-542732] - org.sonatype.nexus.proxy.storage.remote.httpclient.HttpClientRemoteStorage - Remote HTTP request with malformed path attempted: repository M2Repository(id=sevntu), path /com/huawei/bsp/com.huawei.bsp.commonlib.roa.restserver/${commonlib.version}/com.huawei.bsp.commonlib.roa.restserver-${commonlib.version}.jar jvm 1 | 2018-08-27 02:34:14 WARN [32473851-542732] - org.sonatype.nexus.proxy.storage.remote.httpclient.HttpClientRemoteStorage - Remote HTTP request with malformed path attempted: repository M2Repository(id=gemini-blueprint), path /com/huawei/bsp/com.huawei.bsp.commonlib.roa.restserver/${commonlib.version}/com.huawei.bsp.commonlib.roa.restserver-${commonlib.version}.jar jvm 1 | 2018-08-27 02:34:14 WARN [32473851-542732] - org.sonatype.nexus.proxy.storage.remote.httpclient.HttpClientRemoteStorage - Remote HTTP request with malformed path attempted: repository M2Repository(id=clojars), path /com/huawei/bsp/com.huawei.bsp.commonlib.roa.restserver/${commonlib.version}/com.huawei.bsp.commonlib.roa.restserver-${commonlib.version}.jar jvm 1 | 2018-08-27 02:34:51 INFO [32473851-542713] - org.apache.http.impl.execchain.RetryExec - I/O exception (org.apache.http.NoHttpResponseException) caught when processing request to {}->http://svn.apache.org:80: The target server failed to respond jvm 1 | 2018-08-27 02:34:51 INFO [32473851-542713] - org.apache.http.impl.execchain.RetryExec - Retrying request to {}->http://svn.apache.org:80 jvm 1 | 2018-08-27 02:34:55 WARN [32473851-542623] - org.sonatype.nexus.content.internal.ContentServlet - org.eclipse.jetty.io.EofException, caused by: java.io.IOException: Connection reset by peer [client=221.215.106.202,ua=Mozilla/4.0 (compatible; MSIE 7.0; Windows NT 6.1; WOW64; Trident/4.0; SLCC2; .NET CLR 2.0.50727; .NET CLR 3.5.30729; .NET CLR 3.0.30729; Media Center PC 6.0; .NET4.0C; .NET4.0E),req=GET http://nexus.opendaylight.org/content/repositories/public/org/opendaylight/integration/karaf/0.8.3/karaf-0.8.3.zip] jvm 1 | 2018-08-27 02:34:55 ERROR [32473851-542623] - org.sonatype.nexus.web.internal.ErrorPageFilter - Internal error jvm 1 | 2018-08-27 11:02:34 WARN [etcherImpl-task] - com.sonatype.central.secure.nexus.plugin.internal.AuthtokenFetcherImpl - Failed to fetch authtoken: org.apache.http.conn.ConnectTimeoutException: Connect to secure.central.sonatype.com:443 [secure.central.sonatype.com/207.223.241.90] failed: connect timed out jvm 1 | 2018-08-27 13:02:34 INFO [32473851-569336] - org.apache.http.impl.execchain.RetryExec - I/O exception (org.apache.http.NoHttpResponseException) caught when processing request to {}->http://svn.apache.org:80: The target server failed to respond jvm 1 | 2018-08-27 13:02:34 INFO [32473851-569336] - org.apache.http.impl.execchain.RetryExec - Retrying request to {}->http://svn.apache.org:80
investigate further
- relates to
-
RELENG-1378 Refactor deploy-maven-file cmd to pure Python
- Done
-
RELENG-1504 Investigate ODL Nexus issues for the incident on Nov 16, 17
- Done