HACKER Q&A
📣 bognition

Has anyone else seen TCP connection issues in AWS US East this week?


Over the last week we've seen random TCP connection issues in the US-1 East data center in AWS. Has anyone else been seeing this?


  👤 osipovas Accepted Answer ✓
Maybe AWS US East is going to be the region with random failures?

"I want to have an AWS region where everything breaks with high frequency" https://news.ycombinator.com/item?id=24103746


👤 BookPage
It would help if you defined exactly what type of issues you experienced. Packet loss? Early RSTs? Latency? Single AZ or cross AZ? Same for VPCs or NAT'd internet traffic?

You should take some tcpdumps and open a support case.


👤 fossuser
Yes - I've been seeing failures this morning.

Searched around didn't see much on twitter beyond this: https://twitter.com/Flock/status/1294304262126804993?s=20

We think it's one AZ in us-east-1.


👤 omreaderhn
Yes I have seen sporadic connection issues with various site scraping functions my app employs. I figured it was a widespread issue but I'm glad you made a post about it that basically confirms that.

👤 RulerOf
One instance was randomly powered down about 22 hours ago.

We saw a synthetic monitor failure at midnight. Investigation of the transaction trace shows that a specific code path that should take maybe ~100ms took almost 40000ms.

It could have been unresponsive EBS. Or failure to look up the Redis server's IP address. Or some other infrastructure-level failure. The synthetic browser saw it as a 502.


👤 mweberxyz
Perhaps related: our load tests this week showed an increase in 502s from the ALB. The app server request logs indicate those requests never made it from the ALB.

👤 renewiltord
We had a full `us-west-2` 30 minute network drop-out this week. CloudTrail shows nothing.

👤 syllableai
We are seeing sporadic connection issues where tcp syn packets are dropped before reaching our elb. Have noticed off and on for a few weeks now. Still investigating and have support ticket out with aws.

👤 bassman9000
Not as much as TCP issues, but increased API call failure, across multiple services (cloudformation, ec2, rds), yes, on us-east-1. Mind you, still pretty low rate, but enough to notice some pattern.

👤 wgyn
We had a few minutes earlier this week where a machine saw packets in/out go to zero for no discernible reason.

👤 jeppesen-io
I've not seen anything across 4 AZs

👤 sg47
Someone asked for a region where everything breaks all the time.

👤 jscheel
Yep, had issues yesterday. Botched a big deploy for me too.

👤 WaxProlix
In any specific AZ or across the board?