Question-and-Answer Resource for the Building Energy Modeling Community
Get started with the Help page
Ask Your Question
2

Restart an AWS PAT?

asked 2019-03-14 15:43:23 -0500

pow_skier's avatar

Hello,

My connection to AWS seems to time out. I've been trying to do these fairly large Design of Experiment based algorithmic runs, in some cases where I need them to run overnight. Twice now the PAT seems to stop mid stream. The connection to AWS remains open but I am no longer getting progress on models. PAT version 2.7.0 and AMI 2.7.1. In this case it hung up around 22% complete of a 10240 case simulation. Thank you for any insight.

Regards.

edit retag flag offensive close merge delete

Comments

1

Is it just PAT that is unresponsive, or is the server on AWS also not responsive? (If you open it in a web browser)

David Goldwasser's avatar David Goldwasser  ( 2019-03-15 08:17:58 -0500 )edit
1

whats your server instance type? How many datapoints are you running and how big is each datapoint file?

BrianLBall's avatar BrianLBall  ( 2019-03-15 10:13:58 -0500 )edit

Service instance was m3.2xlarge, 10,240 data points and each run is approx 50-100mb.

The server seems alive, from the EC2 monitoring I can see the cpu activity as resting. It seems like PAT just stops sending more run instructions. The Resque monitoring shows no activity.

pow_skier's avatar pow_skier  ( 2019-03-15 10:24:13 -0500 )edit
1

also try '2.7.1-largescale1'. This AMI has some load balancing changes to keep the server node from getting overloaded with worker processes, which can make it unresponsive

BrianLBall's avatar BrianLBall  ( 2019-03-15 10:24:29 -0500 )edit
1

well, 10,240 datapoints at 50Mb is 512 Gb, so your instance probably ran out of disk space. You can verify that by using the server.pem key and ssh into the server node and 'df -h'. user name is 'ubuntu'

BrianLBall's avatar BrianLBall  ( 2019-03-15 10:30:02 -0500 )edit

1 Answer

Sort by ยป oldest newest most voted
2

answered 2019-03-15 10:39:59 -0500

pow_skier's avatar

"Maybe I had too much too fast. Or just overplayed the part. Nothing shakin..."

edit flag offensive delete link more

Your Answer

Please start posting anonymously - your entry will be published after you log in or create a new account.

Add Answer

Careers

Question Tools

1 follower

Stats

Asked: 2019-03-14 15:43:23 -0500

Seen: 209 times

Last updated: Mar 15 '19