IRC logs for #aegir, 2016-07-27 (GMT)

2016-07-26
2016-07-28
TimeNickMessage
[10:01:18]* hestenet has quit (Remote host closed the connection)
[10:03:40]* theMusician has joined #aegir
[10:13:54]* ponies has quit (Ping timeout: 250 seconds)
[10:26:21]* theMusician has quit (Quit: theMusician)
[10:33:38]* theMusician has joined #aegir
[10:40:07]* hestenet has joined #aegir
[10:41:23]* hestenet has quit (Remote host closed the connection)
[10:49:48]* gusaus has quit (Quit: gusaus)
[11:06:48]* theMusician has left #aegir ()
[11:22:30]* mstenta has quit (Ping timeout: 258 seconds)
[11:45:46]* gusaus has joined #aegir
[12:16:38]* stijnvbrande has quit (Quit: Connection closed for inactivity)
[13:38:07]* g1i7ch has quit (Quit: Leaving)
[14:30:39]* gusaus has quit (Ping timeout: 244 seconds)
[14:36:54]* gusaus has joined #aegir
[15:30:05]* mengi has quit (Quit: Leaving.)
[15:58:37]* boshtian has joined #aegir
[16:08:29]* gusaus has quit (Quit: gusaus)
[16:14:12]* stijnvbrande has joined #aegir
[16:49:03]* drupol has quit (Ping timeout: 250 seconds)
[16:52:04]* stijnvbrande has quit (Read error: Connection reset by peer)
[16:52:40]* drupol has joined #aegir
[16:54:01]* stijnvbrande has joined #aegir
[17:22:04]* elijah has quit (Ping timeout: 240 seconds)
[17:23:51]* elijah has joined #aegir
[17:26:13]* drakythe is now known as zz_drakythe
[17:52:08]* g33kg1rl has joined #aegir
[17:55:58]<g33kg1rl>Hi there, I am processing the latest upgrade for aegir3. It is hanging on reconfiguring aegir3-hostmaster.
[17:56:49]<g33kg1rl>It gets stuck on Platforms path /var/aegir/platforms is writable [success]
[17:57:10]<g33kg1rl>Has anyone encountered this issue?
[17:57:18]<viashimo>g33kg1rl: is there any extra information if you run it with --verbose --debug ?
[17:57:23]<viashimo>g33kg1rl: are you using the debian package?
[17:58:11]<g33kg1rl>I checked htop and it seems to be running the hosting-pause and hostmaster-migrate commands in the background, but they aren't using any memory or cpu.
[17:58:15]<g33kg1rl>I am using debian
[17:58:35]<g33kg1rl>This was encountered when doing apt-get upgrade
[18:00:19]<viashimo>g33kg1rl: hmm I don't think I've run into this issue (at 3.6 right now)
[18:00:33]<viashimo>I would try env DPKG_DEBUG=developer sudo apt-get install aegir
[18:01:08]<viashimo>anyway, to try and the apt upgrade to run again with extra info
[18:01:42]<viashimo>http://aegir.readthedocs.io/en/3.x/install/#8-troubleshooting-the-install
[18:03:52]<g33kg1rl>It says that dpkg was interrupted and it won't run any apt-get commands
[18:07:14]<viashimo>hrmm, maybe with DPKG_DEBUG=developer dpkg --configure aegir
[18:07:31]<viashimo>I don't really know dpkg etc all that well
[18:08:10]* kvanderw has quit (Ping timeout: 244 seconds)
[18:08:33]<viashimo>I've used something similar in the past to restart updates to the aegir package that had failed, but I don't recall the exact commands
[18:08:57]* kvanderw has joined #aegir
[18:19:55]<g33kg1rl>OK I was able to get that to work
[18:20:05]<g33kg1rl>Now it is hanging on Executing: mysql --defaults-extra-file=/tmp/drush_07o4Pu --database=odinsnrgus_0 --host=localhost --port=3306 --silent < /tmp/drush_ixjEKl
[18:25:07]<viashimo>you might be able to see what it's trying to do in mysql by using "show processlist" or "show full processlist" in an sql client
[18:27:18]<g33kg1rl>Thank you for being so helpful, I really do appreciate it :) I ran that new command and it says is running three separate processes on the hostmaster database
[18:27:27]<g33kg1rl>the command column says Sleep
[18:28:24]<viashimo>I think that means the connection are open but idle
[18:29:41]<g33kg1rl>So basically it is stuck XD
[18:30:22]<viashimo>haha yeh. is the mysql command it printed out listed in 'ps ax' ?
[18:30:44]<viashimo>or I guess with ps axf you can see the processes under the dpkg command you issued
[18:32:09]<viashimo>you could see what the processes are trying to do with "sudo strace -p PID"
[18:32:23]<viashimo>but whenever I run that I'm usually at the grasping at straws point :p
[18:35:10]<g33kg1rl>Most definitely at that point right now, still I appreciate the help. I will check out those commands
[18:36:32]<viashimo>g33kg1rl: np
[18:48:36]* gandhiano has joined #aegir
[19:01:16]* g33kg1rl has quit (Ping timeout: 250 seconds)
[19:15:17]* g33kg1rl has joined #aegir
[19:16:02]<g33kg1rl>viashimo: are you still here? Could you send me those last commands the env debug command, I was logged out of IRC and hadn't copied those to a safe place
[19:20:00]<viashimo>g33kg1rl: "ps axf" list the processes in a tree (helpful to find what's running under dpkg)
[19:20:20]<viashimo>g33kg1rl: "sudo strace -p PID" attach to a running process and see what system calls it's running
[19:20:56]<viashimo>g33kg1rl: and "DPKG_DEBUG=developer dpkg --configure aegir"
[19:21:19]<viashimo>g33kg1rl: sorry I couldn't be more helpful
[19:36:15]<g33kg1rl>So the strace provided this over and over poll([{fd=6, events=POLLIN|POLLPRI}], 1, 0) = 0 (Timeout) sendto(6, "p\0\0\0\3SELECT COUNT(t.vid) FROM ho"..., 116, 0, NULL, 0) = 116 recvfrom(6, "\1\0\0\1\1\"\0\0\2\3def\0\0\0\fCOUNT(t.vid)\0\f?"..., 16384, 0, NULL, NULL) = 67 poll([{fd=6, events=POLLIN|POLLPRI}], 1, 0) = 0 (Timeout) sendto(6, "x\0\0\0\3SELECT count(t.nid) FROM no"..., 124, 0, NULL, 0) = 124 recvfrom(6, "\1\0\0\1\1\"\0\0\2\3def
[19:36:33]<g33kg1rl>(there is more but it looks like there is a limit to what I can post)
[19:40:10]<viashimo>kind of looks like it's trying to run some sql
[19:40:21]<viashimo>but you didn't see anything in the process list
[19:41:28]<viashimo>g33kg1rl: does that look like anything in the /tmp/drush_ixjEKl file?
[19:42:53]* g33kg1rl has quit (Quit: Page closed)
[19:44:53]* g33kg1rl has joined #aegir
[19:45:25]<g33kg1rl>I don't know if you got my last messages... my internet has been funky today
[19:45:59]<viashimo>g33kg1rl: I don't think so, last I saw was the past of the strace
[19:46:03]<viashimo>paste*
[19:46:23]<g33kg1rl>the only thing in the tmp file is Show Tables
[19:46:43]<g33kg1rl>so it is stuck on mysql trying to show the tables for the hostmaster database
[19:47:25]<viashimo>strangeness
[19:47:59]<viashimo>you could try killing the mysql command (kill PID) and see if that unblocks dpkg
[19:48:12]<viashimo>I don't know if it will just fail, or try to soldier on
[19:48:51]<g33kg1rl>Thanks for the suggestions :)
[19:54:46]<g33kg1rl>Well I did that and it got a lot further than it usally does, but then it errors out with not being able to connect to the database.
[19:59:41]<viashimo>I don't really have any more ideas right now; hopefully someone else will be able to help a bit more
[20:03:45]<g33kg1rl>No problem, I just figured I would put it out there in case anyone could think of anything :)
[20:14:37]* julienfayad has joined #aegir
[20:28:48]* gandhiano has quit (Ping timeout: 258 seconds)
[20:32:49]* gandhiano has joined #aegir
[20:33:37]<g33kg1rl>OK I figured out the solution!
[20:34:02]<g33kg1rl>I had to do a manual upgrade then run the dpkg command again
[20:38:00]* gandhiano has quit (Ping timeout: 258 seconds)
[20:38:52]* g33kg1rl has quit (Quit: Page closed)
[21:44:33]* gandhiano has joined #aegir
[22:17:58]* bgm_ is now known as bgm
[22:26:30]* zombiebeard has joined #aegir
[22:42:07]* julienfayad has quit (Quit: julienfayad)
[22:48:46]* julienfayad has joined #aegir
[23:27:03]* mstenta has joined #aegir
[23:35:31]* gandhiano has quit (Ping timeout: 265 seconds)
[00:56:11]* hestenet has joined #aegir
[01:03:13]* boshtian has quit (Quit: boshtian)
[01:09:38]* gandhiano has joined #aegir
[01:10:58]* theMusician has joined #aegir
[02:17:48]* g1i7ch has joined #aegir
[02:35:00]* gandhiano has quit (Remote host closed the connection)
[03:11:56]* gusaus has joined #aegir
[03:12:22]* julienfayad has quit (Quit: julienfayad)
[03:26:38]* stijnvbrande has quit (Quit: Connection closed for inactivity)
[03:45:52]* g1i7ch has quit (Ping timeout: 240 seconds)
[03:46:50]* g1i7ch has joined #aegir
[03:57:47]* dean has joined #aegir
[04:51:07]* mengi has joined #aegir
[05:05:54]* theMusician has quit (Quit: theMusician)
[05:12:57]* anarcat has quit (Quit: rebooting)
[05:21:48]* anarcat has joined #aegir
[05:39:34]* anarcat has quit (Quit: rebooting)
[05:42:59]* tree_ has joined #aegir
[05:54:28]* g1i7ch has quit (Ping timeout: 264 seconds)
[05:54:38]* g1i7ch has joined #aegir
[05:54:42]* tree_ is now known as millenniumtree
[05:55:35]* julienfayad has joined #aegir
[06:01:07]* theMusician has joined #aegir
[06:19:08]* julienfayad has quit (Quit: julienfayad)
[06:25:12]* boshtian has joined #aegir
[06:31:43]* boshtian has quit (Quit: boshtian)
[06:50:42]* anarcat has joined #aegir
[07:27:32]* formatC_vt has quit (Ping timeout: 240 seconds)
[07:27:56]* formatC_vt has joined #aegir
[07:27:57]* formatC_vt has quit (Changing host)
[07:27:57]* formatC_vt has joined #aegir
[07:46:53]<g1i7ch>I was trying to upgrade a platform with "drush up" and received this error: Error: Class 'EntityCacheUserController' not found in project_kickstart/includes/common.inc, line 8015. Anyone know what's up?
[07:50:13]<colan>g1i7ch: maybe a problem with kickstart?
[07:50:46]<g1i7ch>You think it might be distro specific??
[07:50:58]<g1i7ch>I'll dig a little deeper
[07:51:12]<colan>well, it does say that in the error ;)
[07:51:14]* roycroft has joined #aegir
[07:51:22]<roycroft>hello again, aegir folk
[07:51:36]<roycroft>i'm making progress with my aegir upgrade/migration
[07:51:56]<g1i7ch>thanks
[07:51:59]<colan>g1i7ch: try it without aegir maybe.
[07:52:11]<roycroft>what we are doing is migrating platforms from our current (aegir 2) machine to a new one running aegir 3.x
[07:52:18]<roycroft>that's going well
[07:52:36]<roycroft>but the next step will be to move our sites from the old aegir machine to the new one
[07:52:47]<roycroft>i'm not sure what the best approach is for that
[07:52:54]<roycroft>does anyone have insights on that?
[07:53:00]<roycroft>we would move one site at a time
[07:54:32]<roycroft>and to be clear, the websites themselves are not necessarily hosted on one of the aegir machines - they mostly live on other machines
[07:54:49]<roycroft>i just need to migrate management of each site from the old aegir machine to the new one
[07:55:02]<roycroft>the old aegir machine will be repurposed when this is complete
[08:00:30]<viashimo>roycroft: hosting_remote_import supports importing into aegir 3.x from aegir 2.x installations
[08:00:44]<viashimo>roycroft: otherwise you can probably script it with drush
[08:01:39]<viashimo>then the only trick is managing dns changes / downtimes for content modification etc.
[08:03:25]<viashimo>there's caveats depending on your setup: if provision-backup doesn't give you the entire site directory (with files, modules, etc...) then remote import isn't so useful
[08:10:26]* zombiebeard has quit (Quit: zombiebeard)
[08:26:22]* theMusician has quit (Quit: theMusician)
[08:39:11]<g1i7ch>colan: It looks like it was the distro. It looks like the platform was compromised, as it was using 7.43 core. So I downloaded a fresh copy, updated with drush and now I'm migrating the sites.
[08:52:33]* mstenta has quit (Ping timeout: 240 seconds)
[08:54:49]* hestenet has quit (Remote host closed the connection)
[09:01:29]* theMusician has joined #aegir
[09:25:42]<colan>g1i7ch: great.
[09:37:04]* g1i7ch has quit (Ping timeout: 240 seconds)
[09:37:10]* g1i7ch has joined #aegir
[09:56:56]* theMusician has quit (Quit: theMusician)
[09:58:25]* theMusician has joined #aegir