IRC logs for #aegir, 2017-03-17 (GMT)

2017-03-16
2017-03-18
TimeNickMessage
[12:47:21]* v20th has quit (Quit: Leaving)
[15:51:53]* s-mutin has quit (Read error: Connection reset by peer)
[15:52:17]* s-mutin has joined #aegir
[15:54:38]* s-mutin has quit (Read error: Connection reset by peer)
[15:55:05]* s-mutin has joined #aegir
[16:15:36]* s-mutin has quit (Read error: Connection reset by peer)
[16:16:02]* s-mutin has joined #aegir
[16:19:45]* s-mutin has quit (Read error: Connection reset by peer)
[16:20:08]* s-mutin has joined #aegir
[16:21:06]* s-mutin has quit (Read error: Connection reset by peer)
[16:21:33]* s-mutin has joined #aegir
[19:26:27]* roycroft has quit (Ping timeout: 240 seconds)
[00:21:46]* ergonlog1c has joined #aegir
[00:36:40]* v20th has joined #aegir
[00:46:48]* roycroft has joined #aegir
[01:07:38]* shaneonabike has joined #aegir
[01:10:33]* shaneonabike has left #aegir ()
[01:47:55]* ybabel has joined #aegir
[02:18:12]* theMusician has joined #aegir
[03:05:21]* theMusician has quit (Ping timeout: 268 seconds)
[03:08:22]* theMusician has joined #aegir
[03:47:50]* fatguylaughing has joined #aegir
[04:03:28]<colan>helmo: still around?
[04:04:21]<colan>on one upgrade, it's hanging on "Platforms path /var/aegir/platforms is writable."
[04:05:36]<colan>not too sure how to debug that. running "env DPKG_DEBUG=yes sudo apt full-upgrade" doesn't actually produce additional output.
[04:05:47]<colan>ergonlog1c: ^^^
[04:07:32]<colan>bgm: ^^^
[04:12:29]<bgm>strace the process?
[04:12:48]<bgm>although i've run into that, usually some weird drupal cache or inifite loop :(
[04:13:03]<bgm>ex: when hosting_https would cause infinite loops
[04:13:09]<bgm>but haven't seen that on recent upgrades
[04:13:57]* ybabel has quit (Quit: ybabel)
[04:15:44]<colan>bgm: what does this mean?
[04:15:44]<colan>pselect6(40, [0 37 39], NULL, NULL, {0, 50000000}, {[], 8}) = 0 (Timeout)
[04:15:44]<colan>wait4(2727, 0x7ffe05af7cec, WNOHANG, NULL) = 0
[04:15:57]<colan>Just keeps repeating over & over.
[04:16:12]* colan isn't much of a sysadmin
[04:22:26]* theMusician has quit (Quit: theMusician)
[04:24:49]* theMusician has joined #aegir
[04:37:25]<bgm>hmm pselect() is for I/O on file descriptors
[04:38:01]<bgm>so i guess it's not an infinite loop in the code, although really hard to tell
[04:50:00]<colan>"sudo lsof -p 2982 -ad 0,37,39" gives me:
[04:50:00]<colan>COMMAND PID USER FD TYPE DEVICE SIZE/OFF NODE NAME
[04:50:01]<colan>apt 2982 root 0u CHR 136,3 0t0 6 /dev/pts/3
[04:50:01]<colan>apt 2982 root 37u CHR 5,2 0t0 86 /dev/ptmx
[04:50:01]<colan>apt 2982 root 39r FIFO 0,10 0t0 21343 pipe
[04:50:01]<colan>...which I'm not sure what to do with.
[05:06:03]<viashimo>is apt waiting for user input for some reason?
[05:48:18]<helmo42[m]>Could that be hosting-pause? It could be waiting for a task to complete
[06:09:12]<ergonlog1c>colan: I think you need "env DPKG_DEBUG=developer ..."
[06:09:41]<ergonlog1c>someone else metioned a similar issue recently on upgrading to 3.9
[06:10:15]<ergonlog1c>so this might be caused by a change in the OS
[06:13:51]<colan>viashimo: nope.
[06:14:01]<colan>helmo: i never used it so i doubt it.
[06:14:15]<colan>ergonlog1c: still no extra output.
[06:15:01]<colan>what's weird is this worked fine on 5 aegirs, including staging for the prod site i'm working on. it's low traffic, but man, that's annoying.
[06:15:23]<ergonlog1c>http://docs.aegirproject.org/en/3.x/install/#8-troubleshooting-the-install
[06:15:28]<ergonlog1c>try apt-get
[06:17:42]* theMusician has quit (Quit: theMusician)
[06:20:56]<colan>ergonlog1c: that's doesn't work at all. i get:
[06:20:56]<colan>The following packages have unmet dependencies:
[06:20:56]<colan> aegir : Depends: aegir-provision (>= 1.12) but it is not going to be installed
[06:20:56]<colan> Depends: aegir-hostmaster (>= 1.12) but it is not going to be installed
[06:21:41]<colan>"env DPKG_DEBUG=developer sudo apt-get dist-upgrade" and "env DPKG_DEBUG=developer sudo apt full-upgrade" will re-run, but then get stuck at the same spot, with no extra output.
[06:31:40]<colan>ergonlog1c: the last command the shows up in /tmp/drush_blah is "SHOW TABLES". So looks similar to https://www.drupal.org/node/2773223
[06:31:41]<hefring>https://www.drupal.org/node/2773223 => Aegir Upgrade Stuck After Platforms Path Is Writable [#2773223] => 4 comments, 1 IRC mention
[06:41:34]<roycroft>hey folks
[06:41:37]<roycroft>a quick question, if i may
[06:41:47]* theMusician has joined #aegir
[06:41:51]<roycroft>on a hosting only server what do i need to instal for the maegir master to push sites to it?
[06:42:07]<ergonlog1c>colan: "aegir3", rather than jut "aegir" (the docs need an update there
[06:42:09]<roycroft>aegir3 and aegir3-rpovision?
[06:42:15]<roycroft>provision, rather
[06:42:18]<roycroft>or just aegir3?
[06:42:45]<ergonlog1c>roycroft: aegir3-clusternode, iirc
[06:42:51]<roycroft>oh
[06:42:54]<roycroft>so none of the above :)
[06:43:26]<ergonlog1c>roycroft: sorry, that's aegir3-cluster-slave
[06:43:41]<viashimo>colan: I guess one thing looks kinda weird to me with the unmet dependencies: usually the new packages are aegir3-X right?
[06:43:41]<roycroft>right, i just determined that with apt-cache search
[06:44:12]<roycroft>that in addition to aegir3 and aegir3-provision?
[06:44:30]<ergonlog1c>roycroft: no
[06:44:31]<roycroft>i know i'm not going to install aegir3-hostmaster on the production machines
[06:44:34]<roycroft>ok, thanks
[06:44:41]* roycroft will be doing that stuff after lunch
[06:44:48]<ergonlog1c>you don't need much more that a web server on remote, generally
[06:44:51]<roycroft>right
[06:44:58]<roycroft>web server, mysql, rsync
[06:45:12]<roycroft>and an aegir account for the hostmaster use for pushing things
[06:45:35]<roycroft>and php of course :)
[06:45:37]<ergonlog1c>you can pre-seed the SSH key: http://cgit.drupalcode.org/provision/plain/debian/aegir3-cluster-slave.t...
[06:46:04]<colan>ergonlog1c: ok, thanks, runs with aegir3, but still no extra output. tried both "yes" and "developer".
[06:46:07]<roycroft>probably drush as well would be good
[06:46:22]<ergonlog1c>roycroft: not really
[06:46:52]<ergonlog1c>colan: well, that in itself is odd
[06:47:05]<roycroft>my boss likes to change the admin password for websites and then forget it
[06:47:41]<roycroft>drush can reset it just like it does on the aegir master, can't it?
[06:47:53]<roycroft>i know i can do that in the database as well, but drush is easier
[06:48:31]<roycroft>and since i have to reset them for him all the time i like easy :)
[06:48:57]<roycroft>but i do like to keep production machines as lean as possible
[06:49:07]<roycroft>so maybe i'll skip drush, at least for now
[06:49:51]<ergonlog1c>roycroft: Aegir runs a hub&spoke network model. So all the tasks are run locally and then pushed out tot he remote
[06:49:58]<ergonlog1c>the remotes are "dumb"
[06:50:21]<colan>ergonlog1c: is it possible that env won't work on stable / non-dev installs?
[06:50:32]<ergonlog1c>for example, when you build a platform on a remote, Aegir will build it locally on the master, then rsync it out to the remote
[06:51:11]<ergonlog1c>colan: no... it's an option passed to the debian package that should make everything much more verbose
[06:51:53]<ergonlog1c>roycroft: so, generally, you don't want to grant access to remotes, since changes there may well get overwritten
[06:52:37]<ergonlog1c>"drush @sitename uli" would generally only work on the master server, iirc
[06:53:14]<ergonlog1c>I know there was some work last year to make stand-alone remotes the canonical source, and so rsync back to the master after initial deployment
[06:53:29]<ergonlog1c>but I don't recall ever using that functionality myself
[06:54:19]<ergonlog1c>I usually only ever run remotes as clusters for high-availability and performance
[07:02:58]<ergonlog1c>colan: could it be a utf8mb4 issue? maybe mysql default config changed or something
[07:04:05]<helmo>colan: still a hanging upgrade?
[07:04:19]<colan>helmo: yeah :(
[07:04:41]<colan>ergonlog1c: well, it worked on staging...
[07:05:04]<helmo>when you look at the task queue ... is there anything in there?
[07:06:07]<helmo>I'v had it before on a box where it turned out that the queued hadwas not running ... so a few tasks were pening and the upgrade was waiting for them to finish
[07:07:51]<colan>helmo: there was a task running, but deleting it didn't help.
[07:08:04]<colan>also restarted hosting-queued
[07:10:03]* hestenet has quit ()
[07:13:01]<helmo>colan: what does `drush @hostmaster php-eval "echo (hosting_task_count() + hosting_task_count_running())"` return?
[07:13:33]<colan>helmo: 1
[07:14:50]<colan>so running() is returning 1.
[07:15:16]<helmo>colan: then your waiting for drush_hosting_pause_validate() in pause.hosting.inc
[07:15:45]<colan>helmo so how to flush?
[07:16:28]<helmo>find the task and delete it .... status HOSTING_TASK_PROCESSING (-1) or HOSTING_TASK_QUEUED (0)
[07:23:39]<colan>found a -1 and deleted it. re-running full-upgrade.... seems to have passed that spot....
[07:25:46]<colan>helmo: thanks!! the upgrade finished, and there don't seem to be any problems. now i want to document this somewhere.
[07:27:34]<colan>Can we add a log message at that spot, something like, "Waiting for running and queued tasks to terminate..."?
[07:27:50]<colan>So if it hangs there, it'll be obvious what's going on.
[07:29:01]<helmo42[m]>It must be the level... There is such a message.
[07:34:14]* ergonlog1c wonders where the "troubleshooting" section of our docs went...
[07:34:36]<ergonlog1c>colan: that's definitely a good 'troubleshooting' entry
[07:35:01]<colan>ergonlog1c: yeah, but it would go away if the log entry actually showed up. ;)
[07:35:59]<colan>how to turn drush_log() into something that's always present?
[07:39:14]<colan>hmm. 'message' doesn't shows up, but 'success' does.
[07:41:10]<colan>ergonlog1c: i think the problem is that the message shows up with debugging on, but debugging mesgs can't be turned on, at least in my case.
[07:57:42]* v20th has quit (Quit: Leaving)
[07:58:33]<colan>https://www.drupal.org/node/2861696
[07:58:34]<hefring>https://www.drupal.org/node/2861696 => Extra output via debugging messages cannot be enabled on Debian package upgrades [#2861696] => 0 comments, 1 IRC mention
[08:44:42]* theMusician has quit (Quit: theMusician)