<html xmlns:v="urn:schemas-microsoft-com:vml" xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:w="urn:schemas-microsoft-com:office:word" xmlns:m="http://schemas.microsoft.com/office/2004/12/omml" xmlns="http://www.w3.org/TR/REC-html40">
<head>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
<meta name="Generator" content="Microsoft Word 15 (filtered medium)">
<style><!--
/* Font Definitions */
@font-face
{font-family:"Cambria Math";
panose-1:2 4 5 3 5 4 6 3 2 4;}
@font-face
{font-family:Calibri;
panose-1:2 15 5 2 2 2 4 3 2 4;}
@font-face
{font-family:Verdana;
panose-1:2 11 6 4 3 5 4 4 2 4;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
{margin:0in;
font-size:11.0pt;
font-family:"Calibri",sans-serif;}
a:link, span.MsoHyperlink
{mso-style-priority:99;
color:blue;
text-decoration:underline;}
span.EmailStyle18
{mso-style-type:personal-reply;
font-family:"Calibri",sans-serif;
color:windowtext;}
.MsoChpDefault
{mso-style-type:export-only;
font-family:"Calibri",sans-serif;}
@page WordSection1
{size:8.5in 11.0in;
margin:1.0in 1.0in 1.0in 1.0in;}
div.WordSection1
{page:WordSection1;}
--></style><!--[if gte mso 9]><xml>
<o:shapedefaults v:ext="edit" spidmax="1026" />
</xml><![endif]--><!--[if gte mso 9]><xml>
<o:shapelayout v:ext="edit">
<o:idmap v:ext="edit" data="1" />
</o:shapelayout></xml><![endif]-->
</head>
<body lang="EN-US" link="blue" vlink="purple" style="word-wrap:break-word">
<div class="WordSection1">
<p class="MsoNormal">Thanks for putting more care into the answer than I did the question.
<span style="font-family:"Segoe UI Emoji",sans-serif">😊</span> I don’t have a problem here, was just riffing on the conversation. I suspected the way to do this was use a latent representation and since Jon is talking about conformations, perhaps he already
has it.<o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<div style="border:none;border-top:solid #E1E1E1 1.0pt;padding:3.0pt 0in 0in 0in">
<p class="MsoNormal"><b>From:</b> Friam <friam-bounces@redfish.com> <b>On Behalf Of
</b>Roger Critchlow<br>
<b>Sent:</b> Tuesday, October 12, 2021 10:17 PM<br>
<b>To:</b> The Friday Morning Applied Complexity Coffee Group <friam@redfish.com><br>
<b>Subject:</b> Re: [FRIAM] Schwill Rock?<o:p></o:p></p>
</div>
<p class="MsoNormal"><o:p> </o:p></p>
<div>
<div>
<p class="MsoNormal">As long as you stick to basic CNO type chemistry, you can probably figure out a valid SMILES generator pretty quickly. But it might be easier to work in the explicit molecule graph instead and just use SMILES as a content name string.
The things that might go wrong after the SMILES input parses correctly are 1) a ring that's strained because it's too tight a turn, 2) steric hindrance between bulky sidechains occupying the same space, and 3) something that no one has any idea how to make.
You could probably notice 1) and 2) if you track the average energy per bond.<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><o:p> </o:p></p>
</div>
<div>
<p class="MsoNormal">The benzodiazepines are a classic example of structuring the search around a core framework and substituting side chains onto the core. That's the way the chemists do it in the wetlab.<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><o:p> </o:p></p>
</div>
<div>
<p class="MsoNormal">OpenEyes (<a href="https://www.eyesopen.com/">https://www.eyesopen.com/</a>) is running some online events this month that might be interesting.<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><o:p> </o:p></p>
</div>
<div>
<p class="MsoNormal">-- rec --<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><o:p> </o:p></p>
</div>
<div>
<div>
<p class="MsoNormal">On Tue, Oct 12, 2021 at 5:44 PM Marcus Daniels <<a href="mailto:marcus@snoutfarm.com" target="_blank">marcus@snoutfarm.com</a>> wrote:<o:p></o:p></p>
</div>
<blockquote style="border:none;border-left:solid #CCCCCC 1.0pt;padding:0in 0in 0in 6.0pt;margin-left:4.8pt;margin-right:0in">
<div>
<div>
<p class="MsoNormal" style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto">A search algorithm that, say, proposes a prefix or a suffix to a SMILES string would need to have a way to autocomplete candidates before it could use these descriptors to guide
an optimization because the parsing step is non-trivial, never mind the sanitization step (mentioned on that web page). <o:p></o:p></p>
<p class="MsoNormal" style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto"> <o:p></o:p></p>
<p class="MsoNormal" style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto">I will deflect blame on Jon for changing the topic from music to chemicals, but presumably with enough debate their aesthetic preferences in music could be formalized in theory
or some rule-based way, as is manifest here.<o:p></o:p></p>
<p class="MsoNormal" style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto"> <o:p></o:p></p>
<div style="border:none;border-top:solid #E1E1E1 1.0pt;padding:3.0pt 0in 0in 0in">
<p class="MsoNormal" style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto"><b>From:</b> Friam <<a href="mailto:friam-bounces@redfish.com" target="_blank">friam-bounces@redfish.com</a>>
<b>On Behalf Of </b>Roger Critchlow<br>
<b>Sent:</b> Tuesday, October 12, 2021 2:10 PM<br>
<b>To:</b> The Friday Morning Applied Complexity Coffee Group <<a href="mailto:friam@redfish.com" target="_blank">friam@redfish.com</a>><br>
<b>Subject:</b> Re: [FRIAM] Schwill Rock?<o:p></o:p></p>
</div>
<p class="MsoNormal" style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto"> <o:p></o:p></p>
<div>
<div>
<p class="MsoNormal" style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto">Hmm, when I was in the drug discovery canal, the "descriptors" that you could calculate from a SMILES string were legion. <o:p></o:p></p>
<div>
<p class="MsoNormal" style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto"> <o:p></o:p></p>
</div>
<div>
<p class="MsoNormal" style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto">Here's the list for RDKIT, <a href="https://www.rdkit.org/docs/GettingStartedInPython.html#list-of-available-descriptors" target="_blank">https://www.rdkit.org/docs/GettingStartedInPython.html#list-of-available-descriptors</a>.
There are one bunch that depend entirely on the formula and molecular structure. Then there's a whole other bunch you can compute if you generate 3d structures for the molecules, possibly multiplied by the number of low energy structures the molecule can
adopt.<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal" style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto"> <o:p></o:p></p>
</div>
<div>
<p class="MsoNormal" style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto">What kind of plausibility were you looking for? Does the SMILES string specify a real molecule? That's hard. There are syntax errors in SMILES, failures to close rings, valency
errors, charge errors. But there are lots of syntactically valid SMILES that won't match any known molecule, either because they're impossible or as yet to be determined. The pharmas all have their own lists of molecules of interest, but those are proprietary.
Looks like there are various online databases, none that I'm familiar with. If the SMILES parses, you can try generating a 2d depiction and a 3d structure. Those will throw exceptions if things get too weird.<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal" style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto"> <o:p></o:p></p>
</div>
<div>
<p class="MsoNormal" style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto">-- rec --<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal" style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto"> <o:p></o:p></p>
</div>
<div>
<p class="MsoNormal" style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto">On Tue, Oct 12, 2021 at 3:22 PM Marcus Daniels <<a href="mailto:marcus@snoutfarm.com" target="_blank">marcus@snoutfarm.com</a>> wrote:<o:p></o:p></p>
</div>
</div>
<div>
<blockquote style="border:none;border-left:solid #CCCCCC 1.0pt;padding:0in 0in 0in 6.0pt;margin-left:4.8pt;margin-top:5.0pt;margin-right:0in;margin-bottom:5.0pt">
<div>
<div>
<p class="MsoNormal" style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto">I was playing with RDKIT the other day, and it wasn’t obvious how to get a scalar quantity of plausibility of a molecule. It seems a SMILES string is right or wrong, and then
maybe there are some warnings that can be trapped. However, the benefits for search or fair sampling are different than the needs of correctness checks, which is deeper property. That isn’t quite a fit to the music example where aesthetic considerations
are subjective.<o:p></o:p></p>
<p class="MsoNormal" style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto"> <o:p></o:p></p>
<div style="border:none;border-top:solid #E1E1E1 1.0pt;padding:3.0pt 0in 0in 0in">
<p class="MsoNormal" style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto"><b>From:</b> Friam <<a href="mailto:friam-bounces@redfish.com" target="_blank">friam-bounces@redfish.com</a>>
<b>On Behalf Of </b>Jon Zingale<br>
<b>Sent:</b> Tuesday, October 12, 2021 12:11 PM<br>
<b>To:</b> <a href="mailto:friam@redfish.com" target="_blank">friam@redfish.com</a><br>
<b>Subject:</b> Re: [FRIAM] Schwill Rock?<o:p></o:p></p>
</div>
<p class="MsoNormal" style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto"> <o:p></o:p></p>
<div>
<div>
<p class="MsoNormal" style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto"><span style="font-size:12.0pt;font-family:"Verdana",sans-serif;color:#333333">"</span><span style="font-size:12.0pt;font-family:"Arial",sans-serif;color:black">I mean from the perspective
of aesthetics. Understanding why Pandora is messing it up means sampling the deep wells."</span><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal" style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto"><span style="font-size:12.0pt;font-family:"Verdana",sans-serif;color:#333333"> </span><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal" style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto"><span style="font-size:12.0pt;font-family:"Arial",sans-serif;color:black">Yes, but not more than one has to. This is why I am advocating for methods like a weighted ensemble. The
working analogy for me comes from drug discovery. It doesn't make a lot of sense to probe the same old sites and the same old conformations.</span><o:p></o:p></p>
</div>
</div>
</div>
</div>
<p class="MsoNormal" style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto"><br>
.-- .- -. - / .- -.-. - .. --- -. ..--.. / -.-. --- -. .--- ..- --. .- - .<br>
FRIAM Applied Complexity Group listserv<br>
Zoom Fridays 9:30a-12p Mtn UTC-6 <a href="http://bit.ly/virtualfriam" target="_blank">
bit.ly/virtualfriam</a><br>
un/subscribe <a href="http://redfish.com/mailman/listinfo/friam_redfish.com" target="_blank">
http://redfish.com/mailman/listinfo/friam_redfish.com</a><br>
FRIAM-COMIC <a href="http://friam-comic.blogspot.com/" target="_blank">http://friam-comic.blogspot.com/</a><br>
archives:<br>
5/2017 thru present <a href="https://redfish.com/pipermail/friam_redfish.com/" target="_blank">
https://redfish.com/pipermail/friam_redfish.com/</a><br>
1/2003 thru 6/2021 <a href="http://friam.383.s1.nabble.com/" target="_blank">http://friam.383.s1.nabble.com/</a><o:p></o:p></p>
</blockquote>
</div>
</div>
</div>
</div>
<p class="MsoNormal"><br>
.-- .- -. - / .- -.-. - .. --- -. ..--.. / -.-. --- -. .--- ..- --. .- - .<br>
FRIAM Applied Complexity Group listserv<br>
Zoom Fridays 9:30a-12p Mtn UTC-6 <a href="http://bit.ly/virtualfriam" target="_blank">
bit.ly/virtualfriam</a><br>
un/subscribe <a href="http://redfish.com/mailman/listinfo/friam_redfish.com" target="_blank">
http://redfish.com/mailman/listinfo/friam_redfish.com</a><br>
FRIAM-COMIC <a href="http://friam-comic.blogspot.com/" target="_blank">http://friam-comic.blogspot.com/</a><br>
archives:<br>
5/2017 thru present <a href="https://redfish.com/pipermail/friam_redfish.com/" target="_blank">
https://redfish.com/pipermail/friam_redfish.com/</a><br>
1/2003 thru 6/2021 <a href="http://friam.383.s1.nabble.com/" target="_blank">http://friam.383.s1.nabble.com/</a><o:p></o:p></p>
</blockquote>
</div>
</div>
</div>
</body>
</html>