Plemmy Release 0.3.0 - parsing `request.Response` JSON into Python objects · tjkessler/plemmy

travis@lemmy.blue · 1 year ago

Thanks for sharing! These seem to focus on LLMs/transformers, but since they use MLPs I should be able to find a way to adapt them for my use!

travis@lemmy.blue · edit-2 1 year ago

Sort of - the models are able to predict numerical property values given a large amount of data to observe during training. In other words, given the scope of known data, we can extrapolate predictions for new data. The predictive capabilities of the model are only as reliable as the data used to train it, and unfortunately in our case we only have hundreds of samples per property, as opposed to other ML tasks with millions of samples. This highlights how much time it actually takes to find, synthesize, and experimentally test molecules!

Unfortunately neural networks, especially traditional multi-layered feed-forward networks, are often seen as a “black box” approach to regression and classification, where we don’t really understand how a network learns or why its weights are tuned the way they are. Analysis methods have come a long way, but ambiguity still exists.

What we have done, however, is find the statistical significance of specific molecular substructures as they relate to combustion properties. For example, when we trained our models to predict sooting propensity (amount of pollution formed during combustion), we noticed that various algorithms such as random forest regression were putting a heck of a lot more weight into a molecular variable measuring path length (length of carbon chains, number of higher order bonds); from this, we were able to conclude that long-chain hydrocarbons with a higher number of double or triple bonds form more soot, and an idea of what mechanistic pathways we should stay away from when producing bio-oil.

As for fuel-grade molecules, we’ve found that furanic compounds and compounds with cyclohexane substructures generally have equal operating efficiency (cetane number), equal energy density (lower heating value, MJ/kg), operate well in various environments (optimal flash, boiling, and cloud points, deg. C), all while producing much less soot (yield sooting index) compared to diesel fuel. The next step is finding a cheap way to mass produce the stuff!

Recently we’ve started down the rabbit hole of fungus-derived bio-oils, terpenes (yes, those terpenes!) derived from fungus may be useful for use as soot-reducing fuel additives.

travis@lemmy.blue · 1 year ago

TL;DR, I throw a bunch of molecules at a pile of linear algebra, and hope predicted values line up with known experimental values; then I use the pile of linear algebra on novel molecules.

There’s a bit more to it than that, like how to represent molecules in a computer-readable format, generating additional input variables (molecular characteristics), input variable down-selection and/or dimensionality reduction, the specific ML models we use (feed-forward MLPs and graph convolution nets), and how to interpret results as they relate back to combustion.

From a broad perspective, our work is just a small part of a larger push from the Department of Energy to find economically-viable alternative liquid fuels. ML speeds up the process of screening candidate molecules, for example those found in bio-oil resulting from pyrolizing and catalytically-upgrading lignocellulosic biomass or other renewable sources. Our colleagues don’t have to synthesize large samples of many molecules just to test their properties and determine how they will behave in existing engines (a very costly and time-consuming process), instead we predict the properties and behaviors to highlight viable candidates so our colleagues can focus on analyzing those.

These papers (1, 2, 3) best outline the procedures and motivations for this work. PM me if you can’t get access and I’ll send you them!

travis@lemmy.blue · 1 year ago

Postdoc in engineering research - we’re using machine learning to predict chemical properties relevant to combustion, speeding up the discovery of cleaner liquid fuels as we transition away from fossil fuels!

travis@lemmy.blue · 1 year ago

Plemmy Release 0.3.0 - parsing `request.Response` JSON into Python objects · tjkessler/plemmy

travis@lemmy.blue · 1 year ago

Plemmy Release 0.3.0 - parsing `request.Response` JSON into Python objects · tjkessler/plemmy

travis@lemmy.blue · 1 year ago

Plemmy Release 0.3.0 - parsing `request.Response` JSON into Python objects · tjkessler/plemmy

travis@lemmy.blue · edit-2 1 year ago

I use a few used Dell Optiplex 7050 Micros, they’re great for the price (and have a small footprint too!)

Edit: for storage I have a HP MicroServer Gen. 10 plus

travis@lemmy.blue · 1 year ago

Self-hosting lemmy.blue!

travis@lemmy.blue · 1 year ago

YouTube TV and Spotify. There’s a workaround for everything else!

travis@lemmy.blue · 1 year ago

I like the idea of a Reddit-Lemmy cross-poster! It should make transitioning to Lemmy a bit easier for those used to seeing more Reddit-like content.

travis@lemmy.blue · 1 year ago

No problem, happy to help out the fediverse!

travis@lemmy.blue · 1 year ago

I’m not sure, I don’t think there’s a direct way to do this at the moment. Maybe there’s a way using the get_community and get_person_details functions?

travis@lemmy.blue · 1 year ago

GitHub - tjkessler/plemmy: A Python package for accessing the Lemmy API

travis@lemmy.blue · edit-2 1 year ago

GitHub - tjkessler/plemmy: A Python package for accessing the Lemmy API

travis@lemmy.blue · 1 year ago

GitHub - tjkessler/plemmy: A Python package for accessing the Lemmy API

travis@lemmy.blue · 1 year ago

Very cool, I’ll do some digging myself!

travis@lemmy.blue · 1 year ago

Interesting - it would be even more interesting if they provided some metrics as to how much time is being saved compared to RSS.

travis@lemmy.blue · 1 year ago

What’s worse:

Meanwhile Kim Dotcom, the founder of Megaupload, is continuing to fight the U.S. charges and threat of extradition. He has said he expects his former colleagues to testify against him as part of the deal they struck.

travis@lemmy.blue · edit-2 1 year ago

Ortmann was sentenced to 2 years and 7 months while van der Kolk was sentenced to 2 years and 6 months. Each had faced a maximum sentence of 10 years in prison but argued they should be allowed to serve their sentences in home detention.

Does the punishment fit the “crime”?

travis@lemmy.blue · 1 year ago

Joined! PhD with applied ML experience, looking forward to contributing.

travis@lemmy.blue · 1 year ago

He’s gonna run Twitter into the ground like you would your favorite car

travis@lemmy.blue · 1 year ago

Fantastic! An Apollo-like app will make switching so much easier for so many people

travis@lemmy.blue · 1 year ago

+1 for running TrueNAS in a VM, I’ve got one running in Proxmox. Make sure to enable hardware passthrough so TrueNAS has direct access to your drives!

travis@lemmy.blue · 1 year ago

My workflow for setting up a Lemmy instance goes something like this:

asdf

travis@lemmy.blue · 1 year ago

+1 for Docker, specifically Docker Compose. Lemmy probably isn’t the right container to learn Docker with, but once you have the fundamentals down spinning up Lemmy was pretty seamless.