Japan Drug Database

Published: December 1, 2021

In my previous job, I was finding myself doing a lot of desk research about drugs in Japan. For example, when we started a new project about a drug, it was common procedure for me to find the type of this drug, the diseases that the drug is indicated for (indications), the action mechanism, other drugs in the same field, the price of it, etc.

However, this becomes quite tedious when you are working with a drug with multiple indications. For example, some cancer drugs treat 10+ different cancer types. For this reason, finding the competitor landscape for each indication and the respective prices & manufacturers of each of these competitor drugs becomes a nightmare.

For this reason, I have decided to create a database of drugs in Japan. The main information I needed to collect for each of these drugs was:

Name of the drug
Manufacturer
Active ingredient
Approval date
Indications
Products (ml / price)

And the idea was that, by clicking on an indication, you could see all drugs that are indicated for that disease. This would make it really easy to find competitors of a certain drug for a certain indication.

Moreover, you could look at all the drugs of a manufacturer (E.g. Pfizer) and see their whole portfolio.

Finally you would get a sweeping view of all indications of a drug, and the products of that drug because some drugs have several regimes (Pill, vaccine, different dossages, etc.)

Data collection

The data is mainly collected from KEGG: Kyoto Encyclopedia of Genes and Genomes. They are doing the heavy-lifting here as they curate all the necessary information, and what we need to do is to get that information, re-organize it across indications and manufacturers and make it easy to search by users.

Data processing

Once the data is collected from KEGG, we need to do some processing which includes, removing old manufacturers and updating with their most current version, organizing data into English and Japanese, representing companies and diseases through IDs, in order to conduct exact search on them. The organized data looks like below:

Above data structure is created for each drug and finally we have a database of drugs that are approved in Japan.

Serving the data

The most difficult part of this project (like most of data science projects) was to get the data in shape. Once we have the organized data, we now need to build a front/ back end to serve this data to users.

For this, we use NEXT.JS to build a Single-page-app to create an interface that lets the user search among drugs, competitors, indications and view results: