CancerDB builds one main CSV file. cancerdb.csv
contains 1447 rows and 86 columns and is 394.8KB uncompressed. Every row is an entity and every entity is one row.
Index | Column | Values | Coverage | Example | Description | Source | Definition |
---|---|---|---|---|---|---|---|
1 | title | 1447 | 100% | National Cancer Institute | Title of the entity. | name.grammar | |
2 | uscsCasesPerYear | 1447 | 100% | 1679621404000 | Cumulative number of cases per year for a cancer type in the U.S. Cancer Statistics data set. | uscs.grammar | |
3 | uscsDeathsPerYear | 1447 | 100% | 1679621404000 | Cumulative number of deaths per year for a cancer type in the U.S. Cancer Statistics data set. | uscs.grammar | |
4 | type | 1447 | 100% | cancerType | What kind of thing is this entity? | type.grammar | |
5 | tissue | 855 | 59% | Lymphoid | Tissues from oncoTree dataset. | oncotree.mskcc.org | oncoTree.grammar |
6 | subTypes | 854 | 59% | 0 | How many subtypes of this cancer type? Data from oncoTree. | oncotree.mskcc.org | oncoTree.grammar |
7 | oncoTreeLevel | 854 | 59% | 3 | oncotree.mskcc.org | oncoTree.grammar | |
8 | parentOncoTreeId | 854 | 59% | SOFT_TISSUE | oncotree.mskcc.org | oncoTree.grammar | |
9 | oncoTreeId | 854 | 59% | AA | oncotree.mskcc.org | oncoTree.grammar | |
10 | mainType | 854 | 59% | Mature B-Cell Neoplasms | Types from oncoTree dataset. | oncotree.mskcc.org | oncoTree.grammar |
11 | umls | 542 | 37% | C0238196 | Unified Medical Language System | www.nlm.nih.gov | umls.grammar |
12 | nciCode | 526 | 36% | C40090 | NCI Concepts | ncithesaurus.nci.nih.gov | nci.grammar |
13 | wikipedia | 352 | 24% | https://en.wikipedia.org/wiki/AbbVie | URL of the entity on Wikipedia, if and only if it has a page dedicated to it. | wikipedia.org | wikipedia.grammar |
14 | website | 288 | 20% | http://biogen.com/ | website.grammar | ||
15 | country | 232 | 16% | United States | location.grammar | ||
16 | description | 173 | 12% | Cancer risk calculator | description.grammar | ||
17 | appeared | 127 | 9% | 2014 | history.grammar | ||
18 | city | 117 | 8% | New York, NY | City and state | location.grammar | |
19 | domainName | 112 | 8% | 1plus1cares.com | website.grammar | ||
20 | 101 | 7% | https://twitter.com/cityofhope | twitter.com | twitter.grammar | ||
21 | reference | 100 | 7% | http://www.scielo.org.za/scielo.php?script=sci_arttext&pid=S0256-95742015000200018 | A URL to a reference about the thing. | reference.grammar | |
22 | standsFor | 98 | 7% | If the thing is an acronym what does/did it stand for? | name.grammar | ||
23 | pubChem | 87 | 6% | 148124 | https://pubchem.ncbi.nlm.nih.gov/ | pubchem.ncbi.nlm.nih.gov | pubChem.grammar |
24 | 81 | 6% | https://facebook.com/uhcancercenter | facebook.com | facebook.grammar | ||
25 | kegg | 74 | 5% | D00491 | https://www.genome.jp/kegg/kegg1.html | genome.jp | kegg.grammar |
26 | domainName.registered | 74 | 5% | 1996 | website.grammar | ||
27 | nciImage | 73 | 5% | https://www.cancer.gov/sites/g/files/xnrzdm211/files/styles/cgov_social_media/public/cgov_image/media_image/900/300/files/mayoclinic-article.jpg | URL to the organization's image on the NCI website. | cancer.gov | nci.grammar |
28 | nciLink | 73 | 5% | https://www.cancer.gov/research/infrastructure/cancer-centers/find/mayoclinic | URL to the organization's page on the NCI website. | cancer.gov | nci.grammar |
29 | nciDesignation | 73 | 5% | Comprehensive Cancer Center | Which of the 3 NCI designations does this research center have? | cancer.gov | nci.grammar |
30 | 71 | 5% | https://linkedin.com/company/uhcancercenter | linkedin.com | linkedIn.grammar | ||
31 | routesOfAdministration | 68 | 5% | intravenous | drugs.grammar | ||
32 | drugBank | 67 | 5% | DB01229 | https://en.wikipedia.org/wiki/DrugBank | drugbank.com | drugBank.grammar |
33 | medlinePlus | 66 | 5% | a696031 | https://en.wikipedia.org/wiki/MedlinePlus | medlineplus.gov | medlinePlus.grammar |
34 | tradenames | 65 | 4% | Taxotere && Docecad && Docefrez | Tradenames for the drug. | name.grammar | |
35 | twitter.followers | 61 | 4% | 105000 | How many followers on this platform does this account have? | socialMedia.grammar | |
36 | 58 | 4% | https://instagram.com/uhcancercenter | instagram.com | instagram.grammar | ||
37 | usNewsRank | 46 | 3% | 15 | Where does this hospital rank in the U.S. News cancer rankings? https://health.usnews.com/best-hospitals/rankings/cancer | usnews.com | usNewsRank.grammar |
38 | parentOrganization | 43 | 3% | nlm | What is the parent entity(ies)? | relationships.grammar | |
39 | hostSchool | 43 | 3% | Albert Einstein College of Medicine | What is the host university or college of this research center? | cancer.gov | nci.grammar |
40 | youTubeChannel | 42 | 3% | https://www.youtube.com/UCLAJCCC | URL of the entity's YouTube channel. | youtube.com | youtube.grammar |
41 | aka | 38 | 3% | AAD | Another name for the thing. Entries can have multiple aka lines. | name.grammar | |
42 | cancerTypes | 35 | 2% | breast | Which cancer type(s) does this entity specialize in? | cancerTypes.grammar | |
43 | author | 29 | 2% | Aleksandr Solzhenitsyn | publications.grammar | ||
44 | uscsTable | 26 | 2% | 2019 | Data table from https://www.cdc.gov/cancer/uscs/dataviz/download_data.htm | uscs.grammar | |
45 | gco | 24 | 2% | https://gco.iarc.fr/today/data/factsheets/cancers/11-Liver-fact-sheet.pdf | gco.iarc.fr | gco.grammar | |
46 | uscsMortalityRate | 24 | 2% | 22% | Deaths/cases in USCS data as a percentage between 0-100. | uscs.grammar | |
47 | uscsId | 22 | 2% | Brain and Other Nervous System | The ID of the Cancer Type in the U.S. Cancer Statistics data. | uscs.grammar | |
48 | facebook.followers | 21 | 1% | 110 | How many followers on this platform does this account have? | socialMedia.grammar | |
49 | cancerDotOrg | 20 | 1% | https://cancer.org/cancer/ovarian-cancer.html | cancer.org | cancerDotOrg.grammar | |
50 | amazon | 17 | 1% | https://www.amazon.com/Anticancer-New-Life-David-Servan-Schreiber/dp/0670021644 | amazon.grammar | ||
51 | ein | 14 | 1% | 042263040 | A U.S. Employer Identification Number. | irs.grammar | |
52 | youTubeChannel.followers | 14 | 1% | 17100 | How many followers on this platform does this account have? | socialMedia.grammar | |
53 | subreddit.members.2023 | 13 | 1% | 13959 | primitives.grammar | ||
54 | subreddit | 13 | 1% | https://www.reddit.com/r/BladderCancer | Url of a subreddit(s) for this thing. | reddit.com | reddit.grammar |
55 | cancerDotGov | 13 | 1% | https://www.cancer.gov/types/bladder | cancer.gov | cancerDotGov.grammar | |
56 | linkedin.followers | 12 | 1% | 1119026 | How many followers on this platform does this account have? | socialMedia.grammar | |
57 | charityNavigator | 11 | 1% | https://www.charitynavigator.org/ein/042263040 | charitynavigator.org | charityRegistries.grammar | |
58 | instagram.followers | 11 | 1% | 13 | How many followers on this platform does this account have? | socialMedia.grammar | |
59 | nyse | 10 | 1% | https://www.nyse.com/quote/XNYS:ABBV | URL to the company's ticker on NYSE | company.grammar | |
60 | nasdaq | 10 | 1% | https://www.nasdaq.com/market-activity/stocks/vtrs | URL to the company's ticker on NASDAQ | company.grammar | |
61 | 9 | 1% | https://pinterest.com/adventhealth | pinterest.com | pinterest.grammar | ||
62 | related | 8 | 1% | mycanceriq | What entities are related? This serves as a catch all, and it is better to use a more specific relationship node. | relationships.grammar | |
63 | annualDeathsReport.2020 | 6 | 0% | 136084 | primitives.grammar | ||
64 | annualDeathsReport | 6 | 0% | US https://www.cdc.gov/cancer/dcpc/research/update-on-cancer-deaths/index.htm | Data from an annual report on cancer deaths for a particular country. | annualDeathsReport.grammar | |
65 | greatNonProfits | 5 | 0% | https://greatnonprofits.org/org/ican-international-cancer-advocacy-network | greatnonprofits.org | charityRegistries.grammar | |
66 | guideStar | 4 | 0% | https://www.guidestar.org/profile/13-1919715 | guidestar.org | charityRegistries.grammar | |
67 | company | 3 | 0% | Aiden Industries, LLC | company.grammar | ||
68 | oldName | 3 | 0% | GlaxoSmithKline plc | What is the old name(s) of this thing? | name.grammar | |
69 | youTube | 3 | 0% | https://www.youtube.com/watch?v=YWppfk2Np6A | A URL to a YouTube video about the thing. | youtube.com | youtube.grammar |
70 | closed | 2 | 0% | 2004 | history.grammar | ||
71 | 2 | 0% | https://journals.viamedica.pl/ginekologia_polska/article/download/GP.a2021.0003/55497 | publications.grammar | |||
72 | originCommunity | 2 | 0% | Dilon Technologies | history.grammar | ||
73 | githubRepo | 2 | 0% | https://github.com/cBioPortal/cbioportal | URL to a project on GitHub. | github.com | github.grammar |
74 | isOpenSource | 2 | 0% | false | Is it an open source project? | openSource.grammar | |
75 | englandAndWalesCharityDetails | 2 | 0% | https://register-of-charities.charitycommission.gov.uk/charity-search/-/charity-details/1000739 | register-of-charities.charitycommission.gov.uk | charityRegistries.grammar | |
76 | members.2023 | 2 | 0% | 18000 | primitives.grammar | ||
77 | investorRelationsPage | 2 | 0% | https://www.bms.com/investors.html | company.grammar | ||
78 | pinterest.followers | 2 | 0% | 1100 | How many followers on this platform does this account have? | socialMedia.grammar | |
79 | coursera | 1 | 0% | https://www.coursera.org/learn/breast-cancer-causes-prevention | Link to a course on Coursera. | coursera.com | coursera.grammar |
80 | nextDate | 1 | 0% | March 24, 2023 | events.grammar | ||
81 | journal | 1 | 0% | Evidence-Based Complementary and Alternative Medicine | publications.grammar | ||
82 | isPublicDomain | 1 | 0% | false | Is it a public domain project? | openSource.grammar | |
83 | github | 1 | 0% | https://github.com/thehyve | URL to the organization's GitHub page. | github.com | github.grammar |
84 | eventsPage | 1 | 0% | https://www.iarc.who.int/events/ | website.grammar | ||
85 | publicationFrequency | 1 | 0% | quarterly | How often does the publication come out? | periodicals.grammar | |
86 | wolframAlpha | 1 | 0% | https://www.wolframalpha.com/input?i=skin+cancer | wolframalpha.com | wolframAlpha.grammar |
The table above is also available as csv.