MathGroup Archive 2012

[Date Index] [Thread Index] [Author Index]

Search the Archive

Re: Extracting Information from XBRL Files

  • To: mathgroup at
  • Subject: [mg127066] Re: Extracting Information from XBRL Files
  • From: Gregory Lypny <gregory.lypny at>
  • Date: Thu, 28 Jun 2012 04:00:38 -0400 (EDT)
  • Delivered-to:
  • References: <>

Thanks Hans,

I'm just flying by the seat of my pants.  I will try your suggestion.  I need compensation tables from the DEF 14A.  I spoke to a SEC representative yesterday, and she told me that DEF 14A is not yet available in XBRL format.  I like your download-to-notebook-format idea.

Thanks once again,


On Wed, Jun 27, 2012, at 10:19 AM, Hans Michel wrote:

> Gregory:
> I have used Mathematica to extract data from the SEC. (Mostly the =
> EDGAR format).
> Not all of the data on the SEC website is available in XBRL format.
> For some forms I prefer the EDGAR SGML-XML-HTML-Text hybrid fixed =
schema and
> taxonomy without the PDF.
> The XBRL structure brings so much framing with it that parsing the =
core xml
> file in Mathematica should be straight forward. But attaching the =
> schemas and definitions are not so easy.
> The SEC provide a RSS feed of interactive data. Mathematica can take =
an RSS
> fee and change it to Notebook format.
> Import["";, "RSS"]
> With that note book format you can write code to download and extract =
> zip files source.
> Parsing the main XML data file in a XBRL file is straight forward. It =
> attaching the schema and the meaning which could be done in =
Mathematica but
> to do so one would have to have a compeling reason not to use other =
> that are specifically made for such task.
> I am familiar with SGML data I would consider the XBRL format a hybrid =
> SGML-XML even though the use of schemas (DTD) and entity files etc.
> What are you trying to do?
> Hans
> -----Original Message-----
> From: Gregory Lypny [mailto:gregory.lypny at]
> Sent: Wednesday, June 27, 2012 3:09 AM
> To: mathgroup at
> Subject: Extracting Information from XBRL Files
> Hello everyone,
> This is a long shot, but has anyone used Mathematica to parse XBRL =
> such as those accessible from the SEC's (US Securities and Exchange
> Commission) EDGAR system?  XBRL is a tagged format, an offshoot of =
> Gregory Lypny

  • Prev by Date: Re: An easier functional way to divide each Column of matrix by a row vector, element-wise?
  • Next by Date: Re: Approximate Zero Times A Symbol
  • Previous by thread: Extracting Information from XBRL Files
  • Next by thread: Re: Extracting Information from XBRL Files