Parsing: http://microformats.org { "items": [ { "type": [ "h-feed" ], "properties": { "category": [ "indieweb", "microformats2", "microformats2", "indieweb", "microformats2" ] }, "id": "content", "children": [ { "type": [ "h-entry" ], "properties": { "name": [ "How to Consume Microformats 2 Data" ], "url": [ "https:\/\/microformats.org\/2022\/02\/19\/how-to-consume-microformats-2-data", "https:\/\/microformats.org\/2022\/02\/19\/how-to-consume-microformats-2-data" ], "updated": [ "2022-02-19T11:48:15" ], "content": [ { "html": "

A (very) belated follow up to Getting Started with Microformats 2<\/a>, covering the basics of consuming and using microformats 2 data. Originally posted on waterpigs.co.uk<\/a>.<\/p>\n\n

More and more people are using microformats 2 to mark up profiles, posts, events and other data on their personal sites, enabling developers to build applications which use this data in useful and interesting ways. Whether you want to add basic support for webmention comments to your personal site, or have ambitious plans for a structured-data-aware-social-graph-search-engine-super-feed-reader, you\u2019re going to need a solid grasp of how to parse and handle microformats 2 data.<\/p>\n\n

Choose a Parser<\/h2>\n\n
To turn a web page containing data marked up with microformats 2 (or classic microformats, if supported) into a canonical MF2 JSON data structure, you\u2019ll need a parser.<\/p>\n\n
At the time of writing, there are actively supported microformats 2 parsers<\/a> available for the following programming languages:<\/p>\n\n
\n
Go<\/a><\/li>\n
Javascript (server-side and browser)<\/a><\/li>\n
PHP<\/a><\/li>\n
Python<\/a><\/li>\n
Ruby<\/a><\/li>\n
Rust<\/a><\/li>\n<\/ul>\n\n
Parsers for various other languages exist, but might not be actively supported or support recent changes to the parsing specification.<\/p>\n\n
There are also various websites which you can use to experiment with microformats markup without having to download a library and write any code:<\/p>\n\n
\n
My own live-updating php-mf2 sandbox<\/a><\/li>\n
The various parser comparison tools hosted on microformats.io<\/a><\/li>\n
Aaron Parecki<\/a>\u2019s pin13.net microformats parser<\/a> for parsing either URLs or HTML fragments<\/li>\n<\/ul>\n\n
If there\u2019s not currently a parser available for your language of choice, you have a few options:<\/p>\n\n
\n
Call the command-line tools provided by one of the existing libraries from your code, and consume the JSON they provide<\/li>\n
Make use of one of the online mf2 parsers capable of parsing sites, and consume the JSON it returns (only recommended for very low volume usage!)<\/li>\n
Write your own microformats 2 parser! There are plenty of people happy to help<\/a>, and a language-agnostic test suite you can plug your implementation into for testing.<\/li>\n<\/ul>\n\n
Considerations During Fetching and Parsing<\/h2>\n\n
Most real-world microformats data is fetched from a URL, which could potentially redirect to a different URL one or more times. The final URL in the redirect chain is called the \u201ceffective URL\u201d. HTML often contains relative URLs, which need to be resolved against a base URL in order to be useful out of context.<\/p>\n\n\n
If your parser has a function for \u201cparsing microformats from a URL\u201d, it should deal with all of this for you. If you\u2019re making the request yourself (e.g. to use custom caching or network settings) and then passing the response HTML and base URL to the parser, make sure to use the effective URL, not the starting URL!<\/strong> The parser will handle relative URL resolution, but it needs to know the correct base URL.<\/p>\n\n
When parsing microformats, an HTTP request which returns a non-200 value doesn\u2019t necessarily mean that there\u2019s nothing to parse! For example, a `410 Gone<\/code> response might contain a h-entry with a message explaining the deletion of whatever was there before.\n\n<\/p>`
Storing Raw HTML vs Parsed Canonical JSON vs Derived Data<\/h2>\n\nWhen consuming microformats 2 data, you\u2019ll most often be fetching raw HTML from a URL, parsing it to canonical JSON, then finally processing it into a simpler, cleaned and sanitised format ready for use in your website or application. That\u2019s three different representations of the same data \u2014 you\u2019ll most likely end up storing the derived data somewhere for quick access, but what about the other two?<\/p>\n\n Experience shows that, over time:<\/p>\n\n\nthe way a particular application cleans up mf2 data will be tweaked and improved as you add new features and handle unexpected edge-cases<\/li>\nmf2 parsers gradually get improved, fixing bugs and occasionally adding entirely new features.<\/li>\n<\/ul>\n\nTherefore, if it makes sense for your use case, I recommend archiving a copy of the original HTML as well as your derived data, leaving out the intermediate canonical JSON. That way, you can easily create scripts or background jobs to update all the derived data based on the original HTML, taking advantage of both parser improvements and improvements to your own code at the same time, without having to re-fetch potentially hundreds of potentially broken links.<\/p>\n\n As mentioned in the previous section, if you archive original HTML for re-parsing, you\u2019ll need to additionally store the effective URL for correct relative URL resolution.<\/p>\n\nFor some languages, there are already libraries (such as XRay<\/a> for PHP) which will perform common cleaning and sanitisation for you. If the assumptions with which these libraries are built suit your applications, you may be able to avoid a lot of the hard work of handling raw microformats 2 data structures!<\/p>\n\n If not, read on\u2026<\/p>\n\n Navigating Microformat Structures<\/h2>\n\nA parsed page may contain a number of microformat data structures (mf structs), in various different places.<\/p>\n\nTake a look at the parsed canonical microformats JSON for the article you\u2019re reading right now<\/a>, for example.<\/p>\n\n items<\/code> is a list of top-level mf structs, each of which may contain nested mf structs either under their properties<\/code> or children<\/code> keys.<\/p>\n\n Each individual mf struct is guaranteed to have at least two keys, type<\/code> and properties<\/code>. type<\/code> is the primary way of identifying what sort of thing that struct represents (e.g. a person, a post, an event). Structs can have more than one type if they represent multiple things at once without wanting to nest them \u2014 for example, a post detailing an event might be both a h-entry and a h-event at the same time. Structs can also have additional top-level keys such as id<\/code> and lang<\/code>.<\/p>\n\n Generally speaking, type<\/code> information is most useful when dealing with top-level mf structs, and mf structs nested under a children<\/code> key. Nested mf structs found in properties<\/code> will also have type<\/code> information, but their usage is usually implied by the property name they\u2019re found under.<\/p>\n\n For many common use cases (e.g. a homepage feed and profile) there are several different ways people might nest mf structs to achieve the same goals, so it\u2019s important that your code is capable of searching the entire tree, rather than just looking at the top-level mf structs. Never assume that the microformat struct you\u2019re looking for will be in the top-level of the items<\/code> list!<\/strong> You need to search the whole tree.<\/p>\n\n I recommend writing some functions which can traverse a mf tree and return all structs which match a filtering callback. This can then be used as a basis for writing more specific convenience functions for common tasks such as finding all microformats on a page of a particular type, or where a certain property matches a certain value.<\/p>\n\n See my microformats2 PHP functions<\/a> for some working examples.<\/p>\n\n Possible Property Values<\/h2>\n\nEach key in a mf struct\u2019s properties<\/code> dict maps to a list of values for that property. Every property may map to multiple values, and those values may be a mixture of any of the following:<\/p>\n\n A plain string value, containing no HTML, and leaving HTML entities unescaped (e.g. <<\/code>)<\/p>\n\n{\n \"items\"<\/span>: [{\n \"type\"<\/span>: [\"h-card\"<\/span>],\n \"properties\"<\/span>: {\n \"name\"<\/span>: [\"Barnaby Walters\"<\/span>]\n }\n }]\n}\n<\/code><\/pre>\n\n(In future examples I will leave out the encapsulating {\"items\": [{\"type\": [\u2022\u2022\u2022], \u2022\u2022\u2022}]}<\/code> for brevity, focusing on the properties<\/code> key of a single mf struct.)<\/p>\n\n An embedded HTML struct, containing two keys: html<\/code>, which maps to an HTML representation of the property, and value<\/code>, mapping to a plain text version.<\/p>\n\n\"properties\": {\n \"content\": [{\n \"html\": \"<p>The content of<\/span> a post, as<\/span> <strong>raw HTML<\/strong> (or<\/span> not<\/span>).<\/p>\",\n \"value\": \"The content of<\/span> a post, as<\/span> raw HTML (or<\/span> not<\/span>).\"\n }]\n}\n<\/code><\/pre>\nAn img\/alt struct, containing the URL of a parsed image under value<\/code>, and its alt text under alt<\/code>.<\/p>\n\n\"properties\"<\/span>: {\n \"photo\"<\/span>: [{\n \"value\"<\/span>: \"https:\/\/example.com\/profile-photo.jpg\"<\/span>,\n \"alt\"<\/span>: \"Example Person\"<\/span>\n }]\n}\n<\/code><\/pre>\nA nested microformat data structure, with an additional value<\/code> key containing a plaintext representation of the data contained within.<\/p>\n\n\"properties\"<\/span>: {\n \"author\"<\/span>: [{\n \"type\"<\/span>: [\"h-card\"<\/span>],\n \"properties\"<\/span>: {\n \"name\"<\/span>: [\"Barnaby Walters\"<\/span>]\n },\n \"value\"<\/span>: \"Barnaby Walters<\/span>\n }]\n}\n<\/code><\/pre>\nAll properties may have more than one value. In cases where you expect a single property value (e.g. name<\/code>), simply take the first one you find, and in cases where you expect multiple values, use all values you consider valid. There are also some cases where it may make sense to use multiple values, but to prioritise one based on some heuristic \u2014 for example, an h-card may have multiple url<\/code> values, in which case the first one is usually the \u201ccanonical\u201d URL, and further URLs refer to external profiles.<\/p>\n\n Let\u2019s look at the implications of each of the potential property value structures in turn.<\/p>\n\n Firstly, Never assume that a property value will be a plaintext string<\/strong>. Microformats publishers can nest microformats, embedded content and img\/alt structures in a variety of different ways, and your consuming code should be as flexible as possible.<\/p>\n\n To partially make up for this complexity, you can always rely on the value<\/code> key of nested structs to provide you with an equivalent plaintext value<\/strong>, regardless of what type of struct you\u2019ve found.<\/p>\n\n When you start consuming microformats 2, write a function like this, and get into the habit of using it every time<\/strong> you want a single, plaintext value from a property:<\/p>\n\ndef<\/span> get_first_plaintext<\/span>(mf_struct, property_name)<\/span>:<\/span>\n try<\/span>:\n first_val = mf_struct['properties'<\/span>][property_name][0<\/span>]\n if<\/span> isinstance(first_val, str):\n return<\/span> first_val\n else<\/span>:\n return<\/span> first_val['value'<\/span>]\n except<\/span> (IndexError, KeyError):\n return<\/span> None<\/span>\n<\/code><\/pre>\n\nSecondly, Never assume that a particular property will contain an embedded HTML struct<\/strong> \u2014 this usually applies to content<\/code>, but is relevant anywhere your application expects embedded HTML. If you want to reliably get a value encoded as raw HTML, then you need to:<\/p>\n\n\nCheck whether the first property value is an embedded HTML struct (i.e. has an html<\/code> key). If so, take the value of the html<\/code> key<\/li>\n Otherwise, get the first plaintext property value using the approach above, and HTML-escape it<\/li>\nIf neither is found, the property has no value.<\/li>\n<\/ol>\n\nIn Python 3.5+, that could look something like this:<\/p>\n\nfrom<\/span> html import<\/span> escape\n\ndef<\/span> get_first_html<\/span>(mf_struct, property_name)<\/span>:<\/span>\n try<\/span>:\n first_val = mf_struct['properties'<\/span>][property_name][0<\/span>]\n if<\/span> isinstance(first_val, dict) and<\/span> 'html'<\/span> in<\/span> first_val:\n return<\/span> first_val['html'<\/span>]\n else<\/span>:\n plaintext_val = get_first_plaintext(mf_struct, property_name)\n\n if<\/span> plaintext_val is<\/span> not<\/span> None<\/span>:\n plaintext_val = escape(plaintext_val)\n\n return<\/span> plaintext_val\n except<\/span> (IndexError, KeyError):\n return<\/span> None<\/span>\n<\/code><\/pre>\nIn some cases, it may make sense for your application to be aware of whether a value was parsed as embedded HTML or a plain text string, and to store\/treat them differently. In all other cases, always<\/strong> use a function like this when you\u2019re expecting embedded HTML data.<\/p>\n\n Thirdly, when expecting an image URL, check for an img\/alt structure, falling back to the plain text value (and either assuming an empty alt text or inferring an appropriate one, depending on your specific use case). Something like this could be a good starting point:<\/p>\n\n def<\/span> get_img_alt<\/span>(mf_struct, property_name)<\/span>:<\/span>\n try<\/span>:\n first_val = mf_struct['properties'<\/span>][property_name][0<\/span>]\n if<\/span> isinstance(first_val, dict) and<\/span> 'alt'<\/span> in<\/span> first_val:\n return<\/span> first_val\n else<\/span>:\n plaintext_val = get_first_plaintext(mf_struct, property_name)\n\n if<\/span> plaintext_val is<\/span> not<\/span> None<\/span>:\n return<\/span> {'value'<\/span>: plaintext_val, 'alt'<\/span>: ''<\/span>}\n\n return<\/span> None<\/span>\n except<\/span> (IndexError, KeyError):\n return<\/span> None<\/span>\n<\/code><\/pre>\nFinally, in cases where you expect a nested microformat, you might end up getting something else. This is the hardest case to deal with, and the one which depends the most on the specific data and use-case you\u2019re dealing with. For example, if you\u2019re expecting a nested h-card under an author<\/code> property, but get something else, you could use any of the following approaches:<\/p>\n\n \nIf you got a plain string which doesn\u2019t look like a URL, treat it as the name<\/code> property of an implied h-card structure with no other properties (and if you need a URL, you could potentially take the hostname of the effective URL, if it works in context as a useful fallback value)<\/li>\n If you got an img alt struct, you could treat the value<\/code> as the photo<\/code> property, the alt<\/code> as the name<\/code> property, and potentially even take the hostname of the photo<\/code> URL to be the implied fallback url<\/code> property (although that\u2019s pushing it a bit, and in most cases it\u2019s probably better to just leave out the url<\/code>)<\/li>\n If you got an embedded HTML struct, take its plaintext value<\/code> and use one of the first two approaches<\/li>\n If you got a plain string, check to see if it looks like a URL. If so, fetch that URL and look for a representative h-card to use as the author value<\/li>\n If you get an embedded mf struct with a url<\/code> property but no photo<\/code>, you could fetch the url<\/code>, look for a representative h-card (more on that in the next section) and see if it has a photo<\/code> property<\/li>\n Treat the author<\/code> property as invalid and run the h-entry (or entire page if relevant) through the authorship algorithm<\/a><\/li>\n<\/ul>\n\nThe first three are general principles which can be applied to many scenarios where you expect an embedded mf struct but find something else. The last three, however, are examples of a common trend in consuming microformats 2 data: for many common use-cases, there are well-thought-through algorithms you can use to interpret data in a standardised way.<\/p>\n\nKnow Your Algorithms and Vocabularies<\/h2>\n\nThe authorship algorithm mentioned above is one of several more-or-less formally established algorithms used to solve common problems in indieweb usages of microformats 2. Some others which are worth knowing about include:<\/p>\n\n\n\u201cWho wrote this post?\u201d: authorship algorithm<\/a><\/li>\n \u201cThere\u2019s more than one h-card on this page, which one should I use?\u201d: representative h-card<\/a><\/li>\n \u201cI want to get a paginated feed of posts from this page\u201d: How to consume h-feed<\/a><\/li>\n \u201cHow do I find and display the main post on this page?\u201d: How to consume h-entry<\/a><\/li>\n \u201cI received a response to one of my posts via webmention, how do I display it?\u201d: How to display comments<\/a><\/li>\n<\/ul>\n\nLibrary implementations of these algorithms exist for some languages, although they often deviate slightly from the exact text. See if you can find one which meets your needs, and if not, write your own and share it with the community!<\/p>\n\n In addition to the formal consumption algorithms, it\u2019s worth looking through the definitions of the microformats vocabularies you\u2019re using (as well as testing with real-world data) and adding support for properties or publishing techniques you might not have thought of the first time around. Some examples to get you started:<\/p>\n\n\nIf an h-card has no valid photo<\/code>, see if there\u2019s a valid logo<\/code> you can use instead<\/li>\n When presenting a h-entry with a featured photo, check both the photo<\/code> property and the featured<\/code> property, as one or the other might be used in different scenarios<\/li>\n When dealing with address or location data (e.g. on an h-card, h-entry or h-event), be aware that either might be present in various different forms. Co-ordinates might be separate latitude<\/code> and longitude<\/code> properties, a combined plaintext geo<\/code> property, or an embedded h-geo<\/code>. Addresses might be separate top-level properties or an embedded h-adr. There are many variations which are totally valid to publish, and your consuming code should be as liberal as possible in what it accepts.<\/li>\nIf a h-entry contains images which are marked up with u-photo<\/code> within the e-content<\/code>, they\u2019ll be present both in the content<\/code> html<\/code> key and also under the photo<\/code> property. If your app shows the embedded content<\/code> HTML rather than using the plaintext version, and also supports photo<\/code> properties (which may also be present outside the content<\/code>), you may have to sniff the presence of photos within the content<\/code>, and either remove them from it or ignore the corresponding photo<\/code> properties to avoid showing photos twice.<\/li>\n<\/ul>\n\nSanitise, Validate, and Truncate<\/h2>\n\nIn the vast majority of cases, consuming microformats 2 data involves handling, storing and potentially re-publishing untrusted and potentially dangerous input data. Preventing XSS and other attacks is out of the scope of the microformats parsing algorithm, so the data your parser gives you is just as dangerous as the original source. You need to take your own measures for sanitising and truncating it so you can store and display it safely.<\/p>\n\nCovering every possible injection and XSS attack is out of the scope of this article, so I highly recommend referring to the OWASP resources on XSS Prevention<\/a>, Unicode Attacks<\/a> and Injection Attacks<\/a> for more information.<\/p>\n\n Other than that, the following ideas are a good start:<\/p>\n\n \nUse plaintext values where possible, only using embedded HTML when absolutely necessary<\/li>\nPass everything (HTML or not) through a well-respected HTML sanitizer such as PHP\u2019s HTML Purifier<\/a>. Configure it to make sure that embedded HTML can\u2019t interfere with your own markup or CSS. It probably shouldn\u2019t contain any javascript ever, either.<\/li>\n In any case where you\u2019re expecting a value with a specific format, validate it as appropriate.<\/li>\n More specifically, everywhere that you expect a URL, check that what you got was actually a URL. If you\u2019re using the URL as an image, consider fetching it an checking its content type<\/li>\n Consider either proxying resource such as images, or storing local copies of them (reducing size and resolution as necessary), to avoid mixed content issues, potential attacks, and missing images if the links break in the future.<\/li>\n Decide on relevant maximum length values for each separate piece of external content, and truncate them as necessary. Ideally, use a language-aware truncation algorithm to avoid breaking words apart. When the content of a post is truncated, consider adding a \u201cRead More\u201d link for convenience.<\/li>\n<\/ul>\n\nTest with Real-World Data<\/h2>\n\nThe web is a diverse place, and microformats are a flexible, permissive method of marking up structured data. There are often several different yet perfectly valid ways to achieve the same goal, and as a good consumer of mf2 data, your application should strive to accept as many of them as possible!<\/p>\n\nThe best way to test this is with real world data<\/em>. If your application is built with a particular source of data in mind, then start off with testing it against that. If you want to be able to handle a wider variety of sources, the best way is to determine what vocabularies and publishing use-cases your application consumes, and look at the Examples sections of the relevant indieweb.org<\/a> wiki pages for real-world sites to test your code against.<\/p>\n\n Don\u2019t forget to test your code against examples you\u2019ve published on your own personal site!<\/p>\n\n Next Steps<\/h2>\n\nHopefully this article helped you avoid a lot of common gotchas, and gave you a good head-start towards successfully consuming real-world microformats 2 data.<\/p>\n\nIf you have questions or issues, or want to share something cool you\u2019ve built, come and join us in the indieweb chat room<\/a>.<\/p>", "value": "A (very) belated follow up to Getting Started with Microformats 2, covering the basics of consuming and using microformats 2 data. Originally posted on waterpigs.co.uk.\nMore and more people are using microformats 2 to mark up profiles, posts, events and other data on their personal sites, enabling developers to build applications which use this data in useful and interesting ways. Whether you want to add basic support for webmention comments to your personal site, or have ambitious plans for a structured-data-aware-social-graph-search-engine-super-feed-reader, you\u2019re going to need a solid grasp of how to parse and handle microformats 2 data. Choose a Parser\nTo turn a web page containing data marked up with microformats 2 (or classic microformats, if supported) into a canonical MF2 JSON data structure, you\u2019ll need a parser.\nAt the time of writing, there are actively supported microformats 2 parsers available for the following programming languages: Go Javascript (server-side and browser) PHP Python Ruby Rust\nParsers for various other languages exist, but might not be actively supported or support recent changes to the parsing specification.\nThere are also various websites which you can use to experiment with microformats markup without having to download a library and write any code: My own live-updating php-mf2 sandbox The various parser comparison tools hosted on microformats.io Aaron Parecki\u2019s pin13.net microformats parser for parsing either URLs or HTML fragments\nIf there\u2019s not currently a parser available for your language of choice, you have a few options: Call the command-line tools provided by one of the existing libraries from your code, and consume the JSON they provide Make use of one of the online mf2 parsers capable of parsing sites, and consume the JSON it returns (only recommended for very low volume usage!) Write your own microformats 2 parser! There are plenty of people happy to help, and a language-agnostic test suite you can plug your implementation into for testing. Considerations During Fetching and Parsing\nMost real-world microformats data is fetched from a URL, which could potentially redirect to a different URL one or more times. The final URL in the redirect chain is called the \u201ceffective URL\u201d. HTML often contains relative URLs, which need to be resolved against a base URL in order to be useful out of context.\nIf your parser has a function for \u201cparsing microformats from a URL\u201d, it should deal with all of this for you. If you\u2019re making the request yourself (e.g. to use custom caching or network settings) and then passing the response HTML and base URL to the parser, make sure to use the effective URL, not the starting URL! The parser will handle relative URL resolution, but it needs to know the correct base URL.\nWhen parsing microformats, an HTTP request which returns a non-200 value doesn\u2019t necessarily mean that there\u2019s nothing to parse! For example, a 410 Gone response might contain a h-entry with a message explaining the deletion of whatever was there before. Storing Raw HTML vs Parsed Canonical JSON vs Derived Data\nWhen consuming microformats 2 data, you\u2019ll most often be fetching raw HTML from a URL, parsing it to canonical JSON, then finally processing it into a simpler, cleaned and sanitised format ready for use in your website or application. That\u2019s three different representations of the same data \u2014 you\u2019ll most likely end up storing the derived data somewhere for quick access, but what about the other two?\nExperience shows that, over time: the way a particular application cleans up mf2 data will be tweaked and improved as you add new features and handle unexpected edge-cases mf2 parsers gradually get improved, fixing bugs and occasionally adding entirely new features.\nTherefore, if it makes sense for your use case, I recommend archiving a copy of the original HTML as well as your derived data, leaving out the intermediate canonical JSON. That way, you can easily create scripts or background jobs to update all the derived data based on the original HTML, taking advantage of both parser improvements and improvements to your own code at the same time, without having to re-fetch potentially hundreds of potentially broken links.\nAs mentioned in the previous section, if you archive original HTML for re-parsing, you\u2019ll need to additionally store the effective URL for correct relative URL resolution.\nFor some languages, there are already libraries (such as XRay for PHP) which will perform common cleaning and sanitisation for you. If the assumptions with which these libraries are built suit your applications, you may be able to avoid a lot of the hard work of handling raw microformats 2 data structures!\nIf not, read on\u2026 Navigating Microformat Structures\nA parsed page may contain a number of microformat data structures (mf structs), in various different places.\nTake a look at the parsed canonical microformats JSON for the article you\u2019re reading right now, for example.\nitems is a list of top-level mf structs, each of which may contain nested mf structs either under their properties or children keys.\nEach individual mf struct is guaranteed to have at least two keys, type and properties. type is the primary way of identifying what sort of thing that struct represents (e.g. a person, a post, an event). Structs can have more than one type if they represent multiple things at once without wanting to nest them \u2014 for example, a post detailing an event might be both a h-entry and a h-event at the same time. Structs can also have additional top-level keys such as id and lang.\nGenerally speaking, type information is most useful when dealing with top-level mf structs, and mf structs nested under a children key. Nested mf structs found in properties will also have type information, but their usage is usually implied by the property name they\u2019re found under.\nFor many common use cases (e.g. a homepage feed and profile) there are several different ways people might nest mf structs to achieve the same goals, so it\u2019s important that your code is capable of searching the entire tree, rather than just looking at the top-level mf structs. Never assume that the microformat struct you\u2019re looking for will be in the top-level of the items list! You need to search the whole tree.\nI recommend writing some functions which can traverse a mf tree and return all structs which match a filtering callback. This can then be used as a basis for writing more specific convenience functions for common tasks such as finding all microformats on a page of a particular type, or where a certain property matches a certain value.\nSee my microformats2 PHP functions for some working examples. Possible Property Values\nEach key in a mf struct\u2019s properties dict maps to a list of values for that property. Every property may map to multiple values, and those values may be a mixture of any of the following:\nA plain string value, containing no HTML, and leaving HTML entities unescaped (e.g. <) { \"items\": [{ \"type\": [\"h-card\"], \"properties\": { \"name\": [\"Barnaby Walters\"] } }] }\n(In future examples I will leave out the encapsulating {\"items\": [{\"type\": [\u2022\u2022\u2022], \u2022\u2022\u2022}]} for brevity, focusing on the properties key of a single mf struct.)\nAn embedded HTML struct, containing two keys: html, which maps to an HTML representation of the property, and value, mapping to a plain text version. \"properties\": { \"content\": [{ \"html\": \" The content of a post, as raw HTML<\/strong> (or not).<\/p>\", \"value\": \"The content of a post, as raw HTML (or not).\" }] }\nAn img\/alt struct, containing the URL of a parsed image under value, and its alt text under alt. \"properties\": { \"photo\": [{ \"value\": \"https:\/\/example.com\/profile-photo.jpg\", \"alt\": \"Example Person\" }] }\nA nested microformat data structure, with an additional value key containing a plaintext representation of the data contained within. \"properties\": { \"author\": [{ \"type\": [\"h-card\"], \"properties\": { \"name\": [\"Barnaby Walters\"] }, \"value\": \"Barnaby Walters }] }\nAll properties may have more than one value. In cases where you expect a single property value (e.g. name), simply take the first one you find, and in cases where you expect multiple values, use all values you consider valid. There are also some cases where it may make sense to use multiple values, but to prioritise one based on some heuristic \u2014 for example, an h-card may have multiple url values, in which case the first one is usually the \u201ccanonical\u201d URL, and further URLs refer to external profiles.\nLet\u2019s look at the implications of each of the potential property value structures in turn.\nFirstly, Never assume that a property value will be a plaintext string. Microformats publishers can nest microformats, embedded content and img\/alt structures in a variety of different ways, and your consuming code should be as flexible as possible.\nTo partially make up for this complexity, you can always rely on the value key of nested structs to provide you with an equivalent plaintext value, regardless of what type of struct you\u2019ve found.\nWhen you start consuming microformats 2, write a function like this, and get into the habit of using it every time you want a single, plaintext value from a property: def get_first_plaintext(mf_struct, property_name): try: first_val = mf_struct['properties'][property_name][0] if isinstance(first_val, str): return first_val else: return first_val['value'] except (IndexError, KeyError): return None\nSecondly, Never assume that a particular property will contain an embedded HTML struct \u2014 this usually applies to content, but is relevant anywhere your application expects embedded HTML. If you want to reliably get a value encoded as raw HTML, then you need to: Check whether the first property value is an embedded HTML struct (i.e. has an html key). If so, take the value of the html key Otherwise, get the first plaintext property value using the approach above, and HTML-escape it If neither is found, the property has no value.\nIn Python 3.5+, that could look something like this: from html import escape def get_first_html(mf_struct, property_name): try: first_val = mf_struct['properties'][property_name][0] if isinstance(first_val, dict) and 'html' in first_val: return first_val['html'] else: plaintext_val = get_first_plaintext(mf_struct, property_name) if plaintext_val is not None: plaintext_val = escape(plaintext_val) return plaintext_val except (IndexError, KeyError): return None\nIn some cases, it may make sense for your application to be aware of whether a value was parsed as embedded HTML or a plain text string, and to store\/treat them differently. In all other cases, always use a function like this when you\u2019re expecting embedded HTML data.\nThirdly, when expecting an image URL, check for an img\/alt structure, falling back to the plain text value (and either assuming an empty alt text or inferring an appropriate one, depending on your specific use case). Something like this could be a good starting point: def get_img_alt(mf_struct, property_name): try: first_val = mf_struct['properties'][property_name][0] if isinstance(first_val, dict) and 'alt' in first_val: return first_val else: plaintext_val = get_first_plaintext(mf_struct, property_name) if plaintext_val is not None: return {'value': plaintext_val, 'alt': ''} return None except (IndexError, KeyError): return None\nFinally, in cases where you expect a nested microformat, you might end up getting something else. This is the hardest case to deal with, and the one which depends the most on the specific data and use-case you\u2019re dealing with. For example, if you\u2019re expecting a nested h-card under an author property, but get something else, you could use any of the following approaches: If you got a plain string which doesn\u2019t look like a URL, treat it as the name property of an implied h-card structure with no other properties (and if you need a URL, you could potentially take the hostname of the effective URL, if it works in context as a useful fallback value) If you got an img alt struct, you could treat the value as the photo property, the alt as the name property, and potentially even take the hostname of the photo URL to be the implied fallback url property (although that\u2019s pushing it a bit, and in most cases it\u2019s probably better to just leave out the url) If you got an embedded HTML struct, take its plaintext value and use one of the first two approaches If you got a plain string, check to see if it looks like a URL. If so, fetch that URL and look for a representative h-card to use as the author value If you get an embedded mf struct with a url property but no photo, you could fetch the url, look for a representative h-card (more on that in the next section) and see if it has a photo property Treat the author property as invalid and run the h-entry (or entire page if relevant) through the authorship algorithm\nThe first three are general principles which can be applied to many scenarios where you expect an embedded mf struct but find something else. The last three, however, are examples of a common trend in consuming microformats 2 data: for many common use-cases, there are well-thought-through algorithms you can use to interpret data in a standardised way. Know Your Algorithms and Vocabularies\nThe authorship algorithm mentioned above is one of several more-or-less formally established algorithms used to solve common problems in indieweb usages of microformats 2. Some others which are worth knowing about include: \u201cWho wrote this post?\u201d: authorship algorithm \u201cThere\u2019s more than one h-card on this page, which one should I use?\u201d: representative h-card \u201cI want to get a paginated feed of posts from this page\u201d: How to consume h-feed \u201cHow do I find and display the main post on this page?\u201d: How to consume h-entry \u201cI received a response to one of my posts via webmention, how do I display it?\u201d: How to display comments\nLibrary implementations of these algorithms exist for some languages, although they often deviate slightly from the exact text. See if you can find one which meets your needs, and if not, write your own and share it with the community!\nIn addition to the formal consumption algorithms, it\u2019s worth looking through the definitions of the microformats vocabularies you\u2019re using (as well as testing with real-world data) and adding support for properties or publishing techniques you might not have thought of the first time around. Some examples to get you started: If an h-card has no valid photo, see if there\u2019s a valid logo you can use instead When presenting a h-entry with a featured photo, check both the photo property and the featured property, as one or the other might be used in different scenarios When dealing with address or location data (e.g. on an h-card, h-entry or h-event), be aware that either might be present in various different forms. Co-ordinates might be separate latitude and longitude properties, a combined plaintext geo property, or an embedded h-geo. Addresses might be separate top-level properties or an embedded h-adr. There are many variations which are totally valid to publish, and your consuming code should be as liberal as possible in what it accepts. If a h-entry contains images which are marked up with u-photo within the e-content, they\u2019ll be present both in the content html key and also under the photo property. If your app shows the embedded content HTML rather than using the plaintext version, and also supports photo properties (which may also be present outside the content), you may have to sniff the presence of photos within the content, and either remove them from it or ignore the corresponding photo properties to avoid showing photos twice. Sanitise, Validate, and Truncate\nIn the vast majority of cases, consuming microformats 2 data involves handling, storing and potentially re-publishing untrusted and potentially dangerous input data. Preventing XSS and other attacks is out of the scope of the microformats parsing algorithm, so the data your parser gives you is just as dangerous as the original source. You need to take your own measures for sanitising and truncating it so you can store and display it safely.\nCovering every possible injection and XSS attack is out of the scope of this article, so I highly recommend referring to the OWASP resources on XSS Prevention, Unicode Attacks and Injection Attacks for more information.\nOther than that, the following ideas are a good start: Use plaintext values where possible, only using embedded HTML when absolutely necessary Pass everything (HTML or not) through a well-respected HTML sanitizer such as PHP\u2019s HTML Purifier. Configure it to make sure that embedded HTML can\u2019t interfere with your own markup or CSS. It probably shouldn\u2019t contain any javascript ever, either. In any case where you\u2019re expecting a value with a specific format, validate it as appropriate. More specifically, everywhere that you expect a URL, check that what you got was actually a URL. If you\u2019re using the URL as an image, consider fetching it an checking its content type Consider either proxying resource such as images, or storing local copies of them (reducing size and resolution as necessary), to avoid mixed content issues, potential attacks, and missing images if the links break in the future. Decide on relevant maximum length values for each separate piece of external content, and truncate them as necessary. Ideally, use a language-aware truncation algorithm to avoid breaking words apart. When the content of a post is truncated, consider adding a \u201cRead More\u201d link for convenience. Test with Real-World Data\nThe web is a diverse place, and microformats are a flexible, permissive method of marking up structured data. There are often several different yet perfectly valid ways to achieve the same goal, and as a good consumer of mf2 data, your application should strive to accept as many of them as possible!\nThe best way to test this is with real world data. If your application is built with a particular source of data in mind, then start off with testing it against that. If you want to be able to handle a wider variety of sources, the best way is to determine what vocabularies and publishing use-cases your application consumes, and look at the Examples sections of the relevant indieweb.org wiki pages for real-world sites to test your code against.\nDon\u2019t forget to test your code against examples you\u2019ve published on your own personal site! Next Steps\nHopefully this article helped you avoid a lot of common gotchas, and gave you a good head-start towards successfully consuming real-world microformats 2 data.\nIf you have questions or issues, or want to share something cool you\u2019ve built, come and join us in the indieweb chat room." } ], "author": [ { "type": [ "h-card" ], "properties": { "name": [ "waterpigs.co.uk\/" ], "url": [ "http:\/\/microformats.org\/" ], "photo": [ { "value": "http:\/\/1.gravatar.com\/avatar\/4a57cddee3c50aefa893005dcdd33b64?s=16&d=mm&r=pg", "alt": "" } ] }, "value": "waterpigs.co.uk\/" } ] }, "id": "post-524", "children": [ { "type": [ "h-card" ], "properties": { "name": [ "Aaron Parecki" ], "url": [ "https:\/\/aaronparecki.com" ] } } ] }, { "type": [ "h-entry" ], "properties": { "name": [ "Google confirms Microformats are still a recommended metadata format for content" ], "category": [ "indieweb", "microformats2" ], "url": [ "https:\/\/microformats.org\/2020\/03\/04\/google-confirms-microformats-are-still-a-recommended-metadata-format-for-content", "https:\/\/microformats.org\/2020\/03\/04\/google-confirms-microformats-are-still-a-recommended-metadata-format-for-content" ], "updated": [ "2020-03-04T10:48:02" ], "content": [ { "html": " \nThis post originally appeared on Jamie Tanna\u2019s site<\/a>.<\/p>\n Google announced that they are removing support for the data-vocabulary metadata<\/a> markup that could be used to provide rich search results on its Search Engine.<\/p>\n In a Twitter exchange, John Mueller, a Webmaster Trends Analyst at Google, confirmed that Microformats<\/a> are still being supported by Google at this time:<\/p>\n \nYes, we still support them.<\/p>\n\u2014 \ud83c\udf4c John \ud83c\udf4c (@JohnMu) January 21, 2020<\/a><\/p><\/blockquote>\n John also confirmed that he knows of no upcoming plans to deprecate Microformats:<\/p>\n \nWe don\u2019t have any plans for changes to announce there at the moment. I don\u2019t know off-hand how broadly microformats are used, my guess is it\u2019s much more than data-vocabulary. That said \u2026 https:\/\/t.co\/ZCE7rTKmPa<\/a><\/p>\n \u2014 \ud83c\udf4c John \ud83c\udf4c (@JohnMu) January 21, 2020<\/a><\/p><\/blockquote>\n This is an especially great result due to the way that Google is quite happy to abandon various metadata formats, as noted in our 7th anniversary blog post<\/a>, almost 8 years ago. With this announcement, Microformats are now the longest-supported metadata format that Google parses, since at least 2009<\/a>!<\/p>\n With the continued growth of Microformats across the IndieWeb<\/a>, we expect that Google will extend its Microformats support accordingly.<\/p>\n<\/div>", "value": "This post originally appeared on Jamie Tanna\u2019s site.\nGoogle announced that they are removing support for the data-vocabulary metadata markup that could be used to provide rich search results on its Search Engine.\nIn a Twitter exchange, John Mueller, a Webmaster Trends Analyst at Google, confirmed that Microformats are still being supported by Google at this time:\nYes, we still support them.\n\u2014 \ud83c\udf4c John \ud83c\udf4c (@JohnMu) January 21, 2020\nJohn also confirmed that he knows of no upcoming plans to deprecate Microformats:\nWe don\u2019t have any plans for changes to announce there at the moment. I don\u2019t know off-hand how broadly microformats are used, my guess is it\u2019s much more than data-vocabulary. That said \u2026 https:\/\/t.co\/ZCE7rTKmPa\n\u2014 \ud83c\udf4c John \ud83c\udf4c (@JohnMu) January 21, 2020\nThis is an especially great result due to the way that Google is quite happy to abandon various metadata formats, as noted in our 7th anniversary blog post, almost 8 years ago. With this announcement, Microformats are now the longest-supported metadata format that Google parses, since at least 2009!\nWith the continued growth of Microformats across the IndieWeb, we expect that Google will extend its Microformats support accordingly." } ], "author": [ { "type": [ "h-card" ], "properties": { "name": [ "jamietanna" ], "url": [ "http:\/\/microformats.org\/" ], "photo": [ { "value": "http:\/\/1.gravatar.com\/avatar\/702c2c3657b87396c41f14251af663c4?s=16&d=mm&r=pg", "alt": "" } ] }, "value": "jamietanna" } ] }, "id": "post-491" }, { "type": [ "h-entry" ], "properties": { "name": [ "microformats.org Year 14 \u2014 Welcome New Admins" ], "category": [ "microformats2" ], "url": [ "https:\/\/microformats.org\/2018\/06\/22\/microformats-org-year-14-welcome-new-admins", "https:\/\/microformats.org\/2018\/06\/22\/microformats-org-year-14-welcome-new-admins" ], "updated": [ "2018-06-22T15:14:41" ], "content": [ { "html": " In microformats.org year 14, we welcome new admins<\/a>: Aaron Parecki<\/a>, Gregor Morrill<\/a>, Martijn van der Ven<\/a>, and Sven Knebel<\/a>! All have been active for years, helping welcome new members and doing essential wiki gardening & microformats2 parser updates<\/a>!<\/p>\n Originally posted at: tantek.com<\/a><\/p>", "value": "In microformats.org year 14, we welcome new admins: Aaron Parecki, Gregor Morrill, Martijn van der Ven, and Sven Knebel! All have been active for years, helping welcome new members and doing essential wiki gardening & microformats2 parser updates!\nOriginally posted at: tantek.com" } ], "author": [ { "type": [ "h-card" ], "properties": { "name": [ "Tantek" ], "url": [ "http:\/\/microformats.org\/" ], "photo": [ { "value": "http:\/\/0.gravatar.com\/avatar\/02cd45622e90350cc061aaaa02229195?s=16&d=mm&r=pg", "alt": "" } ] }, "value": "Tantek" } ] }, "id": "post-480" }, { "type": [ "h-entry" ], "properties": { "name": [ "Happy 13th to microformats.org!" ], "category": [ "indieweb", "microformats2" ], "url": [ "https:\/\/microformats.org\/2018\/06\/21\/happy-13th-to-microformats-org", "https:\/\/microformats.org\/2018\/06\/21\/happy-13th-to-microformats-org" ], "updated": [ "2018-06-21T08:40:46" ], "content": [ { "html": " With more use of microformats2<\/a>, especially among the growing indieweb<\/a> network of websites, we\u2019ve iterated key<\/a> specs<\/a> for real-world needs and are seeing more active community members. More updates & posts coming up!<\/p>\n Originally posted on tantek.com<\/a>.<\/p>", "value": "With more use of microformats2, especially among the growing indieweb network of websites, we\u2019ve iterated key specs for real-world needs and are seeing more active community members. More updates & posts coming up!\nOriginally posted on tantek.com." } ], "author": [ { "type": [ "h-card" ], "properties": { "name": [ "Tantek" ], "url": [ "http:\/\/microformats.org\/" ], "photo": [ { "value": "http:\/\/0.gravatar.com\/avatar\/02cd45622e90350cc061aaaa02229195?s=16&d=mm&r=pg", "alt": "" } ] }, "value": "Tantek" } ] }, "id": "post-475" }, { "type": [ "h-entry" ], "properties": { "name": [ "Improving the php-mf2 parser" ], "url": [ "https:\/\/microformats.org\/2017\/06\/22\/improving-the-php-mf2-parser", "https:\/\/microformats.org\/2017\/06\/22\/improving-the-php-mf2-parser" ], "updated": [ "2017-06-22T09:13:53" ], "content": [ { "html": " During the past year, the popular php-mf2<\/a> microformats parser has received quite a few improvements. My site runs ProcessWire and one of the plugins for it uses php-mf2, so I have been spending some time on it.<\/p>\n My own experience with microformats started when I discovered the hCard microformat<\/a>. I was impressed with the novelty of adding some simple HTML classes around contact information and having a browser extension parse it into an address book. Years later, when I started to get involved in the IndieWeb community, I learned a lot more about microformats2 and they became a key building block of my personal site.<\/p>\n php-mf2 is now much better at backwards-compatible parsing of microformats1. This is important because software should be able to consistently consume content whether it\u2019s marked up with microformats1, microformats2, or a combination. An experimental feature for parsing language attributes has also been added. Finally, it\u2019s now using the microformats test suite. Several other parsers use this test suite as well. This will make it easier to catch bugs and improve all of the different parsers.<\/p>\n php-mf2 is a stable library that\u2019s ready to be installed in your software to start consuming microformats. It is currently used in Known<\/a>, WordPress plugins<\/a>, and ProcessWire plugins<\/a> for richer social interactions. It\u2019s also used in tools like XRay<\/a> and microformats.io<\/a>. I\u2019m looking forward to more improvements to php-mf2 in the coming year as well as more software using it!<\/p>\n Original published at: https:\/\/gregorlove.com\/2017\/06\/improving-the-php-mf2-parser\/<\/a><\/p>", "value": "During the past year, the popular php-mf2 microformats parser has received quite a few improvements. My site runs ProcessWire and one of the plugins for it uses php-mf2, so I have been spending some time on it.\nMy own experience with microformats started when I discovered the hCard microformat. I was impressed with the novelty of adding some simple HTML classes around contact information and having a browser extension parse it into an address book. Years later, when I started to get involved in the IndieWeb community, I learned a lot more about microformats2 and they became a key building block of my personal site.\nphp-mf2 is now much better at backwards-compatible parsing of microformats1. This is important because software should be able to consistently consume content whether it\u2019s marked up with microformats1, microformats2, or a combination. An experimental feature for parsing language attributes has also been added. Finally, it\u2019s now using the microformats test suite. Several other parsers use this test suite as well. This will make it easier to catch bugs and improve all of the different parsers.\nphp-mf2 is a stable library that\u2019s ready to be installed in your software to start consuming microformats. It is currently used in Known, WordPress plugins, and ProcessWire plugins for richer social interactions. It\u2019s also used in tools like XRay and microformats.io. I\u2019m looking forward to more improvements to php-mf2 in the coming year as well as more software using it!\nOriginal published at: https:\/\/gregorlove.com\/2017\/06\/improving-the-php-mf2-parser\/" } ], "author": [ { "type": [ "h-card" ], "properties": { "name": [ "gRegor Morrill" ], "url": [ "http:\/\/microformats.org\/" ], "photo": [ { "value": "http:\/\/1.gravatar.com\/avatar\/aca81ab5bf69a4626c91edc811cea208?s=16&d=mm&r=pg", "alt": "" } ] }, "value": "gRegor Morrill" } ] }, "id": "post-469" } ] } ], "rels": { "shortcut": [ "http:\/\/microformats.org\/favicon.ico" ], "icon": [ "http:\/\/microformats.org\/favicon.ico", "https:\/\/microformats.org\/media\/2020\/06\/microformats-logo-150x150.png", "https:\/\/microformats.org\/media\/2020\/06\/microformats-logo.png" ], "profile": [ "http:\/\/microformats.org\/profile\/specs", "http:\/\/microformats.org\/profile\/hatom" ], "dns-prefetch": [ "http:\/\/s.w.org" ], "alternate": [ "https:\/\/microformats.org\/feed", "https:\/\/microformats.org\/comments\/feed" ], "stylesheet": [ "http:\/\/microformats.org\/wordpress\/wp-content\/plugins\/openid\/f\/openid.css?ver=519", "http:\/\/microformats.org\/wordpress\/wp-includes\/css\/dist\/block-library\/style.min.css?ver=5.4.15", "http:\/\/microformats.org\/wordpress\/wp-content\/themes\/microformats\/style.css?ver=1.0", "http:\/\/microformats.org\/wordpress\/wp-content\/themes\/microformatscss\/print.css?ver=1.0" ], "https:\/\/api.w.org\/": [ "https:\/\/microformats.org\/wp-json\/" ], "EditURI": [ "https:\/\/microformats.org\/wordpress\/xmlrpc.php?rsd" ], "wlwmanifest": [ "http:\/\/microformats.org\/wordpress\/wp-includes\/wlwmanifest.xml" ], "apple-touch-icon": [ "https:\/\/microformats.org\/media\/2020\/06\/microformats-logo.png" ], "bookmark": [ "https:\/\/microformats.org\/2022\/02\/19\/how-to-consume-microformats-2-data", "https:\/\/microformats.org\/2020\/03\/04\/google-confirms-microformats-are-still-a-recommended-metadata-format-for-content", "https:\/\/microformats.org\/2018\/06\/22\/microformats-org-year-14-welcome-new-admins", "https:\/\/microformats.org\/2018\/06\/21\/happy-13th-to-microformats-org", "https:\/\/microformats.org\/2017\/06\/22\/improving-the-php-mf2-parser" ], "canonical": [ "https:\/\/www.jvt.me\/posts\/2020\/03\/02\/google-microformats-support\/", "https:\/\/gregorlove.com\/2017\/06\/improving-the-php-mf2-parser\/" ], "tag": [ "https:\/\/microformats.org\/tag\/indieweb", "https:\/\/microformats.org\/tag\/microformats2" ] }, "rel-urls": { "http:\/\/microformats.org\/favicon.ico": { "type": "image\/ico", "rels": [ "icon", "shortcut" ] }, "http:\/\/microformats.org\/profile\/specs": { "rels": [ "profile" ] }, "http:\/\/microformats.org\/profile\/hatom": { "rels": [ "profile" ] }, "http:\/\/s.w.org": { "rels": [ "dns-prefetch" ] }, "https:\/\/microformats.org\/feed": { "title": "Microformats \u00bb Feed", "type": "application\/rss+xml", "rels": [ "alternate" ] }, "https:\/\/microformats.org\/comments\/feed": { "title": "Microformats \u00bb Comments Feed", "type": "application\/rss+xml", "rels": [ "alternate" ] }, "http:\/\/microformats.org\/wordpress\/wp-content\/plugins\/openid\/f\/openid.css?ver=519": { "media": "all", "type": "text\/css", "rels": [ "stylesheet" ] }, "http:\/\/microformats.org\/wordpress\/wp-includes\/css\/dist\/block-library\/style.min.css?ver=5.4.15": { "media": "all", "type": "text\/css", "rels": [ "stylesheet" ] }, "http:\/\/microformats.org\/wordpress\/wp-content\/themes\/microformats\/style.css?ver=1.0": { "media": "screen", "type": "text\/css", "rels": [ "stylesheet" ] }, "http:\/\/microformats.org\/wordpress\/wp-content\/themes\/microformatscss\/print.css?ver=1.0": { "media": "print", "type": "text\/css", "rels": [ "stylesheet" ] }, "https:\/\/microformats.org\/wp-json\/": { "rels": [ "https:\/\/api.w.org\/" ] }, "https:\/\/microformats.org\/wordpress\/xmlrpc.php?rsd": { "title": "RSD", "type": "application\/rsd+xml", "rels": [ "EditURI" ] }, "http:\/\/microformats.org\/wordpress\/wp-includes\/wlwmanifest.xml": { "type": "application\/wlwmanifest+xml", "rels": [ "wlwmanifest" ] }, "https:\/\/microformats.org\/media\/2020\/06\/microformats-logo-150x150.png": { "rels": [ "icon" ] }, "https:\/\/microformats.org\/media\/2020\/06\/microformats-logo.png": { "rels": [ "apple-touch-icon", "icon" ] }, "https:\/\/microformats.org\/2022\/02\/19\/how-to-consume-microformats-2-data": { "title": "Permanent Link to How to Consume Microformats 2 Data", "text": "How to Consume Microformats 2 Data", "rels": [ "bookmark" ] }, "https:\/\/microformats.org\/2020\/03\/04\/google-confirms-microformats-are-still-a-recommended-metadata-format-for-content": { "title": "Permanent Link to Google confirms Microformats are still a recommended metadata format for content", "text": "Google confirms Microformats are still a recommended metadata format for content", "rels": [ "bookmark" ] }, "https:\/\/www.jvt.me\/posts\/2020\/03\/02\/google-microformats-support\/": { "text": "originally appeared on Jamie Tanna\u2019s site", "rels": [ "canonical" ] }, "https:\/\/microformats.org\/tag\/indieweb": { "text": "indieweb", "rels": [ "tag" ] }, "https:\/\/microformats.org\/tag\/microformats2": { "text": "microformats2", "rels": [ "tag" ] }, "https:\/\/microformats.org\/2018\/06\/22\/microformats-org-year-14-welcome-new-admins": { "title": "Permanent Link to microformats.org Year 14 \u2014 Welcome New Admins", "text": "microformats.org Year 14 \u2014 Welcome New Admins", "rels": [ "bookmark" ] }, "https:\/\/microformats.org\/2018\/06\/21\/happy-13th-to-microformats-org": { "title": "Permanent Link to Happy 13th to microformats.org!", "text": "Happy 13th to microformats.org!", "rels": [ "bookmark" ] }, "https:\/\/microformats.org\/2017\/06\/22\/improving-the-php-mf2-parser": { "title": "Permanent Link to Improving the php-mf2 parser", "text": "Improving the php-mf2 parser", "rels": [ "bookmark" ] }, "https:\/\/gregorlove.com\/2017\/06\/improving-the-php-mf2-parser\/": { "text": "https:\/\/gregorlove.com\/2017\/06\/improving-the-php-mf2-parser\/", "rels": [ "canonical" ] } } }