2017 © Pedro Peláez
 

library http-parser

A tool for scraping URL resources

image

hypejunction/http-parser

A tool for scraping URL resources

  • Wednesday, October 4, 2017
  • by hypeJunction
  • Repository
  • 3 Watchers
  • 3 Stars
  • 922 Installations
  • PHP
  • 2 Dependents
  • 0 Suggesters
  • 3 Forks
  • 1 Open issues
  • 8 Versions
  • 18 % Grown

The README.md

HTTP Parser

Code Coverage Scrutinizer Code Quality Build Status, (*1)

Parses HTTP resources into serializable arrays of oEmbed-like metatags, (*2)

Features

  • Parses HTTP resources into oEmbed-like arrays of metatags
  • Handy for rendering URL cards

Security

  • Make sure to implement a domain whitelist, as this method is not entirely secure and prone to XSS

Sample Output

{
    "url": "https://soundcloud.com/phobosrecords/maksim-light-pulse",
    "canonical": "https://soundcloud.com/phobosrecords/maksim-light-pulse",
    "oembed_url": "https://soundcloud.com/oembed?url=https%3A%2F%2Fsoundcloud.com%2Fphobosrecords%2Fmaksim-light-pulse&format=json",
    "icons": [
        "https://a-v2.sndcdn.com/assets/images/sc-icons/favicon-2cadd14b.ico"
    ],
    "metatags": {
        "keywords": "record, sounds, share, sound, audio, tracks, music, soundcloud",
        "referrer": "origin",
        "google-site-verification": "dY0CigqM8Inubs_hgrYMwk-zGchKwrvJLcvI_G8631Q",
        "viewport": "width=device-width,minimum-scale=1,maximum-scale=1,user-scalable=no",
        "fb:app_id": "19507961798",
        "og:site_name": "SoundCloud",
        "twitter:app:name:iphone": [
            "SoundCloud",
            "SoundCloud"
        ],
        "description": "Stream PHS014: Maksim Dark & Light Breath - Finger Pulse (Original Mix) by Phobos Records from desktop or your mobile device",
        "twitter:title": "PHS014: Maksim Dark & Light Breath - Finger Pulse (Original Mix)",
        "twitter:description": "Release Date: 01.02.2016\n
PHS014: Maksim Dark & Light Breath \"BELRUS EP\".\npro.beatport.com/release/belrus-ep/1683847\n\nfacebook.com/phobosrecords",
        "twitter:player": "https://w.soundcloud.com/player/?url=https%3A%2F%2Fapi.soundcloud.com%2Ftracks%2F244902712&auto_play=false&show_artwork=true&visual=true&origin=twitter",
        "twitter:player:height": "400",
        "twitter:player:width": "435",
        "twitter:image:src": "https://i1.sndcdn.com/artworks-000145456354-01w756-t500x500.jpg",
        "twitter:card": "audio",
        "twitter:audio:artist_name": "Phobos Records",
        "twitter:audio:source": "https://api-partners.soundcloud.com/twitter/tracks/soundcloud:sounds:244902712/vmap",
        "twitter:app:url:googleplay": "soundcloud://sounds:244902712",
        "al:ios:app_store_id": "336353151",
        "al:android:package": "com.soundcloud.android",
        "og:type": "soundcloud:sound",
        "og:url": "https://soundcloud.com/phobosrecords/maksim-light-pulse",
        "og:image:width": "500",
        "al:web:should_fallback": "false",
        "soundcloud:user": "https://soundcloud.com/phobosrecords",
        "soundcloud:play_count": "611",
        "soundcloud:comments_count": "11",
        "soundcloud:like_count": "40",
        "msapplication-tooltip": "Launch SoundCloud",
        "msapplication-tileimage": "https://a-v2.sndcdn.com/assets/images/sc-icons/win8-2dc974a1.png",
        "msapplication-tilecolor": "#ff5500",
        "msapplication-starturl": "https://soundcloud.com"
    },
    "tags": [
        "record",
        "sounds",
        "share",
        "sound",
        "audio",
        "tracks",
        "music",
        "soundcloud"
    ],
    "provider_name": "SoundCloud",
    "description": "Stream PHS014: Maksim Dark & Light Breath - Finger Pulse (Original Mix) by Phobos Records from desktop or your mobile device",
    "title": "PHS014: Maksim Dark & Light Breath - Finger Pulse (Original Mix) by Phobos Records",
    "resource_type": "soundcloud:sound",
    "thumbnails": [
        "https://i1.sndcdn.com/artworks-000145456354-01w756-t500x500.jpg",
        "https://a-v2.sndcdn.com/assets/images/loader-dark-45940ae3.gif"
    ],
    "type": "rich",
    "version": "1",
    "width": "100%",
    "height": "400",
    "html": "<iframe width=\"100%\" height=\"400\" scrolling=\"no\" frameborder=\"no\" src=\"https://w.soundcloud.com/player/?visual=true&url=https%3A%2F%2Fapi.soundcloud.com%2Ftracks%2F244902712&show_artwork=true\"></iframe>",
    "author_name": "Phobos Records",
    "author_url": "https://soundcloud.com/phobosrecords",
    "provider_url": "http://soundcloud.com",
    "thumbnail_url": "http://i1.sndcdn.com/artworks-000145456354-01w756-t500x500.jpg"
}

Usage

// sample index.php
require __DIR__ . '/vendor/autoload.php';

$client = new \GuzzleHttp\Client([
    'headers' => [
        'User-Agent' => 'Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.9.2.12) Gecko/20101026 Firefox/3.6.12',
    ],
    'allow_redirects' => [
        'max' => 3,
        'strict' => true,
        'referer' => true,
        'protocols' => ['http', 'https']
    ],
    'timeout' => 5,
    'connect_timeout' => 5,
    'verify' => false,
]);
$parser = new \hypeJunction\Parser($client);

$url = $_GET['url'];

header('Content-Type: application/json');

echo json_encode($parser->parse($url));

The Versions

04/10 2017

dev-master

9999999-dev http://hypejunction.com

A tool for scraping URL resources

  Sources   Download

GPL-2.0

The Requires

 

The Development Requires

by Ismayil Khayredinov

plugin open graph card scraper elgg twitter tags

04/10 2017

0.1.3

0.1.3.0 http://hypejunction.com

A tool for scraping URL resources

  Sources   Download

GPL-2.0

The Requires

 

The Development Requires

by Ismayil Khayredinov

plugin open graph card scraper elgg twitter tags

21/04 2017

0.1.2

0.1.2.0 http://hypejunction.com

A tool for scraping URL resources

  Sources   Download

GPL-2.0

The Requires

 

The Development Requires

by Ismayil Khayredinov

plugin open graph card scraper elgg twitter tags

21/04 2017

0.1.1

0.1.1.0 http://hypejunction.com

A tool for scraping URL resources

  Sources   Download

GPL-2.0

The Requires

 

The Development Requires

by Ismayil Khayredinov

plugin open graph card scraper elgg twitter tags

18/04 2017

0.1.0

0.1.0.0 http://hypejunction.com

A tool for scraping URL resources

  Sources   Download

GPL-2.0

The Requires

 

The Development Requires

by Ismayil Khayredinov

plugin open graph card scraper elgg twitter tags

18/02 2017

0.0.2

0.0.2.0 http://hypejunction.com

A tool for scraping URL resources

  Sources   Download

GPL-2.0

The Requires

 

The Development Requires

by Ismayil Khayredinov

plugin open graph card scraper elgg twitter tags

03/02 2017

0.0.1

0.0.1.0 http://hypejunction.com

A tool for scraping URL resources

  Sources   Download

GPL-2.0

The Requires

 

The Development Requires

by Ismayil Khayredinov

plugin open graph card scraper elgg twitter tags

08/12 2016

4.2.3

4.2.3.0 http://hypejunction.com

A tool for scraping URL resources

  Sources   Download

GPL-2.0

The Requires

 

The Development Requires

by Ismayil Khayredinov

plugin open graph card scraper elgg twitter tags