yt-dlp/devscripts/prepare_manpage.py

#!/usr/bin/env python3
from __future__ import unicode_literals

import io
import optparse
import os.path
import re

ROOT_DIR = os.path.dirname(os.path.dirname(os.path.abspath(__file__)))
README_FILE = os.path.join(ROOT_DIR, 'README.md')

PREFIX = r'''%yt-dlp(1)

# NAME

yt\-dlp \- A youtube-dl fork with additional features and patches

# SYNOPSIS

**yt-dlp** \[OPTIONS\] URL [URL...]

# DESCRIPTION

'''


def main():
    parser = optparse.OptionParser(usage='%prog OUTFILE.md')
    options, args = parser.parse_args()
    if len(args) != 1:
        parser.error('Expected an output filename')

    outfile, = args

    with io.open(README_FILE, encoding='utf-8') as f:
        readme = f.read()

    readme = filter_excluded_sections(readme)
    readme = move_sections(readme)
    readme = filter_options(readme)

    with io.open(outfile, 'w', encoding='utf-8') as outf:
        outf.write(PREFIX + readme)


def filter_excluded_sections(readme):
    EXCLUDED_SECTION_BEGIN_STRING = re.escape('<!-- MANPAGE: BEGIN EXCLUDED SECTION -->')
    EXCLUDED_SECTION_END_STRING = re.escape('<!-- MANPAGE: END EXCLUDED SECTION -->')
    return re.sub(
        rf'(?s){EXCLUDED_SECTION_BEGIN_STRING}.+?{EXCLUDED_SECTION_END_STRING}\n',
        '', readme)


def move_sections(readme):
    MOVE_TAG_TEMPLATE = '<!-- MANPAGE: MOVE "%s" SECTION HERE -->'
    sections = re.findall(rf'(?m)^{re.escape(MOVE_TAG_TEMPLATE) % "(.+)"}$', readme)

    for section_name in sections:
        move_tag = MOVE_TAG_TEMPLATE % section_name
        if readme.count(move_tag) > 1:
            raise Exception(f'There is more than one occurrence of "{move_tag}". This is unexpected')

        sections = re.findall(rf'(?sm)(^# {re.escape(section_name)}.+?)(?=^# )', readme)
        if len(sections) < 1:
            raise Exception(f'The section {section_name} does not exist')
        elif len(sections) > 1:
            raise Exception(f'There are multiple occurrences of section {section_name}, this is unhandled')

        readme = readme.replace(sections[0], '', 1).replace(move_tag, sections[0], 1)
    return readme


def filter_options(readme):
    section = re.search(r'(?sm)^# USAGE AND OPTIONS\n.+?(?=^# )', readme).group(0)
    options = '# OPTIONS\n'
    for line in section.split('\n')[1:]:
        if line.lstrip().startswith('-'):
            split = re.split(r'\s{2,}', line.lstrip())
            # Description string may start with `-` as well. If there is
            # only one piece then it's a description bit not an option.
            if len(split) > 1:
                option, description = split
                split_option = option.split(' ')

                if not split_option[-1].startswith('-'):  # metavar
                    option = ' '.join(split_option[:-1] + [f'*{split_option[-1]}*'])

                # Pandoc's definition_lists. See http://pandoc.org/README.html
                options += f'\n{option}\n:   {description}\n'
                continue
        options += line.lstrip() + '\n'

    return readme.replace(section, options, 1)


if __name__ == '__main__':
    main()
[cleanup] Point all shebang to `python3` (#372) Authored by: fstirlitz 2021-06-03 05:43:42 -04:00			`#!/usr/bin/env python3`
[test_unicode_literals] Arm unicode_literals check From now on, the line from __future__ import unicode_literals should be contained in every single Python file lest we run into any more 2.x/3.x issues. Going forward, we're likely to develop on 3.x only and would likely miss subtle bugs otherwise. 2014-11-26 14:01:20 -05:00			`from __future__ import unicode_literals`
add prepare_manpage 2014-05-13 08:21:21 -04:00
			`import io`
[devscripts/prepare_manpage] Fix manpage generation on Windows 2016-05-28 23:06:10 -04:00			`import optparse`
add prepare_manpage 2014-05-13 08:21:21 -04:00			`import os.path`
			`import re`

			`ROOT_DIR = os.path.dirname(os.path.dirname(os.path.abspath(__file__)))`
			`README_FILE = os.path.join(ROOT_DIR, 'README.md')`

Completely change project name to yt-dlp (#85) * All modules and binary names are changed * All documentation references changed * yt-dlp no longer loads youtube-dlc config files * All URLs changed to point to organization account Co-authored-by: Pccode66 Co-authored-by: pukkandan 2021-02-24 13:45:56 -05:00			`PREFIX = r'''%yt-dlp(1)`
[devscripts/prepare_manpage] Fix manpage generation on Windows 2016-05-28 23:06:10 -04:00
			`# NAME`

[docs] Improve manpage format (#2003) Closes #1448 Authored by: iw0nderhow, pukkandan 2021-12-16 20:23:04 -05:00			`yt\-dlp \- A youtube-dl fork with additional features and patches`
[devscripts/prepare_manpage] Fix manpage generation on Windows 2016-05-28 23:06:10 -04:00
			`# SYNOPSIS`

Completely change project name to yt-dlp (#85) * All modules and binary names are changed * All documentation references changed * yt-dlp no longer loads youtube-dlc config files * All URLs changed to point to organization account Co-authored-by: Pccode66 Co-authored-by: pukkandan 2021-02-24 13:45:56 -05:00			`yt-dlp \[OPTIONS\] URL [URL...]`
[devscripts/prepare_manpage] Fix manpage generation on Windows 2016-05-28 23:06:10 -04:00
[docs] Improve manpage format (#2003) Closes #1448 Authored by: iw0nderhow, pukkandan 2021-12-16 20:23:04 -05:00			`# DESCRIPTION`

[devscripts/prepare_manpage] Fix manpage generation on Windows 2016-05-28 23:06:10 -04:00			`'''`


			`def main():`
			`parser = optparse.OptionParser(usage='%prog OUTFILE.md')`
			`options, args = parser.parse_args()`
			`if len(args) != 1:`
			`parser.error('Expected an output filename')`

			`outfile, = args`

			`with io.open(README_FILE, encoding='utf-8') as f:`
			`readme = f.read()`

[docs] Improve manpage format (#2003) Closes #1448 Authored by: iw0nderhow, pukkandan 2021-12-16 20:23:04 -05:00			`readme = filter_excluded_sections(readme)`
			`readme = move_sections(readme)`
[devscripts/prepare_manpage] Fix manpage generation on Windows 2016-05-28 23:06:10 -04:00			`readme = filter_options(readme)`

			`with io.open(outfile, 'w', encoding='utf-8') as outf:`
[docs] Improve manpage format (#2003) Closes #1448 Authored by: iw0nderhow, pukkandan 2021-12-16 20:23:04 -05:00			`outf.write(PREFIX + readme)`


			`def filter_excluded_sections(readme):`
			`EXCLUDED_SECTION_BEGIN_STRING = re.escape('<!-- MANPAGE: BEGIN EXCLUDED SECTION -->')`
			`EXCLUDED_SECTION_END_STRING = re.escape('<!-- MANPAGE: END EXCLUDED SECTION -->')`
			`return re.sub(`
			`rf'(?s){EXCLUDED_SECTION_BEGIN_STRING}.+?{EXCLUDED_SECTION_END_STRING}\n',`
			`'', readme)`


			`def move_sections(readme):`
			`MOVE_TAG_TEMPLATE = '<!-- MANPAGE: MOVE "%s" SECTION HERE -->'`
			`sections = re.findall(rf'(?m)^{re.escape(MOVE_TAG_TEMPLATE) % "(.+)"}$', readme)`

			`for section_name in sections:`
			`move_tag = MOVE_TAG_TEMPLATE % section_name`
			`if readme.count(move_tag) > 1:`
			`raise Exception(f'There is more than one occurrence of "{move_tag}". This is unexpected')`

			`sections = re.findall(rf'(?sm)(^# {re.escape(section_name)}.+?)(?=^# )', readme)`
			`if len(sections) < 1:`
			`raise Exception(f'The section {section_name} does not exist')`
			`elif len(sections) > 1:`
			`raise Exception(f'There are multiple occurrences of section {section_name}, this is unhandled')`

			`readme = readme.replace(sections[0], '', 1).replace(move_tag, sections[0], 1)`
			`return readme`
[devscripts/prepare_manpage] Fix manpage generation on Windows 2016-05-28 23:06:10 -04:00
[doc] Better formatting of youtube-dl.1 (closes #6510) 2015-09-13 08:10:23 -04:00
			`def filter_options(readme):`
[docs] Improve manpage format (#2003) Closes #1448 Authored by: iw0nderhow, pukkandan 2021-12-16 20:23:04 -05:00			`section = re.search(r'(?sm)^# USAGE AND OPTIONS\n.+?(?=^# )', readme).group(0)`
			`options = '# OPTIONS\n'`
			`for line in section.split('\n')[1:]:`
			`if line.lstrip().startswith('-'):`
			`split = re.split(r'\s{2,}', line.lstrip())`
			# Description string may start with `-` as well. If there is
			`# only one piece then it's a description bit not an option.`
			`if len(split) > 1:`
			`option, description = split`
			`split_option = option.split(' ')`

			`if not split_option[-1].startswith('-'): # metavar`
			`option = ' '.join(split_option[:-1] + [f'{split_option[-1]}'])`

			`# Pandoc's definition_lists. See http://pandoc.org/README.html`
			`options += f'\n{option}\n: {description}\n'`
			`continue`
			`options += line.lstrip() + '\n'`

			`return readme.replace(section, options, 1)`
[doc] Better formatting of youtube-dl.1 (closes #6510) 2015-09-13 08:10:23 -04:00
Update coding style after pycodestyle 2.1.0 In pycodestyle 2.1.0, E305 was introduced, which requires two blank lines after top level declarations, too. See https://github.com/PyCQA/pycodestyle/issues/400 See also #10689; thanks @stepshal for first mentioning this issue and initial patches 2016-11-17 06:42:56 -05:00
[devscripts/prepare_manpage] Fix manpage generation on Windows 2016-05-28 23:06:10 -04:00			`if __name__ == '__main__':`
			`main()`